Towards Agile and Judicious Metadata Load Balancing for Ceph File System via Matrix-based Modeling

Shao, Xinyang; Wang, Yiduo; Li, Cheng; Liang, Hengyu; Wang, Chenhan; Yan, Feng; Xu, Yinlong

doi:10.1145/3721483

Citation Details

This content will become publicly available on March 5, 2026

Towards Agile and Judicious Metadata Load Balancing for Ceph File System via Matrix-based Modeling

To scale out the massive metadata access, the Ceph distributed file system (CephFS) adopts adynamic subtree partitioningmethod, splitting the hierarchical namespace and distributingsubtreesacross multiple metadata servers. However, this method suffers from a severe imbalance problem that may result in poor performance due to its inaccurate imbalance prediction, ignorance of workload characteristics, and unnecessary/invalid migration activities. To eliminate these inefficiencies, we propose Lunule, a novel CephFS metadata load balancer, which employs animbalance factor modelfor accurately determiningwhento trigger re-balance and tolerate unharmful imbalanced situations. Lunule further adopts aworkload-aware migration plannerto appropriately select subtree migration candidates. Finally, we extend Lunule to Lunule⁺, which models metadata accesses into matrices, and employs matrix-based formulas for more accurate load prediction and re-balance decision. Compared to baselines, Lunule achieves better load balance, increases the metadata throughput by up to 315.8%, and shortens the tail job completion time by up to 64.6% for five real-world workloads and their mixture, respectively. Besides, Lunule is capable of handling the metadata cluster expansion and the workload growth, and scales linearly on a 16-node cluster. Compared to Lunule, Lunule⁺achieves up to 64.96% better metadata load balance, and 13.53-86.09% higher throughput. more »

Award ID(s):: 2305491

PAR ID:: 10577864

Author(s) / Creator(s):: Shao, Xinyang; Wang, Yiduo; Li, Cheng; Liang, Hengyu; Wang, Chenhan; Yan, Feng; Xu, Yinlong

Publisher / Repository:: ACM

Date Published:: 2025-03-05

Journal Name:: ACM Transactions on Storage

ISSN:: 1553-3077

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 5, 2026
Journal Article:
https://doi.org/10.1145/3721483

More Like this