Using flash-based solid state drives (SSDs) as main memory has been proposed as a practical solution towards scaling memory capacity for data-intensive applications. However, almost all existing approaches rely on the paging mechanism to move data between SSDs and host DRAM. This inevitably incurs significant performance overhead and extra I/O traffic. Thanks to the byte-addressability supported by the PCIe interconnect and the internal memory in SSD controllers, it is feasible to access SSDs in both byte and block granularity today. Exploiting the benefits of SSD's byte-accessibility in today's memory-storage hierarchy is, however, challenging as it lacks systems support and abstractionsmore »
Chameleon: An Adaptive Wear Balancer for Flash Clusters
NAND flash-based Solid State Devices (SSDs) offer
the desirable features of high performance, energy efficiency,
and fast growing capacity. Thus, the use of SSDs is increasing in
distributed storage systems. A key obstacle in this context is that
the natural unbalance in distributed I/O workloads can result in
wear imbalance across the SSDs in a distributed setting. This, in
turn can have significant impact on the reliability, performance,
and lifetime of the storage deployment. Extant load balancers
for storage systems do not consider SSD wear imbalance when
placing data, as the main design goal of such balancers is to
extract higher performance. Consequently, data migration is
the only common technique for tackling wear imbalance, where
existing data is moved from highly loaded servers to the least
loaded ones.
In this paper, we explore an innovative holistic approach,
Chameleon, that employs data redundancy techniques such as
replication and erasure-coding, coupled with endurance-aware
write offloading, to mitigate wear level imbalance in distributed
SSD-based storage. Chameleon aims to balance the wear among
different flash servers while meeting desirable objectives of:
extending life of flash servers; improving I/O performance; and
avoiding bottlenecks. Evaluation with a 50 node SSD cluster
shows that Chameleon reduces the wear distribution deviation
by 81% while improving the write performance by up to 33%.
- Publication Date:
- NSF-PAR ID:
- 10065105
- Journal Name:
- the 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS)
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Although flash cells wear out, a typical SSD has enough cells and sufficiently sophisticated firmware that its lifetime generally exceeds the expected lifetime of its host system. Even under heavy use, SSDs last for years and can be replaced upon failure. On a smartphone, in contrast, the hardware is more limited and we show that, under heavy use, one can easily, and more quickly, wear out smartphone flash storage. Consequently, a simple, unprivileged, malicious application can render a smartphone unbootable ("bricked") in a few weeks with no warning signs to the user. This bleak result becomes more worrisome when consideringmore »
-
Abstract. Flash based solid state drives (SSDs) have established them- selves as a higher-performance alternative to hard disk drives in cloud and mobile environments. Nevertheless, SSDs remain a performance bot- tleneck of computer systems due to their high I/O access latency. A com- mon approach for improving the access latency is prefetching. Prefetch- ing predicts future block accesses and preloads them into main memory ahead of time. In this paper, we discuss the challenges of prefetching in SSDs, explain why prior approaches fail to achieve high accuracy, and present a neural network based prefetching approach that signi cantly outperforms themore »
-
Log-based data management systems use storage as if it were an append-only medium, transforming random writes into sequential writes, which delivers significant benefits when logs are persisted on hard disks. Although solid-state drives (SSDs) offer improved random write capabilities, sequential writes continue to be advan- tageous due to locality and space efficiency. However, the inherent properties of flash-based SSDs induce major disadvantages when used with a random write block interface, causing write amplifica- tion, uneven wear, log stacking, and garbage collection overheads. To eliminate these disadvantages, Zoned Namespace (ZNS) SSDs have recently been introduced. They offer increased capacity, re- ducedmore »
-
The freshness of web page indices is the key to improving searching quality of search engines. In Baidu, the major search engine in China, we have developed DirectLoad, an index updating system for efficiently delivering the webscale indices to nationwide data centers. However, the web-scale index updating suffers from increasingly high data volumes during network transmission and inefficient I/O transactions due to slow disk operations. DirectLoad accelerates the index updating streams from two aspects: 1) DirectLoad effectively cuts down the overwhelmingly high volume of indices in transmission by removing the redundant data across versions, and mutates regular operations in amore »