skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Zhang, Cheng"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available October 6, 2027
  2. Free, publicly-accessible full text available May 1, 2027
  3. Free, publicly-accessible full text available May 1, 2027
  4. Modern model hubs, such as Hugging Face, store tens of petabytes of LLMs, with fine-tuned variants vastly outnumbering base models and dominating storage consumption. Existing storage reduction techniques---such as deduplication and compression---are either LLM-oblivious or not compatible with each other, limiting data reduction effectiveness. Our large-scale characterization study across all publicly available Hugging Face LLM repositories reveals several key insights: (1) fine-tuned models within the same family exhibit highly structured, sparse parameter differences suitable for delta compression; (2) bitwise similarity enables LLM family clustering; and (3) tensor-level deduplication is better aligned with model storage workloads, achieving high data reduction with low metadata overhead. Building on these insights, we design BitX, an effective, fast, lossless delta compression algorithm that compresses XORed difference between fine-tuned and base LLMs. We build ZipLLM, a model storage reduction pipeline that unifies tensor-level deduplication and lossless BitX compression. By synergizing deduplication and compression around LLM family clustering, ZipLLM reduces model storage consumption by 54%, over 20% higher than state-of-the-art deduplication and compression approaches. 
    more » « less
    Free, publicly-accessible full text available May 4, 2027
  5. Free, publicly-accessible full text available March 31, 2027
  6. Free, publicly-accessible full text available March 1, 2027
  7. Free, publicly-accessible full text available January 20, 2027
  8. Free, publicly-accessible full text available January 1, 2027
  9. Here we provide percent contribution of mineral associated (i.e., heavy fraction - HF) and relatively more labile (i.e., light fraction - LF) organic matter through soil profiles and along hillslope catena within sites in the Critical Zone Network (CZNet) Geomicrobiology cluster. Each sample is separated into a HF an a LF utilizing a 1.85 g cm-3 sodium polytungstate (3Na2WO4·9WO3·H2O or Na6 [H2W12O40]) solution. The resultant fractions are run for percent carbon (C) and nitrogen (N) and their associated stable isotopes (δ13C and δ15N) to offer novel insights in soil organic matter processes. Samples that were either too small for analytical analysis or below instrument detection limit are labeled with BDL. 
    more » « less
  10. Free, publicly-accessible full text available December 1, 2026