This content will become publicly available on January 1, 2026
MiniKV: Pushing the Limits of 2-Bit KV Cache via Compression and System Co-Design for Efficient Long Context Inference
- Award ID(s):
- 2441601
- PAR ID:
- 10632221
- Publisher / Repository:
- Association for Computational Linguistics
- Date Published:
- Page Range / eLocation ID:
- 18506 to 18523
- Format(s):
- Medium: X
- Location:
- Vienna, Austria
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found
An official website of the United States government
