This content will become publicly available on May 12, 2026
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
- Award ID(s):
- 2112562
- PAR ID:
- 10647406
- Publisher / Repository:
- The Eighth Annual Conference on Machine Learning and Systems
- Date Published:
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found
An official website of the United States government
