This content will become publicly available on May 12, 2026
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
- Award ID(s):
- 2211882
- PAR ID:
- 10639392
- Publisher / Repository:
- MLsys 2025
- Date Published:
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found
An official website of the United States government
