FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference

An official website of the United States government Here's how you know

Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

This content will become publicly available on November 4, 2026

FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference

Author(s) / Creator(s):: Wang, Dongwei; Liu, Zijie; Wang, Song; Ren, Yuxin; Deng, Jianing; Hu, Jingtong; Chen, Tianlong; Yang, Huanrui