LatticeGen: Hiding Generated Text in a Lattice for Privacy-Aware Large Language Model Generation on Cloud

Zhang, Mengke; He, Tianxing; Wang, Tianle; Mi, Lu; Mireshghallah, Niloofar; Chen, Binyi; Wang, Hao; Tsvetkov, Yulia

Citation Details

In the current user-server interaction paradigm of prompted generation with large language models (LLMs) on cloud, the server fully controls the generation process, which leaves zero options for users who want to keep the generated text private to themselves. For privacy-aware text generation on cloud, we propose LatticeGen, a cooperative protocol in which the server still handles most of the computation while the client controls the sampling operation. The key idea is that the true generated sequence is mixed with noise tokens by the client and hidden in a noised lattice. Only the client knows which tokens are the true ones. Considering potential attacks from a hypothetically malicious server and how the client can defend against it, we propose the repeated beam-search attack and the mixing noise scheme. In our experiments we apply LatticeGen to protect both prompt and generation. It is shown that while the noised lattice degrades generation quality, LatticeGen successfully protects the true generation to a remarkable degree under strong attacks (more than 50{\%} of the semantic remains hidden as measured by BERTScore). more »

Award ID(s):: 2142739 2203097 2125201

PAR ID:: 10520232

Author(s) / Creator(s):: Zhang, Mengke; He, Tianxing; Wang, Tianle; Mi, Lu; Mireshghallah, Niloofar; Chen, Binyi; Wang, Hao; Tsvetkov, Yulia

Publisher / Repository:: NAACL

Date Published:: 2024-06-28

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this