GenAIPABench: A Benchmark for Generative AI-based Privacy Assistants

Hamid, Aamir; Samidi, Hemanth Reddy; Pappachan, Primal; Finin, Tim; Yus, Roberto

doi:10.56553/popets-2024-0081

Citation Details

GenAIPABench: A Benchmark for Generative AI-based Privacy Assistants

Website privacy policies are often lengthy and intricate. Privacy assistants assist in simplifying policies and making them more accessible and user-friendly. The emergence of generative AI (genAI) offers new opportunities to build privacy assistants that can answer users’ questions about privacy policies. However, genAI’s reliability is a concern due to its potential for producing inaccurate information. This study introduces GenAIPABench, a benchmark for evaluating Generative AI-based Privacy Assistants (GenAIPAs). GenAIPABench includes: 1) A set of curated questions about privacy policies along with annotated answers for various organizations and regulations; 2) Metrics to assess the accuracy, relevance, and consistency of responses; and 3) A tool for generating prompts to introduce privacy policies and paraphrased variants of the curated questions. We evaluated 3 leading genAI systems—ChatGPT-4, Bard, and Bing AI—using GenAIPABench to gauge their effectiveness as GenAIPAs. Our results demonstrate significant promise in genAI capabilities in the privacy domain while also highlighting challenges in managing complex queries, ensuring consistency, and verifying source accuracy. more »

Award ID(s):: 2115040

PAR ID:: 10529113

Author(s) / Creator(s):: Hamid, Aamir; Samidi, Hemanth Reddy; Pappachan, Primal; Finin, Tim; Yus, Roberto

Publisher / Repository:: PoPETs

Date Published:: 2024-07-01

Journal Name:: Proceedings on Privacy Enhancing Technologies

Volume:: 2024

Issue:: 3

ISSN:: 2299-0984

Page Range / eLocation ID:: 336 to 352

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.56553/popets-2024-0081

More Like this