Question Answering for Privacy Policies: Combining Computational and Legal Perspectives

Ravichander, Abhilasha; Black, Alan; Wilson, Shomir; Norton, Thomas; Sadeh, Norman

doi:10.18653/v1/D19-1500

Citation Details

Question Answering for Privacy Policies: Combining Computational and Legal Perspectives

Privacy policies are long and complex documents that are difficult for users to read and understand, and yet, they have legal effects on how user data is collected, managed and used. Ideally, we would like to empower users to inform themselves about issues that matter to them, and enable them to selective explore those issues. We present PRIVACYQA, a corpus consisting of 1750 questions about the privacy policies of mobile applications, and over 3500 expert annotations of relevant answers. We observe that a strong neural baseline underperforms human performance by almost 0.3 F1 on PRIVACYQA, suggesting considerable room for improvement for future systems. Further, we use this dataset to shed light on challenges to question answerability, with domain-general implications for any question answering system. The PRIVACYQA corpus offers a challenging corpus for question answering, with genuine real-world utility. more »

Award ID(s):: 1914486

PAR ID:: 10169866

Author(s) / Creator(s):: Ravichander, Abhilasha; Black, Alan; Wilson, Shomir; Norton, Thomas; Sadeh, Norman

Date Published:: 2019-11-01

Journal Name:: 2019 Conference on Empirical Methods in Natural Language Processing

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/D19-1500

More Like this