Automated Detection and Analysis of Data Practices Using A Real-World Corpus

Srinath, Mukund; Narayanan_Venkit, Pranav; Badillo, Maria; Schaub, Florian; Giles, C; Wilson, Shomir

doi:10.18653/v1/2024.findings-acl.271

Citation Details

Automated Detection and Analysis of Data Practices Using A Real-World Corpus

Privacy policies are crucial for informing users about data practices, yet their length and complexity often deter users from reading them. In this paper, we propose an automated approach to identify and visualize data practices within privacy policies at different levels of detail. Leveraging crowd-sourced annotations from the ToS;DR platform, we experiment with various methods to match policy excerpts with predefined data practice descriptions. We further conduct a case study to evaluate our approach on a real-world policy, demonstrating its effectiveness in simplifying complex policies. Experiments show that our approach accurately matches data practice descriptions with policy excerpts, facilitating the presentation of simplified privacy information to users. more »

Award ID(s):: 2105736 2237574

PAR ID:: 10599512

Author(s) / Creator(s):: Srinath, Mukund; Narayanan_Venkit, Pranav; Badillo, Maria; Schaub, Florian; Giles, C; Wilson, Shomir

Publisher / Repository:: Association for Computational Linguistics

Date Published:: 2024-01-01

Page Range / eLocation ID:: 4567 to 4574

Format(s):: Medium: X

Location:: Bangkok, Thailand and virtual meeting

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2024.findings-acl.271

More Like this