Retrofitting GDPR compliance onto legacy databases

Agarwal, Archita; George, Marilyn; Jeyaraj, Aaron; Schwarzkopf, Malte

doi:10.14778/3503585.3503603

Citation Details

Retrofitting GDPR compliance onto legacy databases

New privacy laws like the European Union's General Data Protection Regulation (GDPR) require database administrators (DBAs) to identify all information related to an individual on request, e.g. , to return or delete it. This requires time-consuming manual labor today, particularly for legacy schemas and applications. In this paper, we investigate what it takes to provide mostly-automated tools that assist DBAs in GDPR-compliant data extraction for legacy databases. We find that a combination of techniques is needed to realize a tool that works for the databases of real-world applications, such as web applications, which may violate strict normal forms or encode data relationships in bespoke ways. Our tool, GDPRizer, relies on foreign keys, query logs that identify implied relationships, data-driven methods, and coarse-grained annotations provided by the DBA to extract an individual's data. In a case study with three popular web applications, GDPRizer achieves 100% precision and 96--100% recall. GDPRizer saves work compared to hand-written queries, and while manual verification of its outputs is required, GDPRizer simplifies privacy compliance. more »

Award ID(s):: 2045170

PAR ID:: 10404621

Author(s) / Creator(s):: Agarwal, Archita; George, Marilyn; Jeyaraj, Aaron; Schwarzkopf, Malte

Date Published:: 2021-12-01

Journal Name:: Proceedings of the VLDB Endowment

Volume:: 15

Issue:: 4

ISSN:: 2150-8097

Page Range / eLocation ID:: 958 to 970

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.14778/3503585.3503603

More Like this