Presto-Native Noisy Aggregations for Privacy-Preserving Workflows

Monday, June 03, 2024 - 10:50 am11:05 am

Kien Nguyen and Chen-Kuei Lee, Meta

Abstract: 

At Meta, large-scale data analysis happens constantly, across varied surfaces, platforms, and systems. Differential privacy (DP), because of its strong protection, is one of the privacy-enhancing technologies deployed by Meta to protect users' privacy. However, implementing DP in practice, especially at Meta scale, has many challenges, including the diversity of interfaces for analysis, size of datasets, expertise required, and integration with other policy requirements and enforcement. In this talk, we describe an approach to private data analysis at Meta that places a set of common privacy primitives in the compute engine (Presto), which are leveraged by different frameworks and services to enforce DP guarantees across our many systems. Examples include automatic query rewriting for interactive data analysis, privacy-preserving ETL pipelines, and web mapping of aggregate statistics. The Presto-based approach helped increase flexibility, minimize changes to existing workflows, and enable robust privacy enforcement and guarantees. This is joint work with Jonathan Hehir (Meta Platforms, Inc.)

Kien Nguyen, Meta

Kien Nguyen currently works as a Research Scientist in the Applied Privacy Tech team at Meta, developing and deploying large-scale privacy-preserving systems in Meta. Kien finished his PhD program in Computer Science at the University of Southern California, under the supervision of Prof. Cyrus Shahabi. Kien is interested in privacy-preserving data analysis, location privacy, marketplaces, and their applications.

Chen-Kuei Lee, Meta

Chen-Kuei Lee is a Software Engineer in Applied Privacy Team at Meta. He works on applying a variety of privacy preserving techniques inside Meta to support data minimization and to reduce re-identification risks.

BibTeX
@conference {296329,
author = {Kien Nguyen and Chen-Kuei Lee},
title = {{Presto-Native} Noisy Aggregations for {Privacy-Preserving} Workflows},
year = {2024},
address = {Santa Clara, CA},
publisher = {USENIX Association},
month = jun
}