Scalable Private Search with Wally

📅 2024-06-10
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge of balancing privacy preservation and query efficiency in large-scale private database search, this paper proposes Wally—a scalable system achieving $(varepsilon,delta)$-differential privacy under high concurrency. Wally integrates four key techniques: (i) an adaptive dummy-query attenuation mechanism—where the number of dummy queries decreases with the number of honest users; (ii) Tor-based anonymous routing; (iii) randomized query submission timing; and (iv) lightweight homomorphic encryption. This design ensures that per-query privacy overhead asymptotically approaches zero as system scale increases. To our knowledge, Wally is the first system to enable efficient private search at the scale of ten million database entries and one million concurrent users. Experiments on a 3.2-million-entry dataset show that Wally achieves 7–28× higher QPS than Tiptoe while reducing communication overhead by 6.69–31×.

Technology Category

Application Category

📝 Abstract
This paper presents Wally, a private search system that supports efficient search queries against large databases. When sufficiently many clients are making queries, Wally's performance is significantly better than previous systems while providing a standard privacy guarantee of $(epsilon, delta)$-differential privacy. Specifically, for a database with 3.2 million entries, Wally's queries per second (QPS) is 7-28x higher, and communication is 6.69-31x smaller than Tiptoe, a state-of-the-art private search system. In Wally, each client adds a few fake queries and sends each query via an anonymous network to the server at independently chosen random instants. We also use somewhat homomorphic encryption (SHE) to reduce the communication size. The number of fake queries each client makes depends inversely on the number of clients making queries. Therefore, the overhead of fake queries vanishes as the number of honest clients increases, enabling scalability to millions of queries and large databases.
Problem

Research questions and friction points this paper is trying to address.

Enables efficient private search on large databases
Improves query speed and reduces communication overhead
Scales to millions of queries with differential privacy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses fake queries for enhanced privacy
Employs somewhat homomorphic encryption
Scales efficiently with client numbers
🔎 Similar Papers
No similar papers found.
Hilal Asi
Hilal Asi
Apple
Machine LearningTrustworthy AIDifferential Privacy
Fabian Boemer
Fabian Boemer
Apple Inc.
Nicholas Genise
Nicholas Genise
Apple Inc.
M
M. Mughees
Apple Inc.
T
Tabitha Ogilvie
Royal Holloway, University of London
R
Rehan Rishi
Apple Inc.
G
G. Rothblum
Apple Inc.
Kunal Talwar
Kunal Talwar
Apple Inc
Machine LearningDifferential PrivacyAlgorithms
K
Karl Tarbe
Apple Inc.
R
Ruiyu Zhu
Apple Inc.
M
Marco Zuliani
Apple Inc.