Exploiting Metric Structure for Efficient Private Query Release

  title={Exploiting Metric Structure for Efficient Private Query Release},
  author={Zhiyi Huang and Aaron Roth},
We consider the problem of privately answering queries defined on databases which are collections of points belonging to some metric space. We give simple, computationally efficient algorithms for answering distance queries defined over an arbitrary metric. Distance queries are specified by points in the metric space, and ask for the average distance from the query point to the points contained in the database, according to the specified metric. Our algorithms run efficiently in the database… 

Figures from this paper

Efficient Private Query Release via Polynomial Approximation

It is shown that there exists a computationally efficient $\varepsilon$-differentially private mechanism that releases a query class parametrized by additively separable Holder continuous functions, and that the accuracy can be significantly boosted.

Optimal Differentially Private Algorithms for k-Means Clustering

It is proved a matching lower bound that no (ε, δ)-differentially private algorithm can guarantee Wasserstein distance less than Ømega (Φ2) and, thus, the positive result is optimal up to a constant factor.

Differentially Private Data Publishing and Analysis: A Survey

This survey compares the diverse release mechanisms of differentially private data publishing given a variety of input data in terms of query type, the maximum number of queries, efficiency, and accuracy.

Differential Privacy of Mathematical Functions

An abstract model of differential privacy is presented in which a differential privacy problem is modeled as finding a randomized mapping between two metric spaces and the experiments show that the mechanisms have more accurate results than the state of the art mechanisms.

Differentially private data publishing: Non-interactive setting

This chapter present the non-interactive setting in data publishing, including batch queries publishing, contingency table publishing and synthetic dataset publishing, which means all queries are given to the curator at one time.

Technical Questions About Differential Privacy 2 . 1 Efficient Algorithms for Releasing Conjunctions

  • Computer Science
  • 2012
Technical Questions About Differential Privacy 2.1 Efficient Algorithms for Releasing Conjunctions and Reduction Hypothesis Under the -Matrix Mechanism and more.

The Policies of Designing Differentially Private Mechanisms: Utility First vs. Privacy First

This paper realizes that designing a differentially private mechanism can be considered as finding a randomized mapping between two metric spaces and finds that the sensitivity-based methods are those just using the metric about utility to construct mechanisms.

Differential Privacy and Applications

This chapter presents three methods that apply differential privacy to achieve location privacy for LBSs: the geo-indistinguishability method, the synthetic differentially private trajectory Publishing method, and the hierarchical location data publishing method, with an emphasis on the last one.

An Antifolk Theorem for Large Repeated Games

It is argued that in large games (n player games in which unilateral deviations by single players have only a small impact on the utility of other players), many monitoring settings naturally lead to signals that satisfy (ε, γ)-differential privacy for ε and γ tending to zero as the number of players n grows large.

Differentially Private Kernel Support Vector Machines Based on the Exponential and Laplace Hybrid Mechanism

This paper proposes a new differentially private algorithm for the kernel SVMs based on the exponential and Laplace hybrid mechanism named DPKSVMEL and theoretically proves that the DP KSVMEL algorithm satisfies differential privacy.



Fast Private Data Release Algorithms for Sparse Queries

This paper considers the large class of sparse queries, which take non-zero values on only polynomially many universe elements, and gives efficient query release algorithms for this class, in both the interactive and the non-interactive setting.

Answering n{2+o(1)} counting queries with differential privacy is hard

It is proved that if one-way functions exist, then there is no algorithm that takes as input a database db ∈ dbset, and k = ~Ω(n2) arbitrary efficiently computable counting queries, runs in time poly(d, n), and returns an approximate answer to each query, while satisfying differential privacy.

Privately releasing conjunctions and the statistical query barrier

The number of statistical queries necessary and sufficient for this task is equal to the agnostic learning complexity of C in Kearns' statistical query (SQ)model, which isolates the complexity of agnosticLearning in the SQ-model as a new barrier in the design of differentially private algorithms.

Interactive privacy via the median mechanism

The median mechanism is the first privacy mechanism capable of identifying and exploiting correlations among queries in an interactive setting, and an efficient implementation is given, with running time polynomial in the number of queries, the database size, and the domain size.

Iterative Constructions and Private Data Release

New algorithms (and new analyses of existing algorithms) in both the interactive and non-interactive settings are given, and a reduction based on the IDC framework shows that an efficient, private algorithm for computing sufficiently accurate rank-1 matrix approximations would lead to an improved efficient algorithm for releasing private synthetic data for graph cuts.

Faster Algorithms for Privately Releasing Marginals

To the knowledge, this work is the first algorithm capable of privately releasing marginal queries with non-trivial worst-case accuracy guarantees in time substantially smaller than the number of k-way marginal queries, which is dΘ(k) (for k≪d).

Mirror Descent Based Database Privacy

This paper forms a generic IDC framework based on the Mirror Descent algorithm, a popular convex optimization algorithm, and presents two concrete applications, namely, cut queries over a bipartite graph and linear queries over low-rank matrices, and provides significantly tighter error bounds.

A Multiplicative Weights Mechanism for Privacy-Preserving Data Analysis

A new differentially private multiplicative weights mechanism for answering a large number of interactive counting (or linear) queries that arrive online and may be adaptively chosen, and it is shown that when the input database is drawn from a smooth distribution — a distribution that does not place too much weight on any single data item — accuracy remains as above, and the running time becomes poly-logarithmic in the data universe size.

On the complexity of differentially private data release: efficient algorithms and hardness results

Private data analysis in the setting in which a trusted and trustworthy curator releases to the public a "sanitization" of the data set that simultaneously protects the privacy of the individual contributors of data and offers utility to the data analyst is considered.

Practical privacy: the SuLQ framework

This work considers a statistical database in which a trusted administrator introduces noise to the query responses with the goal of maintaining privacy of individual database entries, and modify the privacy analysis to real-valued functions f and arbitrary row types, greatly improving the bounds on noise required for privacy.