The Power of Localization for Efficiently Learning Linear Separators with Noise
- Pranjal Awasthi, M. Balcan, Philip M. Long
- Computer ScienceJournal of the ACM
- 31 July 2013
This work provides the first polynomial-time active learning algorithm for learning linear separators in the presence of malicious noise or adversarial label noise, and achieves a label complexity whose dependence on the error parameter ϵ is polylogarithmic (and thus exponentially better than that of any passive algorithm).
Center-based clustering under perturbation stability
- Pranjal Awasthi, A. Blum, Or Sheffet
- Computer ScienceInformation Processing Letters
- 18 September 2010
Relax, No Need to Round: Integrality of Clustering Formulations
- Pranjal Awasthi, A. Bandeira, M. Charikar, Ravishankar Krishnaswamy, Soledad Villar, Rachel A. Ward
- Computer ScienceInformation Technology Convergence and Services
- 18 August 2014
We study exact recovery conditions for convex relaxations of point cloud clustering problems, focusing on two of the most common optimization problems for unsupervised clustering: k-means and…
Guarantees for Spectral Clustering with Fairness Constraints
- Matthäus Kleindessner, S. Samadi, Pranjal Awasthi, Jamie H. Morgenstern
- Computer ScienceInternational Conference on Machine Learning
- 24 January 2019
This work develops variants of both normalized and unnormalized constrained SC and shows that they help find fairer clusterings on both synthetic and real data and proves that their algorithms can recover this fair clustering with high probability.
The Hardness of Approximation of Euclidean k-Means
- Pranjal Awasthi, M. Charikar, Ravishankar Krishnaswamy, A. Sinop
- Mathematics, Computer ScienceInternational Symposium on Computational Geometry
- 11 February 2015
The first hardness of approximation for the Euclidean $k-means problem is provided via an efficient reduction from the vertex cover problem on triangle-free graphs: given a triangle- free graph, the goal is to choose the fewest number of vertices which are incident on all the edges.
Efficient Learning of Linear Separators under Bounded Noise
- Pranjal Awasthi, M. Balcan, Nika Haghtalab, Ruth Urner
- Computer ScienceAnnual Conference Computational Learning Theory
- 11 March 2015
This work provides the first evidence that one can indeed design algorithms achieving arbitrarily small excess error in polynomial time under this realistic noise model and thus opens up a new and exciting line of research.
Fair k-Center Clustering for Data Summarization
- Matthäus Kleindessner, Pranjal Awasthi, Jamie H. Morgenstern
- Computer ScienceInternational Conference on Machine Learning
- 24 January 2019
This paper provides a simple approximation algorithm for the $k$-center problem under the fairness constraint with running time linear in the size of the data set and $k$.
Improved Spectral-Norm Bounds for Clustering
- Pranjal Awasthi, Or Sheffet
- Computer Science, ChemistryInternational Workshop and International Workshop…
- 14 June 2012
Aiming to unify known results about clustering mixtures of distributions under separation conditions, Kumar and Kannan [1] introduced a deterministic condition for clustering datasets. They showed…
Stability Yields a PTAS for k-Median and k-Means Clustering
- Pranjal Awasthi, A. Blum, Or Sheffet
- Computer Science, MathematicsIEEE Annual Symposium on Foundations of Computer…
- 23 October 2010
Improvements are made to the distance of the clustering found to the target from $O(\delta)$ to $\delta$ when all target clusters are large, and for $k-median the authors improve the ``largeness'' condition needed in the work of Balcan et al. to get exactly $delta-close from O(delta n) to $\Delta n$.
Decision trees for entity identification: approximation algorithms and hardness results
- Venkatesan T. Chakaravarthy, Vinayaka Pandit, Sambuddha Roy, Pranjal Awasthi, M. Mohania
- Computer Science, MathematicsACM SIGACT-SIGMOD-SIGART Symposium on Principles…
- 11 June 2007
A natural greedy algorithm is considered and an approximation guarantee of O(rK • log N) is proved, where N is the number of entities and K is the maximum number of distinct values of an attribute, which shows that it is NP-hard to approximate the problem within a factor of Ω(log N).
...
...