#### Filter Results:

- Full text PDF available (37)

#### Publication Year

2009

2017

- This year (1)
- Last five years (36)

#### Publication Type

#### Co-author

#### Publication Venue

#### Key Phrases

Learn More

- Jelani Nelson, Huy L. Nguyen
- 2013 IEEE 54th Annual Symposium on Foundations of…
- 2013

An oblivious subspace embedding (OSE) given some parameters ε, d is a distribution D over matrices Π ∈ R<sup>m×n</sup> such that for any linear subspace W ⊆ R<sup>n</sup> with dim(W) = d, P<sub>Π~D</sub>(∀x ∈ W ||Πx||<sub>2</sub> ∈ (1 ± ε)||x||<sub>2</sub>) > 2/3. We show… (More)

We present a new data structure for the c-approximate near neighbor problem (ANN) in the Euclidean space. For n points in R d , our algorithm achieves O c (n ρ + d log n) query time and O c (n 1+ρ + d log n) space, where ρ ≤ 7/(8c 2) + O(1/c 3) + o c (1). This is the first improvement over the result by Andoni and Indyk (FOCS 2006) and the first data… (More)

- Yi Li, Huy L. Nguyen, David P. Woodruff
- STOC
- 2014

In the turnstile model of data streams, an underlying vector <i>x</i> ∈ {--<i>m</i>,--<i>m</i>+1,..., <i>m</i>--1,<i>m</i>}<sup><i>n</i></sup> is presented as a long sequence of positive and negative integer updates to its coordinates. A randomized algorithm seeks to approximate a function <i>f</i>(<i>x</i>) with constant probability while only making… (More)

- Yi Li, Huy L. Nguyen, David P. Woodruff
- SODA
- 2014

Sketching is a prominent algorithmic tool for processing large data. In this paper, we study the problem of sketching matrix norms. We consider two sketching models. The first is bilinear sketching, in which there is a distribution over pairs of r × n matrices S and n × s matrices T such that for any fixed n × n matrix A, from S · A · T one can approximate… (More)

- Ankit Garg, Tengyu Ma, Huy L. Nguyen
- NIPS
- 2014

We explore the connection between dimensionality and communication cost in distributed learning problems. Specifically we study the problem of estimating the mean ~ ✓ of an unknown d dimensional gaussian distribution in the distributed setting. In this problem, the samples from the unknown distribution are distributed among m different machines. The goal is… (More)

- Rafael da Ponte Barbosa, Alina Ene, Huy L. Nguyen, Justin Ward
- 2016 IEEE 57th Annual Symposium on Foundations of…
- 2016

A wide variety of problems in machine learning, including exemplar clustering, document summarization, and sensor placement, can be cast as constrained submodular maximization problems. A lot of recent effort has been devoted to developing distributed algorithms for these problems. However, these results suffer from high number of rounds, suboptimal… (More)

- Jelani Nelson, Huy L. Nguyen
- ICALP
- 2014

An oblivious subspace embedding (OSE) for some ε, δ ∈ (0, 1/3) and d ≤ m ≤ n is a distribution D over R m×n such that for any linear subspace W ⊂ R n of dimension d, P Π∼D (∀x ∈ W, (1 − ε)x 2 ≤ Πx 2 ≤ (1 + ε)x 2) ≥ 1 − δ. We prove that any OSE with δ < 1/3 must have m = Ω((d + log(1/δ))/ε 2), which is optimal. Furthermore, if every Π in the support of D is… (More)

We consider the problem of approximate nearest neighbors in high dimensions, when the queries are lines. In this problem, given n points in R d , we want to construct a data structure to support efficiently the following queries: given a line L, report the point p closest to L. This problem generalizes the more familiar nearest neighbor problem. From a… (More)

- Khanh Do Ba, Huy L. Nguyen, Huy N. Nguyen, Ronitt Rubinfeld
- Theory Comput. Syst.
- 2011

We study the problem of estimating the Earth Mover's Distance (EMD) between probability distributions when given access only to samples. We give closeness testers and additive-error estimators over domains in [0, ∆] d , with sample complexities independent of domain size – permitting the testability even of continuous distributions over infinite domains.… (More)

- Alexandr Andoni, Huy L. Nguyen
- SODA
- 2013

We study the question of estimating the eigenvalues of a matrix in the streaming model, addressing a question posed in [Mut05]. We show that the eigenvalue " heavy hitters " of a matrix can be computed in a single pass. In particular, we show that the φ-heavy hitters (in the 1 or 2 norms) can be estimated in space proportional to 1/φ 2. Such a dependence on… (More)