• Publications
  • Influence
Efficient Ranking from Pairwise Comparisons
The ranking of n objects based on pairwise comparisons is a core machine learning problem, arising in recommender systems, ad placement, player ranking, biological applications and others. In manyExpand
  • 156
  • 12
Bayesian Bias Mitigation for Crowdsourcing
Biased labelers are a systemic problem in crowdsourcing, and a comprehensive toolbox for handling their responses is still being developed. A typical crowdsourcing application can be divided intoExpand
  • 105
  • 8
Multiple novel gene-by-environment interactions modify the effect of FTO variants on body mass index
Genetic studies have shown that obesity risk is heritable and that, of the many common variants now associated with body mass index, those in an intron of the fat mass and obesity-associated (FTO)Expand
  • 73
  • 3
Active spectral clustering via iterative uncertainty reduction
Spectral clustering is a widely used method for organizing data that only relies on pairwise similarity measurements. This makes its application to non-vectorial data straight-forward in principle,Expand
  • 44
  • 3
Identifying loci affecting trait variability and detecting interactions in genome-wide association studies
Identification of genetic variants with effects on trait variability can provide insights into the biological mechanisms that control variation and can identify potential interactions. We propose aExpand
  • 20
  • 2
Heavy-Tailed Process Priors for Selective Shrinkage
Heavy-tailed distributions are often used to enhance the robustness of regression and classification methods to outliers in output space. Often, however, we are confronted with "outliers" in inputExpand
  • 12
  • 1
A Comparative Framework for Preconditioned Lasso Algorithms
The Lasso is a cornerstone of modern multivariate data analysis, yet its performance suffers in the common situation in which covariates are correlated. This limitation has led to a growing number ofExpand
  • 11
  • 1
Nonparametric Combinatorial Sequence Models
This work considers biological sequences that exhibit combinatorial structures in their composition: groups of positions of the aligned sequences are "linked" and covary as one unit across sequences.Expand
  • 3
  • 1
Nonparametric Combinatorial Sequence Models
This work considers biological sequences that exhibit combinatorial structures in their composition: groups of positions of the aligned sequences are "linked" and covary as one unit across sequences.Expand
  • 1
Anomaly Detection for Asynchronous and Incomplete Data
Traffic anomalies, node failures, and attacks are common occurrences in today’s computer networks, and identifying them quickly and accurately is important for any large network. Algorithms forExpand