• Publications
  • Influence
Slicing: A New Approach for Privacy Preserving Data Publishing
TLDR
We present a new approach called slicing to privacy-preserving microdata publishing. Expand
  • 306
  • 33
  • PDF
Blockwise coordinate descent procedures for the multi-task lasso, with applications to neural semantic basis discovery
TLDR
We develop a cyclical blockwise coordinate descent algorithm for the multi-task Lasso that efficiently solves problems with thousands of features and tasks. Expand
  • 197
  • 19
  • PDF
Topic-conditioned novelty detection
TLDR
We propose a new approach which addresses this problem in two stages: 1) using a supervised learning algorithm to classify the on-line document stream into pre-defined broad topic categories, and 2) performing topic-conditioned novelty detection for documents in each topic. Expand
  • 241
  • 16
  • PDF
Modified Logistic Regression: An Approximation to SVM and Its Applications in Large-Scale Text Categorization
TLDR
In this paper, we use a modified version of LR to approximate the optimization of SVM by a sequence of unconstrained optimization problems. Expand
  • 148
  • 14
  • PDF
PI3K/Akt signaling in osteosarcoma.
Osteosarcoma (OS) is the most common nonhematologic bone malignancy in children and adolescents. Despite the advances of adjuvant chemotherapy and significant improvement of survival, the prognosisExpand
  • 164
  • 10
Robustness of adaptive filtering methods in a cross-benchmark evaluation
TLDR
This paper reports a cross-benchmark evaluation of regularized logistic regression (LR) and incremental Rocchio for adaptive filtering on Topic Detection and Tracking corpora. Expand
  • 58
  • 9
  • PDF
Effect of Time Spent Outdoors at School on the Development of Myopia Among Children in China: A Randomized Clinical Trial.
IMPORTANCE Myopia has reached epidemic levels in parts of East and Southeast Asia. However, there is no effective intervention to prevent the development of myopia. OBJECTIVE To assess the efficacyExpand
  • 326
  • 8
  • PDF
A scalability analysis of classifiers in text categorization
TLDR
This paper addresses the problem with respect to a set of popular algorithms in text categorization, including Support Vector Machines, k-nearest neighbor, ridge regression, linear least square fit and logistic regression. Expand
  • 200
  • 8
  • PDF
Learning Multiple Related Tasks using Latent Independent Component Analysis
TLDR
We propose a probabilistic model based on Independent Component Analysis for learning multiple related tasks. Expand
  • 125
  • 8
  • PDF
Modeling and Integrating Background Knowledge in Data Anonymization
TLDR
This paper presents a general framework for modeling the adversary's background knowledge using kernel estimation methods. Expand
  • 74
  • 8
  • PDF