• Publications
  • Influence
Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission
TLDR
We present two case studies where high-performance generalized additive models with pairwise interactions (GA2Ms) are applied to real healthcare problems yielding intelligible models with state-of-the-art accuracy. Expand
  • 718
  • 44
  • PDF
An Unsupervised Aspect-Sentiment Model for Online Reviews
TLDR
We present an unsuper-vised system for extracting aspects and determining sentiment in review text. Expand
  • 481
  • 41
  • PDF
Inferring Strategies for Sentence Ordering in Multidocument News Summarization
TLDR
We propose a methodology for studying the properties of ordering information in the news genre and describe experiments done on a corpus of multiple acceptable orderings we developed for the task. Expand
  • 332
  • 31
  • PDF
A Comparison of Features for Automatic Readability Assessment
TLDR
Several sets of explanatory variables - including shallow, language modeling, POS, syntactic, and discourse features - are compared and evaluated in terms of their impact on predicting the grade level of reading material for primary school students. Expand
  • 192
  • 30
  • PDF
Putting it Simply: a Context-Aware Approach to Lexical Simplification
We present a method for lexical simplification. Simplification rules are learned from a comparable corpus, and the rules are applied in a context-aware fashion to input sentences. Our method isExpand
  • 151
  • 26
  • PDF
Beyond the Stars: Improving Rating Predictions using Review Text Content
TLDR
We propose new ad-hoc and regression-based recommendation measures, that both take into account the textual component of user reviews and use this information to improve user experience in accessing reviews. Expand
  • 396
  • 22
  • PDF
Diagnosis code assignment: models and evaluation metrics
TLDR
We propose novel evaluation metrics, which reflect the distances among gold-standard and predicted codes and their locations in the ICD9 tree. Expand
  • 142
  • 18
  • PDF
Multi-Label Classification of Patient Notes a Case Study on ICD Code Assignment
TLDR
We present Hierarchical Attention-GRU (HA- GRU), a hierarchical approach to tag a document by identifying the sentences relevant for each label. Expand
  • 96
  • 17
  • PDF
Sentence Alignment for Monolingual Comparable Corpora
TLDR
We address the problem of sentence alignment for monolingual corpora, a phenomenon distinct from alignment in parallel corpora. Expand
  • 201
  • 15
  • PDF
Mining a Lexicon of Technical Terms and Lay Equivalents
TLDR
We present a corpus-driven method for building a lexicon of semantically equivalent pairs of technical and lay medical terms. Expand
  • 93
  • 13
  • PDF
...
1
2
3
4
5
...