• Publications
  • Influence
LexRank: Graph-based Lexical Centrality as Salience in Text Summarization
TLDR
We introduce a stochastic graph-based method for computing relative importance of textual units for Natural Language Processing. Expand
  • 2,153
  • 275
  • PDF
TimeML: Robust Specification of Event and Temporal Expressions in Text
TLDR
We provide a description of TimeML, a rich specification language for event and temporal expressions in natural language text, developed in the context of the AQUAINT program on Question Answering Systems. Expand
  • 765
  • 133
  • PDF
Centroid-based summarization of multiple documents
TLDR
We present a multi-document summarizer, MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. Expand
  • 1,089
  • 90
  • PDF
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task
TLDR
We present Spider, a large-scale complex and cross-domain semantic parsing and text-to-SQL task so that different complicated SQL queries and databases appear in train and test sets. Expand
  • 169
  • 57
  • PDF
How to Analyze Political Attention with Minimal Assumptions and Costs
TLDR
We describe a topic model for legislative speech, a statistical learning model that uses word choices to infer topical categories covered in a set of speeches and to identify the topic of specific speeches. Expand
  • 458
  • 52
  • PDF
The ACL anthology network corpus
TLDR
We introduce the ACL Anthology Network (AAN), a comprehensive manually curated networked database of citations, collaborations, and summaries in the field of Computational Linguistics. Expand
  • 245
  • 45
  • PDF
TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation
TLDR
We propose TYPESQL for text-to-SQL which views the problem as a slot filling task and uses type information to better understand rare entities and numbers in the input. Expand
  • 116
  • 45
  • PDF
Rumor has it: Identifying Misinformation in Microblogs
TLDR
We address the problem of rumor detection in microblogs and explore the effectiveness of 3 categories of features: content based, network-based, and microblog-specific memes for correctly identifying rumors. Expand
  • 610
  • 40
  • PDF
SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task
TLDR
We propose SyntaxSQLNet, a syntax tree network to address the complex and cross-domain text-to-SQL generation task. Expand
  • 76
  • 39
  • PDF
Centroid-based summarization of multiple documents: sentence extraction utility-based evaluation, and user studies
TLDR
We present a multi-document summarizer, called MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. Expand
  • 536
  • 37
  • PDF
...
1
2
3
4
5
...