• Publications
  • Influence
Language Identification for Creating Language-Specific Twitter Collections
Social media services such as Twitter offer an immense volume of real-world linguistic data. We explore the use of Twitter to obtain authentic user-generated text in low-resource languages such asExpand
  • 100
  • 15
  • PDF
"Beacons of hope" in decentralized coordination: learning from on-the-ground medical twitterers during the 2010 Haiti earthquake
We examine the public, social media communications of 110 emergency medical response teams and organizations in the immediate aftermath of the January 12, 2010 Haiti earthquake. We found the teamsExpand
  • 99
  • 8
  • PDF
Blogs as a collective war diary
Disaster-related research in human-centered computing has typically focused on the shorter-term, emergency period of a disaster event, whereas effects of some crises are long-term, lasting years.Expand
  • 45
  • 3
  • PDF
CLIP at TREC 2015: Microblog and LiveQA
The Computational Linguistics and Information Processing lab at the University of Maryland participated in two TREC tracks this year. The Microblog Real-Time Filtering and the LiveQA tasks bothExpand
  • 18
  • 2
  • PDF
On Predicting Deletions of Microblog Posts
Among the many classification tasks on Twitter content, predicting whether a tweet will be deleted has to date received relatively little attention. Deletions occur for a variety of reasons, whichExpand
  • 12
  • 2
  • PDF
What Questions Do Journalists Ask on Twitter?
Social media platforms are a major source of information for both the general public and for journalists. Journalists use Twitter and other social media services to gather story ideas, to findExpand
  • 4
  • 2
Journalists and Twitter: A Multidimensional Quantitative Description of Usage Patterns
We conduct a large scale quantitative comparison of the usage pattern of a microblogging service by journalists, news organizations, and news consumers. Through two statistical tests of eighteenExpand
  • 7
  • 1
  • PDF
CLIP at TREC 2016 : LiveQA and RTS
The Computational Linguistics and Information Processing lab at the University of Maryland participated in two TREC tracks this year. The LiveQA and the Real-Time Summarization tasks both involveExpand
  • 4
  • 1
  • PDF
EpiViz : A Visual Analytic Tool for Epigenetic Features
We propose EpiViz, a new visualization tool for genomic and epigenetic data. This tool focuses on the correlation between different types of measurements, as well as different views of the same data,Expand
  • 1
  • 1
  • PDF
Towards minimizing the annotation cost of certified text classification
The common practice of testing a sequence of text classifiers learned on a growing training set, and stopping when a target value of estimated effectiveness is first met, introduces a sequentialExpand
  • 16
  • PDF