• Publications
  • Influence
'Beating the news' with EMBERS: forecasting civil unrest using open source indicators
TLDR
We describe the design, implementation, and evaluation of EMBERS, an automated, 24x7 continuous system for forecasting civil unrest across 10 countries of Latin America using open source indicators such as tweets, news sources, blogs, economic indicators, and other data sources. Expand
  • 186
  • 15
  • PDF
Automatic Discovery of Logical Document Structure
TLDR
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly, and the need for flexible document manipulation tools is growing correspondingly. Expand
  • 45
  • 4
Near-wordless document structure classification
  • K. Summers
  • Computer Science
  • Proceedings of 3rd International Conference on…
  • 14 August 1995
TLDR
This paper proposes an approach to the classification of logical document structures, according to their distance from predefined prototypes. Expand
  • 22
  • 3
Analyzing Civil Unrest through Social Media
TLDR
Mining and analyzing data from social networks such as Twitter can reveal new insights into the causes of civil disturbances, including trigger events and the role of political entrepreneurs and organizations in galvanizing public opinion. Expand
  • 36
  • 2
  • PDF
Forecasting Significant Societal Events Using The Embers Streaming Predictive Analytics System
TLDR
Developed under the Intelligence Advanced Research Project Activity Open Source Indicators program, Early Model Based Event Recognition using Surrogates (EMBERS) is a large-scale big data analytics system for forecasting significant societal events on the basis of continuous, automated analysis of large volumes of publicly available data. Expand
  • 18
  • 2
  • PDF
Combining different classification approaches to improve off-line Arabic handwritten word recognition
TLDR
We address the problem of offline Arabic handwriting recognition of pre-segmented words by combining heterogeneous classification methodologies. Expand
  • 14
  • 2
  • PDF
Using White Space for Automated Document Structuring
TLDR
We present and analyze efficient algorithms for the automated recognition and interpretation of layout structures in electronic documents that use the patterns in the distribution of white space in a document to recognize and interpret its components. Expand
  • 38
  • 1
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecasting System
TLDR
EMBERS is an anticipatory intelligence system forecasting population-level events in multiple countries of Latin America. Expand
  • 17
  • 1
  • PDF
Document image improvement for OCR as a classification problem
  • K. Summers
  • Computer Science, Engineering
  • IS&T/SPIE Electronic Imaging
  • 20 January 2003
TLDR
In support of the goal of automatically selecting methods of enhancing an image to improve the accuracy of OCR on that image, we consider the problem of determining whether to apply each of a set of methods as a supervised classification problem. Expand
  • 12
The EMBERS architecture for streaming predictive analytics
TLDR
Developed under the IARPA Open Source Initiative program, EMBERS (Early Model Based Event Recognition using Surrogates) is a large-scale Big-Data analytics system for forecasting significant societal events, such as civil unrest incidents and disease outbreaks on the basis of continuous, automated analysis of large volumes of publicly available data. Expand
  • 12
  • PDF