• Corpus ID: 67337278

new/s/leak – A Tool for Visual Exploration of Large Text Document Collections in the Journalistic Doman

@inproceedings{Ballweg2016newsleakA,
  title={new/s/leak – A Tool for Visual Exploration of Large Text Document Collections in the Journalistic Doman},
  author={Kathrin Ballweg and Florian Zouhar and Patrick Wilhelmi-Dworski and Tatiana von Landesberger and Uli Fahrer and Alexander Panchenko and Seid Muhie Yimam and Chris Biemann and Michaela Regneri and Heiner Ulrich},
  year={2016}
}
Journalists strive for newsworthy stories for the public. To find those stories they need to explore and read documents from large collections such as the Kissinger Cables. This is very time consuming, since the the text document collections are too large to read them alone – even in a team. Interactive text visualization can support journalists in this endeavor. Several tools exists, but interviews with our collaboration journalists revealed their various drawbacks. Therefore, we develop and… 
Guidance for Multi-Type Entity Graphs from Text Collections
TLDR
A novel DOI for graphs with multiple types of entities is presented and the user can interactively set her interest in entity types and entity relations to be applied to a graph extracted from WikiLeaks PlusD Cablegate documents.
Storyfinder: Personalized Knowledge Base Construction and Management by Browsing the Web
This paper presents Storyfinder, an application which consists of a browser plugin and a web server backend with the goal to highlight and manage the information contained in web pages by combining
Adaptive Approaches to Natural Language Processing in Annotation and Application
TLDR
This thesis embeds an adaptive machine learning model into the application that can continuously learn and update its model from usage data and designs different tools for rapid, adaptive, and personalized annotations.

References

SHOWING 1-10 OF 14 REFERENCES
Overview: The Design, Adoption, and Analysis of a Visual Document Mining Tool for Investigative Journalists
TLDR
The frequently-used language of “exploring” a document collection is both too vague and too narrow to capture how journalists actually used the Overview application, which is presented, an application for the systematic analysis of large document collections based on document clustering, visualization, and tagging.
new/s/leak - Information Extraction and Visualization for Investigative Data Journalists
TLDR
New/s/leak, a novel tool developed for and with the help of journalists, which enables the automatic analysis and discovery of newsworthy stories from large textual datasets, features an intuitive web-based user interface based on network visualization combined with data exploring methods and various search and faceting mechanisms.
Combining Computational Analyses and Interactive Visualization for Document Exploration and Sensemaking in Jigsaw
TLDR
A visual analytics approach that integrates multiple text analysis algorithms with a suite of interactive visualizations to provide a flexible and powerful environment that allows analysts to explore collections of documents while sensemaking.
Networks of Names: Visual Exploration and Semi‐Automatic Tagging of Social Networks from Newspaper Articles
TLDR
A new visual analytics system, which offers interactive exploration and tagging of social networks extracted from newspapers, and improves the seminal definition of DOI, which is found to produce the same “globally interesting” neighbourhoods in the use case, regardless of the query.
VAiRoma: A Visual Analytics System for Making Sense of Places, Times, and Events in Roman History
TLDR
VAiRoma is a visual analytics system that couples state-of-the-art text analysis methods with an intuitive visual interface to help users make sense of events, places, times, and more importantly, the relationships between them and allows users to learn and create new knowledge regarding Roman history in an informed way.
TimeLineCurator: Interactive Authoring of Visual Timelines from Unstructured Text
We present TimeLineCurator, a browser-based authoring tool that automatically extracts event data from temporal references in unstructured text documents using natural language processing and encodes
On Close and Distant Reading in Digital Humanities: A Survey and Future Challenges
TLDR
A taxonomy of applied methods for close and distant reading, and approaches that combine both reading techniques to provide a multifaceted view of the data are provided.
Text Visualization Browser : A Visual Survey of Text Visualization Techniques
TLDR
An interactive visual survey of text visualization techniques that can be used for the purposes of search for related work, introduction to the subfield and gaining insight into research trends are presented.
Speculative Practices: Utilizing InfoVis to Explore Untapped Literary Collections
TLDR
This paper exemplifies how information visualization supports speculative thinking, hypotheses testing, and preliminary interpretation processes as part of literary research and suggests a design space for visualizing literary collections that is defined by their academic and public relevance.
Graphs, Maps, Trees: Abstract Models for a Literary History
"In this groundbreaking book, Franco Moretti argues that literature scholars should stop reading books and start counting, graphing, and mapping them instead. In place of the traditionally selective
...
...