Connecting the dots between news articles

@inproceedings{Shahaf2011ConnectingTD,
  title={Connecting the dots between news articles},
  author={Dafna Shahaf and Carlos Guestrin},
  booktitle={IJCAI 2011},
  year={2011}
}
The process of extracting useful knowledge from large datasets has become one of the most pressing problems in today's society. The problem spans entire sectors, from scientists to intelligence analysts and web users, all of whom are constantly struggling to keep up with the larger and larger amounts of content published every day. With this much data, it is often easy to miss the big picture. In this paper, we investigate methods for automatically connecting the dots - providing a… 
Linking Multimedia Content for Efficient News Browsing
TLDR
An efficient way to generate links between news items in order to obtain an easily navigable graph, and enrich this graph by automatically typing created links is introduced.
Joint Inference of Concepts and Networks of Documents
TLDR
This thesis investigates the problem of finding semantic relationships between documents based on their textual content and describes a simple method to extract memes from the paper text using the DocNETINF algorithm to jointly infer concepts and the document network.
Learning to model relatedness for news recommendation
TLDR
This paper proposes a set of features to characterize relatedness between news articles across four aspects: relevance, novelty, connection clarity, and transition smoothness, and puts forward a learning approach to model relatedness.
Storytelling in entity networks to support intelligence analysts
TLDR
A system to automatically construct stories in entity networks that can help form directed chains of relationships, with support for co-referencing, evidence marshaling, and imposing syntactic constraints on the story generation process is presented.
A framework for intelligence analysis using spatio-temporal storytelling
TLDR
A novel method of spatio-temporal analysis on induced concept graphs that models storylines propagating through spatial regions in a time sequence is proposed and ConceptRank is described as a ranking strategy that differentiates strongly-typed connections from weakly-bound ones.
Helping Intelligence Analysts Make Connections
TLDR
A visual analytics system-- Analyst's Workspace (AW)--that integrates browsing tools with a storytelling algorithm in a large screen display environment that helps analysts systematically construct stories of desired fidelity from document collections and helps marshall evidence as longer stories are constructed.
Discovering Diverse and Salient Threads in Document Collections
TLDR
This work proposes a novel probabilistic technique for modeling and extracting salient structure from large document collections, particularly interested in revealing and exploiting relationships between documents.
Storytelling in heterogeneous Twitter entity network based on hierarchical cluster routing
TLDR
This work presents a system that automatically constructs stories by connecting entities in Twitter datasets using an entity similarity model that combines both traditional entity-related features and social network attributes and a novel story generation algorithm applied on the similarity model.
Mind the Gap:Newsfeed Visualisation with Metro Maps
TLDR
The results of the study support the hypothesis that metro map users demonstrate greater topic recall than users of an equivalent RSS reader, and recommendations for the commercial development of this and similar systems are discussed.
Exploring Event Evolution Patterns at the Atomic Level
TLDR
An unsupervised approach to explore event evolution patterns through extracting atomic event from documents, identifying their co-reference, and measuring their relationship automatically is proposed, and then the event evolution graph is constructed.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 11 REFERENCES
Connecting the dots between news articles
TLDR
This paper investigates methods for automatically connecting the dots -- providing a structured, easy way to navigate within a new topic and discover hidden connections in the news domain.
Event threading within news topics
TLDR
This work attempts to capture the rich structure of events and their dependencies in a news topic through the authors' event models, and takes into account novel features such as temporal locality of stories for event recognition and time-ordering for capturing dependencies.
Authoritative sources in a hyperlinked environment
TLDR
This work proposes and test an algorithmic formulation of the notion of authority, based on the relationship between a set of relevant authoritative pages and the set of “hub pages” that join them together in the link structure, and has connections to the eigenvectors of certain matrices associated with the link graph.
Maximizing the spread of influence through a social network
TLDR
An analysis framework based on submodular functions shows that a natural greedy strategy obtains a solution that is provably within 63% of optimal for several classes of models, and suggests a general approach for reasoning about the performance guarantees of algorithms for these types of influence problems in social networks.
Discovering evolutionary theme patterns from text: an exploration of temporal text mining
TLDR
This paper studies a particular TTM task -- discovering and summarizing the evolutionary patterns of themes in a text stream and presents general probabilistic methods for solving this problem through discovering latent themes from text and constructing an evolution graph of themes.
A Computational Model of Inferencing in Narrative
TLDR
This paper presents a method for generating causal and intentional inferences, in the form of sequences of events, from a narrative discourse, and defines cognitive criteria of necessitated inferences with regards to these representations, and shows how a partial order planner can determine which inferences are enabled.
Threading Electronic Mail - A Preliminary Study
Learning approaches for detecting and tracking news events
TLDR
The authors extend existing supervised-learning and unsupervised-clustering algorithms to allow document classification based on the information content and temporal aspects of news events to be classified using manually segmented documents.
Generalized best-first search strategies and the optimality of A*
TLDR
It is shown that several known properties of A* retain their form and it is also shown that no optimal algorithm exists, but if the performance tests are confirmed to cases in which the estimates are also consistent, then A* is indeed optimal.
...
1
2
...