Towards Monitoring of Novel Statements in the News

@inproceedings{Frber2016TowardsMO,
  title={Towards Monitoring of Novel Statements in the News},
  author={Michael F{\"a}rber and Achim Rettinger and A. Harth},
  booktitle={ESWC},
  year={2016}
}
In media monitoring users have a clearly defined information need to find so far unknown statements regarding certain entities or relations mentioned in natural-language text. However, commonly used keyword-based search technologies are focused on finding relevant documents and cannot judge the novelty of statements contained in the text. In this work, we propose a new semantic novelty measure that allows to retrieve statements, which are both novel and relevant, from natural-language sentences… 
A Linked Data wrapper for CrunchBase
TLDR
This work describes how to harvest the RDF data to obtain a local copy of the data for further processing and querying that goes beyond the access facilities of the CrunchBase API and describes the cases in which the Linked Data API for CrunchBase and the crawled CrunchBase RDF Data have been used in other works.
Linked Crunchbase: A Linked Data API and RDF Data Set About Innovative Companies
TLDR
This paper presents how to bring Crunchbase to the Web of Data so that its data can be used in the machine-readable RDF format by anyone on the Web and presents the method for crawling RDF data based on this API to build a custom Crunchbase RDF knowledge graph.
Linked Crunchbase: A Linked Data API and RDF Data Set About Innovative Companies
TLDR
This paper presents how to bring Crunchbase to the Web of Data so that its data can be used in the machine-readable RDF format by anyone on the Web, and presents the method for crawling RDF data based on this API to build a custom Crunchbase RDF knowledge graph.

References

SHOWING 1-10 OF 23 REFERENCES
Novelty Detection: The TREC Experience
TLDR
In the TREC novelty track, the task was to highlight sentences containing relevant and new information in a short, topical document stream, and it was found that opinions are much harder to track than events.
Novelty detection based on sentence level patterns
TLDR
A new novelty detection approach based on the identification of sentence level patterns is proposed that significantly outperforms all three baselines in terms of precision at top ranks.
Identifying Relations for Open Information Extraction
TLDR
Two simple syntactic and lexical constraints on binary relations expressed by verbs are introduced in the ReVerb Open IE system, which more than doubles the area under the precision-recall curve relative to previous extractors such as TextRunner and woepos.
Open Language Learning for Information Extraction
Open Information Extraction (IE) systems extract relational tuples from text, without requiring a pre-specified vocabulary, by identifying relation phrases and associated arguments in arbitrary
ClausIE: clause-based open information extraction
TLDR
ClausIE is a novel, clause-based approach to open information extraction, which extracts relations and their arguments from natural language text using a small set of domain-independent lexica, operates sentence by sentence without any post-processing, and requires no training data.
LODifier: Generating Linked Data from Unstructured Text
TLDR
LODifier is an approach that combines deep semantic analysis with named entity recognition, word sense disambiguation and controlled Semantic Web vocabularies in order to extract named entities and relations between them from text and to convert them into an RDF representation which is linked to DBpedia and WordNet.
Newsjunkie: providing personalized newsfeeds via analysis of information novelty
TLDR
Newsjunkie is described, a system that personalizes news for users by identifying the novelty of stories in the context of stories they have already reviewed, and employs novelty-analysis algorithms that represent articles as words and named entities.
X-LiSA: Cross-lingual Semantic Annotation
TLDR
X-LiSA is demonstrated, an infrastructure for cross-lingual semantic annotation, which supports both service-oriented and user-oriented interfaces for annotating text documents and web pages in different languages using resources from Wikipedia and Linked Open Data.
Semantifying Triples from Open Information Extraction Systems
TLDR
This work semantify NELL triples using DBPEDIA to enrich the later with high precision novel facts and present a statistical approach to discover new knowledge.
Efficient Online Novelty Detection in News Streams
TLDR
This paper proposes a new novelty detection algorithm based on the Inverse Document Frequency (IDF) scoring function that enables us to avoid similarity comparisons with previous documents in the text stream, thus leading to faster execution times.
...
1
2
3
...