Entity Linking for Queries by Searching Wikipedia Sentences

@inproceedings{Tan2017EntityLF,
  title={Entity Linking for Queries by Searching Wikipedia Sentences},
  author={Chuanqi Tan and Furu Wei and Pengjie Ren and Weifeng Lv and M. Zhou},
  booktitle={EMNLP},
  year={2017}
}
We present a simple yet effective approach for linking entities in queries. The key idea is to search sentences similar to a query from Wikipedia articles and directly use the human-annotated entities in the similar sentences as candidate entities for the query. Then, we employ a rich set of features, such as link-probability, context-matching, word embeddings, and relatedness among candidate entities as well as their related entities, to rank the candidates under a regression based framework… 
High Quality Candidate Generation and Sequential Graph Attention Network for Entity Linking
TLDR
A Sequential Graph Attention Network (SeqGAT) which combines the advantages of graph and sequence methods and achieves significant improvements over the state-of-the-art methods forEntity Linking.
SMAPH: A Piggyback Approach for Entity-Linking in Web Queries
TLDR
Three versions of SMAPH are proposed that outperform state-of-the-art solutions on the known benchmarks and on the GERDAQ dataset, a novel dataset that is built specifically for this problem via crowd-sourcing and that is made publicly available.
0 SMAPH : A Piggyback Approach for Entity-Linking in Web eries
We study the problem of linking the terms of a web-search query to a semantic representation given by the set of entities (aka concepts) mentioned in it. We introduce SMAPH, a system that performs
Joint Entity Linking with Deep Reinforcement Learning
TLDR
This work converts the global linking into a sequence decision problem and proposes a reinforcement learning model which makes decisions from a global perspective, which outperforms state-of-the-art systems and has better generalization performance.
Neural Cross-Lingual Entity Linking
TLDR
This paper proposes a neural EL model that trains fine-grained similarities and dissimilarities between the query and candidate document from multiple perspectives, combined with convolution and tensor networks and shows that this English-trained system can be applied, in zero-shot learning, to other languages by making surprisingly effective use of multi-lingual embeddings.
Location-Aware Named Entity Disambiguation
TLDR
It is shown that the spatial dimension is crucial in disambiguating named entities and that the location inference is less utilized in many NED systems and a location-aware NED framework is proposed that resolves ambiguities in text with little other contextual cues.
Entity Linking within a Social Media Platform: A Case Study on Yelp
TLDR
A new entity linking problem where both the entity mentions and the target entities are within a same social media platform, where the business mentions in Yelp reviews are linked to their corresponding businesses on the platform.
QUEACO: Borrowing Treasures from Weakly-labeled Behavior Data for Query Attribute Value Extraction
TLDR
A unified query attribute value extraction system in e-commerce search named QUEACO, which involves both two phases of named entity recognition and attribute value normalization and leverages weakly-labeled query-to-attribute behavior data to normalize surface form attribute values from queries into canonical forms from products.
Wikipedia as a Resource for Text Analysis and Retrieval
TLDR
The role of Wikipedia is examined in tasks related to text analysis and retrieval, which include coreference resolution, word sense and entity disambiguation and information extraction.
...
1
2
...

References

SHOWING 1-10 OF 26 REFERENCES
A Piggyback System for Joint Entity Mention Detection and Linking in Web Queries
TLDR
This paper introduces SMAPH-2, a second-order approach that, by piggybacking on a web search engine, alleviates the noise and irregularities that characterize the language of queries and puts queries in a larger context in which it is easier to make sense of them.
Fast and Space-Efficient Entity Linking for Queries
TLDR
This paper proposes a probabilistic model that leverages user-generated information on the web to link queries to entities in a knowledge base and significantly outperforms several state-of-the-art baselines while being able to process queries in sub-millisecond times---at least two orders of magnitude faster than existing systems.
A search based approach to entity recognition: magnetic and IISAS team at ERD challenge
TLDR
This write-up describes Magnetic-IISAS team's approach to the entity recognition in search queries with which they have participated in ERD 2014 challenge and discusses successful as well as unsuccessful attempts to improve the quality of system results.
Robust Disambiguation of Named Entities in Text
TLDR
A robust method for collective disambiguation is presented, by harnessing context from knowledge bases and using a new form of coherence graph that significantly outperforms prior methods in terms of accuracy, with robust behavior across a variety of inputs.
Entity linking by focusing DBpedia candidate entities
TLDR
This paper proposes to use text pre-processing and parameter tuning to "focus" a general-purpose EL system to perform better on different kinds of input text and design a classifier to automatically classify DBpedia Spotlight's output entities as "NIL" or "Not NIL".
Entity linking based on the co-occurrence graph and entity probability
TLDR
This paper describes the system and its components in depth, together with their influence on performance, for the Entity Recognition and Disambiguation Challenge 2014.
The SMAPH system for query entity recognition and disambiguation
TLDR
The SMAPH system implements a pipeline of four main steps that fetches the search results returned by a search engine given the query to be annotated, and a binary SVM classifier is used to decide which entities to keep/discard in order to generate the final annotation set for the query.
Learning to link with wikipedia
TLDR
This paper explains how machine learning can be used to identify significant terms within unstructured text, and enrich it with links to the appropriate Wikipedia articles, and performs very well, with recall and precision of almost 75%.
Design Challenges for Entity Linking
TLDR
This work analyzes differences between several versions of the EL problem and presents a simple yet effective, modular, unsupervised system, called Vinculum, for entity linking, and elucidate key aspects of the system that include mention extraction, candidate generation, entity type prediction, entity coreference, and coherence.
An effective, low-cost measure of semantic relatedness obtained from Wikipedia links
TLDR
This paper describes a new technique for obtaining measures of semantic relatedness that uses Wikipedia to provide structured world knowledge about the terms of interest using the hyperlink structure of Wikipedia rather than its category hierarchy or textual content.
...
1
2
3
...