Reddit Entity Linking Dataset

  title={Reddit Entity Linking Dataset},
  author={Nicholas Botzer and Yifan Ding and Tim Weninger},
  journal={Inf. Process. Manag.},

Figures and Tables from this paper

A Study on Entity Linking Across Domains”:" Which Data is Best for Fine-Tuning?

This paper aims at a system that enables linking mentions to entities from a general-domain KG and a domain-specific KG at the same time, and addresses the questions of which data is best suited for creating and fine-tuning that space, and whether fine- Tuning harms performance on the general domain.

Cross-Domain Neural Entity Linking

The objective is to have a single system that enables simultaneous linking to both the general-domain KB and the domain-specific KB, and this framework is proposed: Cross-Domain Neural Entity Linking (CDNEL).

OpenEL: An Annotated Corpus for Entity Linking and Discourse in Open Domain Dialogue

The results show that dialogue context is extremely beneficial for entity linking in conversations, with Flair + Blink achieving an F1 of 0.61 without discourse context, and the remaining performance gap between the baselines and human performance is demonstrated.

Keynote Talk: What kinds of questions have we been asking? A taxonomy for QA/RC benchmarks

An overview of the current landscape of resources for Question Answering and Reading comprehension is provided, highlighting the current lacunae for future work and a new taxonomy of “skills” targeted by QA/RC datasets is presented.

Building an Icelandic Entity Linking Corpus

This paper describes the approach of using a multilingual entity linking model (mGENRE) in combination with Wikipedia API Search (WAPIS) to label data and compares it to an approach using WAPIS only, finding that the combined method reaches 53.9% coverage on the corpus.

Posthoc Verification and the Fallibility of the Ground Truth

Compared to pre-annotation evaluation, state-of-the-art EL models performed extremely well according to the posthoc evaluation methodology, and predictions from EL models had a similar or higher verification rate than the ground truth.



Crowdsourcing Named Entity Recognition and Entity Linking Corpora

This chapter describes the experience with crowdsourcing a corpus containing named entity annotations and their linking to DBpedia, and defines the methodological framework for crowdsourcing entity annotated corpora, which combines expert-based and paid-for crowdsourcing.

Zero-Shot Entity Linking by Reading Entity Descriptions

It is shown that strong reading comprehension models pre-trained on large unlabeled data can be used to generalize to unseen entities and proposed domain-adaptive pre-training (DAP) is proposed to address the domain shift problem associated with linking unseen entities in a new domain.

Improving Entity Linking by Modeling Latent Relations between Mentions

This work treats relations as latent variables in the neural entity-linking model so that the injected structural bias helps to explain regularities in the training data and achieves the best reported scores on the standard benchmark and substantially outperforms its relation-agnostic version.

Fine-Grained Evaluation for Entity Linking

It is argued that no one definition of the Entity Linking task fits all, and rather a fine-grained categorization of different types of entity mentions and links is proposed.

Fine-Grained Entity Typing for Domain Independent Entity Linking

This work tackles the problem of building robust entity linking models that generalize effectively and do not rely on labeled entity linking data with a specific entity distribution, and models fine-grained entity properties, which can help disambiguate between even closely related entities.

Boosting Entity Linking Performance by Leveraging Unlabeled Documents

The resulting approach rivals fully-supervised state-of-the-art systems on standard test sets and approaches their performance in the very challenging setting: when tested on a test set sampled from the data used to estimate the supervised systems.

End-to-End Neural Entity Linking

This work proposes the first neural end-to-end EL system that jointly discovers and links entities in a text document and shows that it significantly outperforms popular systems on the Gerbil platform when enough training data is available.

Neural Entity Linking: A Survey of Models based on Deep Learning

This work distills a generic architecture of a neural EL system and discusses its components, such as candidate generation, mention-context encoding, and entity ranking, summarizing prominent methods for each of them.

To Link or Not to Link? A Study on End-to-End Tweet Entity Linking

A structural SVM algorithm for entity linking is proposed that jointly optimizes mention detection and entity disambiguation as a single end-to-end task and is able to outperform existing state-of-the art entity linking systems by 15% F1.

Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

A novel embedding method specifically designed for NED that jointly maps words and entities into the same continuous vector space and extends the skip-gram model by using two models.