# AllenNLP: A Deep Semantic Natural Language Processing Platform

@article{Gardner2018AllenNLPAD,
title={AllenNLP: A Deep Semantic Natural Language Processing Platform},
author={Matt Gardner and Joel Grus and Mark Neumann and Oyvind Tafjord and Pradeep Dasigi and Nelson F. Liu and Matthew E. Peters and Michael Schmitz and Luke Zettlemoyer},
journal={ArXiv},
year={2018},
volume={abs/1803.07640}
}
• Published 20 March 2018
• Computer Science
• ArXiv
This paper describes AllenNLP, a platform for research on deep learning methods in natural language understanding. [] Key Method It also includes reference implementations of high quality approaches for both core semantic problems (e.g. semantic role labeling (Palmer et al., 2005)) and language understanding applications (e.g. machine comprehension (Rajpurkar et al., 2016)). AllenNLP is an ongoing open-source effort maintained by engineers and researchers at the Allen Institute for Artificial Intelligence.
943 Citations

### DL4NLP 2019 Proceedings of the First NLPL Workshop on Deep Learning for Natural Language Processing

• Computer Science
• 2019
This work argues that the most natural formalization of definition modeling is to treat it as a sequenceto-sequence task, rather than a word-tosequence task: given an input sequence with a highlighted word, generate a contextually appropriate definition for it.

### NaturalCC: A Toolkit to Naturalize the Source Code Corpus

• Computer Science
ArXiv
• 2020
NaturalCC is an efficient and extensible toolkit to bridge the gap between natural language and programming language, and facilitate the research on big code analysis, and is built upon Fairseq and PyTorch.

### Transformers : State-ofthe-art Natural Language Processing

• Computer Science
• 2019
Transformers is presented, a library for state-of-the-art NLP, making these developments available to the community by gathering state of theart general-purpose pretrained models under a unified API together with an ecosystem of libraries, examples, tutorials and scripts targeting many downstream NLP tasks.

### Neural Semantic Parsing with Anonymization for Command Understanding in General-Purpose Service Robots

• Computer Science
RoboCup
• 2019
This work proposes an approach that leverages neural semantic parsing methods in combination with contextual word embeddings to enable the training of a semantic parser with little data and without domain specific parser engineering.

### HuggingFace's Transformers: State-of-the-art Natural Language Processing

• Computer Science
ArXiv
• 2019
The \textit{Transformers} library is an open-source library that consists of carefully engineered state-of-the art Transformer architectures under a unified API and a curated collection of pretrained models made by and available for the community.

### Probing Natural Language Inference Models through Semantic Fragments

• Computer Science
AAAI
• 2020
This work proposes the use of semantic fragments—systematically generated datasets that each target a different semantic phenomenon—for probing, and efficiently improving, such capabilities of linguistic models.

### SciBERT: A Pretrained Language Model for Scientific Text

• Computer Science
EMNLP
• 2019
SciBERT leverages unsupervised pretraining on a large multi-domain corpus of scientific publications to improve performance on downstream scientific NLP tasks and demonstrates statistically significant improvements over BERT.

### Transformers: State-of-the-Art Natural Language Processing

• Computer Science
EMNLP
• 2020
Transformers is an open-source library that consists of carefully engineered state-of-the art Transformer architectures under a unified API and a curated collection of pretrained models made by and available for the community.

### GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

• Computer Science
BlackboxNLP@EMNLP
• 2018
A benchmark of nine diverse NLU tasks, an auxiliary dataset for probing models for understanding of specific linguistic phenomena, and an online platform for evaluating and comparing models, which favors models that can represent linguistic knowledge in a way that facilitates sample-efficient learning and effective knowledge-transfer across tasks.

### A Data-Centric Framework for Composable NLP Workflows

• Computer Science
EMNLP
• 2020
A unified open-source framework to support fast development of such sophisticated NLP workflows in a composable manner and introduces a uniform data representation to encode heterogeneous results by a wide range of NLP tasks.

## References

SHOWING 1-10 OF 38 REFERENCES

### A large annotated corpus for learning natural language inference

• Computer Science
EMNLP
• 2015
The Stanford Natural Language Inference corpus is introduced, a new, freely available collection of labeled sentence pairs, written by humans doing a novel grounded task based on image captioning, which allows a neural network-based model to perform competitively on natural language inference benchmarks for the first time.

### Deep Semantic Role Labeling: What Works and What's Next

• Computer Science
ACL
• 2017
We introduce a new deep learning model for semantic role labeling (SRL) that significantly improves the state of the art, along with detailed analyses to reveal its strengths and limitations. We use

### End-to-end learning of semantic role labeling using recurrent neural networks

• Computer Science
ACL
• 2015
This work proposes to use deep bi-directional recurrent network as an end-to-end system for SRL, which takes only original text information as input feature, without using any syntactic knowledge.

### Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit

The book is a practical guide to NLP achieving a balance between NLP theory and practical programming skills, and alternates between focusing on natural language, supported by pertinent programming examples, or focusing on the Python programming language while linguistic examples play a supporting role.

### Neural Semantic Parsing with Type Constraints for Semi-Structured Tables

• Computer Science
EMNLP
• 2017
A new semantic parsing model for answering compositional questions on semi-structured Wikipedia tables with a state-of-the-art accuracy and type constraints and entity linking are valuable components to incorporate in neural semantic parsers.

### The Stanford CoreNLP Natural Language Processing Toolkit

• Computer Science
ACL
• 2014
The design and use of the Stanford CoreNLP toolkit is described, an extensible pipeline that provides core natural language analysis, and it is suggested that this follows from a simple, approachable design, straightforward interfaces, the inclusion of robust and good quality analysis components, and not requiring use of a large amount of associated baggage.

### A Decomposable Attention Model for Natural Language Inference

• Computer Science
EMNLP
• 2016
We propose a simple neural architecture for natural language inference. Our approach uses attention to decompose the problem into subproblems that can be solved separately, thus making it trivially

### ParlAI: A Dialog Research Software Platform

• Computer Science
EMNLP
• 2017
ParlAI (pronounced “par-lay”), an open-source software platform for dialog research implemented in Python, is introduced, to provide a unified framework for sharing, training and testing dialog models; integration of Amazon Mechanical Turk for data collection, human evaluation, and online/reinforcement learning.

### End-to-end Neural Coreference Resolution

• Computer Science
EMNLP
• 2017
This work introduces the first end-to-end coreference resolution model, trained to maximize the marginal likelihood of gold antecedent spans from coreference clusters and is factored to enable aggressive pruning of potential mentions.

### Deep Contextualized Word Representations

• Computer Science
NAACL
• 2018
A new type of deep contextualized word representation is introduced that models both complex characteristics of word use and how these uses vary across linguistic contexts, allowing downstream models to mix different types of semi-supervision signals.