What do RNN Language Models Learn about Filler-Gap Dependencies?
@inproceedings{Wilcox2018WhatDR, title={What do RNN Language Models Learn about Filler-Gap Dependencies?}, author={E. Wilcox and R. Levy and T. Morita and Richard Futrell}, booktitle={BlackboxNLP@EMNLP}, year={2018} }
RNN language models have achieved state-of-the-art perplexity results and have proven useful in a suite of NLP tasks, but it is as yet unclear what syntactic generalizations they learn. Here we investigate whether state-of-the-art RNN language models represent long-distance filler-gap dependencies and constraints on them. Examining RNN behavior on experimentally controlled sentences designed to expose filler-gap dependencies, we show that RNNs can represent the relationship in multiple… CONTINUE READING
60 Citations
What Don’t RNN Language Models Learn About Filler-Gap Dependencies?
- Computer Science
- SCIL
- 2020
- 7
- Highly Influenced
- PDF
Structural Supervision Improves Learning of Non-Local Grammatical Dependencies
- Computer Science
- NAACL-HLT
- 2019
- 19
- PDF
Representation of Constituents in Neural Language Models: Coordination Phrase as a Case Study
- Computer Science
- EMNLP/IJCNLP
- 2019
- 6
- PDF
Hierarchical Representation in Neural Language Models: Suppression and Recovery of Expectations
- Computer Science
- ACL 2019
- 2019
- 10
- PDF
Assessing the ability of Transformer-based Neural Models to represent structurally unbounded dependencies
- Computer Science
- SCIL
- 2020
- 4
- Highly Influenced
- PDF
Linguistic Knowledge and Transferability of Contextual Representations
- Computer Science
- NAACL-HLT
- 2019
- 227
- PDF
References
SHOWING 1-10 OF 35 REFERENCES
Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies
- Computer Science
- Transactions of the Association for Computational Linguistics
- 2016
- 427
- PDF
RNN Simulations of Grammaticality Judgments on Long-distance Dependencies
- Computer Science
- COLING
- 2018
- 32
Syntactic Islands and Learning Biases: Combining Experimental Syntax and Computational Modeling to Investigate the Language Acquisition Problem
- Computer Science
- 2013
- 64
- PDF
One billion word benchmark for measuring progress in statistical language modeling
- Computer Science
- INTERSPEECH
- 2014
- 729
- PDF
Subjacency Constraints without Universal Grammar: Evidence from Artificial Language Learning and Con
- Computer Science
- 2000
- 43
- PDF