Understanding Learning Dynamics Of Language Models with SVCCA
@inproceedings{Saphra2019UnderstandingLD, title={Understanding Learning Dynamics Of Language Models with SVCCA}, author={Naomi Saphra and A. Lopez}, booktitle={NAACL-HLT}, year={2019} }
Research has shown that neural models implicitly encode linguistic features, but there has been no research showing \emph{how} these encodings arise as the models are trained. We present the first study on the learning dynamics of neural language models, using a simple and flexible analysis method called Singular Vector Canonical Correlation Analysis (SVCCA), which enables us to compare learned representations across time and across models, without the need to evaluate directly on annotated… Expand
Figures, Tables, and Topics from this paper
37 Citations
Word Interdependence Exposes How LSTMs Compose Representations
- Computer Science, Mathematics
- ArXiv
- 2020
- 3
- PDF
Emergent linguistic structure in artificial neural networks trained by self-supervision
- Computer Science, Medicine
- Proceedings of the National Academy of Sciences
- 2020
- 25
- PDF
References
SHOWING 1-10 OF 35 REFERENCES
Language Modeling Teaches You More Syntax than Translation Does: Lessons Learned Through Auxiliary Task Analysis
- Computer Science
- BlackboxNLP@EMNLP
- 2018
- 37
- PDF
Under the Hood: Using Diagnostic Classifiers to Investigate and Improve how Language Models Track Agreement Information
- Computer Science
- BlackboxNLP@EMNLP
- 2018
- 64
- PDF
Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies
- Computer Science
- Transactions of the Association for Computational Linguistics
- 2016
- 451
- Highly Influential
- PDF
Representation of Linguistic Form and Function in Recurrent Neural Networks
- Computer Science
- Computational Linguistics
- 2017
- 108
- Highly Influential
- PDF
Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks
- Computer Science
- IJCNLP
- 2017
- 97
- Highly Influential
- PDF
Encoding of phonology in a recurrent neural model of grounded speech
- Computer Science
- CoNLL
- 2017
- 38
- Highly Influential
- PDF