How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions
@inproceedings{Glavas2019HowT, title={How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions}, author={Goran Glavas and Robert Litschko and Sebastian Ruder and Ivan Vulic}, booktitle={ACL}, year={2019} }
Cross-lingual word embeddings (CLEs) enable multilingual modeling of meaning and facilitate cross-lingual transfer of NLP models. Despite their ubiquitous usage in downstream tasks, recent increasingly popular projection-based CLE models are almost exclusively evaluated on a single task only: bilingual lexicon induction (BLI). Even BLI evaluations vary greatly, hindering our ability to correctly interpret performance and properties of different CLE models. In this work, we make the first step… Expand
91 Citations
Investigating Cross-Lingual Alignment Methods for Contextualized Embeddings with Token-Level Evaluation
- Computer Science
- CoNLL
- 2019
- 12
- PDF
Why Overfitting Isn’t Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries
- Computer Science
- ACL
- 2020
- 3
- PDF
On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning
- Computer Science
- LREC
- 2020
- 12
- Highly Influenced
- PDF
Beyond Offline Mapping: Learning Cross Lingual Word Embeddings through Context Anchoring
- Computer Science
- ArXiv
- 2020
- Highly Influenced
- PDF
Lost in Embedding Space: Explaining Cross-Lingual Task Performance with Eigenvalue Divergence
- Computer Science
- ArXiv
- 2020
- 2
Training Effective Neural CLIR by Bridging the Translation Gap
- Computer Science
- SIGIR
- 2020
- Highly Influenced
Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval
- Computer Science
- ECIR
- 2021
- 1
- PDF
References
SHOWING 1-10 OF 67 REFERENCES
A Strong Baseline for Learning Cross-Lingual Word Embeddings from Sentence Alignments
- Computer Science
- EACL
- 2017
- 50
- PDF
Unsupervised Cross-Lingual Information Retrieval Using Monolingual Data Only
- Computer Science
- SIGIR
- 2018
- 37
- PDF
A Survey of Cross-lingual Word Embedding Models
- Computer Science, Mathematics
- J. Artif. Intell. Res.
- 2019
- 218
- PDF
Ten Pairs to Tag - Multilingual POS Tagging via Coarse Mapping between Embeddings
- Computer Science
- HLT-NAACL
- 2016
- 87
- Highly Influential
- PDF
XNLI: Evaluating Cross-lingual Sentence Representations
- Computer Science
- EMNLP
- 2018
- 281
- Highly Influential
- PDF
Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings
- Computer Science
- SIGIR
- 2015
- 218
- PDF
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
- Computer Science
- Transactions of the Association for Computational Linguistics
- 2019
- 287
- PDF