Corpus ID: 11591887

Offline bilingual word vectors, orthogonal transformations and the inverted softmax

@article{Smith2017OfflineBW,
  title={Offline bilingual word vectors, orthogonal transformations and the inverted softmax},
  author={S. L. Smith and David H. P. Turban and S. Hamblin and N. Hammerla},
  journal={ArXiv},
  year={2017},
  volume={abs/1702.03859}
}
Usually bilingual word vectors are trained "online". Mikolov et al. showed they can also be found "offline", whereby two pre-trained embeddings are aligned with a linear transformation, using dictionaries compiled from expert knowledge. In this work, we prove that the linear transformation between two spaces should be orthogonal. This transformation can be obtained using the singular value decomposition. We introduce a novel "inverted softmax" for identifying translation pairs, with which we… Expand
362 Citations
Density Matching for Bilingual Word Embedding
  • 17
  • PDF
Unsupervised Cross-lingual Transfer of Word Embedding Spaces
  • 58
  • PDF
Unsupervised Cross-lingual Word Embeddings Based on Subword Alignment
  • PDF
Towards a Universal Semantic Dictionary
  • PDF
Cross-Lingual Word Embeddings for Turkic Languages
  • 2
  • PDF
Comparing Unsupervised Word Translation Methods Step by Step
  • 7
  • PDF
Bilingual Dictionary Based Neural Machine Translation without Using Parallel Sentences
  • 3
  • PDF
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 22 REFERENCES
BilBOWA: Fast Bilingual Distributed Representations without Word Alignments
  • 334
  • PDF
An Autoencoder Approach to Learning Bilingual Word Representations
  • 291
  • PDF
Learning principled bilingual mappings of word embeddings while preserving monolingual invariance
  • 237
  • Highly Influential
  • PDF
Normalized Word Embedding and Orthogonal Transform for Bilingual Word Translation
  • 279
  • PDF
Exploiting Similarities among Languages for Machine Translation
  • 1,143
  • Highly Influential
  • PDF
Multilingual Distributed Representations without Word Alignment
  • 140
  • PDF
Deep Multilingual Correlation for Improved Word Embeddings
  • 123
  • PDF
Inducing Crosslingual Distributed Representations of Words
  • 330
  • PDF
Bilingual Word Embeddings for Phrase-Based Machine Translation
  • 495
  • PDF
Improving Vector Space Word Representations Using Multilingual Correlation
  • 513
  • Highly Influential
  • PDF
...
1
2
3
...