Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings

@article{Vulic2015MonolingualAC,
  title={Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings},
  author={Ivan Vulic and Marie-Francine Moens},
  journal={Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval},
  year={2015}
}
  • Ivan Vulic, Marie-Francine Moens
  • Published 2015
  • Computer Science
  • Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
We propose a new unified framework for monolingual (MoIR) and cross-lingual information retrieval (CLIR) which relies on the induction of dense real-valued word vectors known as word embeddings (WE) from comparable data. To this end, we make several important contributions: (1) We present a novel word representation learning model called Bilingual Word Embeddings Skip-Gram (BWESG) which is the first model able to learn bilingual word embeddings solely on the basis of document-aligned comparable… Expand
218 Citations
Unsupervised Cross-Lingual Information Retrieval Using Monolingual Data Only
  • 37
  • PDF
Exploring Implicit Semantic Constraints for Bilingual Word Embeddings
Using Communities of Words Derived from Multilingual Word Vectors for Cross-Language Information Retrieval in Indian Languages
  • 5
  • PDF
Using Word Embeddings for Query Translation for Hindi to English Cross Language Information Retrieval
  • 7
  • PDF
On the Role of Seed Lexicons in Learning Bilingual Word Embeddings
  • 85
  • PDF
Cross-Lingual Syntactically Informed Distributed Word Representations
  • 16
  • PDF
Improving Cross-Lingual Word Embeddings by Meeting in the Middle
  • 31
  • PDF
...
1
2
3
4
5
...

References

SHOWING 1-5 OF 5 REFERENCES
Distributed Representations of Words and Phrases and their Compositionality
  • 21,227
  • Highly Influential
  • PDF
Efficient Estimation of Word Representations in Vector Space
  • 17,050
  • Highly Influential
  • PDF
A Language Modeling Approach to Information Retrieval
  • 548
  • Highly Influential
LDA-based document models for ad-hoc retrieval
  • 1,071
  • Highly Influential
  • PDF
Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics
  • 179
  • Highly Influential
  • PDF