Improving Vector Space Word Representations Using Multilingual Correlation

  title={Improving Vector Space Word Representations Using Multilingual Correlation},
  author={Manaal Faruqui and Chris Dyer},
The distributional hypothesis of Harris (1954), according to which the meaning of words is evidenced by the contexts they occur in, has motivated several effective techniques for obtaining vector space semantic representations of words using unannotated text corpora. This paper argues that lexico-semantic content should additionally be invariant across languages and proposes a simple technique based on canonical correlation analysis (CCA) for incorporating multilingual evidence into vectors… CONTINUE READING
Highly Influential
This paper has highly influenced 28 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 363 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 257 extracted citations

Improving Vector Space Word Representations Via Kernel Canonical Correlation Analysis

ACM Trans. Asian & Low-Resource Lang. Inf. Process. • 2018
View 7 Excerpts
Highly Influenced

Incorporating Word Embedding into Cross-Lingual Topic Modeling

2018 IEEE International Congress on Big Data (BigData Congress) • 2018
View 8 Excerpts
Highly Influenced

364 Citations

Citations per Year
Semantic Scholar estimates that this publication has 364 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 43 references

Contextual correlates of semantic similarity

George A. Miller, Walter G. Charles.
Language and Cognitive Processes, 6(1):1–28. • 1991
View 4 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…