Controlled Experiments for Word Embeddings
@article{Wilson2015ControlledEF, title={Controlled Experiments for Word Embeddings}, author={B. Wilson and Adriaan M. J. Schakel}, journal={ArXiv}, year={2015}, volume={abs/1510.02675} }
An experimental approach to studying the properties of word embeddings is proposed. Controlled experiments, achieved through modifications of the training corpus, permit the demonstration of direct relations between word properties and word vector direction and length. The approach is demonstrated using the word2vec CBOW model with experiments that independently vary word frequency and word co-occurrence noise. The experiments reveal that word vector length depends more or less linearly on both… CONTINUE READING
Figures, Tables, and Topics from this paper
15 Citations
Intrinsic Evaluations of Word Embeddings: What Can We Do Better?
- Computer Science
- RepEval@ACL
- 2016
- 62
- PDF
NTUA-SLP at SemEval-2018 Task 3: Tracking Ironic Tweets using Ensembles of Word and Character Level Attentive RNNs
- Computer Science
- SemEval@NAACL-HLT
- 2018
- 25
- PDF
PIC a Different Word: A Simple Model for Lexical Substitution in Context
- Computer Science
- HLT-NAACL
- 2016
- 15
- PDF
SAO2Vec: Development of an algorithm for embedding the subject–action–object (SAO) structure using Doc2Vec
- Computer Science, Medicine
- PloS one
- 2020
- 2
Extended Association Rules in Semantic Vector Spaces for Sentiment Classification
- Computer Science
- WorldCIST
- 2018
- 1
Identifying lexical relationships and entailments with distributional semantics
- Computer Science
- 2017
- PDF
Spherical Regression under Mismatch Corruption with Application to Automated Knowledge Translation
- Computer Science, Mathematics
- 2018
- 12
- PDF
References
SHOWING 1-10 OF 16 REFERENCES
Random Walks on Context Spaces: Towards an Explanation of the Mysteries of Semantic Word Embeddings
- Computer Science
- ArXiv
- 2015
- 50
- PDF
Measuring Word Significance using Distributed Representations of Words
- Computer Science
- ArXiv
- 2015
- 49
- PDF
Distributed Representations of Words and Phrases and their Compositionality
- Computer Science, Mathematics
- NIPS
- 2013
- 20,831
- PDF
Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors
- Computer Science
- ACL
- 2014
- 1,179
- PDF