Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors
@article{Zhelezniak2019DontSF, title={Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors}, author={V. Zhelezniak and A. Savkov and April Shen and Francesco Moramarco and Jack Flann and N. Hammerla}, journal={ArXiv}, year={2019}, volume={abs/1904.13264} }
Recent literature suggests that averaged word vectors followed by simple post-processing outperform many deep learning methods on semantic textual similarity tasks. Furthermore, when averaged word vectors are trained supervised on large corpora of paraphrases, they achieve state-of-the-art results on standard STS benchmarks. Inspired by these insights, we push the limits of word embeddings even further. We propose a novel fuzzy bag-of-words (FBoW) representation for text that contains all the… CONTINUE READING
Supplemental Code
Github Repo
Via Papers with Code
Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
Figures, Tables, and Topics from this paper
17 Citations
Correlation Coefficients and Semantic Textual Similarity
- Computer Science, Mathematics
- NAACL-HLT
- 2019
- 14
- PDF
Structural-Aware Sentence Similarity with Recursive Optimal Transport
- Computer Science, Mathematics
- ArXiv
- 2020
- 1
- PDF
Word Rotator's Distance: Decomposing Vectors Gives Better Representations
- Computer Science
- ArXiv
- 2020
- Highly Influenced
SEAGLE: A Platform for Comparative Evaluation of Semantic Encoders for Information Retrieval
- Computer Science
- EMNLP/IJCNLP
- 2019
- 1
- Highly Influenced
- PDF
Can Embeddings Adequately Represent Medical Terminology? New Large-Scale Medical Term Similarity Datasets Have the Answer!
- Computer Science
- AAAI
- 2020
- 3
- Highly Influenced
- PDF
References
SHOWING 1-10 OF 71 REFERENCES
All-but-the-Top: Simple and Effective Postprocessing for Word Representations
- Computer Science, Mathematics
- ICLR
- 2018
- 100
- PDF
A Simple but Tough-to-Beat Baseline for Sentence Embeddings
- Computer Science
- ICLR
- 2017
- 742
- Highly Influential
Enriching Word Vectors with Subword Information
- Computer Science
- Transactions of the Association for Computational Linguistics
- 2017
- 4,453
- Highly Influential
- PDF
Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms
- Computer Science
- ACL
- 2018
- 169
- PDF
Representation learning for very short texts using weighted word embedding aggregation
- Computer Science
- Pattern Recognit. Lett.
- 2016
- 135
- PDF
From Word Embeddings To Document Distances
- Mathematics, Computer Science
- ICML
- 2015
- 1,154
- Highly Influential
- PDF
Unsupervised Random Walk Sentence Embeddings: A Strong but Simple Baseline
- Computer Science
- Rep4NLP@ACL
- 2018
- 35
- PDF
Linguistic Regularities in Continuous Space Word Representations
- Computer Science
- HLT-NAACL
- 2013
- 2,661
- PDF