• Corpus ID: 7187188

Measuring, Predicting and Visualizing Short-Term Change in Word Representation and Usage in VKontakte Social Network

@inproceedings{Stewart2017MeasuringPA,
  title={Measuring, Predicting and Visualizing Short-Term Change in Word Representation and Usage in VKontakte Social Network},
  author={Ian Stewart and Dustin L. Arendt and Eric Bell and Svitlana Volkova},
  booktitle={ICWSM},
  year={2017}
}
Language in social media is extremely dynamic: new words emerge, trend and disappear, while the meaning of existing words can fluctuate over time. Such dynamics are especially notable during a period of crisis. This work addresses several important tasks of measuring, visualizing and predicting short term text representation shift, i.e. the change in a word's contextual semantics, and contrasting such shift with surface level word dynamics, or concept drift, observed in social media streams… 
Knowledge-enhanced temporal word embedding for diachronic semantic change estimation
TLDR
A knowledge-enhanced temporal word embedding model that utilizes ‘word-centric dependency relations’ for capturing context words irrespective of their n-gram position and ‘syntactic patterns for lexical relations” for determining the type of semantic change is built.
ESTEEM: A Novel Framework for Qualitatively Evaluating and Visualizing Spatiotemporal Embeddings in Social Media
TLDR
This paper used state of theart techniques to learn embeddings and developed a visualization to represent dynamically changing relations between words in social media over time and other dimensions, which is the first interactive visualization of streaming text representations learned from social media texts that also allows users to contrast differences across multiple dimensions of the data.
Measure and Evaluation of Semantic Divergence across Two Languages
TLDR
This paper proposes a set of scenarios to characterize semantic divergence across two languages, along with a setup to differentiate them in a bilingual corpus, and concludes that BERT embeddings coupled with a clustering step lead to the best performance on synthetic corpora.
Incremental Word Vectors for Time-Evolving Sentiment Lexicon Induction
TLDR
This paper proposes a methodology for automatically inducing continuously updated sentiment lexicons from Twitter streams by training incremental word sentiment classifiers from time-evolving distributional word vectors and experiments show that the approach allows for successfully tracking of the sentiment of words over time even when drastic change is induced.
Empirical Study of Diachronic Word Embeddings for Scarce Data
TLDR
This paper compares three models to learn diachronic word embeddings on scarce data: incremental updating of a Skip-Gram from Kim et al. (2014), dynamic filtering from Bamler and Mandt (2017), and dynamic Bernoulli embedDings from Rudolph and Blei (2018).
Survey of Computational Approaches to Lexical Semantic Change
TLDR
This article focuses on diachronic conceptual change as an extension of semantic change, and a survey of recent computational techniques to tackle lexical semantic change currently under review.
Capturing Stance Dynamics in Social Media: Open Challenges and Research Directions
TLDR
A critical review in emerging research considering dynamics, exploring different semantic and pragmatic factors that impact linguistic data in general, and stance particularly, is performed and current directions in capturing stance dynamics in social media are discussed.
Room to Glo: A Systematic Comparison of Semantic Change Detection Approaches with Word Embeddings
TLDR
A new evaluation framework for semantic change detection is proposed and it is found that using the whole time series is preferable over only comparing between the first and last time points; independently trained and aligned embeddings perform better than continuously trainedembeddings for long time periods; and that the reference point for comparison matters.
Scalable and Interpretable Semantic Change Detection
TLDR
This work proposes a novel scalable method for word usage-change detection that offers large gains in processing time and significant memory savings while offering the same interpretability and better performance than unscalable methods.
Semantic shift in social networks
TLDR
This work investigates the relationship between community structure and semantic change in 45 communities from the social media website Reddit, and uses distributional methods to quantify lexical semantic change and induce a social network on communities, based on interactions between members.
...
1
2
...

References

SHOWING 1-10 OF 57 REFERENCES
Statistically Significant Detection of Linguistic Change
TLDR
This meta-analysis approach constructs property time series of word usage, and then uses statistically sound change point detection algorithms to identify significant linguistic shifts.
Diffusion of Lexical Change in Social Media
TLDR
Using a latent vector autoregressive model to aggregate across thousands of words, high-level patterns in diffusion of linguistic change over the United States are identified and support for prior arguments that focus on geographical proximity and population size is offered.
An automatic approach to identify word sense changes in text media across timescales
TLDR
An unsupervised and automated method to identify noun sense changes based on rigorous analysis of time-varying text data available in the form of millions of digitized books and millions of tweets posted per day is proposed.
Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change
TLDR
A robust methodology for quantifying semantic change is developed by evaluating word embeddings against known historical changes and it is revealed that words that are more polysemous have higher rates of semantic change.
Ad Hoc Monitoring of Vocabulary Shifts over Time
TLDR
The introduction of the task of ad hoc monitoring of vocabulary shifts over time, the description of an algorithm for tracking shifting vocabularies over time given a small set of seed words, and a systematic evaluation of results over a substantial period of time are introduced.
Cultural Shift or Linguistic Drift? Comparing Two Computational Measures of Semantic Change
TLDR
Two different distributional measures can be used to detect two different types of semantic change, allowing researchers to determine whether changes are more cultural or linguistic in nature, a distinction that is essential for work in the digital humanities and historical linguistics.
Contrasting Public Opinion Dynamics and Emotional Response During Crisis
TLDR
Some of the findings contradict stereotypical misconceptions imposed by media, and are the first to demonstrate contrastive stance variations over time across geo-locations using storyline visualization technique.
Software Framework for Topic Modelling with Large Corpora
TLDR
This work describes a Natural Language Processing software framework which is based on the idea of document streaming, i.e. processing corpora document after document, in a memory independent fashion, and implements several popular algorithms for topical inference, including Latent Semantic Analysis and Latent Dirichlet Allocation in a way that makes them completely independent of the training corpus size.
Adaptive Method for Following Dynamic Topics on Twitter
TLDR
This paper proposes an automatic approach that adapts to emerging aspects of a tracked broad topic over time that achieves over 100% increase in recall relative to the baseline Boolean approach, while maintaining an acceptable precision of 83%.
Concept Drift Awareness in Twitter Streams
TLDR
This paper presents a learning strategy to learn with drift in the occurrence of concepts in Twitter, and proposes three different models: a time-window model, an ensemble-based model and an incremental model.
...
1
2
3
4
5
...