Detecting and Assessing Contextual Change in Diachronic Text Documents using Context Volatility

@article{Kahmann2017DetectingAA,
  title={Detecting and Assessing Contextual Change in Diachronic Text Documents using Context Volatility},
  author={Christian Kahmann and Andreas Niekler and Gerhard Heyer},
  journal={ArXiv},
  year={2017},
  volume={abs/1711.05538}
}
Terms in diachronic text corpora may exhibit a high degree of semantic dynamics that is only partially captured by the common notion of semantic change. The new measure of context volatility that we propose models the degree by which terms change context in a text collection over time. The computation of context volatility for a word relies on the significance-values of its co-occurrent terms and the corresponding co-occurrence ranks in sequential time spans. We define a baseline and present an… 

Figures and Tables from this paper

Short-term Semantic Shifts and their Relation to Frequency Change
TLDR
The evolution of the usage of words that represent concepts in the context of the refugee crisis is traced by analyzing cosine similarities of word vectors over time as well as similarities based on the words’ nearest neighbourhood sets.
Survey of Computational Approaches to Lexical Semantic Change
TLDR
This article focuses on diachronic conceptual change as an extension of semantic change, and a survey of recent computational techniques to tackle lexical semantic change currently under review.
Measuring Context Change to Detect Statements Violating the Overton Window
TLDR
This paper presents a novel NLP approach to identify statements in a collection of newspaper articles that shift the borders of the Overton window at some period of time, and applies it on German newspaper texts detecting extreme statements about the refugee crisis in Germany.
TemporalTeller at SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection with Temporal Referencing
TLDR
A unified framework for the common semantic change detection pipelines including preprocessing, learning word embeddings, calculating vector distances and determining threshold is developed and Gamma Quantile Threshold is proposed to distinguish between changed and stable words.
C L ] 1 5 N ov 2 01 8 Survey of Computational Approaches to Diachronic Conceptual Change Detection
Our languages are in constant flux driven by external factors such as cultural, societal and technological changes, as well as by only partially understood internal motivations. Words acquire new
DiaSense at SemEval-2020 Task 1: Modeling Sense Change via Pre-trained BERT Embeddings
TLDR
DiaSense, a system developed for Task 1 ‘Unsupervised Lexical Semantic Change Detection’ of SemEval 2020, uses contextualized word embeddings to model word sense changes.
ILCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data
The iLCM project pursues the development of an integrated research environment for the analysis of structured and unstructured data in a "Software as a Service" architecture (SaaS). The research

References

SHOWING 1-10 OF 29 REFERENCES
Modeling the dynamics of domain specific terminology in diachronic corpora
TLDR
The notion of context volatility as a new measure for detecting semantic change and applies it to key term extraction in a political science case study to identify periods of time that are characterised by intense controversial debates or substantial semantic transformations.
A framework for analyzing semantic change of words across time
TLDR
An exploratory analysis aiming to investigate methods for studying and visualizing changes in word meaning over time, and proposes a framework for exploring semantic change at the lexical level, at the contrastive-pair level, and at the sentiment orientation level.
Ad Hoc Monitoring of Vocabulary Shifts over Time
TLDR
The introduction of the task of ad hoc monitoring of vocabulary shifts over time, the description of an algorithm for tracking shifting vocabularies over time given a small set of seed words, and a systematic evaluation of results over a substantial period of time are introduced.
Time Dynamic Topic Models
TLDR
A linguistically motivated interpretation of topic modeling is given, a state-of-the-art algorithm for extracting latent semantic sets of words from large text corpora is extended, and the rate of change of sets of semantically related words can be interpreted as issue-cycles.
Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change
TLDR
A robust methodology for quantifying semantic change is developed by evaluating word embeddings against known historical changes and it is revealed that words that are more polysemous have higher rates of semantic change.
DiaCollo : On the trail of diachronic collocations
This paper presents DiaCollo, a software tool developed in the context of CLARIN for the efficient extraction, comparison, and interactive visualization of collocations from a diachronic text corpus.
Picturing short-period diachronic phenomena in specialised corpora: a textual terminology description of the dynamics of knowledge in space technologies
This article presents a first description and a proposal for the classification of the evolution phenomena involved in short-term diachrony in the field of space. It is based on the principles of
A Comparison of Co-occurrence and Similarity Measures as Simulations of Context
TLDR
This work presents a linear time complexity approximative algorithm for computing word similarity without any dimensionality reduction and introduces a large-scale evaluation based on two languages and two knowledge sources and discusses the underlying reasons for the relative performance of each measure.
That’s sick dude!: Automatic identification of word sense change across different timescales
TLDR
An unsupervised method to identify noun sense changes based on rigorous analysis of time-varying text data available in the form of millions of digitized books is proposed and can be applied for lexicography, as well as for applications like word sense disambiguation or semantic search.
Towards Tracking Semantic Change by Visual Analytics
TLDR
The aim of this study is to offer a new instrument for investigating the diachronic development of word senses in a way that allows for a better understanding of the nature of semantic change in general.
...
...