• Corpus ID: 4003029

Dynamic Natural Language Processing with Recurrence Quantification Analysis

@article{Dale2018DynamicNL,
  title={Dynamic Natural Language Processing with Recurrence Quantification Analysis},
  author={Rick Dale and Nicholas D. Duran and Moreno I. Coco},
  journal={ArXiv},
  year={2018},
  volume={abs/1803.07136}
}
Writing and reading are dynamic processes. As an author composes a text, a sequence of words is produced. This sequence is one that, the author hopes, causes a revisitation of certain thoughts and ideas in others. These processes of composition and revisitation by readers are ordered in time. This means that text itself can be investigated under the lens of dynamical systems. A common technique for analyzing the behavior of dynamical systems, known as recurrence quantification analysis (RQA… 

Figures and Tables from this paper

Unidimensional and Multidimensional Methods for Recurrence Quantification Analysis with crqa
TLDR
It is shown how such RQA methods can be deployed under a single computational framework in R using a substantially renewed version the authors' crqa 2.0 package, which includes implementations of several recent advances in recurrence-based analysis, among them applications to multivariate data, and improved entropy calculations for categorical data.
An Approach to Aligning Categorical and Continuous Time Series for Studying the Dynamics of Complex Human Behavior
TLDR
A case study suggesting this kind of dynamic analysis holds promise for capturing dynamic coordination across the body, brain and environment in complex performances of this kind is described, and the theoretical implications are discussed.
Beyond frequency counts: Novel conceptual recurrence analysis metrics to index semantic coordination in team communications
TLDR
It is concluded that CRA is sensitive to experimental manipulations in ways consistent with prior findings and that it presents a customizable framework for testing predictions about interpersonal communication patterns and other linguistic exchanges.

References

SHOWING 1-10 OF 45 REFERENCES
Recurrence Quantification Analysis of Processes and Products of Discourse: A Tutorial in R
TLDR
This article introduces Recurrence Quantification Analysis (RQA) as a tool to capture dynamic structure of naturalistic reading and writing performance and presents a step-by-step tutorial on how to run RQA using R.
Orthographic Structuring of Human Speech and Texts: Linguistic Application of Recurrence Quantification Analysis
TLDR
Using poems as a reference standard for judging speech complexity, the technique exhibits language independence, order dependence and freedom from pure statistical characteristics of studied sequences, as well as consistency with easily identifiable texts.
Recurrence Quantification Analysis: A Technique for the Dynamical Analysis of Student Writing
TLDR
The current study examined the degree to which the quality and characteristics of students’ essays could be modeled through dynamic natural language processing analyses, and suggested that dynamic techniques can be used to improve natural languageprocessing assessments of student essays.
Cross-recurrence quantification analysis of categorical and continuous time series: an R package
TLDR
This paper describes the R package crqa, and describes more formally the principles of cross-recurrence, and shows with the current package how to carry out analyses applying them, and compares computational efficiency, and results’ consistency, of crqa R package, with the benchmark MATLAB toolbox crptoolbox.
On the origin of long-range correlations in texts
TLDR
This paper explains how long-range correlations flow from highly structured linguistic levels down to the building blocks of a text (words, letters, etc..) and shows that correlations take form of a bursty sequence of events once the authors approach the semantically relevant topics of the text.
Beyond Word Frequency: Bursts, Lulls, and Scaling in the Temporal Distributions of Words
TLDR
Recurrence patterns of words are well described by a stretched exponential distribution of recurrence times, an empirical scaling that cannot be anticipated from Zipf's law and have implications for other overt manifestations of collective human dynamics.
Natural language processing in an intelligent writing strategy tutoring system
TLDR
The present study extends prior work by including a larger data sample and an expanded set of indices to assess new lexical, syntactic, cohesion, rhetorical, and reading ease indices and finds that the new indices increased accuracy but, more importantly, afford the means to provide more meaningful feedback in the context of a writing tutoring system.
Recurrence plots for the analysis of complex systems
Speech and language processing - an introduction to natural language processing, computational linguistics, and speech recognition
TLDR
This book takes an empirical approach to language processing, based on applying statistical and other machine-learning algorithms to large corpora, to demonstrate how the same algorithm can be used for speech recognition and word-sense disambiguation.
Distributed Representations of Words and Phrases and their Compositionality
TLDR
This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.
...
1
2
3
4
5
...