Three Narratives of Sequence Analysis

  title={Three Narratives of Sequence Analysis},
  author={Brendan Halpin},
How do we relate the distance between two sequences, as given by an algorithm such as optimal matching, to sociologically meaningful notions of similarity and dissimilarity? This has been controversial in sequence analysis. Attention must be paid to how the algorithm operates, and to what sort of distances it generates in empirical practice. We can think of algorithms as giving distinct “narratives” of similarity, derived either formally or heuristically from their operation. This paper… 
What matters in differences between life trajectories: a comparative review of sequence dissimilarity measures
The study shows that there is no universally optimal distance index, and that the choice of a measure depends on which aspect the authors want to focus on, and introduces novel ways of measuring dissimilarities that overcome some flaws in existing measures.
Order or chaos? Understanding career mobility using sequence analysis and information-theoretic methods
We examine the careers of a nationally representative US cohort of young adults using sequence analysis and information-theoretic techniques to describe these careers’ structure and how this
Data quality challenges with missing values and mixed types in joint sequence analysis
This paper employs longitudinal sequence data representations, a similarity measure designed for categorical and longitudinal data, together with state-of-the art clustering methodologies reliant on hierarchical algorithms to investigate the impact of missing values in categorical time series sequences on common data analysis tasks.
SADI: Sequence Analysis Tools for Stata
The SADI package provides tools for sequence analysis, which focuses on the similarity and dissimilarity between categorical time series such as life-course trajectories. SADI‘s main components are
Evaluating the Effects of Missing Values and Mixed Data Types on Social Sequence Clustering Using t-SNE Visualization
It is found that the ability to overcome missing data problems is more difficult in the nominal domain than in the binary domain, and the usage of t-distributed stochastic neighborhood embedding is demonstrated to visually guide mitigation of such biases.
Mobility closure in the upper class: assessing time and forms of capital.
  • M. Toft
  • Sociology, Medicine
    The British journal of sociology
  • 2019
The analysis reveals important divisions in the trajectories of Norwegians who reach the upper class; not only are there differences in their upbringing in terms of the availability of different amounts and types of capital but such divisions also seem linked to their own class careers later in adulthood.
Sequential Synchronization Analysis
This chapter posits that as group members interact, their activities can become socially entrained, constituting the group as an entity beyond the individual members.
Social Mobility and Education in Britain
In 1927 Pitirim Sorokin, a former Russian revolutionary exiled by Lenin, who had become Professor of Sociology at the University of Minnesota, published the first major academic work on social
Career pathways for temporary workers: exploring heterogeneous mobility dynamics with sequence analysis.
A typology of trajectories is derived and age, gender, and type of temporary work stand out as important factors shaping subsequent mobility patterns and their relative precariousness in relation to employment stability and wage and earnings levels and growth.
Social Sequence Analysis: Methods and Applications
The promise of social sequence analysis is illustrated with a comparison of whole-sequence comparison methods for sequence analysis and its applications in the social sciences.


Optimal Matching Analysis and Life-Course Data: The Importance of Duration
The optimal matching (OM) algorithm is widely used for sequence analysis in sociology. It has a natural interpretation for discrete-time sequences but is also widely used for life-history data, which
Time Warps, String Edits, and Macromolecules
This paper presents an anthology of algorithms and concepts for sequence comparison, an overview of sequence comparison Joseph B. Kruskal and David Sankoff, and some recent results on the complexity of common-subsequence problems.
Setting Cost in Optimal Matching to Uncover Contemporaneous Socio-Temporal Patterns
This article addresses the question of the effects of cost setting on the kind of temporal patterns optimal matching (OM) can uncover when applied to social science data. It is argued that the
Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers
This article introduces a method that measures resemblance between sequences using a simple metric based on the insertions, deletions, and substitutions required to transform one sequence into
Measuring the Agreement between Sequences
The present article proposes a new method to assess distances between sequences of states, belonging to, for instance, event histories. It is based on the number of moves needed to turn one sequence
Time Warp Edit Distance with Stiffness Adjustment for Time Series Matching
  • P. Marteau
  • Computer Science, Medicine
    IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 2009
It is shown that the similarity provided by TWED is a potentially useful metric in time series retrieval applications since it could benefit from the triangular inequality property to speed up the retrieval process while tuning the parameters of the elastic measure.
Is Optimal Matching Suboptimal?
The author identifies a flaw in OM ‘‘indel costs’’ and proposes a solution to this flaw, and conducts an empirical test of traditional OM, the alternative localized OM, and sequence comparison.
A general method applicable to the search for similarities in the amino acid sequence of two proteins.
A computer adaptable method for finding similarities in the amino acid sequences of two proteins has been developed and it is possible to determine whether significant homology exists between the proteins to trace their possible evolutionary development.
Spell Sequences, State Proximities, and Distance Metrics
This work investigates the sensitivity, relative to OM, of several variants of this metric to variations in order, timing, and duration of states, and shows that the behavior of the metric is as intended.
Analyzing and Visualizing State Sequences in R with TraMineR
This article describes the many capabilities offered by the TraMineR toolbox for categorical sequence data. It focuses more specifically on the analysis and rendering of state sequences. Addressed