A comparative review of sequence dissimilarity measures

  title={A comparative review of sequence dissimilarity measures},
  author={Matthias Studer and Gilbert Ritschard},
This is a comparative study of the multiple ways of measuring dissimilarities between state sequences. For sequences describing life courses, such as family life trajectories or professional careers, the important differences between the sequences essentially concern the sequencing (the order in which successive states appear), the timing, and the duration of the spells in the successive states. Even if some distance measures underperform, it has been shown that there is no universally optimal… 
What matters in differences between life trajectories: a comparative review of sequence dissimilarity measures
The study shows that there is no universally optimal distance index, and that the choice of a measure depends on which aspect the authors want to focus on, and introduces novel ways of measuring dissimilarities that overcome some flaws in existing measures.
A comparative sequence analysis of career paths among knowledge workers in a multinational bank
This study examined two metrics for measuring the distance between sequences (Euclid and OMSpell) and creating distance matrices combined with two types of clustering methods (AGNES and PAM) to
Sensitivity of sequence methods in the study of neighborhood change in the United States
It is found that four sequence analysis methods do not provide converging neighborhood sequence typologies, and their behavior varies across MSAs, thus prohibiting meaningful comparisons of similar studies, and it is suggested researchers consider one method (“OMecenter”) as the building block towards designing a meaningful sequence analysis method for neighborhood change research.
La Sequence Analysis come strumento per lo studio delle carriere e della dispersione universitaria
This paper focuses on the potentiality of the sequence analysis as a tool to study and better understand the university students’ career paths. Compared to the cross-sectional studies, the sequence
Visualizing, clustering, and characterizing activity-trip sequences via weighted sequence alignment and functional data analysis
By identifying population subgroups with distinct daily activity-travel patterns and explaining how these patterns vary over one day and depend on user profiles, the weighted sequence alignment approach provides an intuitive and flexible method for extracting and characterizing individuals’ activity- travel behaviors for use in transportation planning.
A Similarity Measure Based on Care Trajectories as Sequences of Sets
Comparing care trajectories helps improve health services. Medicoadministrative databases are useful for automatically reconstructing the patients’ history of care. Care trajectories can be compared
A Similarity Measure Based on Care Trajectories as Sequences of Sets
This work proposes a generalization of the longest common subsequence approach to a more expressive representation of care trajectories as sequences of sets, and proposes to take events’ semantic similarity into account for comparing medical episodes.
Using sequence analysis to visualize and validate model transitions
The conclusion is that SA is a useful tool also in a microsimulation context, in visualizing and validating simulated model transitions when statistically more sophisticated mixture modeling are not applicable.
Coreference Chains Categorization by Sequence Clustering
The diversity of coreference chains is usually tackled by means of global features (length, types and number of referring expressions, distance between them, etc.). In this paper, we propose a novel
Rejoinder: Positivism and Big-game Fishing :a reply to comments
Far better an approximate answer to the right question, which is often vague, than the exact answer to the wrong question, which can always be made precise. " (Tukey 1962) The publication of our


Measuring the Agreement between Sequences
The present article proposes a new method to assess distances between sequences of states, belonging to, for instance, event histories. It is based on the number of moves needed to turn one sequence
Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers
This article introduces a method that measures resemblance between sequences using a simple metric based on the insertions, deletions, and substitutions required to transform one sequence into
Discrepancy Analysis of State Sequences
In this article, the authors define a methodological framework for analyzing the relationship between state sequences and covariates. Inspired by the principles of analysis of variance, this approach
Optimal Matching Analysis and Life-Course Data: The Importance of Duration
The optimal matching (OM) algorithm is widely used for sequence analysis in sociology. It has a natural interpretation for discrete-time sequences but is also widely used for life-history data, which
Spell Sequences, State Proximities, and Distance Metrics
This work investigates the sensitivity, relative to OM, of several variants of this metric to variations in order, timing, and duration of states, and shows that the behavior of the metric is as intended.
A Primer on Sequence Methods
This paper considers the technical problem of analyzing sequences of social events, including organizational life cycles, patterns of innovation development, and career tracks of individuals, and considers methods for unique event sequences, proposing the use of multidimensional scaling and illustrating it with an analysis of data on medical organizations.
Optimal Matching Methods for Historical Sequences
common script is standard historical and sociological fare. In the passage from which this quote is drawn, Rude describes a script proceeding from general grievances to triggering events and on to a
A Transition-Oriented Approach to Optimal Matching
This paper introduces a distinction between two sequence types—namely, common ancestors and unfolding processes and presents a new way of coding sequences as an extension to conventional OM analyses and demonstrates its usefulness in simulated and empirical examples.
Reliability of Sequence-Alignment Analysis of Social Processes: Monte Carlo Tests of Clustalg Software
The reliability of alignments in the classification of sequential data is examined and the ClustalG multiple alignment package is used to examine a set of synthetic sequences generated through the use of eight separate generation rules.
Harpoon or Bait? A Comparison of Various Metrics in Fishing for Sequence Patterns
The use of sequence analysis in the social sciences has significantly increased during the last decade or two. Sequence analysis explores and describes trajectories and “fishes for patterns” (Abbott,