# Sequence Similarity

@article{Elzinga2003SequenceS, title={Sequence Similarity}, author={Cees H. Elzinga}, journal={Sociological Methods \& Research}, year={2003}, volume={32}, pages={29 - 3} }

This article reviews objections to optimal-matching (OM) algorithms in sequence analysis and reformulates the concept of sequence similarity in terms of a binary precedence relation. This precedence relation is then used to develop a new quantification of sequence similarity. The new measure is used to reanalyze the life history data that were previously discussed by Dijkstra and Taris (1995). The reanalysis demonstrates the new measure to be superior to the OM algorithm and the alternatives… Expand

#### 102 Citations

Distance, Similarity and Sequence Comparison

- Computer Science
- 2014

This chapter discusses units of distance, admissible transformations and normalization as a method that allows for interpreting the size of distances and similarities and deals with some quite common misunderstandings pertaining to these concepts. Expand

Which Dissimilarity Is to Be Used When Extracting Typologies in Sequence Analysis? A Comparative Study

- Computer Science, Mathematics
- IWANN
- 2013

This paper proposes to use an optimal convex combination of different dissimilarities, which is automatically determined by the clustering procedure and is defined with respect to the within-class variance. Expand

A comparative review of sequence dissimilarity measures

- Geography
- 2014

This is a comparative study of the multiple ways of measuring dissimilarities between state sequences. For sequences describing life courses, such as family life trajectories or professional careers,… Expand

Analyzing Sequence Data

- Computer Science
- 2014

Optimal matching (OM), an invaluable yet underutilized tool in the analysis of sequence data, is discussed and an illustration of its use in the examination of careers of deans at U.S. business schools is provided. Expand

Optimal Matching Analysis and Life-Course Data: The Importance of Duration

- Mathematics
- 2010

The optimal matching (OM) algorithm is widely used for sequence analysis in sociology. It has a natural interpretation for discrete-time sequences but is also widely used for life-history data, which… Expand

New Developments in Sequence Analysis

- Computer Science
- 2010

The technical situation improved with both increasing processor speed and wider availability of software implementations, such as the various implementations of sequence analysis in the Stata package, which enabled more researchers from different disciplines to compare sequences of large numbers of individuals, finding out similarities, quantifying certain characteristics, or grouping certain characteristics. Expand

Quantifying sequential subsumption

- Computer Science
- Theor. Comput. Sci.
- 2019

This paper studies how to quantify subsumption for sequential patterns, gives an axiomatic characterisation of subsumption, and presents one general approach to quantification in terms of set intersection operation over concept extension. Expand

OM Matters: The Interaction Effects between Indel and Substitution Costs

- Computer Science
- 2009

The interaction effect between indel and substitution costs in Optimal Matching Analysis (henceforth OMA), by means of a simulation based on the eight sequences obtained as element permutation of a binary string of length 3, will show that varying the substitution and indel costs produces inconsistent results. Expand

Spell Sequences, State Proximities, and Distance Metrics

- Computer Science
- 2015

This work investigates the sensitivity, relative to OM, of several variants of this metric to variations in order, timing, and duration of states, and shows that the behavior of the metric is as intended. Expand

Three Narratives of Sequence Analysis

- Sociology
- 2014

How do we relate the distance between two sequences, as given by an algorithm such as optimal matching, to sociologically meaningful notions of similarity and dissimilarity? This has been… Expand

#### References

SHOWING 1-10 OF 23 REFERENCES

Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers

- Sociology
- American Journal of Sociology
- 1990

This article introduces a method that measures resemblance between sequences using a simple metric based on the insertions, deletions, and substitutions required to transform one sequence into… Expand

A Comment on “Measuring the Agreement between Sequences”

- Computer Science
- 1995

The author discusses the general concept and nature of alignment algorithms for sequence data, and talks about the character and utility of the Dijkstra/Taris algorithm, a particular implementation of the alignment approach to sequence analysis. Expand

Nonoptimal Alignment

- Mathematics
- 2001

An algorithm to measure agreement between sequences as proposed by Dijkstra and Taris is discussed. It is concluded that the “optimal alignment” algorithm does not necessarily produce the optimal… Expand

Measuring the Agreement between Sequences

- Mathematics
- 1995

The present article proposes a new method to assess distances between sequences of states, belonging to, for instance, event histories. It is based on the number of moves needed to turn one sequence… Expand

Sequence Analysis and Optimal Matching Methods in Sociology

- Mathematics
- 2000

The authors review all known studies applying optimal matching or alignment (OM) techniques to social science sequence data. Issues of data, coding, temporality, cost setting/algorithm design, and… Expand

On the complexity of the Extended String-to-String Correction Problem

- Mathematics, Computer Science
- STOC
- 1975

The CELLAR algorithm is presented, and proof that ESSCP, with WI < WC = WD = @@@@, 0 < WS < @ @@@, suitably encoded, is NP-complete is proved. Expand

How to Measure the Agreement between Sequences

- Mathematics
- 2001

Some problems of optimal alignment procedures to measure the agreement between sequences are discussed. Hidden Markov models may be a new approach that is especially suited for grouping sequences… Expand

Optimal Matching Methods for Historical Sequences

- Sociology
- 1986

common script is standard historical and sociological fare. In the passage from which this quote is drawn, Rude describes a script proceeding from general grievances to triggering events and on to a… Expand

Some Comments on “Sequence Analysis and Optimal Matching Methods in Sociology: Review and Prospect”

- Sociology
- 2000

Apres un rappel methodologique de l'analyse sequentielle, l'A. expose les forces et les faiblesses de cette technique. Il emet un jugement critique sur les avis de Abbott et Tsay a propos de… Expand

Reply to Levine and Wu

- Mathematics
- 2000

En reponse aux critiques emises par Levine dans le present numero a propos de la methode de l'assortiment optimal en analyse sequentielle, l'A. plaide en faveur d'une technique encore jeune mais… Expand