Sequence Similarity

@article{Elzinga2003SequenceS,
  title={Sequence Similarity},
  author={Cees H. Elzinga},
  journal={Sociological Methods \& Research},
  year={2003},
  volume={32},
  pages={29 - 3}
}
  • C. Elzinga
  • Published 1 August 2003
  • Mathematics
  • Sociological Methods & Research
This article reviews objections to optimal-matching (OM) algorithms in sequence analysis and reformulates the concept of sequence similarity in terms of a binary precedence relation. This precedence relation is then used to develop a new quantification of sequence similarity. The new measure is used to reanalyze the life history data that were previously discussed by Dijkstra and Taris (1995). The reanalysis demonstrates the new measure to be superior to the OM algorithm and the alternatives… Expand

Tables from this paper

Distance, Similarity and Sequence Comparison
TLDR
This chapter discusses units of distance, admissible transformations and normalization as a method that allows for interpreting the size of distances and similarities and deals with some quite common misunderstandings pertaining to these concepts. Expand
Which Dissimilarity Is to Be Used When Extracting Typologies in Sequence Analysis? A Comparative Study
TLDR
This paper proposes to use an optimal convex combination of different dissimilarities, which is automatically determined by the clustering procedure and is defined with respect to the within-class variance. Expand
A comparative review of sequence dissimilarity measures
This is a comparative study of the multiple ways of measuring dissimilarities between state sequences. For sequences describing life courses, such as family life trajectories or professional careers,Expand
Analyzing Sequence Data
TLDR
Optimal matching (OM), an invaluable yet underutilized tool in the analysis of sequence data, is discussed and an illustration of its use in the examination of careers of deans at U.S. business schools is provided. Expand
Optimal Matching Analysis and Life-Course Data: The Importance of Duration
The optimal matching (OM) algorithm is widely used for sequence analysis in sociology. It has a natural interpretation for discrete-time sequences but is also widely used for life-history data, whichExpand
New Developments in Sequence Analysis
TLDR
The technical situation improved with both increasing processor speed and wider availability of software implementations, such as the various implementations of sequence analysis in the Stata package, which enabled more researchers from different disciplines to compare sequences of large numbers of individuals, finding out similarities, quantifying certain characteristics, or grouping certain characteristics. Expand
Quantifying sequential subsumption
TLDR
This paper studies how to quantify subsumption for sequential patterns, gives an axiomatic characterisation of subsumption, and presents one general approach to quantification in terms of set intersection operation over concept extension. Expand
OM Matters: The Interaction Effects between Indel and Substitution Costs
TLDR
The interaction effect between indel and substitution costs in Optimal Matching Analysis (henceforth OMA), by means of a simulation based on the eight sequences obtained as element permutation of a binary string of length 3, will show that varying the substitution and indel costs produces inconsistent results. Expand
Spell Sequences, State Proximities, and Distance Metrics
TLDR
This work investigates the sensitivity, relative to OM, of several variants of this metric to variations in order, timing, and duration of states, and shows that the behavior of the metric is as intended. Expand
Three Narratives of Sequence Analysis
How do we relate the distance between two sequences, as given by an algorithm such as optimal matching, to sociologically meaningful notions of similarity and dissimilarity? This has beenExpand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 23 REFERENCES
Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers
This article introduces a method that measures resemblance between sequences using a simple metric based on the insertions, deletions, and substitutions required to transform one sequence intoExpand
A Comment on “Measuring the Agreement between Sequences”
TLDR
The author discusses the general concept and nature of alignment algorithms for sequence data, and talks about the character and utility of the Dijkstra/Taris algorithm, a particular implementation of the alignment approach to sequence analysis. Expand
Nonoptimal Alignment
An algorithm to measure agreement between sequences as proposed by Dijkstra and Taris is discussed. It is concluded that the “optimal alignment” algorithm does not necessarily produce the optimalExpand
Measuring the Agreement between Sequences
The present article proposes a new method to assess distances between sequences of states, belonging to, for instance, event histories. It is based on the number of moves needed to turn one sequenceExpand
Sequence Analysis and Optimal Matching Methods in Sociology
The authors review all known studies applying optimal matching or alignment (OM) techniques to social science sequence data. Issues of data, coding, temporality, cost setting/algorithm design, andExpand
On the complexity of the Extended String-to-String Correction Problem
  • R. Wagner
  • Mathematics, Computer Science
  • STOC
  • 1975
TLDR
The CELLAR algorithm is presented, and proof that ESSCP, with WI < WC = WD = @@@@, 0 < WS < @ @@@, suitably encoded, is NP-complete is proved. Expand
How to Measure the Agreement between Sequences
Some problems of optimal alignment procedures to measure the agreement between sequences are discussed. Hidden Markov models may be a new approach that is especially suited for grouping sequencesExpand
Optimal Matching Methods for Historical Sequences
common script is standard historical and sociological fare. In the passage from which this quote is drawn, Rude describes a script proceeding from general grievances to triggering events and on to aExpand
Some Comments on “Sequence Analysis and Optimal Matching Methods in Sociology: Review and Prospect”
Apres un rappel methodologique de l'analyse sequentielle, l'A. expose les forces et les faiblesses de cette technique. Il emet un jugement critique sur les avis de Abbott et Tsay a propos deExpand
Reply to Levine and Wu
En reponse aux critiques emises par Levine dans le present numero a propos de la methode de l'assortiment optimal en analyse sequentielle, l'A. plaide en faveur d'une technique encore jeune maisExpand
...
1
2
3
...