154703
20181203022100.0
doi
10.1177/0049124109342065
0049-1241
ARTICLE
How much does it cost? Optimization of costs in sequence analysis of social science data
SAGE Publications
2009
2009
Journal Articles
One major methodological problem in analysis of sequence data is the determination of costs from which distances between sequences are derived. Although this problem is currently not optimally dealt with in the social sciences, it has some similarity with problems that have been solved in bioinformatics for three decades. In this article, the authors propose an optimization of substitution and deletion/insertion costs based on computational methods. The authors provide an empirical way of determining costs for cases, frequent in the social sciences, in which theory does not clearly promote one cost scheme over another. Using three distinct data sets, the authors tested the distances and cluster solutions produced by the new cost scheme in comparison with solutions based on cost schemes associated with other research strategies. The proposed method performs well compared with other cost-setting strategies, while it alleviates the justification problem of cost schemes.
sequence analysis
optimal matching
trajectories
empirical cost optimization
Optimal Matching Analysis
Acid Substitution Matrices
Cluster-Analysis
Career Patterns
Alignment
Proteins
Work
Constraints
Sociology
Agreement
Gauthier, Jacques-Antoine
Widmer, Eric D.
244404
Bucher, Philipp
113607
Notredame, Cedric
38
1
197-231
Sociological Methods & Research
252244
GR-BUCHER
U11780
oai:infoscience.tind.io:154703
SV
article
182396
EPFL-ARTICLE-154703
EPFL
REVIEWED
PUBLISHED
ARTICLE