2019
DOI: 10.31235/osf.io/v7mj8
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

An Integrated Heuristic for Validation in Sequence Analysis

Abstract: Sequence Analysis is a collection of tools to describe life courses represented as sequences that are increasingly applied in different fields, particularly in demography, sociology, and political sciences. Identifying typologies through cluster analysis, thus disregarding individual sequences’ peculiarities, is the aim of most applications. However, a substantive interpretation of such typology can be questionable when clusters include sequences deviating from the others. We propose an integrated approach to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…Note that the ASW does not consider individual sequences that are outliers to all clusters but only those that are equally similar to several of the identified clusters, and therefore might overstate the true cluster structure (Struffolino and Piccarreta 2019).…”
Section: Cluster Identification In Different Data Scenariosmentioning
confidence: 99%
See 1 more Smart Citation
“…Note that the ASW does not consider individual sequences that are outliers to all clusters but only those that are equally similar to several of the identified clusters, and therefore might overstate the true cluster structure (Struffolino and Piccarreta 2019).…”
Section: Cluster Identification In Different Data Scenariosmentioning
confidence: 99%
“…Typically, researchers explore the robustness of sequence typologies by applying different cost specifications when calculating dissimilarities between sequences (Aisenbrey and Fasang 2017), or evaluating cluster solutions with different numbers of clusters (Piccarreta and Struffolino 2019). Recently, studies started to exclude poorly classified sequences from clusters based on low silhouette values when clusters are used as dependent or independent variables in further regression-based analyses (Jalovaara and Fasang 2020).…”
Section: Introductionmentioning
confidence: 99%
“…Using this strategy, I identify four clusters for women and three clusters for men as optimal. A battery of partitioning quality indicators across different clustering solutions are presented in Figure A1 (Studer, 2013;Piccarreta and Struffolino, 2019), including the average Silhouette width (ASW), the point biserial correlation (PBC), and the Hubert's C (HC). Taken together, I argue that women's four-cluster and men's five-cluster solutions using the HAM algorithm in a PAM+Ward procedure have high validity representing the major career typology in Taiwan.…”
Section: Analytical Strategy Variables and Measurementsmentioning
confidence: 99%
“…Using this strategy, I identify four clusters for women and three clusters for men as optimal. A battery of partitioning quality indicators across different clustering solutions are presented in Figure A1 (Piccarreta & Struffolino, 2019;Studer, 2013), including the Average Silhouette Width (ASW), the Point Biserial Correlation (PBC), and the Hubert's C (HC). Taken together, I argue that women's four-cluster and men's five-cluster solutions using the HAM algorithm in a PAM+Ward procedure have high validity representing the major career typology in Taiwan.…”
Section: Analytical Strategy Variables and Measurementsmentioning
confidence: 99%