2011
DOI: 10.1007/978-3-642-19032-2_7
|View full text |Cite
|
Sign up to set email alerts
|

Extracting and Rendering Representative Sequences

Abstract: Abstract. This paper is concerned with the summarization of a set of categorical sequences. More specifically, the problem studied is the determination of the smallest possible number of representative sequences that ensure a given coverage of the whole set, i.e. that have together a given percentage of sequences in their neighbourhood. The proposed heuristic for extracting the representative subset requires as main arguments a pairwise distance matrix, a representativeness criterion and a distance threshold u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
40
0
3

Year Published

2011
2011
2019
2019

Publication Types

Select...
6
3
1

Relationship

4
6

Authors

Journals

citations
Cited by 34 publications
(43 citation statements)
references
References 9 publications
0
40
0
3
Order By: Relevance
“…One example is the extraction of typical patterns from sequence databases, an important objective of sequence analysis. This task, which requires nontrivial heuristic procedures when using pairwise dissimilarities (Gabadinho, Ritschard, Studer, and Müller 2011b), can also be achieved with sequence prediction. Another important and promising application is the analysis of the influence of covariates on the patterns.…”
Section: Resultsmentioning
confidence: 99%
“…One example is the extraction of typical patterns from sequence databases, an important objective of sequence analysis. This task, which requires nontrivial heuristic procedures when using pairwise dissimilarities (Gabadinho, Ritschard, Studer, and Müller 2011b), can also be achieved with sequence prediction. Another important and promising application is the analysis of the influence of covariates on the patterns.…”
Section: Resultsmentioning
confidence: 99%
“…3 If there is less than four countries that means that other countries are presented in less than 15% of cluster affiliations, it here is a + sign it meant that there is more than four countries presented on more than 15% of cluster affiliations. 4 That means that at least 75% of a cluster sequences have a distance to representative sequences which is less than 10% of maximal theoretical distance between sequences within a dataset. Refer to [4] for more details about representative sequences.…”
Section: Clustersmentioning
confidence: 99%
“…Although this is not obvious for any kind of complex objects, displaying index-plots like those used in Figure 3 provides a good solution for state sequences. For a somewhat more synthetic view, we could also consider representative plots (Gabadinho, Ritschard, Studer, and Müller 2011b) that show the minimal set of sequences for each node that would be necessary to ensure a given coverage of the sequences at that node.…”
Section: Tree-structured Analysis Of Sequencesmentioning
confidence: 99%