2005
DOI: 10.1007/11552253_15
|View full text |Cite
|
Sign up to set email alerts
|

Removing Statistical Biases in Unsupervised Sequence Learning

Abstract: Abstract. Unsupervised sequence learning is important to many applications. A learner is presented with unlabeled sequential data, and must discover sequential patterns that characterize the data. Popular approaches to such learning include statistical analysis and frequency based methods. We empirically compare these approaches and find that both approaches suffer from biases toward shorter sequences, and from inability to group together multiple instances of the same pattern. We provide methods to address th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2007
2007
2007
2007

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 8 publications
0
4
0
Order By: Relevance
“…A subset of preliminary results from this work appears in [9]. The authors gratefully acknowledge useful discussions with Adele Howe, Paul Cohen, and Ronen Feldman.…”
Section: Acknowledgmentsmentioning
confidence: 93%
“…A subset of preliminary results from this work appears in [9]. The authors gratefully acknowledge useful discussions with Adele Howe, Paul Cohen, and Ronen Feldman.…”
Section: Acknowledgmentsmentioning
confidence: 93%
“…In Horman and Kaminka's work [13] a learner to discover sequential patterns is presented. Also, to overcome the length bias obstacle, they normalize candidate pattern ranks based on their length.…”
Section: Discussionmentioning
confidence: 99%
“…In Horman and Kaminka's work [13] a learner is presented with unlabelled sequential data, and must discover sequential patterns that characterize the data. Also, two popular approaches to such learning are evaluated: frequency-based methods [14] and statistical dependence methods [4].…”
Section: Related Work On Sequence Classificationmentioning
confidence: 99%
See 1 more Smart Citation