ApproxMAP: Approximate Mining of Consensus Sequential Patterns

Kum, Hye‐Chung; Pei, Jian; Wang, Wei; Duncan, Dean

doi:10.1137/1.9781611972733.36

Cited by 73 publications

(40 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Much of the data in these databases is in the form of sequences and we often seek sequences that occur frequently or are "centrally located" or form a "consensus pattern." An example arises in social welfare data [245,246], where an algorithm for finding approximate sequential consensus patterns, ones that appear frequently in a database, is discussed. A similar problem arises in molecular biology, when we seek to choose sequences that occur frequently or are "centrally located" in a database of molecular sequences.…”

Section: Large Databases and Inferencementioning

confidence: 99%

Computer science and decision theory

Roberts

2008

Ann Oper Res

View full text Add to dashboard Cite

This paper reviews applications in computer science that decision theorists have addressed for years, discusses the requirements posed by these applications that place great strain on decision theory/social science methods, and explores applications in the social and decision sciences of newer decision-theoretic methods developed with computer science applications in mind. The paper deals with the relation between computer science and decision-theoretic methods of consensus, with the relation between computer science and game theory and decisions, and with "algorithmic decision theory."

show abstract

Section: Large Databases and Inferencementioning

confidence: 99%

Computer science and decision theory

Roberts

2008

Ann Oper Res

View full text Add to dashboard Cite

show abstract

“…This combination may generate large sequences. However, mining sequential patterns is inefficient with long sequences and often may not find exact matching of long patterns in the database (Kum et al 2003). Due to these shortcomings, an improved MDSPM technique is required to mine patterns without loosing important information carried by the dimensions.…”

Section: Introductionmentioning

confidence: 99%

Mapping frequent spatio-temporal wind profile patterns using multi-dimensional sequential pattern mining

Yusof

Zurita-Milla

2016

International Journal of Digital Earth

View full text Add to dashboard Cite

Holistic understanding of wind behaviour over space, time and height is essential for harvesting wind energy application. This study presents a novel approach for mapping frequent wind profile patterns using multidimensional sequential pattern mining (MDSPM). This study is illustrated with a time series of 24 years of European Centre for Medium-Range Weather Forecasts European Reanalysis-Interim gridded (0.125°× 0.125°) wind data for the Netherlands every 6 h and at six height levels. The wind data were first transformed into two spatio-temporal sequence databases (for speed and direction, respectively). Then, the Linear time Closed Itemset Miner Sequence algorithm was used to extract the multidimensional sequential patterns, which were then visualized using a 3D wind rose, a circular histogram and a geographical map. These patterns were further analysed to determine their wind shear coefficients and turbulence intensities as well as their spatial overlap with current areas with wind turbines. Our analysis identified four frequent wind profile patterns. One of them highly suitable to harvest wind energy at a height of 128 m and 68.97% of the geographical area covered by this pattern already contains wind turbines. This study shows that the proposed approach is capable of efficiently extracting meaningful patterns from complex spatio-temporal datasets.ARTICLE HISTORY

show abstract

“…It is one of the essential data mining tasks widely used in many applications, including customer purchase pattern analysis and biological data sequences [17][18][19][20][21][22], etc. Many research have been performed to efficient sequential pattern mining, such as [23][24][25], closed and maximal sequential pattern mining [26][27][28][29], constraint-based sequential pattern mining [30][31][32] approximate sequential pattern mining [33], sequential pattern mining in multiple data sources [34], sequential pattern mining in noisy data [35], incremental mining of sequential patterns [36], and time-interval weighted sequential pattern mining [37]. Two of the general sequential mining algorithms are SPADE [24] and PrefixSpan [23], which are more efficient than others in terms of processing time.…”

Section: Introductionmentioning

confidence: 99%