Setsuo Arikawa scite author profile

Abstract. We present a linear-time algorithm to compute the longest common prefix information in suffix arrays. As two applications of our algorithm, we show that our algorithm is crucial to the effective use of block-sorting compression, and we present a linear-time algorithm to simulate the bottom-up traversal of a suffix tree with a suffix array combined with the longest common prefix information.

show abstract

Collage system: a unifying framework for compressed pattern matching

Kida

Matsumoto

Shibata

et al. 2003

Theoretical Computer Science

View full text Add to dashboard Cite

Speeding Up Pattern Matching by Text Compression

Shibata

Kida

Fukamachi

et al. 2000

View full text Add to dashboard Cite

Optimized Substructure Discovery for Semi-structured Data

Abe

Kawasoe

Asai

et al. 2002

View full text Add to dashboard Cite

In this paper, we consider the problem of discovering interesting substructures from a large collection of semi-structured data in the framework of optimized pattern discovery. We model semi-structured data and patterns with labeled ordered trees, and present an efficient algorithm that discovers the best labeled ordered trees that optimize a given statistical measure, such as the information entropy and the classification accuracy, in a collection of semi-structured data. We give theoretical analyses of the computational complexity of the algorithm for patterns with bounded and unbounded size. Experiments show that the algorithm performs well and discovered interesting patterns on real datasets.

show abstract

Multiple pattern matching in LZW compressed text

Kida

Takeda

Shinohara

et al.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Setsuo Arikawa

Linear-Time Longest-Common-Prefix Computation in Suffix Arrays and Its Applications

Collage system: a unifying framework for compressed pattern matching

Speeding Up Pattern Matching by Text Compression

Optimized Substructure Discovery for Semi-structured Data

Multiple pattern matching in LZW compressed text

Contact Info

Product

Resources

About