2006
DOI: 10.1177/0165551506059229
|View full text |Cite
|
Sign up to set email alerts
|

An efficient approach for sequence matching in large DNA databases

Abstract: In molecular biology, DNA sequence matching is one of the most crucial operations. Since DNA databases contain a huge volume of sequences, fast indexes are essential for efficient processing of DNA sequence matching. In this paper, we first point out the problems of the suffix tree, an index structure widely-used for DNA sequence matching, in respect of storage overhead, search performance, and difficulty in seamless integration with DBMS. Then, we propose a new index structure that resolves such problems. The… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(9 citation statements)
references
References 28 publications
(62 reference statements)
0
9
0
Order By: Relevance
“…According to the problem analysis outlined in Subsection 3.2, we know that N(Q1,S,1)=M(Q1,S,0) + M(Q1,S,1). Since the lengths of Q1 and λ are 3 and 4, respectively, there are (λ-1)*m=3*3=9 patterns whose distances to Q1 are 1: Q2=c[0,2]g [1,3] [1,3]t. We know that when d is 0, N(P,S,0) can be solved using PAIG [13].…”
Section: Experimental Results Of Theoretical Analysismentioning
confidence: 99%
See 4 more Smart Citations
“…According to the problem analysis outlined in Subsection 3.2, we know that N(Q1,S,1)=M(Q1,S,0) + M(Q1,S,1). Since the lengths of Q1 and λ are 3 and 4, respectively, there are (λ-1)*m=3*3=9 patterns whose distances to Q1 are 1: Q2=c[0,2]g [1,3] [1,3]t. We know that when d is 0, N(P,S,0) can be solved using PAIG [13].…”
Section: Experimental Results Of Theoretical Analysismentioning
confidence: 99%
“…These experiments thus verify that an instance in the present study can be transformed into many instances of [13], thereby also verifying the correctness of SONG. [3,9]t [2,5]a [4,9]g [1,8]t [2,9]a P3 g [1,9]t [1,9]a [1,9]g [1,9]t [1,9]a [1,9]g [1,9]t [1,9]a [1,9]g [1,9]t P4 g [1,5]t[0,6]a [2,7]g [3,9]t [2,5]a [4,9]g [1,8]t [2,9]a [1,9]g [1,9] …”
Section: Experimental Results Of Theoretical Analysismentioning
confidence: 99%
See 3 more Smart Citations