1999
DOI: 10.1101/gr.9.11.1135
|View full text |Cite
|
Sign up to set email alerts
|

d2_cluster: A Validated Method for Clustering EST and Full-Length cDNA Sequences

Abstract: Several efforts are under way to condense single-read expressed sequence tags (ESTs) and full-length transcript data on a large scale by means of clustering or assembly. One goal of these projects is the construction of gene indices where transcripts are partitioned into index classes (or clusters) such that they are put into the same index class if and only if they represent the same gene. Accurate gene indexing facilitates gene expression studies and inexpensive and early partial gene sequence discovery thro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
101
0
3

Year Published

2001
2001
2008
2008

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 183 publications
(104 citation statements)
references
References 33 publications
0
101
0
3
Order By: Relevance
“…High-quality ESTs were first grouped into clusters by using the single-linkage clustering method (26). Two ESTs with Ͼ90% identity over Ͼ100-bp length were grouped into the same cluster.…”
Section: Methodsmentioning
confidence: 99%
“…High-quality ESTs were first grouped into clusters by using the single-linkage clustering method (26). Two ESTs with Ͼ90% identity over Ͼ100-bp length were grouped into the same cluster.…”
Section: Methodsmentioning
confidence: 99%
“…All 5Ј EST reads were treated with software PHRED (20,21) to remove vector sequences and low-quality regions, and then assembled into consensus sequences with software STACKPACK (version 2.1 patch 1) (22,23). The consensus sequences were used as ESTs to search against GenBank with the BLASTX program (24).…”
Section: Est Analysismentioning
confidence: 99%
“…Here, we focus on computational methods, such as newly developed computer programs, because our experimental methods have been published previously (Carninci et al 1996(Carninci et al , 1997Sasaki et al 1998b;Seki et al 1998;Carninci and Hayashizaki 1999;Mizuno et al 1999). There have been many reports on the library assessments Salamov et al 1998) and clustering techniques (Boguski and Schuler 1995;Sutton et al 1995;Schuler et al 1996;Burke et al 1999;Miller et al 1999). Why did we develop new methods for old problems?…”
mentioning
confidence: 99%
“…Many clustering programs have been developed during the last few years. On a specialized computer, these programs can calculate the clustering of several million tag sequences in a few days (Boguski and Schuler 1995;Sutton et al 1995;Schuler et al 1996;Burke et al 1999;Miller et al 1999).…”
mentioning
confidence: 99%