2002
DOI: 10.1101/gr.75202
|View full text |Cite
|
Sign up to set email alerts
|

A Computer-Based Method of Selecting Clones for a Full-Length cDNA Project: Simultaneous Collection of Negligibly Redundant and Variant cDNAs

Abstract: We describe a computer-based method that selects representative clones for full-length sequencing in a full-length cDNA project. Our method classifies end sequences using two kinds of criteria, grouping, and clustering. Grouping places together variant cDNAs, family genes, and cDNAs with sequencing errors. Clustering separates those cDNA clones into distinct clusters. The full-length sequences of the clones selected by grouping are determined preferentially, and then the sequences selected by clustering are de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2003
2003
2014
2014

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 28 publications
(14 citation statements)
references
References 30 publications
0
14
0
Order By: Relevance
“…Lucy was used for vector trimming (Chou and Holmes, 2001); trimmed data were stored in FASTA format using a Perl script. Clones containing cDNAs that were >90% similar over 80 bases or more were classed into the same cluster using the TGICL program (Osato et al, 2002;Pertea et al, 2003). The end sequences in each cluster were aligned using the FASTA homology search software to Uniprot (Apweiler et al, 2004).…”
Section: Data Processing and Assemblymentioning
confidence: 99%
“…Lucy was used for vector trimming (Chou and Holmes, 2001); trimmed data were stored in FASTA format using a Perl script. Clones containing cDNAs that were >90% similar over 80 bases or more were classed into the same cluster using the TGICL program (Osato et al, 2002;Pertea et al, 2003). The end sequences in each cluster were aligned using the FASTA homology search software to Uniprot (Apweiler et al, 2004).…”
Section: Data Processing and Assemblymentioning
confidence: 99%
“…The methods for preferential cloning of cDNA that corresponds to full-length mRNAs with 5¢-end-proximal cap structures (Kristiansen and Pandey 2002) have been developed and used in large-scale analyses of transcripts from human (Suzuki et al 2002;Ota et al 2004), mouse (Konno et al 2001; The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium 2001; Osato et al 2002;, fruit fly (Stapleton et al 2002), Arabidopsis thaliana (Seki et al 2002), and rice (The Rice Full-Length cDNA Consortium 2003; Osato et al 2003). Genomic comparisons of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy (Town et al 2006).…”
Section: Introductionmentioning
confidence: 99%
“…The sequence analysis of rice EST clone (J023038D13), which was derived from rice cDNA library (Osato et al, 2002) from developing seeds prepared in pBluescript that was done by using various bioinformatics tools. The DNA sequencing and sequence analysis were described previously (Sikdar and Kim, 2010).…”
Section: Dna Sequence Analysismentioning
confidence: 99%