2003
DOI: 10.1038/ng1285
|View full text |Cite
|
Sign up to set email alerts
|

Complete sequencing and characterization of 21,243 full-length human cDNAs

Abstract: As a base for human transcriptome and functional genomics, we created the "full-length long Japan" (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at ∼58% compared with a peak at ∼42%fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

8
575
0
10

Year Published

2004
2004
2022
2022

Publication Types

Select...
7
2
1

Relationship

1
9

Authors

Journals

citations
Cited by 814 publications
(602 citation statements)
references
References 29 publications
8
575
0
10
Order By: Relevance
“…28 The predicted truncated transcript was based on evidence derived from a study characterizing all human cDNAs in a brain cDNA library. 29 Depending on the annotation considered, the variant, g.22879A > G, is in exon 6 of the short transcript, or in intron 5 of the long transcript. We sought to validate the existence of the predicted truncated TPH 2 transcript isoform experimentally by RT-PCR.…”
Section: Resultsmentioning
confidence: 99%
“…28 The predicted truncated transcript was based on evidence derived from a study characterizing all human cDNAs in a brain cDNA library. 29 Depending on the annotation considered, the variant, g.22879A > G, is in exon 6 of the short transcript, or in intron 5 of the long transcript. We sought to validate the existence of the predicted truncated TPH 2 transcript isoform experimentally by RT-PCR.…”
Section: Resultsmentioning
confidence: 99%
“…B) CGI 1060 (bold font) overlaps with exon 9 (underlined) and promoter GXP_168256 (dashed line) of a non-coding transcript AK024830. The curved arrow indicates the transcription start site (TSS), 'T', for transcript AK024830 (1687 bp), first reported by Ota et al [40]. CGI 1060 harbours seventeen CpGs (1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14)(15)(16)(17) and seven SNPs indicated with IUB redundancy codes (R: A/G, Y: Methylation levels were determined with bisulphite pyrosequencing and significance of differences between populations, with Mann-Whitney U. Boxplots show location and dispersion of Venda control (n = 28, white) and Venda TB (n = 32, grey) methylation levels.…”
Section: Discussionmentioning
confidence: 99%
“…15). We searched the completely sequenced full-length cDNAs that corresponded to these TSCs and found seven overlapping cDNAs from the FLJ (Ota et al 2004) and MGC (Gerhard et al 2004) cDNA collections in DLD-1 and TIG-3 cells (Supplementary Table 6a). All of the corresponding full-length cDNAs lacked clear open reading frames.…”
Section: Hif-1a Binding Sites In Intergenic Regionsmentioning
confidence: 99%