2004
DOI: 10.1073/pnas.0307971100
|View full text |Cite
|
Sign up to set email alerts
|

Whole-genome shotgun assembly and comparison of human genome assemblies

Abstract: We report a whole-genome shotgun assembly (called WGSA) of the human genome generated at Celera in 2001. The Celeragenerated shotgun data set consisted of 27 million sequencing reads organized in pairs by virtue of end-sequencing 2-kbp, 10-kbp, and 50-kbp inserts from shotgun clone libraries.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
113
0
2

Year Published

2004
2004
2017
2017

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 162 publications
(117 citation statements)
references
References 32 publications
2
113
0
2
Order By: Relevance
“…Next, evidence for accurate long-range ordering and orienting of sequence contigs might naturally emerge from the broader whole-genome scaffolds, with any additional confirmation or refinement being performed, as needed. Of relevance, whole-genome shotgun sequencing projects typically include the generation of insert-end sequence reads from large-insert clones (e.g., BACs, fosmids, and large-insert plasmids) (Mouse Genome Sequencing Consortium 2002; Stein et al 2003;Istrail et al 2004); thus, following incorporation of such reads into a genome-wide sequence assembly, the corresponding clones could be used as substrates for PCR-based confirmation of contig order and orientation. Finally, the types of verification steps performed for BAC-derived sequences (e.g., comparisons to reference sequences and experimentally-derived restriction maps) could be scaled for use with whole-genome sequence assemblies, either globally or in a more targeted fashion.…”
Section: Discussionmentioning
confidence: 99%
“…Next, evidence for accurate long-range ordering and orienting of sequence contigs might naturally emerge from the broader whole-genome scaffolds, with any additional confirmation or refinement being performed, as needed. Of relevance, whole-genome shotgun sequencing projects typically include the generation of insert-end sequence reads from large-insert clones (e.g., BACs, fosmids, and large-insert plasmids) (Mouse Genome Sequencing Consortium 2002; Stein et al 2003;Istrail et al 2004); thus, following incorporation of such reads into a genome-wide sequence assembly, the corresponding clones could be used as substrates for PCR-based confirmation of contig order and orientation. Finally, the types of verification steps performed for BAC-derived sequences (e.g., comparisons to reference sequences and experimentally-derived restriction maps) could be scaled for use with whole-genome sequence assemblies, either globally or in a more targeted fashion.…”
Section: Discussionmentioning
confidence: 99%
“…To generate more SNPs and obtain validation information, shotgun sequencing of DNA from whole-genome libraries and flow-sorted chromosomes was performed 31 , augmented by analysis of sequence traces produced by Applied Biosystems 97,98 , and information on 1.6 million SNPs genotyped by Perlegen Sciences 77 , including 425,000 not in dbSNP when released (Supplementary Table 7). The HapMap Project contributed about 6 million new SNPs to dbSNP.…”
Section: Methodsmentioning
confidence: 99%
“…The Celera sequences represent unconnected scaffolds grouped by chromosome. We also retrieved the Celera whole genome shotgun assembly (WGSA) sequences from GenBank (accessions AADD01000001-AADD01211493, Istrail et al, 2004). All sequences in fasta format were downloaded onto our local bioinformatics server for analyses.…”
Section: Sources For Genomic Sequencesmentioning
confidence: 99%
“…2 shows two examples of newly identified polymorphic Alu insertions. From the 236 polymorphic insertions identified in CHGS, we identified 190 from the Celera whole genome shotgun assembly (WGSA) sequences that were obtained exclusively using the Celera proprietary whole genome shotgun sequences (Istrail et al, 2004). The remaining 46 CHGS polymorphic elements were likely from the public human BAC sequences that may have not been used for the assembly of PHGS.…”
Section: Identification Of Alu Insertion Polymorphismsmentioning
confidence: 99%
See 1 more Smart Citation