hgvs: A Python package for manipulating sequence variants using HGVS nomenclature: 2018 Update

Wang, Meng; Callenberg, Keith M.; Dalgleish, Raymond; Fedtsov, Alexandre; Fox, Naomi; Freeman, Peter; Jacobs, Kevin B.; Kaleta, Piotr; McMurry, Andrew; Prlić, Andreas; Rajaraman, Veena; Hart, Reece K.

doi:10.1002/humu.23615

Cited by 20 publications

(16 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Variants were preferentially selected for further analysis and validation if they met all of the following criteria: (i) minor allele frequency < 0.01 in the 1000 Genomes Project database (http://www.internationalgenome.org/), Exome Aggregation Consortium database (ExAC, http:// exac.broadinstitute.org/), and Genome Aggregation database (gnomAD, http://gnomad.broadinstitute.org/); (ii) occurrence in exon regions or canonical splicing sites that affected RNA splicing; (iii) potential functional effects of nonsynonymous single nucleotide variants were predicted to be damaging or deleterious using multiple lines of computational prediction; (iv) candidate gene variants related to ophthalmic hereditary disease, especially for inherited retinal disease; (v) other reported potential pathogenic variants that did not met the above criteria (e.g., high minor allele frequency variants, deep-intronic variants, and synonymous single nucleotide variants). Variant nomenclature complied with the recommendations of the Human Genome Variation Society (HGVS, http://www.hgvs.org/) [24]. Variant annotation complied with the guidelines of the American College of Medical Genetics (ACMG, https://www.acmg.net/) [25,26].…”

Section: In Silico Analysesmentioning

confidence: 99%

Whole exome sequencing of a family revealed a novel variant in the CHM gene, c.22delG p.(Glu8Serfs*4), which co-segregated with choroideremia

Dan

Lei

et al. 2020

Bioscience Reports

View full text Add to dashboard Cite

Choroideremia is a complex form of blindness-causing retinal degeneration. The aim of the present study was to investigate the pathogenic variant and molecular etiology associated with choroideremia in a Chinese family. All available family members underwent detailed ophthalmological examinations. Whole exome sequencing, bioinformatics analysis, Sanger sequencing, and co-segregation analysis of family members were used to validate sequencing data and confirm the presence of the disease-causing gene variant. The proband was diagnosed with choroideremia on the basis of clinical manifestations. Whole exome sequencing showed that the proband had a hemizygous variant in the CHM gene, c.22delG p. (Glu8Serfs*4), which was confirmed by Sanger sequencing and found to co-segregate with choroideremia. The variant was classified as likely pathogenic and has not previously been described. These results expand the spectrum of variants in the CHM gene, thus potentially enriching the understanding of the molecular basis of choroideremia. Moreover, they may provide insight for future choroideremia diagnosis and gene therapy.

show abstract

Section: In Silico Analysesmentioning

confidence: 99%

Whole exome sequencing of a family revealed a novel variant in the CHM gene, c.22delG p.(Glu8Serfs*4), which co-segregated with choroideremia

Dan

Lei

et al. 2020

Bioscience Reports

View full text Add to dashboard Cite

show abstract

“…The nomenclature used for the variants was in compliance with the recommendations of the Human Genomic Variation Society ([HGVS], http://www.hgvs.org) (Wang et al, 2018). Sequence alignments were performed using the Torrent Suite (Li & Durbin, 2010).…”

Section: In Silico Analysesmentioning

confidence: 99%

Application of targeted exome and whole‐exome sequencing for Chinese families with Stargardt disease

Dan

Huang

Xing

et al. 2019

Annals of Human Genetics

View full text Add to dashboard Cite

Objective: The aim of this study was to investigate pathogenic variants and molecular etiologies of Stargardt disease (STGD) in a cohort of Chinese families. Materials and Methods:A cohort of 12 unrelated STGD families diagnosed on the basis of clinical manifestations underwent analysis by targeted exome or whole-exome sequencing. Bioinformatics analysis, Sanger sequencing, and cosegregation analysis of available family members were used to validate sequencing data and confirm the presence of disease-causing genes.Results: Using targeted exome and whole-exome sequencing, we found that eight families had disease-causing variants in the ABCA4 gene, one family had only one heterozygous variant in the ABCA4 gene, and the remaining three families have not been identified with any disease-causing variants for STGD. We identified 15 variants in the ABCA4 gene; of these, five variants have not been previously described for STGD. Conclusion:The findings in this study expand the data regarding the frequency and spectrum of variants in the ABCA4 gene, thus potentially enriching our understanding of the molecular basis of STGD. Moreover, they constitute clues for future STGD diagnosis and therapy.

show abstract

“…Furthermore, dependencies on remote services create risks for privacy, reproducibility, and overall system availability. These were the problems for which we developed SeqRepo in 2016 as a component for the hgvs Python package (Wang et al, 2018). Using SeqRepo increases validation and variant projection throughput by nearly 50-fold relative to remote sequence access.…”

Section: Introductionmentioning

confidence: 99%

SeqRepo: A system for managing local collections biological sequences

Hart

Prlić

2020

Preprint

Self Cite

View full text Add to dashboard Cite

MotivationAccess to biological sequence data, such as genome, transcript, or protein sequence, is at the core of many bioinformatics analysis workflows. The National Center for Biotechnology Information (NCBI), Ensembl, and other sequence database maintainers provide methods to access sequences through network connections. For many users, the convenience and currency of remotely managed data are compelling, and the network latency is non-consequential. However, for high-throughput and clinical applications, local sequence collections are essential for performance, stability, privacy, and reproducibility.ResultsHere we describe SeqRepo, a novel system for building a local, high-performance, non-redundant collection of biological sequences. SeqRepo enables clients to use primary database identifiers and several digests to identify sequences and sequence alises. SeqRepo provides a native Python interface and a REST interface, which can run locally and enables access from other programming languages. SeqRepo also provides an alternative REST interface based on the GA4GH refget protocol.SeqRepo provides fast random access to sequence slices. We provide results that demonstrate that a local SeqRepo sequence collection yields significant performance benefits of up to 1300-fold over remote sequence collections. In our use case for a variant validation and normalization pipeline, SeqRepo improved throughput 50-fold relative to use with remote sequences. SeqRepo may be used with any species or sequence type. Regular snapshots of Human sequence collections are available.It is often convenient or necessary to use a computed digest as a sequence identifier. For example, a digest-based identifier may be used to refer to proprietary reference genomes or segments of a graph genome, for which conventional identifiers will not be available. Here we also introduce a convention for the application of the SHA-512 hashing algorithm with Base64 encoding to generate URL-safe identifiers. This convention, sha512t24u, combines a fast digest mechanism with a space-efficient representation that can be used for any object. Our report includes an analysis of timing and collision probabilities for sha512t24u. SeqRepo enables clients to use sha512t24u as identifiers, thereby seamlessly integrating public and private sequence sets.AvailabilitySeqRepo is released under the Apache License 2.0 and is available on github and PyPi. Docker images and database snapshots are also available. See https://github.com/biocommons/biocommons.seqrepo.

show abstract

hgvs: A Python package for manipulating sequence variants using HGVS nomenclature: 2018 Update

Cited by 20 publications

References 22 publications

Whole exome sequencing of a family revealed a novel variant in the CHM gene, c.22delG p.(Glu8Serfs*4), which co-segregated with choroideremia

Whole exome sequencing of a family revealed a novel variant in the CHM gene, c.22delG p.(Glu8Serfs*4), which co-segregated with choroideremia

Application of targeted exome and whole‐exome sequencing for Chinese families with Stargardt disease

SeqRepo: A system for managing local collections biological sequences

Contact Info

Product

Resources

About