DNA sequencing by synthesis (SBS) on a solid surface during polymerase reaction offers a paradigm to decipher DNA sequences. We report here the construction of such a DNA sequencing system using molecular engineering approaches. In this approach, four nucleotides (A, C, G, T) are modified as reversible terminators by attaching a cleavable fluorophore to the base and capping the 3-OH group with a small chemically reversible moiety so that they are still recognized by DNA polymerase as substrates. We found that an allyl moiety can be used successfully as a linker to tether a fluorophore to 3-O-allyl-modified nucleotides, forming chemically cleavable fluorescent nucleotide reversible terminators, 3-O-allyldNTPs-allyl-fluorophore, for application in SBS. The fluorophore and the 3-O-allyl group on a DNA extension product, which is generated by incorporating 3-O-allyl-dNTPs-allyl-fluorophore in a polymerase reaction, are removed simultaneously in 30 s by Pdcatalyzed deallylation in aqueous buffer solution. This one-step dual-deallylation reaction thus allows the reinitiation of the polymerase reaction and increases the SBS efficiency. DNA templates consisting of homopolymer regions were accurately sequenced by using this class of fluorescent nucleotide analogues on a DNA chip and a four-color fluorescent scanner. DNA chip DNA sequencing is driving genomics research and discovery. The completion of the Human Genome Project has set the stage for screening genetic mutations to identify disease genes on a genome-wide scale (1). Accurate high-throughput DNA sequencing methods are needed to explore the complete human genome sequence for applications in clinical medicine and health care. To overcome the limitations of the current electrophoresis-based sequencing technology (2-5), a variety of new DNA-sequencing methods have been investigated. Such approaches include sequencing by hybridization (6), mass spectrometry-based sequencing (7-9), sequence-specific detection of single-stranded DNA using engineered nanopores (10), and sequencing by ligation (11). More recently, DNA sequencing by synthesis (SBS) approaches such as pyrosequencing (12), sequencing of single DNA molecules (13), and polymerase colonies (14) have been widely explored.The concept of DNA SBS was revealed in 1988 with an attempt to sequence DNA by detecting the pyrophosphate group that is generated when a nucleotide is incorporated in a DNA polymerase reaction (15). Pyrosequencing, which was developed based on this concept and an enzymatic cascade, has been explored for genome sequencing (16). However, there are inherent difficulties in this method for determining the number of incorporated nucleotides in homopolymeric regions of the template. Additionally, each of the four nucleotides needs to be added and detected separately, which increases the overall detection time. The accumulation of undegraded nucleotides and other components could also lower the accuracy of the method when sequencing a long DNA template. It is thus desirable to have a simple method t...
DNA sequencing by synthesis (SBS) on a solid surface during polymerase reaction can decipher many sequences in parallel. We report here a DNA sequencing method that is a hybrid between the Sanger dideoxynucleotide terminating reaction and SBS. In this approach, four nucleotides, modified as reversible terminators by capping the 3-OH with a small reversible moiety so that they are still recognized by DNA polymerase as substrates, are combined with four cleavable fluorescent dideoxynucleotides to perform SBS. The ratio of the two sets of nucleotides is adjusted as the extension cycles proceed. Sequences are determined by the unique fluorescence emission of each fluorophore on the DNA products terminated by ddNTPs. On removing the 3-OH capping group from the DNA products generated by incorporating the 3-O-modified dNTPs and the fluorophore from the DNA products terminated with the ddNTPs, the polymerase reaction reinitiates to continue the sequence determination. By using an azidomethyl group as a chemically reversible capping moiety in the 3-O-modified dNTPs, and an azido-based cleavable linker to attach the fluorophores to the ddNTPs, we synthesized four 3-O-azidomethyl-dNTPs and four ddNTP-azidolinker-fluorophores for the hybrid SBS. After sequence determination by fluorescence imaging, the 3-O-azidomethyl group and the fluorophore attached to the DNA extension product via the azidolinker are efficiently removed by using Tris(2-carboxyethyl)phosphine in aqueous solution that is compatible with DNA. Various DNA templates, including those with homopolymer regions, were accurately sequenced with a read length of >30 bases by using this hybrid SBS method on a chip and a four-color fluorescence scanner.sequencing by synthesis ͉ DNA chip T he completion of the Human Genome Project (1) was a monumental achievement in biological science. The engine behind this project was the Sanger sequencing method (2), which is still the gold standard in genome research. The prolonged success of the Sanger sequencing method is because of its efficiency and fidelity in producing dideoxy-terminated DNA products that can be separated electrophoretically and detected by fluorescence (3-5). However, a challenge in the use of electrophoresis for DNA separation is the difficulty in achieving high throughput and the complexity involved in the automation, although some level of increased parallelization may be achieved by using miniaturization (6).To overcome the limitations of the Sanger sequencing technology, a variety of new methods have been investigated. Such approaches include sequencing by hybridization (7), mass spectrometry sequencing (8, 9), sequencing by nanopores (10), and sequencing by ligation (11). More recently, DNA sequencing by synthesis (SBS) approaches such as pyrosequencing (12), sequencing of single DNA molecules (13,14), and polymerase colonies (15) have been widely explored. Previously, we reported the development of a general strategy to rationally design cleavable fluorescent nucleotide reversible terminators (NRTs) ...
; R6G, 6-carboxyrhodamine-6G). Each nucleotide analogue consists of a different fluorophore attached to the 5 position of the pyrimidines and the 7 position of the purines through a photocleavable 2-nitrobenzyl linker. After verifying that these nucleotides could be successfully incorporated into a growing DNA strand in a solutionphase polymerase reaction and the fluorophore could be cleaved using laser irradiation (Ϸ355 nm) in 10 sec, we then performed an SBS reaction on a chip that contains a self-priming DNA template covalently immobilized by using 1,3-dipolar azide-alkyne cycloaddition. The DNA template was produced by PCR, using an azidolabeled primer, and the self-priming moiety was attached to the immobilized DNA template by enzymatic ligation. Each cycle of SBS consists of the incorporation of the photocleavable fluorescent nucleotide into the DNA, detection of the fluorescent signal, and photocleavage of the fluorophore. The entire process was repeated to identify 12 continuous bases in the DNA template. These results demonstrate that photocleavable fluorescent nucleotide analogues can be incorporated accurately into a growing DNA strand during a polymerase reaction in solution and on a chip. Moreover, all four fluorophores can be detected and then efficiently cleaved using near-UV irradiation, thereby allowing continuous identification of the DNA template sequence. Optimization of the steps involved in this SBS approach will further increase the read-length. 2-nitrobenzyl linkerNA sequencing is a fundamental tool for biological science. The completion of the Human Genome Project has set the stage for screening genetic mutations to identify disease genes on a genome-wide scale (1). Accurate high-throughput DNA sequencing methods are needed to explore the complete human genome sequence for applications in clinical medicine and health care. Recent studies have indicated that an important route for identifying functional elements in the human genome involves sequencing the genomes of many species representing a wide sampling of the evolutionary tree (2). To overcome the limitations of the current electrophoresis-based sequencing technology (3-5), a variety of new DNA sequencing methods have been investigated. Such approaches include sequencing by hybridization (6), MS-based sequencing (7-9), and sequence-specific detection of single-stranded DNA by using engineered nanopores (10). More recently, DNA sequencing by synthesis (SBS) approaches such as pyrosequencing (11), sequencing of single DNA molecules (12), and polymerase colonies (13) have been widely explored.The concept of DNA SBS was revealed in 1988 (14). This approach involves detection of the identity of each nucleotide immediately after its incorporation into a growing strand of DNA in a polymerase reaction. Thus far, no complete success has been reported in using such a system to sequence DNA unambiguously. We proposed an SBS approach using photocleavable fluorescent nucleotide analogues on a surface in 2000 (15). In this approach, modified nucle...
In the past 50 years, Cannabis sativa (C. sativa) has gone from a substance essentially prohibited worldwide to one that is gaining acceptance both culturally and legally in many countries for medicinal and recreational use. As additional jurisdictions legalize Cannabis products and the variety and complexity of these products surpass the classical dried plant material, appropriate methods for measuring the biologically active constituents is paramount to ensure safety and regulatory compliance. While there are numerous active compounds in C. sativa the primary cannabinoids of regulatory and safety concern are (-)-Δ⁹-tetrahydrocannabinol (THC), cannabidiol (CBD), and their respective acidic forms THCA-A and CBDA. Using the US Food and Drug Administration (FDA) bioanalytical method validation guidelines we developed a sensitive, selective, and accurate method for the simultaneous analysis CBD, CBDA, THC, and THCA-A in oils and THC & CBD in more complex matrices. This HPLC-MS/MS method was simple and reliable using standard sample dilution and homogenization, an isocratic chromatographic separation, and a triple quadrupole mass spectrometer. The lower limit of quantification (LLOQ) for analytes was 0.195 ng/mL over a 0.195–50.0 ng/mL range of quantification with a coefficient of correlation of >0.99. Average intra-day and inter-day accuracies were 94.2–112.7% and 97.2–110.9%, respectively. This method was used to quantify CBD, CBDA, THC, and THCA-A in 40 commercial hemp products representing a variety of matrices including oils, plant materials, and creams/cosmetics. All products tested met the federal regulatory restrictions on THC content in Canada (<10 μg/g) except two, with concentrations of 337 and 10.01 μg/g. With respect to CBD, the majority of analyzed products contained low CBD levels and a CBD: CBDA ratio of <1.0. In contrast, one product contained 8,410 μg/g CBD and a CBD: CBDA ratio of >1,000 (an oil-based product). Overall, the method proved amenable to the analysis of various commercial products including oils, creams, and plant material and may be diagnostically indicative of adulteration with non-hemp C. sativa, specialized hemp cultivars, or unique manufacturing methods.
When using stochastic gradient descent (SGD) to solve large-scale machine learning problems, a common practice of data processing is to shuffle the training data, partition the data across multiple threads/machines if needed, and then perform several epochs of training on the re-shuffled (either locally or globally) data. The above procedure makes the instances used to compute the gradients no longer independently sampled from the training data set, which contradicts with the basic assumptions of conventional convergence analysis of SGD. Then does the distributed SGD method have desirable convergence properties in this practical situation? In this paper, we give answers to this question. First, we give a mathematical formulation for the practical data processing procedure in distributed machine learning, which we call (data partition with) global/local shuffling. We observe that global shuffling is equivalent to without-replacement sampling if the shuffling operations are independent. We prove that SGD with global shuffling has convergence guarantee in both convex and non-convex cases. An interesting finding is that, the non-convex tasks like deep learning are more suitable to apply shuffling comparing to the convex tasks. Second, we conduct the convergence analysis for SGD with local shuffling. The convergence rate for local shuffling is slower than that for global shuffling, since it will lose some information if there's no communication between partitioned data. Finally, we consider the situation when the permutation after shuffling is not uniformly distributed (We call it insufficient shuffling), and discuss the condition under which this insufficiency will not influence the convergence rate. Our theoretical results provide important insights to large-scale machine learning, especially in the selection of data processing methods in order to achieve faster convergence and good speedup. Our theoretical findings are verified by extensive experiments on logistic regression and deep neural networks.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.