Cutting of primers from reads is an important step of processing targeted amplicon-based next generation sequencing data. Existing tools are adapted for cutting of one or several primer/adapter sequences from reads and removing all of their occurrences. Also most of the existing tools use kmers and may cut only part of primers or primers with studied sequence of gene. Because of this, use of such programs leads to incorrect trimming, reduction of coverage, and increase in the number of false-positive and/or false-negative results. We have developed a new tool named cutPrimers for accurate cutting of any number of primers from reads. Using sequencing reads that were obtained during study of BRCA1/2 genes, we compared it with cutadapt, AlienTrimmer, and BBDuk. All of them trimmed reads in such a way that coverage of at least two amplicons decreased to unacceptable level (<30 reads) comparing with reads trimmed with cutPrimers. At the same time, Trimmomatic and AlienTrimmer cut all occurrences of primer sequences, so the length of the remaining reads was less than prospective.
There are strong genetic components to cardiorespiratory fitness and its response to exercise training. It would be useful to understand the differences in the genomic profile of highly trained endurance athletes of world class caliber and sedentary controls. An international consortium (GAMES) was established in order to compare elite endurance athletes and ethnicity-matched controls in a case-control study design. Genome-wide association studies were undertaken on two cohorts of elite endurance athletes and controls (GENATHLETE and Japanese endurance runners), from which a panel of 45 promising markers was identified. These markers were tested for replication in seven additional cohorts of endurance athletes and controls: from Australia, Ethiopia, Japan, Kenya, Poland, Russia and Spain. The study is based on a total of 1520 endurance athletes (835 who took part in endurance events in World Championships and/or Olympic Games) and 2760 controls. We hypothesized that world-class athletes are likely to be characterized by an even higher concentration of endurance performance alleles and we performed separate analyses on this subsample. The meta-analysis of all available studies revealed one statistically significant marker (rs558129 at GALNTL6 locus, p = 0.0002), even after correcting for multiple testing. As shown by the low heterogeneity index (I2 = 0), all eight cohorts showed the same direction of association with rs558129, even though p-values varied across the individual studies. In summary, this study did not identify a panel of genomic variants common to these elite endurance athlete groups. Since GAMES was underpowered to identify alleles with small effect sizes, some of the suggestive leads identified should be explored in expanded comparisons of world-class endurance athletes and sedentary controls and in tightly controlled exercise training studies. Such studies have the potential to illuminate the biology not only of world class endurance performance but also of compromised cardiac functions and cardiometabolic diseases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.