2010
DOI: 10.1515/jib-2010-111
|View full text |Cite
|
Sign up to set email alerts
|

Identifying the impact of G-Quadruplexes on Affymetrix 3′ Arrays using Cloud Computing

Abstract: SummaryA tetramer quadruplex structure is formed by four parallel strands of DNA/ RNA containing runs of guanine. These quadruplexes are able to form because guanine can Hoogsteen hydrogen bond to other guanines, and a tetrad of guanines can form a stable arrangement. Recently we have discovered that probes on Affymetrix GeneChips that contain runs of guanine do not measure gene expression reliably. We associate this finding with the likelihood that quadruplexes are forming on the surface of GeneChips.In order… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 8 publications
0
6
0
Order By: Relevance
“…Sequence-specific motifs are an issue in microarray data [18,35] and have also been shown to affect RNA-seq data [38] as well as RNA primers [9], resulting in sequencespecific deviations in the distribution of mapped reads to a reference genome [27,15]. Furthermore GC content effects have been demonstrated in both Microarray and RNA-seq data [26].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Sequence-specific motifs are an issue in microarray data [18,35] and have also been shown to affect RNA-seq data [38] as well as RNA primers [9], resulting in sequencespecific deviations in the distribution of mapped reads to a reference genome [27,15]. Furthermore GC content effects have been demonstrated in both Microarray and RNA-seq data [26].…”
Section: Methodsmentioning
confidence: 99%
“…The effect of extremes of GC content in sequencing data (as well as microarray data) has been discussed in numerous studies [6,18], and we therefore also investigate the effect of the mean GC content of reads within the exonḡe and the GC content of the 4-mer motif itself gm. In order to partition reads by mean GC content (which we will discuss later) we also define binned GC content ranges (30-40%, 40-50%, 50-60% and 60-70%) forḡe as follows:…”
Section: Phase II -Motif Correlations Analysismentioning
confidence: 99%
“…In this work, some of the experiments which use the Human GeneChip called HG-U133A were downloaded and analyzed. The analysis, carried out using the R statistical language, was to determine whether runs of guanine in the probe sequences (runs of 4 or more 'G's) were producing a significant bias in the gene expression data [14,39].…”
Section: Understanding the Microarray Datamentioning
confidence: 99%
“…The cloud has become an appealing alternative high-performance computing platform for ad-hoc analytics since it offers on-demand computing and storage resources, along with scalability and low maintenance costs [4], [5], [6], [7]. This has led to a variety of research for supporting analytics in computational biology and bioinformatics on the cloud (for example, [8], [9], [10] and [11]). …”
Section: Introductionmentioning
confidence: 99%