2015
DOI: 10.1038/nmeth.3249
|View full text |Cite
|
Sign up to set email alerts
|

Targeted exploration and analysis of large cross-platform human transcriptomic compendia

Abstract: We present SEEK (http://seek.princeton.edu), a query-based search engine across very large transcriptomic data collections, including thousands of human data sets from almost 50 microarray and next-generation sequencing platforms. SEEK uses a novel query-level cross-validation-based algorithm to automatically prioritize data sets relevant to the query and a robust search approach to identify query-coregulated genes, pathways, and processes. SEEK provides cross-platform handling, multi-gene query search, iterat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
138
0
2

Year Published

2016
2016
2021
2021

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 145 publications
(142 citation statements)
references
References 33 publications
2
138
0
2
Order By: Relevance
“…To test the relevance of these findings in clinical disease sets, we conducted a search-based exploration of expression compendia analysis (SEEK) of human datasets across the Gene Expression Omnibus and Cancer Genome Atlas (TCGA) (Zhu et al, 2015) (Figure 3A). The 227 signature had markedly higher co-expression scores (i.e., dataset weight) across the SEEK compendia as compared to random queries of matched size, with such scores highly significant (at p ≤ 0.001) in over 87% of the datasets.…”
Section: Resultsmentioning
confidence: 99%
“…To test the relevance of these findings in clinical disease sets, we conducted a search-based exploration of expression compendia analysis (SEEK) of human datasets across the Gene Expression Omnibus and Cancer Genome Atlas (TCGA) (Zhu et al, 2015) (Figure 3A). The 227 signature had markedly higher co-expression scores (i.e., dataset weight) across the SEEK compendia as compared to random queries of matched size, with such scores highly significant (at p ≤ 0.001) in over 87% of the datasets.…”
Section: Resultsmentioning
confidence: 99%
“…Although CoREST, mSin3 and HDAC1/2 were expressed in HEK293T cell based on microarray data of HEK293T cell (One of ENCODE transcriptome data41; GEO accession: GSE1580542), their expression was not that high. In addition, CoREST is known to be negatively co-expressed with REST from SEEK database43. Thus, endogenous expression level of each protein in HEK293 cells should affect the detection of REST-interacting proteins.…”
Section: Discussionmentioning
confidence: 99%
“…A few web resources, including GEO (Barrett et al 2013), ArrayExpress (Kolesnikov et al 2015), GeneNetwork (Chesler et al 2004), and Bgee (Bastian et al 2008) amongst others, have created repositories of such expression data for curation, reuse, and integration. Several tools, such as GeneMANIA (Warde-Farley et al 2010), GIANT , SEEK (Zhu et al 2015), GeneFriends (van Dam et al 2015), WeGET (Szklarczyk et al 2016), COXPRESdb (Obayashi et al 2019), WGCNA (Langfelder and Horvath 2008), and CLIC , are able to assign putative new functions to genes by means of correlations or co-expression networks. At their core, these methods rely on the concept of guilt-byassociation -that transcripts or proteins exhibiting similar expression patterns tend to be functionally related (Eisen et al 1998).…”
Section: Introductionmentioning
confidence: 99%
“…The key polarity of interactions is often lost among gene products and linked modules (Warde-Farley et al 2010;Greene et al 2015;van Dam et al 2015;Zhu et al 2015;Li et al 2017). Gene set analyses, such as gene set enrichment analysis (GSEA) (Subramanian et al 2005), have been developed to identify processes or modules that are affected by certain genetic or environmental perturbations (Khatri et al 2012).…”
Section: Introductionmentioning
confidence: 99%