2008
DOI: 10.1186/1471-2105-9-334
|View full text |Cite
|
Sign up to set email alerts
|

Performing statistical analyses on quantitative data in Taverna workflows: An example using R and maxdBrowse to identify differentially-expressed genes from microarray data

Abstract: Background: There has been a dramatic increase in the amount of quantitative data derived from the measurement of changes at different levels of biological complexity during the post-genomic era. However, there are a number of issues associated with the use of computational tools employed for the analysis of such data. For example, computational tools such as R and MATLAB require prior knowledge of their programming languages in order to implement statistical analyses on data. Combining two or more tools in an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
33
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 42 publications
(33 citation statements)
references
References 21 publications
0
33
0
Order By: Relevance
“…2: On the left, (1) a workflow for the identification of differential genes generated as an image using Taverna. [17] On the right, (2) Same workflow represented as an interactive graph using web technologies in a mobile device.…”
Section: Discussionmentioning
confidence: 99%
“…2: On the left, (1) a workflow for the identification of differential genes generated as an image using Taverna. [17] On the right, (2) Same workflow represented as an interactive graph using web technologies in a mobile device.…”
Section: Discussionmentioning
confidence: 99%
“…Some of these data workflow systems are presented in [18,12,19]. These systems have been successfully used in a variety of data intensive scenarios like analyzing data from the Southern California Earthquake Center [20], data from biological domains like post genomic research [21], analysis of proteins and peptides from tandem mass spectrometry data [22], cancer research [23], meteorological phenomena [24] or used in the German grid platform [25]. In these scenarios, the systems accessed and processed petabytes of data, and we are convinced that the approach they use is the most suitable for managing the large amounts of data present in the LOD cloud.…”
Section: Implementation Of Sparql-dqp and Welldesigned Patterns Optimmentioning
confidence: 99%
“…As of Taverna version 2.3.0 these services, as well as a variety of other life science Web services, are all grouped in the Service Catalog, a module that lists all the services available in the BioCatalogue project (31). In addition, Taverna supports the local or remote execution of RShell code (32,33), providing access to a variety of statistical functions, both for general purpose and for bioinformatics, such as those included in the Bioconductor package (34). The incorporation of R also benefits performance, as there is a framework for parallel execution of algorithms in R (35).…”
Section: Theoretical Backgroundmentioning
confidence: 99%