2020
DOI: 10.1371/journal.pone.0242933
|View full text |Cite
|
Sign up to set email alerts
|

OpenStats: A robust and scalable software package for reproducible analysis of high-throughput phenotypic data

Abstract: Reproducibility in the statistical analyses of data from high-throughput phenotyping screens requires a robust and reliable analysis foundation that allows modelling of different possible statistical scenarios. Regular challenges are scalability and extensibility of the analysis software. In this manuscript, we describe OpenStats, a freely available software package that addresses these challenges. We show the performance of the software in a high-throughput phenomic pipeline in the International Mouse Phenoty… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

4
5

Authors

Journals

citations
Cited by 15 publications
(12 citation statements)
references
References 37 publications
1
11
0
Order By: Relevance
“…To do this, we used a subset of the genes for which we had continuous measurements (Supp Table 3) to calculate the Euclidean distance from the individual gene-trait associations. The effect size is calculated using the Cohen method (Haselimashhadi et al 2020). Our results based on the extensive IMPC data set are consistent with previous observations.…”
Section: Detecting Pleiotropysupporting
confidence: 88%
“…To do this, we used a subset of the genes for which we had continuous measurements (Supp Table 3) to calculate the Euclidean distance from the individual gene-trait associations. The effect size is calculated using the Cohen method (Haselimashhadi et al 2020). Our results based on the extensive IMPC data set are consistent with previous observations.…”
Section: Detecting Pleiotropysupporting
confidence: 88%
“…In line with [3], the sexual dimorphism effect is tested for all 22 haematology traits, independently for WT mice from each of the 11 centres, corresponding to the same mouse strain and metadata group split. We used a windowed linear mixed model described in [26,27] and implemented in the software R [28], packages OpenStats and SmoothWin [29,30]. As in [3], Sex and Body Weight in the fixed effect terms and Batch (the date when the test is performed on mice) in the random effect term.…”
Section: Resultsmentioning
confidence: 99%
“…We then pruned the resulting data to only include ‘Successful’ analyses, and removed experiments that included the skull. To generate the Gpatch1 boxplot, we obtained raw data using from IMPC’s ‘statistical-raw-data’ SOLR database for Gpatch1 , and analyzed the data in the same manner as IMPC, using the ‘OpenStats’ R package (version 1.0.2), using the method = ‘MM’ and MM_BodyWeightIncluded = TRUE arguments ( Haselimashhadi et al, 2020 ). Finally, mouse genes were converted to their human syntenic counterparts using Ensembl’s ‘hsapiens_gene_ensembl’ and ‘mmusculus_gene_ensembl’ datasets through biomaRt.…”
Section: Methodsmentioning
confidence: 99%