2019 IEEE EMBS International Conference on Biomedical &Amp; Health Informatics (BHI) 2019
DOI: 10.1109/bhi.2019.8834661
|View full text |Cite
|
Sign up to set email alerts
|

Identifying Appropriate Probabilistic Models for Sparse Discrete Omics Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
16
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(16 citation statements)
references
References 18 publications
0
16
0
Order By: Relevance
“…To identify the silvicultural, ecological and environmental variables that best explained variations in sapling density and occurrence of tolerant hardwood species (American beech and sugar maple saplings), we compared a series of zeroinflated negative binomial (ZINB) models (function "zeroinfl", package "pscl" in R) Zeileis et al 2008). Several authors recommend using ZINB models for over-dispersed count data (e.g., sapling density) with extra zeros (Thomas et al 2018;Aldirawi et al 2019;Wang et al 2020). In this case, the ZINB distribution is more appropriate for modeling count data compared to the commonly used Poisson distribution (Aldirawi et al 2019).…”
Section: Statistical Analysesmentioning
confidence: 99%
See 1 more Smart Citation
“…To identify the silvicultural, ecological and environmental variables that best explained variations in sapling density and occurrence of tolerant hardwood species (American beech and sugar maple saplings), we compared a series of zeroinflated negative binomial (ZINB) models (function "zeroinfl", package "pscl" in R) Zeileis et al 2008). Several authors recommend using ZINB models for over-dispersed count data (e.g., sapling density) with extra zeros (Thomas et al 2018;Aldirawi et al 2019;Wang et al 2020). In this case, the ZINB distribution is more appropriate for modeling count data compared to the commonly used Poisson distribution (Aldirawi et al 2019).…”
Section: Statistical Analysesmentioning
confidence: 99%
“…Several authors recommend using ZINB models for over-dispersed count data (e.g., sapling density) with extra zeros (Thomas et al 2018;Aldirawi et al 2019;Wang et al 2020). In this case, the ZINB distribution is more appropriate for modeling count data compared to the commonly used Poisson distribution (Aldirawi et al 2019). In addition, we used ZINB models because they can simultaneously predict the density and the occurrence probability of saplings, while modeling nonlinear effects of silvicultural, environmental, and ecological factors.…”
Section: Statistical Analysesmentioning
confidence: 99%
“…With appropriate probabilistic models identified successfully, we can improve the power of the statistical test significantly. Recently, Aldirawi, Yang, and Metwally (2019) proposed a statistical procedure for identifying the most appropriate discrete probabilistic models for zero-inflated or hurdle models based on the p-value of the discrete Kolmogorov-Smirnov (KS) test. The same procedure could be used for more general zero-inflated or hurdle models, including the ones with continuous baseline distributions.…”
Section: Introductionmentioning
confidence: 99%
“…More specifically, the goal is to test if the sample X = {X 1 , X 2 , .., X n } comes from a discrete or mixed distribution with cumulative distribution function (CDF) F θ (x) where the parameter(s) θ is unknown. Algorithm 1, which is regenerated from Aldirawi, Yang, and Metwally (2019), provides our procedures in details.…”
Section: Introductionmentioning
confidence: 99%
“…For example, quite often people do not have dental caries (Preisser and others , 2016), and the majority of a population does not make a hospital visit during a given year (Gurmu, 1997). In omics data, either because of technological reasons related to sequencing or due to some biologoical reasons, counts are often very sparse (Risso and others , 2018; Van den Berge and others , 2018; Aldirawi and others , 2019). Such count data with excess number of zeros are frequently modeled using the zero-inflated Poisson (ZIP) or the zero-inflated negative binomial (ZINB) distributions (Lambert, 1992; Greene, 1994).…”
Section: Introductionmentioning
confidence: 99%