2005
DOI: 10.1007/s10651-005-6817-1
|View full text |Cite
|
Sign up to set email alerts
|

Modelling skewed data with many zeros: A simple approach combining ordinary and logistic regression

Abstract: We discuss a method for analyzing data that are positively skewed and contain a substantial proportion of zeros. Such data commonly arise in ecological applications, when the focus is on the abundance of a species. The form of the distribution is then due to the patchy nature of the environment and/or the inherent heterogeneity of the species. The method can be used whenever we wish to model the data as a response variable in terms of one or more explanatory variables. The analysis consists of three stages. Th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
177
1
6

Year Published

2007
2007
2020
2020

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 268 publications
(185 citation statements)
references
References 16 publications
(28 reference statements)
1
177
1
6
Order By: Relevance
“…Building stable predictive models with well distributed residuals proved impossible in the case of Aporrectodea longa (Ude, 1885) because there were a large number of zero counts (absences). We therefore followed the approach recommended by Fletcher et al, (2005) by splitting the modelling (still using REML) into (i) a binary regression step (predicting the probability of A. longa presence in a given ESL for each treatment), followed by (ii) an ordinary regression analysis in which we predicted the densities and biomasses in treatments using only those ESLs in which !1 A. longa individual was found. The first step used the logistic probability model and second step used the normal probability model.…”
Section: Discussionmentioning
confidence: 99%
“…Building stable predictive models with well distributed residuals proved impossible in the case of Aporrectodea longa (Ude, 1885) because there were a large number of zero counts (absences). We therefore followed the approach recommended by Fletcher et al, (2005) by splitting the modelling (still using REML) into (i) a binary regression step (predicting the probability of A. longa presence in a given ESL for each treatment), followed by (ii) an ordinary regression analysis in which we predicted the densities and biomasses in treatments using only those ESLs in which !1 A. longa individual was found. The first step used the logistic probability model and second step used the normal probability model.…”
Section: Discussionmentioning
confidence: 99%
“…Because of a zero-inflated distribution [28], a two-part approach [29,30] was taken to analysing seed set data. First, we used a logistic regression to model the probability of setting seeds (yes or no), with the same predictors as for stigma closure, followed by a LR test.…”
Section: Methods (A) Study Organismsmentioning
confidence: 99%
“…Approaches that model the presence or absence of reproductive structures are valid in such cases (Faddy 1998;Kuhnert et al 2005). One approach is the use of a binary logistic regression model, which takes into consideration the presence or absence of a unit (Welsh et al 1996;Fletcher et al 2005). For our analysis, we had to pool observations at higher levels (i.e.…”
Section: Low Levels Of Flowering -Impacts On Gene Flow and The Abilitmentioning
confidence: 99%