2009
DOI: 10.1111/j.1541-0420.2008.01116.x
|View full text |Cite
|
Sign up to set email alerts
|

Presence‐Only Data and the EM Algorithm

Abstract: Summary In ecological modeling of the habitat of a species, it can be prohibitively expensive to determine species absence. Presence-only data consist of a sample of locations with observed presences and a separate group of locations sampled from the full landscape, with unknown presences. We propose an expectation–maximization algorithm to estimate the underlying presence–absence logistic model for presence-only data. This algorithm can be used with any off-the-shelf logistic model. For models with stepwise f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
355
0
1

Year Published

2011
2011
2021
2021

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 262 publications
(359 citation statements)
references
References 17 publications
3
355
0
1
Order By: Relevance
“…This approach is similar to density estimation [4]. When unlabeled data are available, a strategy to solve one-class classification problems is to use EM-like algorithms to iteratively predict the negative examples and learn the classifier [28,17,26]. In [9], Denis show that function classes learnable under the statistical query model are also learnable from positive and unlabeled examples if each positive example is left unlabeled with a constant probability.…”
Section: One-class Classificationmentioning
confidence: 99%
“…This approach is similar to density estimation [4]. When unlabeled data are available, a strategy to solve one-class classification problems is to use EM-like algorithms to iteratively predict the negative examples and learn the classifier [28,17,26]. In [9], Denis show that function classes learnable under the statistical query model are also learnable from positive and unlabeled examples if each positive example is left unlabeled with a constant probability.…”
Section: One-class Classificationmentioning
confidence: 99%
“…Automated random sampling of pseudo-absence/background samples from a set of pixels within the boundaries of Kenya [41], where these species have not been detected [42] was used to maximize predictivity. Pseudoabsence of the probability of presence which would otherwise be confined to presence only [43]. Each species was assumed to have the same probability of being anywhere on the landscape, hence every pixel or environment on the landscape had the same probability of being tagged as "background" in geographic and environmental space.…”
Section: Ecological Niche Modellingmentioning
confidence: 99%
“…The modern distributional data used in such modelling should include not only presences but also absences (Phillips & Dudik 2008;Phillips et al 2009;Soberón & Nakamura 2009;Ward et al 2009;Lobo et al 2010, but see Phillips & Elith (2010) for using presence data only) if probabilities of occurrence are to be estimated (cf. Argáez et al 2005).…”
Section: Strengths and Weaknessesmentioning
confidence: 99%