Improving the precision of estimates of the frequency of rare events

dc.contributor.author Dixon, Philip
dc.contributor.author Ellison, Aaron
dc.contributor.author Gotelli, Nicholas
dc.contributor.department Statistics
dc.date 2018-02-17T11:34:14.000
dc.date.accessioned 2020-07-02T06:57:58Z
dc.date.available 2020-07-02T06:57:58Z
dc.date.issued 2005-01-01
dc.description.abstract <p>The probability of a rare event is usually estimated directly as the number of times the event occurs divided by the total sample size. Unfortunately, the precision of this estimate is low. For typical sample sizes of <em>N</em> < 100 in ecological studies, the coefficient of variation (cv) of this estimate of the probability of a rare event can exceed 300%. Sample sizes on the order of 10<sup>3</sup>–10<sup>4</sup> observations are needed to reduce the cv to below 10%. If it is impractical or impossible to increase the sample size, auxiliary data can be used to improve the precision of the estimate. We describe four approaches for using auxiliary data to improve the precision of estimates of the probability of a rare event: (1) Bayesian analysis that includes prior information about the probability; (2) stratification that incorporates information on the heterogeneity in the population; (3) regression models that account for information correlated with the probability; and (4) inclusion of aggregated data collected at larger spatial or temporal scales. These approaches are illustrated using data on the probability of capture of vespulid wasps by the insectivorous plant Darlingtonia californica. All four methods increase the precision of the estimate relative to the simple frequency-based estimate (absolute precision = 1.26, relative precision [cv] = 70%): stratification (absolute precision = 1.10, cv = 62%); regression models (absolute precision = 1.59, cv = 55%); Bayesian analysis with an informative prior probability distribution (absolute precision = 4.28, cv = 47%); and using temporally aggregated data (absolute precision = 6.75, cv = 36%). When informative auxiliary data is available, we recommend including it when estimating the probability of rare events.</p>
dc.description.comments <p>This is an article from <em>Ecology</em> 86 (2005): 1114, doi:<a href="http://dx.doi.org/10.1890/04-0601" target="_blank">10.1890/04-0601</a>. Posted with permission.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/stat_las_pubs/62/
dc.identifier.articleid 1060
dc.identifier.contextkey 8059031
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath stat_las_pubs/62
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/90662
dc.language.iso 2005
dc.source.bitstream archive/lib.dr.iastate.edu/stat_las_pubs/62/2005_Dixon_ImprovingPrecision.pdf|||Sat Jan 15 01:18:10 UTC 2022
dc.source.uri 10.1890/04-0601
dc.subject.disciplines Statistics and Probability
dc.subject.keywords aggregation
dc.subject.keywords Bayesian inference
dc.subject.keywords coefficient of variation
dc.subject.keywords estimators
dc.subject.keywords precision
dc.subject.keywords rare events
dc.subject.keywords regression
dc.subject.keywords sampling
dc.subject.keywords stratification
dc.title Improving the precision of estimates of the frequency of rare events
dc.type article
dc.type.genre article
dspace.entity.type Publication
relation.isAuthorOfPublication 7b3eb8d2-a569-4aba-87a1-5d9c2d99fade
relation.isOrgUnitOfPublication 264904d9-9e66-4169-8e11-034e537ddbca
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2005_Dixon_ImprovingPrecision.pdf
Size:
320.53 KB
Format:
Adobe Portable Document Format
Description:
Collections