Abstract
The problem of estimating the probability mass of the support of a distribution not observed in random sampling is considered in the case where the distribution is discrete. An example of a situation in which the problem arises is that of species sampling: suppose that one wishes to determine the species of fish native to a body of water and that, after repeated sampling, one identifies a certain number of species. The problem is to estimate the proportion of the fish population belonging to the unobserved species. Since it is a rare event, ideas from large deviation theory play a role in answering the question. The result depends on the underlying distribution, which is unknown in general. Methods similar to nonparametric bootstrapping are therefore used to prove a limit theorem and obtain a confidence interval for the rate function.
Original language | English (US) |
---|---|
Pages (from-to) | 91-105 |
Number of pages | 15 |
Journal | Journal of Statistical Planning and Inference |
Volume | 91 |
Issue number | 1 |
DOIs | |
State | Published - Nov 1 2000 |
Externally published | Yes |
Keywords
- Bootstrapping
- Large deviations
- Primary 62E20
- Secondary 60F10
- Unobserved support
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty
- Applied Mathematics