Strong Interest in fish/wildlife
Examples?
\[ y \in \{0,1\} \]
\[ 1 = \text{occur} \\ 0 = \text{not occur} \]
\[ \hat{p} = \frac{1}{n}\sum_{i=1}^{n}y_{i} \]
\[ \hat{\sigma}^2_{p} = \left(\frac{N-n}{N}\right)\frac{\hat{p}(1-\hat{p})}{n-1} \]
How does sample grid effect \(\hat{p}\) and \(\hat{\sigma}^2_{p}\)?
How does sample grid effect \(\hat{p}\) and \(\hat{\sigma}^2_{p}\)?
How does sample grid effect \(\hat{p}\) and \(\hat{\sigma}^2_{p}\)?
Does this make sense?
Soooooo useful!
\[ \hat{\sigma}_{p} = \sqrt\frac{\hat{p}(1-\hat{p})}{n-1} \]
Lets visualize a few standard deviations of p
Should I include the additional/ancillary surveys as data?
Should I report the cells with occurrences at all?
In small groups: Discuss and then tell me what to do and explain.
What if?
sites occu
1 1 1
2 2 1
3 3 0
4 4 0
5 5 0
6 6 1
Sampling Distribution of \(\hat{p}\)
Estimator Bias = -0.00005
Distribution Var = 0.00084
SRS of n = 200 with 5% ancillary and occupied sites
Estimator Bias = 0.02382
Distribution Var = 0.00073
SRS of n = 200 with 10% ancillary and occupied sites
Estimator Bias = 0.04536
Distribution Var = 0.00069
Should I include the ancillary surveys as data?
Should I report the cells with occurrences at all?
Instead of adding sites w/o clear sampling probabilities, we make these explicit.
Identify sites that are important and increase their probability of being selected for sampling
1/2 sites with 3x the probability of being sampled versus other half
sample
function has an argument prob
\[ \hat{p} = \frac{1}{n}\sum_{i=1}^{n}y_i \]
Estimator Bias = 0.13753
Generalized Unequal Probability Estimator
Thompson Ch. 6.3
\[ \hat{\theta} = \frac{\sum_{i} y_i / \pi_i}{\sum_{i}1 / \pi_i} \]
Estimator Bias = -0.03147
“not precisely unbiased” - Thompson