STA258 Lecture 10

Estimator
Confidence Intervals
Example:
- $σ_{1}^{2} = σ_{2}^{2} = σ^{2}$
- $S_{p}^{2} = \frac{(n_{1} - 1) S_{2} + (n_{2} - 1) S_{2}^{2}}{n_{1} + n_{2} - 2}$
- $C I = ({\bar{Y}}_{1} - {\bar{Y}}_{2}) \pm t_{(\frac{α}{2}, ?)} S_{p} \sqrt{\frac{1}{n_{1}} + \frac{1}{n_{2}}}$
- Mutual funds purchases.
- Can investors do better by buying mutual funds directly rather than through brokers?
- We have a random sample of net annual returns through brokers and directly.
- Find a $95 %$ CI for $μ_{1} - μ_{2}$
- $μ_{1}$ is direct
- $μ_{2}$ is broker
- Assume $σ_{1}^{2} = σ_{2}^{2}$ and annual returns are normal.
- We get that $n_{1} = 50 = n_{2}$
- ${\bar{x}}_{1} = 6.63$
- ${\bar{x}}_{2} = 3.72$
- $s_{1}^{2} = 37.49$
- $s_{2}^{2} = 43.34$
- Pooled Variance
  - $S_{p}^{2} = \frac{(49) (37.49) + (49) (43.34)}{50 + 50 - 2}$
- $ν = 50 + 50 - 2 = 98$
- $t_{(\frac{α}{2}; 98)} = 1.984$
- $C I = (6.63 - 3.72) \pm 1.984 \sqrt{40.42 (50^{- 1} + 50^{- 1})}$
- $C I = 2.91 \pm 2.52$
- $μ_{1} = μ_{d} =$ direct
- $μ_{2} = μ_{b} =$ broker
- Since $0$ is not included, this means the difference is not negligible.
- Because our difference is positive, it means that $μ_{d} > μ_{b}$ . So direct is better.
- If $μ_{d} = μ_{b} ⟹ μ_{d} - μ_{b} = 0$
- Since $μ_{d} - μ_{b} > 0 ⟹ μ_{d} > μ_{b}$ so that's why it's better.
- How sure can we be sure that direct is better than broker?
- $\frac{95}{100}$ times we will have a better return directly. But how strongly can we be confident in this? #tk something about hypothesis testing.
- Since our interval is almost including $0$ . We need to worry about it:
  - If we have $(0.00000001, 5.43)$ , it's almost $0$
  - If we have $(3, 5.43)$ , it's pretty far away from $0$
  - $(0.39, 5.43)$ isn't as close to $0$ as the first example, but it's almost there.
- Suppose $σ_{1} \neq σ_{2}$
  - $C I = ({\bar{Y}}_{1} - {\bar{Y}}_{2}) \pm t_{(\frac{α}{2}; ν)} \sqrt{\frac{S_{1}^{2}}{n_{1}} + \frac{S_{2}^{2}}{n_{2}}}$
  - We don't pool it because we need to account for different variance.
  - $ν = \frac{{(\frac{S_{1}^{2}}{n_{1}} + \frac{S_{2}^{2}}{n_{2}})}^{2}}{(\frac{1}{n_{1} - 1}) {(\frac{S_{1}^{2}}{n_{1}})}^{2} + (\frac{1}{n_{2} - 1}) {(\frac{S_{2}^{2}}{n_{2}})}^{2}}$
  - This is the Welch method. Easier to do with R.
  - Otherwise, you can do:
    - $min (n_{1} - 1, n_{2} - 1)$
    - This will widen the T Distribution, and will give a conservative answer for your CI.
Example:
- A company sells edu materials.
- A consultant has a $3^{rd}$ grade glass of $n = 21$ for $8$ weeks.
  - This is a treatment group
- We need a control group, so we have $m = 23$ $3^{rd}$ grade students without the materials.
- At the end, both groups are given a DRP test to measure.
- $\begin{matrix} Treatment & Control \\ 24 & 61 & 59 & 46 & | & 42 & 44 & 46 & 37 \\ \dots \end{matrix}$
- Find a $95 %$ CI for the mean improvement.
- $C I = ({\bar{Y}}_{T} - {\bar{Y}}_{C}) \pm t^{⋆} \sqrt{\frac{S_{T}^{2}}{n_{T}} + \frac{S_{C}^{2}}{n_{C}}}$
- Conservative approach $⟹ ν = 20$
- $1 - α = 0.95 ⟹ \frac{α}{2} = 0.025$
- $t_{(0.025; 20)} = 2.086$
- $C I = (51.48 - 41.52) \pm 2.086 (4.31)$
  - $= (0.87, 18.95)$
- The lower bound of mean improvement is barely anything. The higher bound is really effective.
- But otherwise, $0 \notin C I$ , so it is effective.
- Since $0.87$ is pretty close to $0$ , how strongly can we know if it would include it or not?
- We also need to know what the test is, does $0.87$ mean $0.87$ marks, or percent? Could it be rounded down?
Not caring about domain knowledge is very dangerous:
- You could end up with a case like the memes about "What they did in the study showing red meat causes cancer" and it's just a video of someone launching cannonballs in a catapult.
- Outcomes are valid, but interpretations matters in application of it.
Assumption in the above $3^{rd}$ grader thing is important too. That the population is normal. We'd need to do a Normal Q-Q Plot. To see if it's normal.
- We don't have evidence for normality by the chart, but we don't have evidence against it.
Example:
- Conservationsist example
- $12$ unlogged forests
- $9, 8$
- $\begin{matrix} Unlogged: & 22 & 18 & 22 & 20 & 15 & 21 & 13 & 13 & 19 & 13 & 19 & 15 \end{matrix}$
- $\begin{matrix} Logged: & 17 & 4 & 18 & 13 & 18 & 15 & 15 & 10 & 12 \end{matrix}$
- Create a $99 %$ CI on difference in mean
- $C I = ({\bar{Y}}_{U} - {\bar{Y}}_{L}) \pm t_{(\frac{α}{2}; ν)} \sqrt{\frac{S_{1}^{2}}{n} + \frac{S_{2}^{2}}{m}}$
- ${\bar{Y}}_{1} - {\bar{Y}}_{2} = 3.83$
- $S_{1} = 3.52$
- $S_{2} = 4.5$
- $n = n_{1} = 12$
- $m = n_{2} = 9$
- $\sqrt{\frac{S_{1}^{2}}{n} + \frac{S_{2}^{2}}{m}} = 1.81$
- $ν = min (12 - 1, 9 - 1) = 8$
- $C I = 3.83 \pm t_{(\frac{α}{2}; ν)} (1.81)$
- $t^{⋆} = 3.35$
- $C I = 3.83 \pm (3.35) (1.81)$
- $= 3.83 - (3.35) (1.81) = - 2.2335$
- $= 3.83 + (3.35) (1.81) = 9.8935$
- Does logging have an impact?
  - $0 \in C I$ so it doesn't
Paired Samples
- Compare two tires
- $\begin{matrix} Auto & 1 & 2 & 3 & 4 & 5 \\ A & 10.6 & 9.8 & 12.3 & 9.7 & 8.8 \\ B & 10.2 & 9.4 & 11.8 & 9.1 & 8.3 \end{matrix}$
- Do the data provide evidence in mean wear for tire types $A$ and $B$
- Mounted on the same car, but when we get the difference, it's independent.
- First find the difference.
- $\begin{matrix} Auto & 1 & 2 & 3 & 4 & 5 \\ A & 10.6 & 9.8 & 12.3 & 9.7 & 8.8 \\ B & 10.2 & 9.4 & 11.8 & 9.1 & 8.3 \\ 0.4 & 0.4 & 0.5 & 0.6 & 0.5 & \bar{X} = 0.48 \end{matrix}$
- $\bar{d} = 0.48$
- $S_{d} = 0.0837$
- $C I = 0.48 \pm (2.776) \frac{0.0837}{\sqrt{5}}$
- $0.48 \pm 0.1039$
- The interval is positive and doesn't contain $0$ so there is a difference between these two tires.
Comparing two proportions
- $C I$
  - We use $Z$ critical values
- $n = 1178$
- Notes whether mother has epidural, do the babies still nurse after $6$ months
- $p_{1} =$ those who had an epidural and are nursing at $6$ months
- $p_{2} =$ those who didn't who are nursing at $6$ months
- $\begin{matrix} Nursing at 6 months \\ Epidural & Y & N \\ Y & 206 & 190 \\ N & 498 & 284 \end{matrix}$
- First find the Point Estimators for two proportions.
- ${\hat{P}}_{1} =$
  - $= \frac{206}{206 + 190} = \frac{103}{198} = 0.52020202020202$
- ${\hat{P}}_{2} =$
  - $= \frac{498}{498 + 284} = \frac{249}{391} = 0.636828644501279$
- Without doing a confidence interval, it seems we already have a pretty big difference.
- $SE [X] = \sqrt{\frac{(0.52) (1 - 0.52)}{396} + \frac{(0.64) (1 - 0.64)}{782}} = 0.0304126977809148$
- $C I = (0.52 - 0.64) \pm (1.96) (0.0304126977809148)$
- Since $0$ is not in the interval, and we have a negative difference. It means that epidural means the baby will nurse likelier than another.
- There's an abundance of women who haven't had an epidural. Why are there different sample sizes, make one sample size match the other.
- The impact is $n_{2}$
- The numbers aren't balanced. They're getting weighed by the sample sizes.
- Extreme sample:
  - Control group is 1000 people
  - What would happen to the Confidence Intervals?
  - It would be smaller, but we don't have balance in the dataset regarding those who had an epidural.
- We are $95 %$ confident that percent of nursing is mothers at 6 months for those who had an epidural are between $0 %$ and $17 %$ less than those who didn't.
Example:
- Smoking
- We have $244$ smokers who want to stop to receive nicotine patches
- $245$ receive both a path and the anti-depressant
- After a year $40$ in the nicotine group had abstained
- $87$ in the patch plus drug group have.
- Give a $99 %$ CI for the difference, $t - c$ in the proportion of smokers who quit.
- Find the Point Estimator for proportions
- ${\hat{P}}_{1} =$
  - $= \frac{40}{244} = 0.163934426229508$
- ${\hat{P}}_{2} =$
  - $= \frac{87}{245} = \frac{87}{245} = 0.355102040816327$
- Already we see that there are a lot more who quit in the treatment group.
- $1 - α = 0.99$
- $α = 0.01$
- $\frac{α}{2} = \frac{0.01}{2} = 0.005$
- $SE [X] = \sqrt{\frac{(0.16) (1 - 0.16)}{244} + \frac{(0.36) (1 - 0.36)}{245}} = 0.0386164192461763$
- $Z_{\frac{α}{2}} = 2.575829$
- $(0.36 - 0.16) + (2.575829) (0.0386164192461763) = 0.299469292570459$
- $(0.36 - 0.16) - (2.575829) (0.0386164192461763) = 0.100530707429541$
- There is a benefit. Because $0 \notin C$ , if $0 \in C$ then the difference is $0$ and there aren't benefits.
- You don't remove someone from another group if someone withdrew from the first.
- If you remove someone, then you need to start the experiment again from day 0.
  - You don't extend it to day 60 for the initial
  - You don't have the effects through 10 days with the new guy if you add one.
Comparing two population variances
- $\frac{\frac{S_{1}^{2}}{σ_{1}^{2}}}{\frac{S_{2}^{2}}{σ_{2}^{2}}} \sim F_{(n_{1} - 1, n_{2} - 2)}$
- $P (F_{(n_{1} - 1, n_{2} - 1; 1 - \frac{α}{2})} \leq \frac{σ_{1}^{2} S_{1}^{2}}{σ_{2}^{2} S_{2}^{2}} \leq F_{(n_{1} - 1, n_{2} - 2; α)}) = 1 - α$
- Compare the ratio of $\frac{σ_{1}^{2}}{σ_{2}^{2}}$
- $P (\frac{S_{1}^{2}}{S_{2}^{2}} \frac{1}{F_{(n_{1} - 1, n_{2} - 1; \frac{α}{2})}} \leq \frac{σ_{1}^{2}}{σ_{2}^{2}} \leq \frac{S_{1}^{2}}{S_{2}^{2}} \frac{1}{F_{(n_{1} - 1, n_{2} - 1; 1 - \frac{α}{2})}})$
- We know that $F_{(n_{1}, n_{2})} = \frac{1}{F_{(n_{2}, n_{1})}}$
- $P (\frac{S_{1}^{2}}{S_{2}^{2}} \frac{1}{F_{(n_{2} - 1, n_{1} - 1; 1 - \frac{α}{2})}} \leq \frac{σ_{1}^{2}}{σ_{2}^{2}} \leq \frac{S_{1}^{2}}{S_{2}^{2}} F_{(n_{2} - 1, n_{1} - 1; \frac{α}{2})}) = 1 - α$
- Example:
  - Managerial success
  - Index based on length of time in the org, and level within the term.
  - Compare group 1, high volume of interactions with people.
  - Group 2, rarely interact with people outside their work unit.
  - $μ_{G_{1}} = 65.3$
  - $μ_{G_{2}} = 49.5$
  - Is there a significant difference?
    - Already, yes.
    - However, we don't know about the index, so we don't know if $15$ points is or isn't really significant.
  - $(\frac{{6.61}^{2}}{{9.33}^{2} (3.09)}, \frac{{6.61}^{2}}{{9.33}^{2} (3.29)})$
  - $(0.16, 1.68)$
  - There is not a significant variance based on our $C I$ . Because $1 \in C I$
  - No evidence to show that the population variances of managerial success index is different.
  - $\frac{σ_{1}^{2}}{σ_{2}^{2}} = 1 ⟺ σ_{1}^{2} = σ_{2}^{2}$