STA258 Lecture 12

Completed Notes Status

Central objective: Introduce the framework of Hypothesis Testing, evaluating decision rules through error probabilities, and establishing connections between test statistics, confidence intervals, and p-values.
Key concepts:
- Hypothesis Testing: Contrasts a null hypothesis ( $H_{0}$ ) against an alternative hypothesis ( $H_{a}$ ), differentiating between simple (singleton) and composite hypotheses.
- Type I Error and Type II Error: $α$ (Type I) is the probability of rejecting a true $H_{0}$ , while $β$ (Type II) is the probability of failing to reject a false $H_{0}$ , demonstrating a seesaw trade-off where researchers generally prioritize minimizing $α$ .
- Critical Region: The subset of the sample space that triggers the rejection of $H_{0}$ , mathematically driven by bounding $α$ below a given threshold.
- P-value: The probability of observing a test statistic at least as extreme as the one computed, assuming $H_{0}$ is true, acting as a continuous metric for evidence against the null.
Connections:
- Two-sided hypothesis tests inherently link to Confidence Intervals; failing to reject $H_{0}$ at level $α$ is mathematically equivalent to the null parameter value $μ_{0}$ falling inside the $(1 - α)$ confidence interval for the mean.

Remember/Understand:
- What is the fundamental difference between a simple hypothesis and a composite hypothesis?
- Define a Type I error and a Type II error in plain language.
- How is the significance level $α$ related to the critical region $C$ ?
Apply/Analyze:
- If a researcher decreases the significance level $α$ from $0.05$ to $0.01$ , what is the expected impact on the probability of a Type II error ( $β$ ), assuming sample size remains constant?
- Given $X \sim Bin (3, p)$ , construct a rejection region to test $H_{0} : p = 0.5$ vs $H_{a} : p = 0.6$ ensuring that $α \leq 0.15$ .
- Explain why a two-sided hypothesis test decision boundary aligns perfectly with the boundaries of a confidence interval.
Evaluate/Create:
- Critically evaluate the standard practice of rigidly adhering to $α = 0.05$ as a rejection threshold; what issues arise when $p = 0.048$ versus $p = 0.051$ in applied fields like finance?
- Propose a scenario where a Type II error is vastly more dangerous than a Type I error, and justify how you would adjust the testing framework to accommodate this priority.

Type I Error vs Type II Error (The Seesaw Effect):
- Why it's challenging: Balancing the minimization of both errors mathematically requires understanding that constraining one tail probability directly widens the non-rejection region, inherently inflating the complementary error risk under the alternative distribution.
- Study strategy: Draw overlapping probability density curves (one for $H_{0}$ , one for $H_{a}$ ). Shade the areas representing $α$ and $β$ to visually trace how moving the critical value line shifts the balance between the two regions.
Translating binomial parameters into optimal critical regions:
- Why it's challenging: Unlike continuous $Z$ or $T$ tests, discrete distributions like the binomial do not allow for exact $α$ matching (e.g., exactly 0.05), requiring analysts to map out probability tables and manually optimize for $β$ while keeping $α$ below a strict cap.
- Study strategy: Re-calculate the $X \sim Bin (3, p)$ example from the notes entirely by hand, calculating $β$ for every single possible valid critical region to solidify the mechanical process of optimization.

Immediate review actions:
- Review all [COMPLETED] insertions and verify with course materials
- Re-read Lecture Summary and confirm key links
- Clarify the ambiguous algebraic bounding interval in the two-sided test formulation
Practice and application:
- Answer Remember/Understand questions
- Answer Apply/Analyze questions
- Attempt Evaluate/Create questions
Deep dive study:
- Focus on Challenging Concepts (above)
- Extend atomic notes only where the new lecture adds content
Verification and integration:
- Cross-reference notes with textbook/slides
- Identify remaining gaps
- Link this lecture to prior/future lecture notes (lecture notes only; no MOCs)