Estimating Parameters and Determining Sample Sizes: Confidence Intervals for Proportions and Means

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Estimating Parameters and Determining Sample Sizes

Introduction

This chapter covers statistical methods for estimating population parameters, specifically proportions and means, using sample data. It also explains how to determine the sample size required for reliable estimation. The focus is on constructing and interpreting confidence intervals, understanding margin of error, and selecting appropriate distributions for inference.

Estimating a Population Proportion

Point Estimate

A point estimate is a single value used to estimate a population parameter. The sample proportion \( \hat{p} \) is the best point estimate of the population proportion \( p \).

Unbiased Estimator: \( \hat{p} \) is unbiased and most consistent for estimating \( p \).
Example: If 43% of 1487 adults in a poll have Facebook pages, the best point estimate of \( p \) is 0.43.

Definition of point estimate Unbiased estimator explanation

Confidence Interval

A confidence interval (CI) is a range of values used to estimate the true value of a population parameter. The confidence level (e.g., 95%) is the probability that the interval contains the parameter, assuming repeated sampling.

Correct Interpretation: "We are 95% confident that the interval contains the true value of \( p \)."
Process Success Rate: Over many samples, 95% of constructed intervals will contain the true \( p \).

Confidence intervals from multiple samples

Critical Values

A critical value is a number separating significant sample statistics from those that are not. For confidence intervals, the critical value \( z_{\alpha/2} \) is a z-score at the border of an area \( \alpha/2 \) in the right tail of the standard normal distribution.

For a 95% confidence level, \( \alpha = 0.05 \), so \( \alpha/2 = 0.025 \).
\( z_{\alpha/2} = 1.96 \) for 95% confidence.

Definition of critical value z_alpha/2 Normal distribution with critical values Normal distribution with confidence level and critical values Table of common critical values

Margin of Error

The margin of error (E) is the maximum likely difference between the sample proportion \( \hat{p} \) and the population proportion \( p \). It is calculated as:

Formula:
\( \hat{q} = 1 - \hat{p} \)

Margin of error explanation Margin of error formula for proportions

Confidence Interval for Population Proportion

To estimate \( p \), construct the confidence interval:

Formula:
Requirements: Simple random sample, binomial conditions, at least 5 successes and 5 failures.
Round interval limits to three significant digits.

Confidence interval formula for proportions

Procedure for Constructing a Confidence Interval for p

Verify requirements (random sample, binomial conditions, minimum successes/failures).
Find critical value \( z_{\alpha/2} \) for desired confidence level.
Calculate margin of error .
Compute interval limits: and .

Procedure step 2 and 3 for confidence interval Procedure step 4 for confidence interval

Example: Constructing a Confidence Interval

Given: 43% of 1487 adults have Facebook pages. Find the margin of error and 95% confidence interval.

Margin of error:
Confidence interval: →

Statdisk output for confidence interval Manual calculation of margin of error Manual calculation of confidence interval

Finding Point Estimate and Margin of Error from a Confidence Interval

Point estimate:
Margin of error:

Finding point estimate from interval Finding margin of error from interval

Determining Sample Size for Estimating a Population Proportion

To achieve a desired margin of error E at a given confidence level, use:

With estimate \( \hat{p} \):
Without estimate:
Round up to next whole number.

Sample size formula notation Sample size requirements Sample size formula for proportions

Example: Sample Size Calculation

With prior estimate \( \hat{p} = 0.80 \), \( \hat{q} = 0.20 \), \( z_{\alpha/2} = 1.96 \), \( E = 0.03 \):
Without prior estimate:

Sample size calculation with estimate Sample size calculation without estimate

Estimating a Population Mean

Point Estimate and Confidence Interval for Mean

The main goal is to use a sample mean \( \bar{x} \) to infer the population mean \( \mu \). Confidence intervals are constructed differently depending on whether the population standard deviation \( \sigma \) is known or unknown.

Sample data for confidence interval

Confidence Interval for Mean with Known σ

Margin of error:
Confidence interval:
Example: For \( \bar{x} = 30.9 \), \( \sigma = 2.9 \), \( n = 15 \), \( z_{\alpha/2} = 1.96 \):
Interval: (rounded to one decimal place)

Normality check for sample data Calculation of margin of error for mean Calculation of confidence interval for mean

Confidence Interval for Mean with Unknown σ

Use Student t distribution with degrees of freedom \( df = n - 1 \).
Margin of error:
Confidence interval:
Round limits to one more decimal place than original data.

Confidence interval formula for mean with unknown sigma

Student t Distribution

The Student t distribution is used when \( \sigma \) is unknown. It is similar to the normal distribution but has greater variability for small samples.

Mean of t distribution is 0.
Standard deviation varies with sample size and is greater than 1.
As sample size increases, t distribution approaches normal distribution.

Comparison of t and normal distributions

Procedure for Constructing a Confidence Interval for µ

Verify requirements: random sample, normal population or n > 30.
With unknown \( \sigma \), use t distribution and find critical value \( t_{\alpha/2} \).
Calculate margin of error .
Compute interval limits: and .

Procedure for confidence interval for mean

Finding Sample Size for Estimating a Population Mean

Required sample size:
If \( \sigma \) is unknown, estimate using range rule:
Round up to next whole number.

Sample size formula for mean

Example: IQ Scores of Statistics Students

Assume \( \sigma = 15 \), \( z_{\alpha/2} = 1.96 \), \( E = 3 \):
Interpretation: At least 97 students are needed for 95% confidence that the sample mean is within 3 points of the population mean.

Choosing an Appropriate Distribution

Student t vs. Normal (z) Distribution

Use z distribution when \( \sigma \) is known and sample size is large.
Use t distribution when \( \sigma \) is unknown or sample size is small.

Comparison of t and normal distributions

Summary Table: Common Critical Values

Confidence level	α	Critical Value, zα/2
90%	0.10	1.645
95%	0.05	1.96
99%	0.01	2.575

Table of common critical values

Key Terms and Notation

p: Population proportion
\( \hat{p} \): Sample proportion
n: Number of sample values
E: Margin of error
zα/2: Critical value for z distribution
tα/2: Critical value for t distribution
\( \bar{x} \): Sample mean
σ: Population standard deviation
s: Sample standard deviation