Midterm 2 Study Guide: Probability, Distributions, and Hypothesis Testing

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Probability Distributions

Identifying Probability Distributions

A probability distribution describes how probabilities are distributed over the values of a random variable. To determine if a distribution is a probability distribution:

All probabilities must be between 0 and 1.
The sum of all probabilities must equal 1.
Each outcome must be mutually exclusive.

Example: For a random variable X with possible values 1, 2, 3 and probabilities 0.2, 0.5, 0.3, check that 0.2 + 0.5 + 0.3 = 1.

Binomial Distribution

Mean and Variance of Binomial Distribution

The binomial distribution models the number of successes in a fixed number of independent trials, each with the same probability of success.

Mean:
Variance:

Example: For n = 10 trials and p = 0.3, mean is , variance is .

Identifying Binomial Experiments

An experiment is binomial if:

There are a fixed number of trials (n).
Each trial has two possible outcomes (success or failure).
The probability of success (p) is constant for each trial.
Trials are independent.

Example: Flipping a coin 5 times and counting heads.

Normal Distribution

Standard Normal Distribution and Areas

The standard normal distribution is a normal distribution with mean 0 and standard deviation 1. Areas under the curve represent probabilities.

To find the area to the left of a z-score, use standard normal tables.
To find the area between two z-scores, subtract the area to the left of the lower z from the area to the left of the higher z.

Example: Area between z = -1 and z = 1 is approximately 0.6826.

Finding Probabilities for Normal Random Variables

For a normal random variable X with mean and standard deviation :

Convert X to a z-score:
Use z-tables to find probabilities.

Example: If , , find probability X < 60: .

Percentiles and Z-Scores

The z-score corresponding to a percentile is found using z-tables or inverse normal functions.

Percentile is the area to the left of the z-score.
Find z for a given percentile using tables or calculator.

Example: The 90th percentile corresponds to z ≈ 1.28.

Comparing Values Using Z-Scores

Z-scores allow comparison of values from different distributions:

Higher z-score = value is further above the mean.
Lower z-score = value is further below the mean.

Example: Compare test scores from different classes by converting to z-scores.

Sampling Distributions and Central Limit Theorem (CLT)

Mean and Standard Deviation of Sampling Distribution

The sampling distribution of the sample mean has:

Mean:
Standard deviation:

Example: If , , , then .

Finding Probabilities for Samples (z and t statistics)

Use z or t statistics to find probabilities for sample means:

Use z if population standard deviation is known and sample size is large.
Use t if population standard deviation is unknown and sample size is small.

Confidence Intervals

Margin of Error

The margin of error for a confidence interval is:

(for known )
(for unknown )

Example: For , , , .

Confidence Interval for Population Mean

A confidence interval estimates the population mean:

For known :
For unknown :

Example: , , , , interval is .

Sample Size Calculation

To find the required sample size for a given margin of error:

Example: , , , (round up to 97).

Hypothesis Testing

Normal vs. t Distribution

Choose the appropriate distribution:

Normal (z): Population standard deviation known, large sample size.
t: Population standard deviation unknown, small sample size.

Conditions for Hypothesis Tests

Each test has specific conditions:

1-sample z: Random sample, normal population or n > 30, known.
1-sample t: Random sample, normal population or n > 30, unknown.
2-sample z: Two independent samples, known.
2 independent t: Two independent samples, unknown.
2 dependent t: Paired samples.

Type I and Type II Errors

Errors in hypothesis testing:

Type I error: Rejecting a true null hypothesis (false positive).
Type II error: Failing to reject a false null hypothesis (false negative).

Example: Type I: Concluding a drug works when it does not. Type II: Concluding a drug does not work when it does.

One-Tailed vs. Two-Tailed Tests

Determine the direction of the test:

One-tailed: Tests for an effect in one direction (e.g., greater than).
Two-tailed: Tests for an effect in both directions (e.g., not equal).

Example: , is two-tailed.

Rejecting or Failing to Reject the Null Hypothesis

Decisions are based on p-value and significance level ():

If p-value < , reject .
If p-value > , fail to reject .

Alternatively, use confidence intervals:

If the confidence interval does not contain the hypothesized value, reject .

Critical Values and Rejection Regions

Critical values define the boundaries of the rejection region:

For z-tests, use or .
For t-tests, use or .

Example: For , two-tailed z-test, critical values are .

Dependent vs. Independent Samples

Samples are:

Independent: No relationship between samples.
Dependent: Samples are paired or matched (e.g., before and after measurements).

Testing Claims: One Sample Hypothesis Test

Steps for a one-sample hypothesis test:

State hypotheses (, ).
Choose significance level ().
Calculate test statistic (z or t).
Find p-value or compare to critical value.
Make decision: reject or fail to reject .
Interpret the result in context.

Example: Test if the mean weight is 150 lbs using sample data.

Summary Table: Hypothesis Test Types

Test Type	Sample(s)	Population Std. Dev. Known?	Distribution
1-sample z	1	Yes	Normal (z)
1-sample t	1	No	t
2-sample z	2	Yes	Normal (z)
2 independent t	2	No	t
2 dependent t	2 (paired)	No	t

Additional info: This table summarizes the main types of hypothesis tests covered in the study guide.