Skip to main content
Back

Unit 2 Exam Review – Step-by-Step Statistics Guidance

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Q1. Repair bills: Outlier Detection and Interpretation

Background

Topic: Descriptive Statistics & Outlier Detection

This question tests your understanding of how to identify outliers using summary statistics (such as quartiles and the interquartile range) and how to interpret the meaning of a value in the context of the data distribution.

Key Terms and Formulas

  • Quartiles (Q1, Q3): Values that divide the data into quarters.

  • Interquartile Range (IQR):

  • Outlier Rule: A value is an outlier if it is below or above .

Step-by-Step Guidance

  1. Calculate the IQR using the given quartiles: .

  2. Compute the lower and upper bounds for outliers:

  3. Compare the minimum and maximum values to these bounds to determine if any are outliers.

Try solving on your own before revealing the answer!

Q2. Salary conversions: Linear Transformations of Data

Background

Topic: Linear Transformations of Random Variables

This question examines how summary statistics (mean, median, standard deviation, IQR) change when data are transformed by multiplying and/or adding constants.

Key Terms and Formulas

  • Linear Transformation:

  • Effect on Mean/Median:

  • Effect on Standard Deviation/IQR: (adding does not affect SD or IQR)

  • Z-score:

Step-by-Step Guidance

  1. For each statistic, apply the transformation: multiply by 8 and add 200 (where appropriate).

  2. Remember: Only the mean and median are affected by addition; SD and IQR are only affected by multiplication.

  3. For the z-score, recall that linear transformations do not change z-scores if all data are transformed the same way.

Try solving on your own before revealing the answer!

Q3. Subaru costs: Regression Model Interpretation

Background

Topic: Linear Regression and Residual Analysis

This question asks you to interpret the fit and meaning of a regression model, including the slope and residuals, and to use the model for prediction.

Key Terms and Formulas

  • Regression Equation:

  • Coefficient of Determination (): Measures the proportion of variance explained by the model.

  • Residual:

Step-by-Step Guidance

  1. Examine the value to assess how well the model explains price variation.

  2. Interpret the slope: For each additional mile, the price decreases by the slope value.

  3. To predict the price for 42,000 miles, substitute into the regression equation:

  4. Calculate the product and subtract from 15,327 to get the predicted price.

Residual plot for Subaru regression model

Try solving on your own before revealing the answer!

Q4. Penicillin assimilation: Re-expression and Modeling

Background

Topic: Data Transformation and Nonlinear Regression

This question tests your ability to straighten a nonlinear relationship (e.g., exponential decay) by re-expressing data, and then use the model for prediction.

Key Terms and Formulas

  • Exponential Decay Model: or

  • Re-expression: Taking the logarithm of the concentration to linearize the relationship with time.

Step-by-Step Guidance

  1. Take the logarithm of the concentration values to linearize the data.

  2. Fit a linear regression model to the transformed data (log concentration vs. time).

  3. Use the regression equation to predict the log concentration at 8 hours, then exponentiate to return to the original scale.

Try solving on your own before revealing the answer!

Q5. Maple trees: Scatterplots, Residuals, and Model Selection

Background

Topic: Regression Diagnostics and Model Improvement

This question asks you to interpret scatterplots and residual plots, assess model appropriateness, and compare linear and transformed models.

Key Terms and Formulas

  • Scatterplot: Visualizes the relationship between two variables.

  • Residual Plot: Shows the pattern of residuals to assess model fit.

  • Log Transformation: Used to linearize nonlinear relationships.

  • Regression Equation (log model):

Step-by-Step Guidance

  1. Describe the association in the scatterplot (direction, form, strength, outliers).

  2. Examine the residual plot for patterns; randomness suggests a good fit.

  3. Discuss whether the linear model is appropriate based on the residuals.

  4. For the log-transformed model, explain why the fit is improved (e.g., higher , more random residuals).

  5. To predict the diameter at age 50, calculate and substitute into the regression equation, then solve for diameter.

Scatterplot and residuals for linear model of maple tree diameter Regression output and residuals for log-transformed model of maple tree diameter

Try solving on your own before revealing the answer!

Q6. Traffic accidents: Probability and Independence

Background

Topic: Probability Rules and Independence

This question tests your ability to use basic probability rules (addition, multiplication, complements) and to assess independence between events.

Key Terms and Formulas

  • Probability of Union:

  • Probability of Neither:

  • Independence: if A and B are independent

Step-by-Step Guidance

  1. Calculate using the given probabilities.

  2. Find the probability of neither event by taking the complement.

  3. Check if to assess independence.

Try solving on your own before revealing the answer!

Q7. SAT prep: Conditional Probability

Background

Topic: Conditional Probability and Bayes' Theorem

This question asks you to find the probability that a student did not take an SAT prep course, given that they were admitted to their first choice college.

Key Terms and Formulas

  • Conditional Probability:

  • Bayes' Theorem:

Step-by-Step Guidance

  1. Define the relevant events (e.g., took prep, admitted).

  2. Calculate the probability of being admitted for both groups (prep and non-prep).

  3. Use the law of total probability to find .

  4. Apply Bayes' theorem to find the probability that a student did not take the prep course, given admission.

Try solving on your own before revealing the answer!

Q8. Bowling: Mean and Standard Deviation of Differences

Background

Topic: Mean and Standard Deviation of Sums/Differences of Random Variables

This question tests your understanding of how to find the mean and standard deviation of the difference between two independent random variables.

Key Terms and Formulas

  • Mean of Difference:

  • SD of Difference (independent):

Step-by-Step Guidance

  1. Subtract the mean women's score from the mean men's score to find the expected difference.

  2. Calculate the standard deviation of the difference using the formula for independent variables.

  3. State the assumption of independence between men's and women's scores.

Try solving on your own before revealing the answer!

Q9. Smoking: Probability Models and Binomial Distribution

Background

Topic: Binomial Probability Model

This question asks you to apply the binomial model to a real-world context, including expected value, standard deviation, and probability calculations.

Key Terms and Formulas

  • Geometric Distribution (for part b):

  • Binomial Model:

  • Mean:

  • Standard Deviation:

Step-by-Step Guidance

  1. For part b, use the expected value formula for the geometric distribution.

  2. For part c, specify the binomial model and calculate mean and SD using the given parameters.

  3. For part d, use the binomial probability formula to find .

Try solving on your own before revealing the answer!

Q10. Strange dice: Probability Model, Mean, and SD

Background

Topic: Probability Distributions for Discrete Random Variables

This question asks you to construct a probability model for the sum of two non-standard dice, and to calculate the mean and standard deviation of the total.

Key Terms and Formulas

  • Probability Model: List all possible outcomes and their probabilities.

  • Mean (Expected Value):

  • Standard Deviation:

Step-by-Step Guidance

  1. List all possible sums and their probabilities by considering all combinations of Die A and Die B outcomes.

  2. Calculate the expected value (mean) using the probability model.

  3. Calculate the standard deviation using the formula for discrete random variables.

Try solving on your own before revealing the answer!

Q11. Still strange dice: Choosing the Best Die

Background

Topic: Probability and Strategy

This question asks you to use your probability model from the previous question to decide which die gives you a better chance of winning in a head-to-head competition.

Key Terms and Formulas

  • Probability of Winning: Calculate and based on the probability model.

Step-by-Step Guidance

  1. For each possible outcome, compare the values of Die A and Die B to determine which is higher.

  2. Sum the probabilities for all outcomes where one die beats the other.

  3. Choose the die with the higher probability of winning and explain your reasoning.

Try solving on your own before revealing the answer!

Pearson Logo

Study Prep