Skip to main content
Ch. 10 - Correlation and Regression
Triola - Elementary Statistics 14th Edition
Triola14th EditionElementary StatisticsISBN: 9780137366446Not the one you use?Change textbook
Chapter 10, Problem 10.5.18

Sum of Squares Criterion In addition to the value of another measurement used to assess the quality of a model is the sum of squares of the residuals. Recall from Section 10-2 that a residual is (the difference between an observed y value and the value predicted from the model). Better models have smaller sums of squares. Refer to the U.S. population data in Table 10-7.
a. Find the sum of squares of the residuals resulting from the linear model.

Verified step by step guidance
1
Step 1: Understand the concept of residuals. A residual is the difference between the observed value (y) and the predicted value (ŷ) from the model. Mathematically, residuals are calculated as: e=y-ŷ.
Step 2: Refer to the U.S. population data in Table 10-7.a. Identify the observed values (y) and the predicted values (ŷ) from the linear model provided in the table.
Step 3: For each data point, calculate the residual by subtracting the predicted value (ŷ) from the observed value (y). Use the formula: e=y-ŷ.
Step 4: Square each residual to eliminate negative values and emphasize larger deviations. Use the formula: e2.
Step 5: Sum all the squared residuals to find the sum of squares of the residuals. Use the formula: e2. This value represents the total deviation of the observed data from the model predictions.

Verified video answer for a similar problem:

This video solution was recommended by our tutors as helpful for the problem above.
Video duration:
3m
Was this helpful?

Key Concepts

Here are the essential concepts you must grasp in order to answer the question correctly.

Residuals

Residuals are the differences between observed values and the values predicted by a statistical model. In regression analysis, they indicate how well the model fits the data; smaller residuals suggest a better fit. Understanding residuals is crucial for evaluating model performance and diagnosing potential issues in the model.

Sum of Squares of Residuals (SSR)

The Sum of Squares of Residuals (SSR) quantifies the total deviation of the observed values from the predicted values in a regression model. It is calculated by squaring each residual and summing these squared values. A lower SSR indicates a better-fitting model, as it reflects less unexplained variability in the data.
Recommended video:
Guided course
03:28
Mean & Standard Deviation of Binomial Distribution

Model Quality Assessment

Model quality assessment involves evaluating how well a statistical model represents the data it is intended to explain. This can be done using various metrics, including the sum of squares of residuals, R-squared values, and other diagnostic tools. A thorough assessment helps in selecting the most appropriate model for the data and ensuring reliable predictions.
Recommended video:
Guided course
04:24
Types of Data
Related Practice
Textbook Question

Finding the Best Model

In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.

Detecting Fraud Leading digits of check amounts are often analyzed for the purpose of detecting fraud. The accompanying table lists frequencies of leading digits from checks written by the author (an honest guy).

26
views
Textbook Question

Finding the Best Model

In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.

Sunspot Numbers Listed below in order by row are annual sunspot numbers beginning with 1980. Is the best model a good model? Carefully examine the scatterplot and identify the pattern of the points. Which of the models fits that pattern?

[IMAGE]

176
views
Textbook Question

Interpreting a Graph The accompanying graph plots the numbers of points scored in each Super Bowl from the first Super Bowl in 1967 (coded as year 1) to the last Super Bowl at the time of this writing. The graph of the quadratic equation that best fits the data is also shown in red. What feature of the graph justifies the value of R^2 = 0.205 for the quadratic model?

154
views
Textbook Question

Finding the Best Model

In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.

Population Growth Here are the values of the world population (billions) beginning with the year 2000:

22
views
Textbook Question

Testing Hypotheses About Regression Coefficients If the coefficient has a nonzero value, then it is helpful in predicting the value of the response variable. If it is not helpful in predicting the value of the response variable and can be eliminated from the regression equation. To test the claim that use the test statistic Critical values or P-values can be found using the t distribution with degrees of freedom, where k is the number of predictor variables and n is the number of observations in the sample. The standard error is often provided by software. For example, see the accompanying StatCrunch display for Example 1, which shows that (found in the column with the heading of “Std. Err.” and the row corresponding to the first predictor variable of height). Use the sample data in Data Set 1 “Body Data” and the StatCrunch display to test the claim that Also test the claim that What do the results imply about the regression equation?


" style="" width="600">

198
views
Textbook Question

Finding the Best Model

In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.

Earthquakes Listed below are earthquake depths (km) and magnitudes (Richter scale) of different earthquakes. Find the best model and then predict the magnitude for the last earthquake with a depth of 3.78 km. Is the predicted value close to the actual magnitude of 7.1?

24
views