Problem 10.4.17
Testing Hypotheses About Regression Coefficients If the coefficient has a nonzero value, then it is helpful in predicting the value of the response variable. If it is not helpful in predicting the value of the response variable and can be eliminated from the regression equation. To test the claim that use the test statistic Critical values or P-values can be found using the t distribution with degrees of freedom, where k is the number of predictor variables and n is the number of observations in the sample. The standard error is often provided by software. For example, see the accompanying StatCrunch display for Example 1, which shows that (found in the column with the heading of “Std. Err.” and the row corresponding to the first predictor variable of height). Use the sample data in Data Set 1 “Body Data” and the StatCrunch display to test the claim that Also test the claim that What do the results imply about the regression equation?
Problem 10.5.14
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Sunspot Numbers Listed below in order by row are annual sunspot numbers beginning with 1980. Is the best model a good model? Carefully examine the scatterplot and identify the pattern of the points. Which of the models fits that pattern?
[IMAGE]
Problem 10.5.18
Sum of Squares Criterion In addition to the value of another measurement used to assess the quality of a model is the sum of squares of the residuals. Recall from Section 10-2 that a residual is (the difference between an observed y value and the value predicted from the model). Better models have smaller sums of squares. Refer to the U.S. population data in Table 10-7.
a. Find the sum of squares of the residuals resulting from the linear model.
Problem 10.5.4
Interpreting a Graph The accompanying graph plots the numbers of points scored in each Super Bowl from the first Super Bowl in 1967 (coded as year 1) to the last Super Bowl at the time of this writing. The graph of the quadratic equation that best fits the data is also shown in red. What feature of the graph justifies the value of R^2 = 0.205 for the quadratic model?
Problem 10.5.6
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Dirt Cheap The Cherry Hill Construction company in Branford, CT sells screened topsoil by the “yard,” which is actually a cubic yard. Let the variable x be the length (yd) of each side of a cube of screened topsoil. The table below lists the values of x along with the corresponding cost (dollars).
Problem 10.5.8
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Sound Intensity The table lists intensities of sounds as multiples of a basic reference sound. A scale similar to the decibel scale is used to measure the sound intensity.
Problem 10.5.10
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Deaths from Motor Vehicle Crashes Listed below are the numbers of deaths in the United States resulting from motor vehicle crashes. Use the best model to find the projected number of such deaths for the year 2025.
Problem 10.5.18c
Sum of Squares Criterion In addition to the value of another measurement used to assess the quality of a model is the sum of squares of the residuals. Recall from Section 10-2 that a residual is (the difference between an observed y value and the value predicted from the model). Better models have smaller sums of squares. Refer to the U.S. population data in Table 10-7.
c. Verify that according to the sum of squares criterion, the quadratic model is better than the linear model.
Problem 10.5.13
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Stock Market Listed below in order by row are the annual high values of the Dow Jones Industrial Average for each year beginning with 2000. Find the best model and then predict the value for the last year listed. Is the predicted value close to the actual value of 26,828.4?
Problem 10.5.15
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Earthquakes Listed below are earthquake depths (km) and magnitudes (Richter scale) of different earthquakes. Find the best model and then predict the magnitude for the last earthquake with a depth of 3.78 km. Is the predicted value close to the actual magnitude of 7.1?
Problem 10.5.16
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Global Warming Listed below are mean annual temperatures (°C) of the earth for each decade, beginning with the decade of the 1880s. Find the best model and then predict the value for 2090–2099. Comment on the result.
Problem 10.5.17
Moore’s Law In 1965, Intel cofounder Gordon Moore initiated what has since become known as Moore’s law: The number of transistors per square inch on integrated circuits will double approximately every 18 months. In the table below, the first row lists different years and the second row lists the number of transistors (in thousands) for different years.
Ignoring the listed data and assuming that Moore’s law is correct and transistors per square inch double every 18 months, which mathematical model best describes this law: linear, quadratic, logarithmic, exponential, power? What specific function describes Moore’s law?
Which mathematical model best fits the listed sample data?
Compare the results from parts (a) and (b). Does Moore’s law appear to be working reasonably well?
Problem 10.5.7
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
CD Yields The table lists the value y (in dollars) of $1000 deposited in a certificate of deposit at Bank of New York (based on rates currently in effect).
Problem 10.5.5
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Landing on the Moon When the Apollo spacecraft landed on the Moon, the rocket engine would typically cut off at about 1.3 meters above the surface so that hot gases and dust and other surface materials would not cause damage. The landing module was in freefall starting at about 1 meter above the surface. The table below lists the time t (seconds) after being dropped and the distance d (meters) travelled by an object dropped near the surface of the Moon.
Problem 10.5.2
Super Bowl and R^2 Let x represent years coded as 1,1,3,... for years starting in 1980, and let y represent the numbers of points scored in each annual Super Bowl beginning in 1980. Using the data from 1980 to the last Super Bowl at the time of this writing, we obtain the following values of R^2 for the different models: linear: 0.008; quadratic: 0.023; logarithmic: 0.0004; exponential: 0.027; power: 0.007. Based on these results, which model is best? Is the best model a good model? What do the results suggest about predicting the number of points scored in a future Super Bowl game?
Problem 10.5.9
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Population Growth Here are the values of the world population (billions) beginning with the year 2000:
Problem 10.5.11
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Richter Scale The table lists different amounts (metric tons) of the explosive TNT and the corresponding value measured on the Richter scale resulting from explosions of the TNT.
Problem 10.5.12
Finding the Best Model
In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.
Detecting Fraud Leading digits of check amounts are often analyzed for the purpose of detecting fraud. The accompanying table lists frequencies of leading digits from checks written by the author (an honest guy).
Problem 12.CR.2
Comparing Two Means Treating the data as samples from larger populations, test the claim that there is a significant difference between the mean of presidents and the mean of popes.
Problem 13
Testing for a Linear Correlation
In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of α = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)
Powerball Jackpots and Tickets Sold Listed below are the same data from Table 10-1 in the Chapter Problem, but an additional pair of values has been added in the last column. Is there sufficient evidence to conclude that there is a linear correlation between lottery jackpot amounts and numbers of tickets sold? Comment on the effect of the added pair of values in the last column. Compare the results to those obtained in Example 4.
[IMAGE]
Ch. 10 - Correlation and Regression
