Skip to main content
Back

Introduction to Statistics: Concepts, Data, and Types

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 1: Introduction to Statistics

Overview of Statistics

Statistics is a foundational discipline in data analysis, focusing on the collection, organization, analysis, and interpretation of data to inform decision-making. This chapter introduces the basic concepts and terminology essential for understanding statistics.

What is Data?

Data refers to a collection of information obtained from observations, counts, measurements, or responses. It forms the basis for statistical analysis and can be quantitative or qualitative.

  • Definition: Data is a list of information coming from observations, counts, measurements, or responses.

  • Examples:

    • "1 in 10 Americans believe the arts unify their community."

    • "2 in 5 Americans have changed an opinion or perception based on an arts experience."

    • "21% of 8–11 year-olds have a social media profile."

What is Statistics?

Statistics is the science of collecting, organizing, analyzing, and interpreting data to make decisions. It provides tools for understanding and drawing conclusions from data.

  • Definition: Statistics is the science of collecting, organizing, analyzing, and interpreting data to make decisions.

  • Applications: Used in fields such as business, healthcare, social sciences, and government to inform policy and practice.

Data Sets: Populations and Samples

Population

A population is the complete collection of all outcomes, responses, measurements, or counts that are of interest in a statistical study.

  • Definition: The entire group of individuals or items under consideration.

  • Example: All employees in the United States.

Sample

A sample is a subset, or part, of the population. Samples are used to make inferences about the population when it is impractical to collect data from every member.

  • Definition: A portion of the population selected for analysis.

  • Example: 834 employees surveyed out of all U.S. employees.

Example: Identifying Data Sets

In a recent survey, 834 employees in the United States were asked if they thought their jobs were highly stressful. Of the 834 respondents, 517 said yes.

  • Population: All employees in the United States.

  • Sample: The 834 employees who responded to the survey.

  • Sample Data Set: 517 "yes" responses and 317 "no" responses.

Parameters and Statistics

Parameter

A parameter is a numerical description of a population characteristic.

  • Definition: A value that describes a characteristic of the entire population.

    • Example: The average age of all people in the United States.

Statistic

A statistic is a numerical description of a sample characteristic.

  • Definition: A value that describes a characteristic of a sample.

  • Example: The average age of people from a sample of three states.

Examples: Distinguishing Parameters and Statistics

  • Sample Statistic: In the United States, a survey of about 9,400 individuals aged 15 and over found that such individuals spent an average of 5.19 hours per day engaged in leisure and sports activities. (Based on a subset of the population.)

  • Population Parameter: The freshman class at a university has an average SAT math score of 514. (Based on the entire freshman class.)

  • Sample Statistic: In a random check of several hundred retail stores, the Food and Drug Administration found that 34% of the stores were not storing fish at the proper temperature. (Based on a subset of all stores.)

Branches of Statistics

Branch

Description

Examples

Descriptive Statistics

Involves the organization, summarization, and display of data.

Tables, charts, averages

Inferential Statistics

Uses sample data to draw conclusions about a population.

Making predictions, testing hypotheses

Example: Descriptive and Inferential Statistics

  • Descriptive: A study of 1,502 U.S. adults found that 18% of adults from households earning less than $30,000 annually do not use the Internet.

  • Inferential: A possible inference is that the Internet has been made less accessible to lower-income households.

  • Descriptive: A study of 1,000 U.S. 401(k) retirement plan participants found that 32% do not know how many years their retirement savings might last.

  • Inferential: A possible inference is that the amount of money a person needs for retirement is difficult to determine.

Summary Table: Key Terms

Term

Definition

Example

Population

Entire group under study

All U.S. employees

Sample

Subset of the population

834 surveyed employees

Parameter

Numerical description of a population

Average SAT score of all freshmen

Statistic

Numerical description of a sample

Average SAT score from a sample

Descriptive Statistics

Summarizes and displays data

Percentage of Internet non-users

Inferential Statistics

Draws conclusions about populations

Inferring accessibility issues

Key Formulas

  • Sample Mean:

  • Population Mean:

Additional info: These notes expand on the brief points in the slides to provide full definitions, examples, and context for each concept, suitable for introductory statistics students.

Pearson Logo

Study Prep