Skip to main content
Back

Introduction to Statistics: Data, Variables, and Models

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 1: Introduction to Statistics

Section 1.1: What Is Statistics?

Statistics is the science of collecting, analyzing, interpreting, and presenting data. It helps us make sense of information in a world where data is everywhere and often varies.

  • Definition of Data: Any collection of numbers, characters, images, or other items that provide information about something.

  • Variation in Data: Data from surveys and experiments can produce a wide variety of outcomes.

  • Purpose of Statistics: To help us understand and interpret data, especially when it varies.

Example: Data Uses in Social Media

  • Platforms like Facebook collect data such as age, gender, education, and interests.

  • This data is used to personalize content and advertisements.

  • Companies value this information for targeted marketing.

Example: Texting While Driving

  • Question: Is texting while driving dangerous?

  • Observation: Texting has increased, but driving fatalities have decreased in recent years.

  • Study: University of Utah measured reaction times of sober, drunk, and texting drivers in simulated emergencies. Result: Texting drivers had the slowest reaction times.

  • Statistics helps us evaluate such questions using data and evidence.

Learning Outcomes in Statistics

  • Design and analyze experiments.

  • Interpret data and communicate results.

  • Identify deficiencies in conclusions from articles.

  • Become a more informed citizen.

Section 1.2: Data

Organizing Data

Proper organization and presentation of data are crucial for understanding and analysis.

  • Raw data can be difficult to interpret without organization.

  • Effective data presentation (tables, charts) makes interpretation easier.

The Five “W”s and One “H” of Data

  • Who: Describe the individuals or objects studied.

  • What: Determine what is being measured (variables).

  • When: When was the data collected?

  • Where: Where was the data collected?

  • Why: What was the purpose of the study?

  • How: How was the data collected?

Types of Individuals in Data

  • Respondents: Individuals who answer surveys (e.g., customers at Amazon).

  • Subjects/Participants: People in experiments (e.g., patients receiving medication).

  • Experimental Units: Non-human objects in experiments (e.g., rats in a maze).

  • Records: Rows in a database (e.g., purchase records).

Sample and Population

  • Population: The entire group of interest.

  • Sample: A subset of the population used to make inferences about the whole.

  • The sample should be representative of the population.

Think, Show, and Tell

  • Think: Decide what information you need.

  • Show: Display data professionally and accurately.

  • Tell: Interpret and communicate conclusions from the data.

Example: Identifying the Who

  • Consumer Reports evaluated tablets from various manufacturers.

  • Population: All tablets currently for sale.

  • Sample: 16 tablets tested.

  • Who: The 16 tablets tested represent all similar tablets by those manufacturers.

Section 1.3: Variables

Categorical Variables

Categorical variables indicate group membership and are also called nominal or qualitative variables.

  • Examples: Favorite color, country of birth, area code.

  • Drawback: Difficult to analyze with mathematical computations.

Quantitative Variables

Quantitative variables have measured numerical values with units, representing amounts or degrees.

  • Examples: Ounces, dollars, degrees Fahrenheit.

Categorical or Quantitative?

  • Some variables, like age, can be either categorical (e.g., child, teen, adult) or quantitative (numerical age).

  • How a variable is used or grouped determines its type.

Identifier Variables

Identifier variables uniquely identify individuals but do not describe them.

  • Examples: Login ID, customer number, transaction number, Social Security number.

Ordinal Variables

Ordinal variables report order without natural units. They can sometimes be treated as quantitative by assigning ranks.

  • Examples: Likert scale (Strongly Disagree, Disagree, Agree, Strongly Agree), Olympic medals (Gold, Silver, Bronze).

  • Rank Assignment: 1 = Strongly Disagree, 2 = Disagree, 3 = Agree, 4 = Strongly Agree.

Example: Identifying Variables in a Tablet Study

  • Manufacturer: Categorical

  • Price: Quantitative ($)

  • Battery life: Quantitative (hours)

  • Operating system: Categorical

  • Overall quality score: Quantitative (no units)

  • Memory card reader: Categorical

  • Why: To help consumers choose a good tablet.

Section 1.4: Models

Statistical Models

Models are simplified representations of reality used to understand and predict phenomena.

  • Examples: Model airplanes in wind tunnels, Kepler's Laws in astronomy.

  • Models help us test hypotheses and make predictions based on data.

What Can Go Wrong?

  • Do not label a variable as categorical or quantitative without considering the context.

  • Do not assume a variable is quantitative just because its values are numbers.

  • Always be skeptical and critically evaluate data and variables.

Table: Types of Variables

Type

Description

Examples

Categorical (Nominal)

Groups or categories with no inherent order

Color, country, area code

Ordinal

Ordered categories without fixed units

Likert scale, Olympic medals

Quantitative

Numerical values with units

Age, price, battery life

Identifier

Unique codes for individuals

Login ID, customer number

Additional info: The above notes expand on brief slide points to provide full academic context, definitions, and examples for each concept, as would be expected in a college-level statistics course.

Pearson Logo

Study Prep