BackChapter 1: Introduction to Statistics – Structured Study Notes
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
1.1 Statistical and Critical Thinking
The Core Goal of Statistics
Statistics is the science of planning studies and experiments, obtaining data, organizing, summarizing, presenting, analyzing, interpreting, and drawing conclusions based on that data. Because populations are typically too large to measure completely, we use samples to infer information about the whole.
Population: The complete collection of all measurements or data being considered.
Census: The collection of data from every member of the population.
Sample: A subset of members selected from a population.
Variable: A characteristic, number, or quantity that can be measured or counted for individuals in the study.
Descriptive vs. Inferential Statistics
Branch | Focus | Example |
|---|---|---|
Descriptive Statistics | Organizing, summarizing, and presenting raw data. | "34 out of 50 (68%) sampled students said they would return a dropped $100 bill." |
Inferential Statistics | Taking sample results, extending them to the population, and measuring reliability. | "We are 95% confident that the true proportion of all college students who would return the money is between 64% and 72%." |
The Statistical Study Process
Prepare
Context: What do the data represent? What is the study's goal?
Source of Data: Is the source unbiased?
Sampling Method: Was the sample collected objectively? Beware of voluntary response samples, which are prone to bias.
Analyze
Graph and explore data distributions.
Identify outliers, missing data, or high nonresponse rates.
Apply technological tools for calculations.
Conclude
Statistical Significance: The result is highly unlikely to occur by random chance (commonly, less than 5% probability).
Practical Significance: The effect is large enough to be meaningful in the real world.
Math Review: Working with Percentages
Finding a Percentage Value: Convert the percentage to a fraction and multiply by the base amount.
Decimal to Percentage: Multiply the decimal by 100%.
Fraction to Percentage: Divide numerator by denominator, then multiply by 100%.
Percentage to Decimal: Divide by 100.
1.2 Types of Data
Parameters vs. Statistics
Parameter: A numerical measurement describing a characteristic of an entire population.
Statistic: A numerical measurement describing a characteristic of a sample.
Tip: Population → Parameter; Sample → Statistic.
Data Classifications: Qualitative vs. Quantitative
Qualitative (Categorical) Data: Names, labels, or attributes based on categories. Examples: Eye color, gender, zip codes.
Quantitative (Numerical) Data: Numbers representing counts or measurements.
Discrete: Countable values (e.g., number of pets).
Continuous: Measurable values on a continuous scale (e.g., height, temperature).
The Four Levels of Measurement
Property | Nominal | Ordinal | Interval | Ratio |
|---|---|---|---|---|
Provides Categories / Labels | Yes | Yes | Yes | Yes |
Has a Meaningful Order | No | Yes | Yes | Yes |
Differences Can Be Measured | No | No | Yes | Yes |
Contains a True Zero Point | No | No | No | Yes |
Nominal: Categories only (e.g., gender, nation of origin).
Ordinal: Categories with a meaningful order, but differences are not meaningful (e.g., letter grades).
Interval: Ordered, differences are meaningful, but no true zero (e.g., temperature in Celsius or Fahrenheit).
Ratio: Ordered, differences and ratios are meaningful, true zero exists (e.g., height, weight).
1.3 Collecting Sample Data
Observational Studies vs. Experimental Studies
Observational Study: Observing and measuring characteristics without modifying subjects.
Experiment: Applying a treatment and observing its effects. Well-designed experiments help establish causation.
Types of Observational Studies
Cross-Sectional Study: Data collected at a single point in time.
Retrospective (Case-Control) Study: Data collected from past records.
Prospective (Cohort) Study: Data collected forward in time from groups sharing common factors.
Pillars of Rigorous Experimental Design
Replication: Repeating the experiment on a large sample to distinguish treatment effects from random variation.
Blinding: Subjects do not know if they receive the treatment or placebo. Double-Blind: Neither subjects nor experimenters know group assignments.
Randomization: Assigning subjects to groups by chance to ensure comparability.
Sampling Methodologies
Simple Random Sample (SRS): Every possible sample of size n has an equal chance of being selected.
Systematic Sampling: Select a starting point, then every kth element.
Convenience Sampling: Use data that are easiest to obtain (high risk of bias).
Stratified Sampling: Divide population into subgroups (strata) and randomly sample from each.
Cluster Sampling: Divide population into clusters, randomly select clusters, and sample all members within chosen clusters.
Multistage Sampling: Combine multiple sampling methods in stages.
Key Distinction: Stratified sampling selects some members from all groups; cluster sampling selects all members from some groups.
Practice Checkpoints: Concept Applications
Scenario: A research company contacts 1,050 adults in California and asks their preferred daily mode of transportation.
Population: All adults in California.
Sample: The 1,050 adults contacted.
Variable: Preferred daily mode of transportation.
Level of Measurement: Nominal (categories without order).
Sampling Method Identification:
Every 15th chip: Systematic Sampling
Randomly select 25 students from each grade: Stratified Sampling
First 10 people on sidewalk: Convenience Sampling
Survey all households in 4 randomly selected neighborhoods: Cluster Sampling
Summary Table: Levels of Measurement
Level | Order? | Meaningful Differences? | True Zero? | Examples |
|---|---|---|---|---|
Nominal | No | No | No | Gender, Eye Color |
Ordinal | Yes | No | No | Letter Grades, Rankings |
Interval | Yes | Yes | No | Temperature (°F), Years |
Ratio | Yes | Yes | Yes | Height, Weight, Age |
Additional info: These notes expand on the original material by providing structured definitions, examples, and summary tables for clarity and exam preparation.