Skip to main content
Back

Foundations of Statistics: Data Types, Probability, and Counting Principles

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

Types of Data

Understanding the types of data is fundamental in statistics, as it determines the appropriate methods for analysis.

  • Qualitative Data: Consists of attributes, labels, or non-numerical entries. Examples include eye color, brand names, or types of cuisine.

  • Quantitative Data: Consists of numbers that are measurements or counts. Examples include height, weight, or the number of students in a class.

Population, Sample, Parameter, and Statistic

Statistics involves studying populations and samples to draw meaningful conclusions.

  • Population: The collection of all outcomes, responses, measurements, or counts that are of interest.

  • Sample: A subset or part of a population.

  • Parameter: A numerical description of a population characteristic.

  • Statistic: A numerical description of a sample characteristic.

Branches of Statistics

  • Descriptive Statistics: Involves the organization, summarization, and display of data.

  • Inferential Statistics: Involves using a sample to draw conclusions about a population.

Designing a Statistical Study

Conducting a statistical study involves several key steps:

  1. Identify the variables of interest and the population of the study.

  2. Develop a detailed plan for collecting data. If using a sample, ensure it is representative of the population.

  3. Collect the data.

  4. Describe the data using descriptive statistics techniques.

  5. Interpret the data and make decisions about the population using inferential statistics.

  6. Identify any possible errors.

Methods of Data Collection

  • Observational Study: The researcher observes and measures characteristics of interest but does not influence the subjects.

  • Experiment: A treatment is applied to part of the population (treatment group), and responses are observed. A control group does not receive the treatment. The responses are compared.

  • Simulation: Uses a mathematical or physical model to reproduce the conditions of a situation or process.

  • Survey: An investigation of one or more characteristics of a population, often by asking people questions.

Key Elements of Well-Designed Experiments

  • Control: Minimizing the effects of variables other than the treatment.

  • Randomization: Randomly assigning subjects to different treatment groups.

  • Replication: Repeating the experiment to confirm results.

Important Experimental Terms

  • Confounding Variable: Occurs when an experiment cannot distinguish between the effects of different factors on a variable.

  • Blinding: Subjects do not know whether they are receiving a treatment or a placebo.

  • Double-Blind Experiment: Neither the experimenter nor the subjects know who receives the treatment or placebo until after data collection.

  • Completely Randomized Design: Subjects are assigned to treatment groups through random selection.

Probability

Basic Probability Concepts

  • Probability Experiment: An action or trial through which specific results are obtained.

  • Outcome: The result of a single trial in a probability experiment.

  • Sample Space (S): The set of all possible outcomes of a probability experiment.

  • Event (E): A subset of the sample space; the outcomes you are interested in.

Types of Events

  • Independent Events: The occurrence of one event does not affect the probability of the other.

  • Dependent Events: The occurrence of one event affects the probability of the other.

Basic Probability Formula

  • Probability of an Event:

  • Probability Range:

Types of Probability

  • Theoretical Probability: Based on equally likely outcomes (e.g., rolling a fair die: ).

  • Empirical Probability: Based on observed data (e.g., survey results).

Counting Principles

1. Fundamental Counting Principle (FCP)

Used to determine the number of possible outcomes when there are multiple steps.

  • Multiply the number of choices at each step.

  • Example: 3 shirts × 2 pants = 6 outfits.

2. Factorials

  • Definition: The product of all positive integers up to n.

3. Permutations (Order Matters)

  • Used when arranging items or assigning roles.

  • All items:

  • r items from n:

  • With Duplicates: (where p, q, r are counts of repeated items)

  • Key Idea: Order matters in permutations.

4. Combinations (Order Does NOT Matter)

  • Used when selecting groups or committees.

  • Formula:

  • Key Idea: Order does not matter in combinations.

Probability Rules

Addition Rule (OR Probability)

  • Mutually Exclusive Events: Cannot happen together.

  • Not Mutually Exclusive: Can happen together.

  • Key Idea: Subtract the overlap if events can happen together.

Multiplication Rule (AND Probability)

  • Independent Events:

  • Dependent Events:

  • Example: Probability of flipping a head and rolling a 3:

Conditional Probability

  • Formula:

  • Meaning: Probability of B given A has already occurred.

Complement Rule (NOT)

  • Formula:

  • Use: Sometimes easier than calculating directly.

Probability Using Counting

  1. Count total outcomes.

  2. Count favorable outcomes.

  3. Divide.

  • Use permutations if order matters; combinations if order does not matter.

Odds

  • Odds in favor:

  • Odds against:

  • Convert Odds to Probability: If odds are a:b, then

Quick Comparison Table

Concept

Use When

FCP

Multiple steps

Permutations

Order matters

Combinations

Order doesn’t matter

Addition Rule

OR

Multiplication Rule

AND

Conditional Probability

Depends on previous event

Complement Rule

NOT

Normal Probability Distributions

Normal Distribution

A normal distribution is a continuous probability distribution for a random variable X. Its graph is called the normal curve.

  • The normal curve approaches but never touches the x-axis as it extends farther from the mean.

  • The center of the curve is where it curves downward; to the left and right, it curves upward. The points where the curve changes from upward to downward are called inflection points.

Standard Normal Distribution

  • A normal distribution with a mean of 0 and a standard deviation of 1.

Exam Preparation Tips

  • Does order matter? If yes, use permutations; if no, use combinations.

  • Is the question about "OR" or "AND"? Use the addition rule for OR, multiplication rule for AND.

  • Are events dependent? If yes, use conditional probability; if no, use simple multiplication.

  • Is there an easier way using "NOT"? Use the complement rule.

Memory Tricks

  • Permutation: Position matters

  • Combination: Choose groups

  • AND: Multiply

  • OR: Add (subtract overlap if needed)

Pearson Logo

Study Prep