BackOrganizing and Displaying Data in Statistics: Tables and Graphs
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Organizing Qualitative Data
Introduction
Qualitative data, also known as categorical data, consists of non-numeric information that can be organized into categories. Proper organization of qualitative data is essential for meaningful analysis and interpretation. This section covers methods for organizing qualitative data using tables and graphical representations.
Organizing Qualitative Data in Tables
Frequency Distribution: Lists each category of data and the number of occurrences for each category.
Relative Frequency: The proportion (or percent) of observations within a category, calculated as:
Relative Frequency Distribution: Lists each category of data together with its relative frequency.
Example: Frequency Distribution Table
A survey asked individuals about their favorite day of the week. The frequency distribution and relative frequency distribution can be constructed from the responses.
Day | Frequency | Relative Frequency |
|---|---|---|
Friday | 8 | 0.4 |
Saturday | 12 | 0.6 |
Additional info: Table values inferred for illustration.
Constructing Bar Graphs
Bar Graph: Constructed by labeling each category of data on either the horizontal or vertical axis and drawing rectangles for each category. The height of each rectangle represents the category's frequency or relative frequency.
Example: Frequency and Relative Frequency Bar Graph
Bar graphs can be used to display the frequency and relative frequency of survey responses, such as the best day of the week or other categorical variables.
Pareto Charts
Pareto Chart: A bar graph in which bars are drawn in decreasing order of frequency or relative frequency.
Example: Pareto Chart
Construct a Pareto chart of survey data to highlight the most common categories.
Pie Charts
Pie Chart: A circle divided into sectors, with each sector representing a category of data. The area of each sector is proportional to the frequency of the category.
Example: Drawing a Pie Chart
Pie charts can be used to visually represent the proportion of responses for each category, such as favorite day of the week.
Side-by-Side Bar Graphs
Side-by-Side Bar Graph: Used to compare instances of a categorical variable across two or more groups. Each group is represented by a set of bars for each category.
Example: Children Under 18 Living with One Parent
Age Group | Father Only | Mother Only |
|---|---|---|
0-5 years | 0.2 | 0.3 |
6-11 years | 0.25 | 0.35 |
Additional info: Table values inferred for illustration.
Organizing Quantitative Data
Introduction
Quantitative data consists of numeric values and can be either discrete or continuous. Organizing quantitative data involves grouping values into classes and displaying them using tables and graphs.
Organizing Discrete Data in Tables
Discrete Data: Data that can take on only specific values, often counts.
Frequency Distribution: Lists each value and the number of occurrences.
Relative Frequency Distribution: Lists each value and its relative frequency.
Example: Frequency Distribution of Siblings
Number of Siblings | Frequency | Relative Frequency |
|---|---|---|
0 | 2 | 0.2 |
1 | 4 | 0.4 |
2 | 3 | 0.3 |
3 | 1 | 0.1 |
Additional info: Table values inferred for illustration.
Constructing Histograms of Discrete Data
Histogram: Constructed by drawing rectangles for each class of data. The height of each rectangle is the frequency or relative frequency of the class. The width of each rectangle is the same and the rectangles touch each other.
Example: Histogram of Siblings
Draw a histogram using the frequency distribution of the number of siblings.
Organizing Continuous Data in Tables
Continuous Data: Data that can take on any value within a range.
Classes: Categories into which data are grouped, often using intervals of numbers.
Class Limits:
Lower Class Limit: Smallest value within the class.
Upper Class Limit: Largest value within the class.
Example: Educational Attainment by Age
Age | Percent with High School Diploma | Percent with Some College | Percent with Associate's Degree | Percent with Bachelor's Degree | Percent with Master's Degree |
|---|---|---|---|---|---|
25-34 | 44.5 | 23.3 | 13.4 | 10.3 | 9.8 |
35-54 | 45.8 | 21.9 | 15.7 | 10.8 | 5.8 |
55 and older | 39.7 | 18.9 | 15.2 | 10.8 | 9.2 |
Constructing Histograms of Continuous Data
Histogram: Used for continuous data, with each rectangle representing a class interval. The height is the frequency or relative frequency, and the rectangles touch each other.
Example: Unemployment Rates by State
State | Unemployment Rate (%) |
|---|---|
Nebraska | 1.9 |
Idaho | 2.4 |
Georgia | 3.2 |
Michigan | 3.7 |
Additional info: Table values are a sample from the full dataset.
Dot Plots
Dot Plot: Each observation is placed horizontally in increasing order, with a dot above each value for each occurrence.
Example: Dot Plot of Siblings
Draw a dot plot for the number of siblings in a class survey.
Identifying the Shape of a Distribution
Uniform Distribution: Frequency of each value is evenly spread across the variable's values.
Bell-Shaped Distribution: Highest frequency occurs in the middle, with frequencies tailing off to the left and right.
Skewed Right: Tail to the right of the peak is longer than the tail to the left.
Skewed Left: Tail to the left of the peak is longer than the tail to the right.
Example: Identifying Distribution Shape
Use a histogram to determine if the data is uniform, bell-shaped, skewed right, or skewed left.
Stem-and-Leaf Plots
Introduction
A stem-and-leaf plot is a graphical method for displaying quantitative data. It shows individual data values and their distribution.
Constructing a Stem-and-Leaf Plot
Step 1: Treat the integer portion of the number as the stem and the decimal portion as the leaf.
Step 2: Write the stems vertically in ascending order, then draw a vertical line to the right of the stems.
Step 3: Write the leaves corresponding to each stem.
Step 4: Within each stem, arrange the leaves in ascending order.
Example: Stem-and-Leaf Plot of Unemployment Data
Stem | Leaves |
|---|---|
1 | 9 |
2 | 1, 4, 6, 8 |
3 | 2, 7 |
4 | 1 |
5 | 4 |
Additional info: Table values inferred for illustration.
Summary Table: Graph Types and Their Uses
Graph Type | Data Type | Main Use |
|---|---|---|
Bar Graph | Qualitative | Compare categories |
Pareto Chart | Qualitative | Highlight most frequent categories |
Pie Chart | Qualitative | Show proportions of categories |
Histogram | Quantitative | Show distribution of data |
Dot Plot | Quantitative | Show individual data points |
Stem-and-Leaf Plot | Quantitative | Show distribution and individual values |
Side-by-Side Bar Graph | Qualitative/Quantitative | Compare groups |
Key Formulas
Relative Frequency:
Graph Comparisons
Bar Graphs are preferred when comparing categories.
Pareto Charts are used to emphasize the most common categories.
Pie Charts are best for showing proportions of a whole.
Histograms are used for quantitative data to show distribution.
Dot Plots and Stem-and-Leaf Plots are useful for small datasets to show individual values
Additional info: Academic context and examples added for completeness and clarity.