Statistics and Probability
Here you’ll learn how to find the median of grouped data. In this case, you find the median by reading off a graph or through calculation. Here’s the procedure for both methods.
Reading off a Graph
To find the median in a set of grouped data, you need to draw the graph representing the relative cumulative frequency. This is how you do it:
Rule
How to Draw the Graph Representing the Relative Cumulative Frequency
- 1.
- If you don’t have a table containing the intervals, you have to group your data into intervals. Unless specified by the exercise, you can choose the sizes of the intervals yourself. Make sure to choose intervals in such a way that they reveal the information you’re looking for. You’ll know your intervals are good if you can say “yes” when you ask yourself “are my intervals reasonable?”
- 2.
- As you’re determining the relative cumulative frequency, your values will be decimal numbers between 0 and 1, or a percentage between % and %.
- 3.
- The first point of the graph is decided in a different way than the rest of the points! Its value is equal to the lowest value in the first interval and its value is %.
- 4.
- Then, mark all the points in a coordinate system and draw lines between the points.
For grouped data, you can assume the values are evenly distributed inside each interval. That’s why you can find the median of grouped data by reading off the graph when the relative cumulative frequency is known.
Rule
Median of Grouped Data: Graphical
You find the median where the relative cumulative frequency passes , or %.
Through Calculation
When you need to find the median through calculation, you have to find the expression for a line between two points. These points are taken from the interval where the relative cumulative frequency passes %, and the interval before it. Here’s how to do it:
Rule
Median of Grouped Data: Calculation
- 1.
- To find the expression for the line, , you have to decide which points to use to find and .
- 2.
- For the values, use the smallest and greatest values of the interval that passes % or . You can call these values and respectively.
- 3.
- Use the relative cumulative frequency of the interval that passes % as and the relative cumulative frequency of the interval before as .
- 4.
- You’ve now found two points: and
- 5.
- Calculate using the formula for the slope of a straight line: .
- 6.
- Calculate by using the formula . Insert the value you found for and the and coordinates of a point on the line (it’s smart to use one of the points we found above). Solve the equation for .
- 7.
- Insert the values you’ve found for and into the expression for the straight line .
- 8.
- Set for a decimal number or % for a percentage, and solve for .
- 9.
- This value of is your median.
Example 1
Below you can see a table showing the relative cumulative frequency of the weight of newborn children.
Weight | Relative Cumulative Frequency |
| % |
| % |
| % |
| % |
| % |
| % |
| % |
From the table, you can see that the relative cumulative frequency passes % in the interval . That means you have , , and . That gives us the points and .
Calculate by using these values in the formula
Calculate by inserting the point and the value of into :
You then get the expression for the line:
Set it equal to (as our numbers are percentages) and solve for :
You’ve found the median of the given birth weights to be kg.