1.6.2 - Histograms

Histogram Section

If there are many data points and we would like to see the distribution of the data, we can represent the data by a frequency histogram or a relative frequency histogram.

A histogram looks similar to a bar chart but it is for quantitative data. To create a histogram, the data need to be grouped into class intervals. Then create a tally to show the frequency (or relative frequency) of the data into each interval. The relative frequency is the frequency in a particular class divided by the total number of observations. The bars are as wide as the class interval and as tall as the frequency (or relative frequency).

Histogram Example

Jessica weighs herself every Saturday for the past 30 weeks. The table below shows her recorded weights in pounds.

135

137

136

137

138

139

140

139

137

140

142

146

148

145

139

140

142

143

144

143

141

139

137

138

139

136

133

134

132

132

Create a histogram of her weight.


Answer

For histograms, we usually want to have from 5 to 20 intervals. Since the data range is from 132 to 148, it is convenient to have a class of width 2 since that will give us 9 intervals.

  • 131.5-133.5
  • 133.5-135.5
  • 135.5-137.5
  • 137.5-139.5
  • 139.5-141.5
  • 141.5-143.5
  • 143.5-145.5
  • 145.5-147.5
  • 147.5-149.5

The reason that we choose the end points as .5 is to avoid confusion whether the end point belongs to the interval to its left or the interval to its right. An alternative is to specify the endpoint convention. For example, Minitab includes the left end point and excludes the right end point.

Having the intervals, one can construct the frequency table and then draw the frequency histogram or get the relative frequency histogram to construct the relative frequency histogram. The following histogram is produced by Minitab when we specify the midpoints for the definition of intervals according to the intervals chosen above.

Histogram of Jessica's weights with the midpoints of the intervals.

 

If we do not specify the midpoint for the definition of intervals, Minitab will default to choose another set of class intervals resulting in the following histogram. According to the include left and exclude right endpoint convention, the observation 133 is included in the class 133-135.

Histrogram of Jessica's weight.

 

Note that different choices of class intervals will result in different histograms. Relative frequency histograms are constructed in much the same way as a frequency histogram except that the vertical axis represents the relative frequency instead of the frequency. For the purpose of visually comparing the distribution of two data sets, it is better to use relative frequency rather than a frequency histogram since the same vertical scale is used for all relative frequency--from 0 to 1. 

Minitab 18

Minitab®

Minitab: Histograms Section

How to create a histogram in Minitab:

  1. Click Graph>Histogram
  2. Choose Simple.
  3. Enter the column with your variable
  4. Click OK.