Don't get fancy! People sometimes add features to graphs that
don't help to convey their information. For example,
3-dimensional bar charts like the one shown in Figure 4 are usually not as effective as their
two-dimensional counterparts.
Here is another way that fanciness can lead to
trouble. Instead of plain bars, it is tempting to substitute
meaningful images. For example,
Figure 5 presents
the iMac data using pictures of computers. The heights of the
pictures accurately represent the number of buyers, yet
Figure 5 is misleading because the viewer's attention
will be captured by areas. This can exaggerate the size
differences between the groups. In terms of percentages, the
ratio of previous Macintosh owners to previous Windows owners
is about 6 to 1. But the ratio of the two areas in
Figure 5 is about 35 to 1. A biased person wishing to
hide the fact that many Windows owners purchased iMacs would
be tempted to use
Figure 5 instead of
Figure 2!
Edward Tufte coined
the term
lie factor to refer to the ratio of the size of the
effect shown in a graph to the size of the effect shown in the
data. He suggests that lie factors greater than 1.05 or less
than 0.95 produce unacceptable distortion.
Another distortion in bar charts results from setting the
baseline to a value other than zero. The baseline is the
bottom of the Y-axis, representing the least number of cases
that could have occurred in a category. Normally, this number
should be zero.
Figure 6 shows the iMac data with
a baseline of 50. Once again, the difference in areas suggests
a different story than the true differences in
percentages. The number of Windows-switchers seems minuscule
compared to its true value of 12%.
Finally, we note that it is a serious mistake to use a line
graph when the X-axis contains merely qualitative variables. A
line graph is essentially a bar graph with the tops of the bars
represented by points joined by lines (the rest of the bar is
suppressed).
Figure 7 inappropriately shows a line
graph of the card game data from Yahoo. The drawback to
Figure 7 is that it gives the false impression that the
games are naturally ordered in a numerical way.