-

Basic Skills and Principles: Statistics and Graphing

Presenting Data
Choosing the Values to Graph

Students can also have difficulty with graphing because they are unsure exactly what data to graph. Let's examine an example to show you what I mean.

Let's assume that a researcher is attempting to determine the effects of pesticide pollution on the hatching success of fish eggs. He has noticed that fish eggs in streams near agricultural fields fare poorly, while those is undisturbed areas hatch successfully. The researcher sets up a lab experiment in which ten groups of fish eggs are allowed to develop in unmanipulated stream water, and ten groups of eggs develop in stream water with the addition of pesticide. The proportion of eggs hatching in each group is tallied (Table 3), and descriptive statistics for the two groups compared.

Statistic
No Pesticide
Pesticide
Mean proportion eggs hatching
0.87
0.59
Standard deviation
0.07
0.09

Group
Proportion eggs hatching
No Pesticide
Pesticide
1
0.80
0.50
2
0.76
0.45
3
0.81
0.68
4
0.90
0.77
5
0.95
0.64
6
0.84
0.60
7
0.88
0.54
8
0.99
0.57
9
0.86
0.62
10
0.93
0.57

Table 3. Effects of pesticide pollution on
hatching success of fish eggs.

Now that we have the data, how do we graph it? First, we determine that a bar graph is most appropriate, as the independent variable is the two groups, and this is categorical, not continuous. Second, we must determine what to graph. If we graph all of the values, we get a graph like the one below.

Figure 12. Effects of pesticide pollution on
hatching success of fish eggs.

But what does this graph tell you? There are 20 bars, 2 categories, and lots of different group numbers of mix things up. If we take a step back and look at the study, what we're really interested in doing is comparing the hatching success in groups exposed to pesticide and the groups not exposed to pesticide. So what we really want to do is to graph the mean of the two groups and compare them to one another. If we do that, we get a graph that looks like:

Figure 13. Effects of pesticide pollution on
hatching success of fish eggs.

Now while this graph is useful in that it readily allows us to compare the means of the two groups, it doesn't provide us with any indication of the spread of the data in each group. Knowing the spread of the data can be important in making conclusions, so you need to provide the reader with this information. This is done by adding "error bars" to data points or bars on a graph that indicate the standard deviation of the data. In this example, we would add the standard deviation error bar to the bar for each group. In bar and line charts, error bars are drawn both above and below the mean to a distance equal to their value.

Figure 14. Effects of pesticide pollution on
hatching success of fish eggs.

Realize that error bars are only appropriate when graphing a group's mean. If you are graphing individual data points, you cannot add error bars as there is only one data point per point on the graph (hence, no mean or standard deviation). Error bars are useful in that they allow you to see the overlap in the data points from two groups. Remembering that all of the data points in a distribution are within two standard deviations of the mean, you can visually estimate the data's distribution by doubling the height of the error bar (because it's only drawn to one standard deviation) for each group and seeing the degree to which their distributions overlap. As shown in the examples below, including error bars can make a big difference in your interpretation of a graph. Although the means of the two groups are identical in each of the graphs, the two groups in the first figure appear to be more different from one another than the two in the second figure due to the smaller error bars and lesser degree of overlap.

Figure 15. The role of error bars in estimating differences in treatment groups

Creating Graphs
You may choose to make your graphs by hand or with the assistance of a computer software application. In either case, you need to make sure your choice of graphs, the data to graph, and the formatting are all appropriate. There are several software applications that enable you to create graphs (e.g., Microsoft Excel) that are available in the campus computer labs. There are also some web sites that allow you to create graphs online, and they are listed below. Providing directions for graph preparation in each would be prohibitively long, so you will need to become familiar with their operation on your own if you opt to use them.

StatCrunch: http://www.statcrunch.com/

National Center for Education Statistics: http://www.nces.ed.gov/nceskids/graphing/