Calculation of Grades

In all my classes, grades are calculated based on the class mean and standard deviation, obtained from a statistical analysis of the data. I think the best way to explain how this is done is by using a fictitious example. Suppose that two rather large groups of students were given a test, and the following results were obtained:


Test Score   0  1  2  3  4  5  6  7  8  91011121314151617181920212223242526272829303132333435363738394041424344454647484950 51525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100
# Students Group 1 11122222334445567789910111213141516171819202122232425262728293030313132323333333333 33333232313130302928272625242322212019181716151413121110998776554443322222111
# Students Group 2 1111111111222223334455677891011131416171921222426283032333537383941414242434242 414139383735333230282624222119171614131110987765544333222221111111111

Thus, for example, 10 of the students in group 2 received a raw score of 27 on this particular test. For these data, a graph of number of students receiving each score vs. scores (a histogram) would look like this:


Bell Curve

Note that the green line represents group 1 and the red line represents group 2. Notice that the peaks, the highest points, of both lines are at the same place – both correspond to a score of 50 points. I think most people are familiar with how to calculate a mean (an average) of a group of numbers: add up the values and divide by the number of samples. In this case, if we add up the scores received and divide by the total number of students in each of the two groups, it turns out that the means (represented by “X”) of the two groups are the same; both are 50 points (represented by the black vertical line in the center of the graph).

However, these two graphs are not the same shape, and that also is significant. In group 2 (the red line) there are fewer students on the “ends:” the numbers of students receiving very low and very high scores are lower/less than the numbers of students in group 1 (the green line). Conversely, there are more students in group 2 clustered more closely to the center, the mean, of the group than there are for group 1. In statistics, the concept of “standard deviation” (represented by “s”) is a measure of how spread out vs. clustered a group of data are. In this case, the standard deviation for group 1 is 18.32 points, while the standard deviation for group 2 is 15.12 points, and since this latter number is smaller (the standard deviation is less) than the former, that indicates the scores received by the students in group 2 are more closely clustered (less spread out) than those of group 1.

In statistics, the numbers 18.32 for group 1 and 15.12 for group 2 are referred to as “one standard deviation unit.” Thus, for group 1, we could refer to a score of 68.32 (= 50 + 18.32) as being one standard deviation unit above the mean (“X + s”) and 31.68 (= 50 – 18.32) as being one standard deviation unit below the mean (“X – s”). Similarly, for group 2, a score of 65.12 is +1 standard deviation unit from the mean and 34.88 is –1 standard deviation unit from the mean. These points are indicated on the above graph by the vertical lines with arrows pointing to them, and the length of each arrow is one standard deviation unit (s).

Those statistical calculations are used to determine grades. The mean (X) is used as the dividing point between “B” and “C” grades, while X + s is used as the dividing point between “A” and “B” grades, X – s is used as the dividing point between “C” and “D” grades, and X – 2s is used as the dividing point between “D” and “F” grades as indicated in the following graph:


Bell Curve

Thus, for a student in group 1 (green line), a raw score of 66 points would be less than X + s (68.32) so that person would receive a letter grade of “B,” while for a student in group 2 (red line), a raw score of 66 points would be greater than X + s (65.12), so that person would receive a letter grade of “A.”

In statistics, there is also something called “z-scores,” which is a way of converting the X, X + s, and X – s points to make them easier to understand. For example, to come up with statistical numbers that are more easily understood at a glance, I can create z-scores by adding 8 to each of those numbers. Thus, X, which, because it is the mean, has a deviation of 0 from the mean, becomes X + 8 = 8, while X + s, which, thus, has a deviation from the mean of +1, becomes X + s + 8 = 9, and X – s, which has a deviation of –1 standard deviation unit, becomes X – s + 8 = 7. This quickly and neatly converts the –1, 0, and +1 “break points” to a 10-point scale, such that X + s (the dividing point between A and B) becomes 9.0, X (the dividing point between B and C) becomes 8.0, X – s (the dividing point between C and D) becomes 7.0, and X – 2s (the dividing point between D and F) becomes 6.0.

Thus, our sample 66 raw points, above, would be equivalent to a z-score of 8.87 for group 1 or 9.06 for group 2. Again, keep in mind that since the scores of the students in group 2 are more closely clustered, thus a score of 66, as compared to the rest of them, is, relatively-speaking, more “out on a limb” – that person really did do that much better, compared with his/her classmates, whereas the person in group 1 who also got a 66 was closer to the “center of the pack” for that class.

This grading system does have several important implications of which you need to be aware and which you need to understand. First, average is average, no matter where it falls. Consider the following sample grades:


Student Assignment 1 Assignment 2 Assignment 3 Assignment 4 Totals
 PtsZGr.PtsZGr.PtsZGr.PtsZGr.PtsZGr.Rank%
John456.73D1478.91B399.00B1949.83A4259.51A285.0%
Sally499.68A1197.51C327.91C1568.56B3568.08B371.2%
Kim488.94B1318.11B266.98D1438.13B3487.92C469.6%
Bill478.21B1237.71C378.69B1056.86D3127.18C562.4%
Mary488.94B17410.26A459.93A1739.13A4409.81A188.0%
Max Poss50 200 50 200 500 
No. of Students5 5 5 5 5
Mean47.40 138.80 35.80 154.20 376.20
% of Totl.95% 69% 72% 77% 75%
Std Dev1.36 20.04 6.43 29.96 48.53
X + s 48.76 158.84 42.23 184.16 424.73
X – s 46.04 118.76 29.37 124.24 327.67
X – 2s 44.69 98.71 22.94 94.29 279.14

Notice that for the first assignment, a raw score of 47.40 out of 50 possible (95%) is “average” – the dividing line between B and C! However, on the second assignment, a raw score of 138.80 out of 200 (only 69%) is “average.” Interestingly, in the past, some students have, apparently, thought that I should randomly, pick and choose and change grading systems based on whatever they perceive as being “best” for them on any particular assignment. Thus, in this example, they would say the first assignment should be graded one way (using, say, a 90-80-70 system), and the second assignment should be graded a totally different way (using mean and standard deviation). The fallacy in that, however, is that it is impossible to compare the proverbial “apples and oranges,” and without consistency in grading policy, there would be no good, fair way to assign grades at the end of the quarter. Look again at the example: while all of these fictitious students did very well on the first assignment and the class average is very high, resulting in a “strange-looking” curve, by the end of the quarter (when it “counts”) the overall class average is only 75%, and if I were to grade these students on a straight 90-80-70 scale, instead, there would be no A grades, the B would be a low C, and both of the people who had Cs with the curve would get Ds, instead. So, . . I am going to consistently use ONE grading system for the whole quarter, and in every class so far, using a curve based on the class mean and standard deviation, as I’ve just described, has been to the students’ advantage by the end of the quarter, even if one assignment early-on looks “strange.”

For each of the assignments you will be asked to do, there is a sample of the grading criteria for that assignment on the Web page that presents the assignment. Consulting those grading criteria as you prepare the work you will submit should help you to do a better job on the assignment. As those assignments are graded, the points you earned will be added to a file which you can access online to check your grades. I might add that, while the “regular” Web pages for this course are available to everyone, the grade list is not. The actual grades file is housed in an offline directory that is not accessible by the Web server, and the grades, themselves, are encoded. The file is only accessible via a Perl script which first checks to verify that the request is coming from a registered student and shows that student only his/her own grades.


Copyright © 2006 by J. Stein Carter. All rights reserved.
This page has been accessed Counter times since 20 Sep 2006.