Biometrics and Statistical Analysis of Data


What Is Biometrics?

In this lab exercise, you will learn more about using metric system measurements for height and weight, how to read lab equipment and interpolate digits, and how to calculate averages and standard deviations to analyze data. Also, in this lab, humans (Homo sapiens) will be used as an example to illustrate Darwin’s concept of intraspecific variation.

In biology, as in other sciences, gathering numerical data to test one’s hypothesis and subsequently performing a statistical analysis on those data are of utmost importance when interpreting the data and drawing any conclusions from them. Due to a variety of factors, despite the most careful observations, there will always be some variation in the data collected, hence the necessity for a statistical analysis of those data. Biometrics is the application of statistical methodology to analyze biological data.


Interpolation of Data

read clock read scale
In collecting data, it is important to know how to correctly read the equipment being used. This frequently involves interpolation to obtain the last digit of the data. Interpolation is “reading between the lines” — for example, if you’re looking at a clock that only has 5-min. markings on it and you read a time of 8:53, you are interpolating the “3” by estimating how far between the “0” and the “5” the minute hand of the clock is. Similarly, when reading the scale on a piece of scientific apparatus, it is also necessary to interpolate, to read between the lines. For example, in the illustration, above, if the numbered divisions represent grams, then the marked divisions in between represent tenths of a gram. This scale must, then, be read to the one-hundredths of a gram by envisioning ten divisions in the white space in between the tenth-gram markings. Also, in biology, as in other sciences, the metric system is used. Thus, we measure an organism’s weight in grams or kilograms and its length or height in centimeters or meters.


Statistical Analysis

To evaluate these numbers, it is necessary to employ several statistical concepts. The mean or average (X) of a set of data is a measure of “central tendency” of a group of numbers, such that the total of the deviations of the numbers above the mean is equal to the total of the deviations of the numbers below the mean. For example, for the numbers 1, 3, 5, 7, and 9, the mean is 5, so the deviations of each of the numbers from that mean are –4, –2, 0, 2, and 4, respectively. Note that the absolute values of |2 + 4| and |(–4) + (–2)| are equal. Further, note that the sum of the deviations around a mean should always be 0. The mean is the total of the values divided by the number of data points. This is expressed mathematically as: X = (ΣXi)/N. Σ means sum, Xi means all the individual values, and N means the number of items. The closer the mean of a group of numbers is to the true value, the more accurate that mean and group of numbers are.

Another concept that is sometimes used is that of the median, which is the data point above and below which one-half of the data points lie. That means that if there is an odd number of data points, the median is the number that’s in the “middle” of the list, just by counting in from both ends. If there is an even number of data points, the median is the average of the middle two. For example, for the numbers 2, 6, 7, 14, and 56, the median is 7. For the numbers 2, 6, 7, 9, 14, and 56, the median is (7 + 9)/2 = 8.

The mean is preferred over the median as a measure of central tendency in a group of data, but there might be some situations where the median would be a better indicator. If a distribution is symmetrical, the mean and median should be about the same, but if a distribution is skewed, then the median might be a better measure to use than mean. For example, if a statistician was looking at family income in an area where four families had incomes of under $20,000 while one family had an income of over $1,000,000, then median would be a better indicator of “typical” family income in that community. The median is less sensitive to extremes in the data than the mean. For example, as pointed out above, the mean of the numbers 1, 3, 5, 7, and 9 is 5, and so is the median. However, for the numbers 1, 3, 5, 7, and 34, the median is still 5, but the mean is 10.

One other concept that is only used occasionally is that of mode. The mode is the number that occurs with the greatest frequency. For example, if 2 students get a score of 50 on a test, 3 students get 80, and 1 student gets a 90, then the mode is 80 — the most students got that score (by the way, since the middle score would be one of the 80s, that is also the median, and the mean of those numbers would be 71.67). However, if you are collecting data on some experiment which requires that you weigh something three times, and you get three entirely different weights, the concept of mode really doesn’t mean much.

When analyzing data, it is also useful to determine how spread-out, how dispersed, those data are. One indication of this is the range of the data, which is equal to the highest number (the maximum) minus the lowest number (the minimum). This can be expressed as range = Xmax – Xmin.

The standard deviation, s, is one of the most commonly-used measures of the dispersion of the data, in other words, a measure of how far from the mean the data are scattered. Thus, the smaller the standard deviation is, the more precise, the closer to agreement with each other, the data are. In many cases, if the standard deviation is as large as or greater than the mean, that would indicate that the experimenter needs to re-examine his/her experimental technique! If the means of two groups of data are not farther apart from each other than the standard deviation of each group, then one cannot draw the conclusion that there is a statistically-significant difference between the two groups (to really be sure, one should do a “t-test” on the data). Standard deviation is expressed mathematically as
standard deviation formula.
In other words, first subtract the mean from each of the data points to get the deviation of each number. Then, square each of those deviations (that “gets rid of” the negative signs). Next, add up all those squared deviations and divide by the number of data points to get an “average”. Finally, calculate the square root of that “average.”

For example, for the numbers 1, 3, 5, 7, and 9 from above (remember, we said the average is 5):
Number Xi – X deviation2
1 1 – 5 = –4 –42 = 16
3 3 – 5 = –2 –22 = 4
5 5 – 5 = 0 02 = 0
7 7 – 5 = 2 22 = 4
9 9 – 5 = 4 42 = 16
Σ = 25   Σ = 40
25 ÷ 5 = X = 5   40 ÷ 5 = 8
    s = √8 = 2.828

Initially (i. e., for this lab), you should practice doing these calculations “by hand” so that you understand what these numbers represent and how to do the calculations. Once you have mastered and understand these calculations, they can easily be done on a calculator or computer. Since so many people use the mean and standard deviation to analyze data, most calculators and spreadsheet software [@avg() and @std() work in most spreadsheet programs I’ve used] have built-in functions to do those calculations.

height histogram weight histogram

To more easily visualize statistical data, often a histogram is constructed. A histogram is a bar graph in which the X-axis represents the range of possible values divided into discrete categories, and the Y-axis represents the number of individuals who “fit into” each category (frequency of individuals observed at each value). Given a large-enough sample size, the histograms for weight and height for adult humans should look like “bell” curves, with fewer people in the highest and lowest weight/height categories, and more people in the middle categories.


How to Collect Your Data

For this lab, you should work in groups of 5 to 7 students.

  1. Please do not form a group of less than 5 people. Including at least 5 people will both increase the accuracy of your numbers and calculations and give you adequate practice doing these calculations. If you do not have enough people, split up and join other groups. While a group of more than 7 people would increase the accuracy of your data, that would also increase the number of required calculations.

Get your lab notebook set up to gather data.

    notebook chart
  1. Set up a chart similar to this in your lab notebook, and record the names and ages of all the students in your group.

Use the medical balance to determine your height.

    medical balance
  1. With the help of a lab partner from your group, use the medical balance in the biology lab to determine your height in centimeters (to the nearest 0.1 cm) and your weight in kilograms (to the nearest 0.01 kg). You may wish to remove your shoes to obtain a more accurate height measurement. Make sure you obtain readings with the correct number of decimal places, and make sure that you record your data (and those of your group members) directly into your lab notebook.
  2. bar straight on head
    height = 160.3 cm
  3. To determine your height, raise the height “bar” on the medical balance to approximately the height of your head, then stand on the balance. Someone else should adjust the height bar (up or down) until its arm sits flat on your head (make sure it is pointing straight sideways and not slightly up or down).
  4. height = 160.3 cm
  5. Read your height in the middle of the bar where the top piece slides into the bottom piece, and make sure to use the metric scale. For example, the height shown in these photos is 160.3 cm (not 5 ft 3⅛ in!). Also, remember to read your height to the nearest 0.1 cm, and remember to record your data in your lab notebook.

Use the medical balance to determine your weight.

    lower beam at 0 upper beam at 0
  1. When obtaining your weight, it is important to notice that the beams on the medical balance have two scales (metric and English) and two sets of notches intermixed. Begin with the weighs on both beams set at 0.

  2. lower beam in 40 kg notch lower beam in 100 lb notch
  3. First, adjust the weight on the lower beam. You need to make sure that the weight is in a notch for one of the metric system numbers, not one of the notches for an English system number (notice the difference, here between the 40-kg and 100-lb notches). Adjust the weight so that it is in the last metric notch that’s “too light.”

  4. Then, carefully slide the weight on the top beam over to adjust the balance such that the needle swings the same amount up and down. Do not wait for the needle to stop swinging because friction may cause it to stop somewhere else.
  5. upper beam at 14.15 kg
  6. Also, remember to read your weight to the nearest 0.01 kg. This balance is at 14.15 kg, so added to the 40 kg from the bottom beam, that person’s total weight would be 54.15 kg.
  7. Remember to write all the measurements for everyone in your group in your lab notebook.

Determine your pulse.

    look for tendon
  1. Locate the tendon that runs just to the “thumb-side” of the middle of the wrist.
  2. feel for pulse
  3. Use your same hand (right-right or left-left) as your “patient.” Support the person’s hand/wrist on the palm of your hand, and reach up and around with your fingers, such that your fingers line up along, and to the outside (thumb-side) of the tendon. (Do not use your thumb because you have a “pulse point” in the end of your thumb.) Gently, but firmly, press down with your fingers to feel the pulse in the radial artery.
  4. stopwatch
  5. Use a stopwatch to time for 30 sec. as you count the number of pulse beats you feel with your fingers. Multiply that number by two to determine the person’s pulse in beats per minute (BPM).
  6. Do this three times — obtain three separate pulse readings — and average the readings to calculate the person’s average pulse.

Determine your blood pressure.

    Pressure = 140, no blood flow
    with pressure of 140, no blood flow
    Pressure = 120, flow when heart beats
    with pressure of 120, flow when beating
    Pressure = 140, flow normally
    with pressure of 80, normal flow
  1. Like barometric pressure, blood pressure is designated in terms of how tall of a column (in mm) of mercury (Hg) that much pressure could support. Thus, the units used are “mm Hg.”
  2. A blood pressure reading consists of two numbers. The first, higher number is called the systolic pressure, and represents the pressure on the blood while the heart is actively contracting (and therefore putting enough pressure on the blood that it is able to overcome the resistance of the cuff and flow under it). The sounds you will hear at that point are the sounds of the brachial artery slapping shut as the heart relaxes and ceases to put pressure on the blood. Thus, you will need to listen (and watch the sphygmomanometer dial) for when you first hear a “beating” sound.
  3. The second, lower number is called the diastolic pressure, and represents the residual pressure in the artery while the heart is relaxed (in between a beat). At that point, the pressure in the cuff is low enough that the blood can easily flow underneath. Thus you will need to listen (and watch the dial) for when the sound becomes muffled.
  4. BP cuff
  5. You will be using a sphygmomanometer attached to a blood-pressure cuff to determine blood pressure. Become familiar with this equipment and the proper way to put the cuff on a person’s arm. Learn which is the inside and which is the outside, which is the top and which is the bottom, and in what “configuration” it is to be placed on someone’s arm.
  6. sphygmomanometer
  7. First, closely examine (and draw) the dial of the sphygmomanometer. Notice what divisions are marked on the dial, and what each of those divisions represents.
  8. bulb
  9. Also, examine and try out the bulb of the sphygmomanometer, and practice turning the screw that controls the valve (remember “righty-tighty, lefty-loosey”).
  10. sphygmomanometer
  11. The cuff should be wrapped snugly around the person’s upper arm, a little above the elbow. Many cuffs have a label indicating which area of the cuff should be lined up with the center-front of the person’s arm. Depending on where it is most visible, the sphygmomanometer may be clipped onto the cuff, as shown here, or removed and placed somewhere nearby.
  12. stethoscope
  13. You will also need to use a stethoscope to hear the sounds of the blood flowing past the cuff. Note that with no cuff on the person’s arm (or the cuff in place but totally deflated, you will not hear a sound because the blood flow is unimpeded.
  14. Before you insert the earpieces of the stethoscope into your ears, you need to insure that they are clean. Since these stethoscopes are used by numerous students, you need to squirt some 70% alcohol onto a Kimwipe® and use that to clean the earpieces before and after you use them.
  15. sphygmomanometer
  16. The bell of the stethoscope is placed slightly under the bottom edge of the cuff, above the elbow, on roughly the center front of the arm. It may be held in place with a finger or two, not your thumb (again, because of the pulse point in your thumb, you might hear your own pulse, instead). Also, remember that with the cuff deflated and the blood flowing freely through the brachial artery, you won’t hear anything.

  17. There is actually, a correct way to insert the earpieces into your ears. Notice that the tips do not face straight toward each other, but rather are “slanted” or “angled” in one direction. stethoscope earpieces Because your ear canals are angled forward (toward your face), the correct way to insert the stethoscope earpieces is also facing forward (toward your face) to enable you to better hear the sound. Be careful not to hit the bell of the stethoscope on anything while the earpieces are inserted in your ears — the noise will be VERY loud!
  18. Close the valve, but not so tightly that it gets stuck. Pump up the cuff — unless the person knows (s)he has hypertension (high blood pressure), going to about 140 mm Hg or so should be sufficient for most people.
  19. Then, open the valve slightly to slowly let air out. As the pressure drops, watch the dial of the sphygmomanometer and listen for any sounds. When you first hear sounds, make a mental note of that reading, and as the pressure continues to drop, make a mental note of when the sounds become muffled. Open the valve to TOTALLY RELEASE THE PRESSURE and let the person’s arm “rest.” Record the systolic and diastolic pressure numbers in your lab notebook.
  20. Repeat this twice more so you have three sets of numbers. Average the three systolic readings and average the three diastolic readings to calculate the person’s average blood pressure.

Submit your data online.

  1. Go to the (Biometrics Data Web page and enter the requested data, including your name or initials, sex, age (to the nearest 0.5 yr), height in centimeters (to the nearest 0.1 cm), weight in kilograms (to the nearest 0.01 kg), your three pulse readings, and your three blood pressure readings on that page. Note: that page contains JavaScript code that is checking to see if the right number of decimal places were entered, so if a message box pops up, READ IT and do what it is asking you to do. When everyone has entered his/her data, use the link to the class data at the bottom of that page to view and print a copy of the class results.

How to Analyze Your Data

  1. Calculate what percentage of your group are males, and what percentage are females.
    As a reminder,
    # males + # females = total #
    # males ÷ total # × 100 = % males
    # females ÷ total # × 100 = % females
    Remember to record all your calculations and final numbers directly into your lab notebook. Do not use a separate sheet of “scratch paper” first, then recopy your numbers.
  2. Calculate the mean age of your group. As a reminder, add all the ages, then divide by the number of people. Round your answer to the nearest 0.1 years. For example, if you wanted to find the mean (average) of the ages 19.0, 17.5, 22.5, 30.0, and 21.5, the total of those numbers would be 110.5, so since there are 5 numbers, the mean would be 110.5 ÷ 5 = 22.1 years.
  3. Calculate the standard deviation for the age of your group. As explained above, for these 5 numbers, the calculations would look like,
    Number Xi – X deviation2
    19.0 19.0 – 22.1 = –3.1 –3.12 =   9.61
    17.5 17.5 – 22.1 = –4.6 –4.62 = 21.16
    22.5 22.5 – 22.1 =  0.4   0.42 =   0.16
    30.0 30.0 – 22.1 =  7.9   7.92 = 62.41
    21.5 21.5 – 22.1 = –0.6 –0.62 =   0.36
    Σ = 110.5   Σ = 93.70
    110.5 ÷ 5 = X = 22.1   93.70 ÷ 5 = 18.74
        s = √18.74 = 4.3
    and thus, the mean ± standard deviation for this group would be expressed as 22.1 ± 4.3 years.
  4. Similarly, calculate the means and standard deviations for the heights and weights of your group.
  5. For pulse, calculate the group average two different ways — based on all three individual numbers for each person (thus averaging a total of 15 to 21 numbers) and based on each person’s average pulse (thus averaging a total of 5 to 7 numbers) — and compare to see if it makes a difference in the results.
  6. For pulse, calculate the group standard deviation based on each person’s average pulse. For blood pressure, (remember, systolic and diastolic numbers must be calculated separately) calculate the group mean and standard deviation based on each person’s “average” blood pressure.
  7. While histograms really work better with larger sets of data, to save time while simultaneously introducing you to how they are constructed, you are asked to plot histograms of your group’s data as follows.
    1. First, as explained in the protocol, make a list of 2.5-year-span age categories, such as:
      under-15
      15.0-17.4
      17.5-19.9
      20.0-22.4
      22.5-24.9
      25.0-27.4
      27.5-29.9
      30.0-32.4
      32.5-34.9
      35.0-37.4
      37.5-39.9
      40-and-over,
      including enough categories to cover the ages of all group members. Then determine how many people’s ages fall into each of those categories. For example, for the ages in the above example, this would look like:
          AGE       NUMBER
      17.5-19.9      2
      20.0-22.4      1
      22.5-24.9      1
      25.0-27.4      0
      27.5-29.9      0
      30.0-32.4      1
    2. Secondly, make a similar list of height by 5 cm categories (for example: 135.0-139.9, 140.0-144.9, etc.), including enough of those categories to include the heights of all group members. Then, determine how many people’s heights fall into each of those categories.
    3. Thirdly, make a similar list of weight by 10 kg categories (for example: 40.00-49.99, 50.00-59.99, etc.), and determine how many people’s weights fall into each of those categories.
    4. Similarly, list average pulse in 5 BPM categories (for example, 75.0-79.9 BPM), and average systolic and diastolic blood pressures (list separately) in 5 mm Hg categories. Determine how many of your group members fall into each of those categories.
    5. histogram example
    6. Set up a histogram (graph) for each set of numbers (age, height, etc.). Each “block” on the X-axis should represent one category (for example, on the height histogram, one of the units on the X-axis would represent the 150.0-154.9 cm group, the next would represent the 155.0-159.9 cm group, etc.). The Y-axis represents the number of people in each category (for example, if 12 people were in the 60.00-69.99 kg category, that bar would be 12 blocks tall). For each category listed on the X-axis, draw a bar the appropriate height to represent the number of people in that category. Refer to the graphing protocol and graphing Web page for information on proper graphing technique, and make sure to properly title your graph and label the axes.
    7. histogram example
    8. For each graph except the one for sex distribution, on the X-axis, indicate where the mean would be located. Also calculate and indicate the position of the mean + one standard deviation unit and the position of the mean – one standard deviation unit (for example, if X = 5.00 and s = 0.20, those two points would be at 5.20 and 4.80, respectively), as well as the mean ± two standard deviation units (which would be 5.40 and 4.60 for this example).
    9. You should end up with histograms for sex, age, height, weight, pulse, systolic, and diastolic blood pressure distributions for your group (a total of 7 graphs).
  8. After you print out the class data, compare your histograms with those for the class data. How are they similar? Different? Are the means in the same places? How similar or dissimilar are the standard deviations? Do any of the histograms form a bell curve?
  9. Based on your statistical analysis of the data, do there appear to any statistically-significant differences in either age, height, weight, pulse, systolic, or diastolic blood pressure when comparing males vs. females? Do there appear to be any statistically-significant differences when comparing people who are under 25 with those who are 20 and over? For example, if you have the following two groups of data:
    Group Mean Standard Deviation Mean +
    Std Dev
    Mean –
    Std Dev
    Group 1 25a 3 28 22a
    Group 2 22b 4 26b 18
    those groups of data would not be different from each other. While there is a “fancy” statistical calculation called a t-test that could be done to determine whether these groups are different or not, for our purposes, since
    25a – 3 = 22a (higher average – its standard deviation), and
    22b + 4 = 26b (lower average + its standard deviation),
    and those numbers are overlapping (22a = 22b and 26b > 25a),
    it can be concluded that there is not a statistically-significant difference between these numbers.
    If you were doing an experiment and obtained these data for your control and experimental groups, you would conclude that there is no difference between the groups, the factor being tested in/on the experimental group had no effect as compared to the control group. Thus, when stating your conclusions, it is important to cite the actual numbers upon which those conclusions are based.
  10. Before looking at the data, which would you have expected to be more different between the under/over 25 groups: weight, height, pulse, or blood pressure? Why? Do the actual data support or refute this hypothesis?
  11. Complete the statistics practice problems in the lab protocol. Show all your work in your lab notebook.

Things to Include in Your Notebook

Make sure you have all of the following in your lab notebook:


Copyright © 2010 by J. Stein Carter. All rights reserved.
Based on printed protocol Copyright © 2000 J. L. Stein Carter.
Chickadee photograph Copyright © by David B. Fankhauser
This page has been accessed Counter times since 23 Jun 2011.