Why you may want to know this
The normal distribution is foundational knowledge for many statistics used in research, such as standard deviation, so understanding it opens the door to reading and understanding massage research articles that use those statistics.
As we often mention at POEM, words have power. The word "normal" can have strong connotations in everyday language, and can be used as implicit or explicit disapproval criticism against people who don't conform to norms of society.
In scientific usage, however, "normal" is not nearly so loaded a word. While it has the same denotations (dictionary meanings) of "typical, usual, or close to an average, according to a benchmark or standard", it doesn't carry any connotations (ideas) of positive or negative simply for being unusual.
What usual and unusual mean will vary, according to the situation. Generally, few people in the total population are extreme in many physical measurements; most are pretty close to a typical value in respect to most measurable physical qualities, which is what we'll deal with most as MTs. So, in that sense, most people are pretty "normal", and we'll remember to be aware of and sensitive to the needs of those who aren't.
For example, consider as a physical measurement the birth weight of all healthy babies born at term in the developed world. In this group, there will be a few big babies, weighing 8½ (8.5) to 9 pounds or more. The baby on the left in the following picture, born in the UK, weighed 14 lb, 7 oz (14.44 lb) at birth.

Source: http://img.dailymail.co.uk/i/pix/2007/08_01/nicholson1NTI1008_468x650.jpg
There will also be a few small babies, who weigh 6 to 6½ (6.5) pounds or less.

Source: http://latimesblogs.latimes.com/.m/photos/uncategorized/2009/03/18/premie.jpg
Unless some sort of problem such as gestational diabetes or premature delivery occurs, most of these babies born at term in developed nations tend to weigh about 7 to 8½ (8.5) pounds.
Let's imagine that we are keeping track of the babies born in one small region, and that 10 babies are born, with the following birth weights:
|
Baby |
Birth weight (in pounds) |
|
Baby 1 |
7.2 |
|
Baby 2 |
9.6 |
|
Baby 3 |
7.5 |
|
Baby 4 |
7.7 |
|
Baby 5 |
7.4 |
|
Baby 6 |
8.0 |
|
Baby 7 |
5.9 |
|
Baby 8 |
7.6 |
|
Baby 9 |
6.1 |
|
Baby 10 |
8.9 |
Let's graph the data from our (imaginary) observation of birth weights.
Number of babies in this group that weigh less than 5.5 lbs: 0
Number of babies in this group that weigh 5.5 lbs-6.9 lbs: 2
(Baby 7, Baby 9)
Number of babies in this group that weigh 7.0 lbs-8.4 lbs: 6
(Baby 1, Baby 3, Baby 4, Baby 5, Baby 6, Baby 8)
Number of babies in this group that weigh 8.5 lbs-9.9 lbs: 2
(Baby 2, Baby 10)
Number of babies in this group that weigh more than 10 lbs: 0
We'll make a column chart of this data, where the x-axis (horizontal) is the birth weight range in pounds, and the y-axis (vertical) is the number of babies with that birth weight.
There is a pattern emerging in that data--most of the babies' birth weights tend to be in the middle of the range--fewer babies are either extremely large or extremely small at birth.
This much larger sample of Norwegian births between the years 1992 and 1998 was graphed in the same way. The x-axis (horizontal) is still the birth weight range, and the y-axis (vertical) is the number of babies with that birth weight. Since Norway uses the metric system, however, they report their birth weight data in kilograms (kg), so the x-axis is labeled in kg, rather than in pounds.
To compare that data to ours, then, we need to know how to convert between kg and pounds.
1.0 kg = 2.2 lb
2.0 kg = 4.4 lb
2.3 kg = 5.0 lb
2.7 kg = 6.0 lb
3.0 kg = 6.6 lb
3.2 kg = 7.0 lb
3.6 kg = 8.0 lb
4.0 kg = 8.8 lb
4.1 kg = 9.0 lb
4.5 kg = 10.0 lb
5.0 kg = 11.0 lb
6.0 kg = 13.0 lb
Even though this dataset is very, very much larger than our 10 observations--several of these birth weights in the mid-range are represented by 30,000 babies or more--we still see the same pattern we saw in our data: lots of babies have a middle-of-the-road birth weight, and the more extreme (the further from average) the birthweight, the fewer babies who have that weight.
This is another graph of birth weights, generated by the National Institutes of Health in the United States. Like the Norwegian graph, the weights in the middle of the range represent 30,000 or more babies born with those weights.
On this graph, the x-axis represents weights in grams (gm here, although usually abbreviated g), so the numbers along the horizontal axis are 1000 times larger than the numbers on the Norwegian graph in kg. No matter what the concept or term that we call it by is, however (gram vs. kilogram vs. pound), the physical referent--how heavy the newborn is--remains constant.
There is a lot of extra information on this graph that we won't be using, so don't worry about anything that seems unclear at this point, such as what "Residual" may mean. I'm talking only about the gray columns and the blue curve drawn over it, although you may recognize mean from our previous discussion (
available by clicking here), and you may also recognize SD as "standard deviation", a statistical measurement that we are now laying the ground for discussing.
The smooth curve drawn connecting the values of these columns is the bell curve, which gets its name from the perceived resemblance to the outline of a bell. A normal distribution of a characteristic in a defined population or group describes a bell curve when graphed in this way.
The relatively few very small and very large babies are the small quantities shown at the extreme left and right sides of the graph (forming the small “tail” at either end). The higher number of 7 to 8½ pound babies make up the big “bump” or curve at the center of the graph.
The awesome thing about the normal distribution is how often it occurs naturally. Remember that our first dataset was made up up imaginary values. I chose those values carefully, to set it up to lead us smoothly into the rest of the discussion.
However, the next two datasets were real, natural data--nothing imaginary required for them. The fact that this distribution is found so often in so many different situations in the natural world allows us to draw connections that we can develop new knowledge out of.
Since data values for many natural phenomena tend to form this normal distribution—with most of the numbers in the middle and a few extreme values at either end—when not subjected to some purposeful manipulation (such as a massage treatment), this effect can be used as a baseline for measuring the distribution of data after such a treatment to see whether it differs significantly from the way the data was distributed before the treatment.
Recognition of the power of this technique lies at the heart of some of the most useful and powerful methods in mathematics and science. And over in Journal Club, you can see from the following presentation slides how Moyer and his team use this as part of their method to determine whether or not massage significantly reduces cortisol (it doesn't).
