We now have five different values that describe how far away each point is from the mean. That seems to be a good start in describing the spread of the data. But the whole point of calculating variance was to get one number that describes the dataset. We don’t want to report five values — we want to combine those into one descriptive statistic.
To do this, we’ll take the average of those five numbers. By adding those numbers together and dividing by
5, we’ll end up with a single number that describes the average distance between our data points and the mean.
Note that we’re not quite done yet — our final answer is going to look a bit strange here. There’s a small problem that we’ll fix in the next exercise.
Sum the five variables
difference_five and store the result in the variable
5 and store the result in the variable named
Think about the answer. Do you think it accurately captures the spread of your data?