# Calculating standard deviation step by step

## Introduction

In this article, we'll learn how to calculate standard deviation "by hand".
Interestingly, in the real world no statistician would ever calculate standard deviation by hand. The calculations involved are somewhat complex, and the risk of making a mistake is high. Also, calculating by hand is slow. Very slow. This is why statisticians rely on spreadsheets and computer programs to crunch their numbers.
So what's the point of this article? Why are we taking time to learn a process statisticians don't actually use? The answer is that learning to do the calculations by hand will give us insight into how standard deviation really works. This insight is valuable. Instead of viewing standard deviation as some magical number our spreadsheet or computer program gives us, we'll be able to explain where that number comes from.

## Overview of how to calculate standard deviation

The formula for standard deviation (SD) is
$\Large\text{SD} = \sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\mu\rvert^2}}}{N}}$
where $\sum$ means "sum of", $x$ is a value in the data set, $\mu$ is the mean of the data set, and $N$ is the number of data points in the population.
The standard deviation formula may look confusing, but it will make sense after we break it down. In the coming sections, we'll walk through a step-by-step interactive example. Here's a quick preview of the steps we're about to follow:
Step 1: Find the mean.
Step 2: For each data point, find the square of its distance to the mean.
Step 3: Sum the values from Step 2.
Step 4: Divide by the number of data points.
Step 5: Take the square root.

## An important note

The formula above is for finding the standard deviation of a population. If you're dealing with a sample, you'll want to use a slightly different formula (below), which uses $n-1$ instead of $N$. The point of this article, however, is to familiarize you with the the process of computing standard deviation, which is basically the same no matter which formula you use.
$\text{SD}_\text{sample} = \sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\bar{x}\rvert^2}}}{n-1}}$
That's a great question, but it is difficult to answer succinctly. We have a lot of videos and simulations on this topic—it's fairly complex and quite interesting.
If you want to learn about the distinction between population and sample standard deviation, and why they're not calculated the same way, you should head to the lesson on sample variance and standard deviation.
Onward!

## Step-by-step interactive example for calculating standard deviation

First, we need a data set to work with. Let's pick something small so we don't get overwhelmed by the number of data points. Here's a good one:
$6, 2, 3, 1$

### Step 1: Finding $\goldD{\mu}$ in $\sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\goldD{\mu}\rvert^2}}}{N}}$

In this step, we find the mean of the data set, which is represented by the variable $\mu$.
Fill in the blank.
$\mu =$
$\mu = \dfrac{6+2 + 3 + 1}{4} = \dfrac{12}{4} = \blueD3$

### Step 2: Finding $\goldD{\lvert x - \mu \rvert^2}$ in $\sqrt{\dfrac{\sum\limits_{}^{}{\goldD{{\lvert x-\mu}\rvert^2}}}{N}}$

In this step, we find the distance from each data point to the mean (i.e., the deviations) and square each of those distances.
For example, the first data point is $6$ and the mean is $3$, so the distance between them is $3$. Squaring this distance gives us $9$.
Complete the table below.
Data point $x$Square of the distance from the mean $\lvert x - \mu \rvert^2$
$6$$9$
$2$
$3$
$1$
Data point $x$Distance from the mean squared $\lvert x - \mu \rvert^2$
$6$$\lvert6-\blueD{3}\rvert^2 = 3^2 = 9$
$2$$\lvert2-\blueD{3}\rvert^2 = 1^2 = 1$
$3$$\lvert3-\blueD{3}\rvert^2 = 0^2 = 0$
$1$$\lvert1-\blueD{3}\rvert^2 = 2^2 = 4$

### Step 3: Finding $\goldD{\sum\lvert x - \mu \rvert^2}$ in $\sqrt{\dfrac{\goldD{\sum\limits_{}^{}{{\lvert x-\mu}\rvert^2}}}{N}}$

The symbol $\sum$ means "sum", so in this step we add up the four values we found in Step 2.
Fill in the blank.
$\sum\lvert x - \mu \rvert^2 =$
Add up all of the squared distances from the data points to the mean from Step 2:
$\sum\lvert x - \mu \rvert^2 = 9 + 1 + 0 + 4 = 14$

### Step 4: Finding $\goldD{\dfrac{\sum\lvert x - \mu \rvert^2}{N}}$ in $\sqrt{\goldD{\dfrac{\sum\limits_{}^{}{{\lvert x-\mu}\rvert^2}}{N}}}$

In this step, we divide our result from Step 3 by the variable $N$, which is the number of data points.
Fill in the blank.
$\dfrac{\sum\lvert x - \mu \rvert^2}{N} =$
Divide the sum from Step 3 by the number of data points $( N =4)$:
$\dfrac{\sum\lvert x - \mu \rvert^2}{N} =\dfrac{{14}}4 = {3.5}$

### Step 5: Finding the standard deviation $\sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\mu\rvert^2}}}{N}}$

We're almost finished! Just take the square root of the answer from Step 4 and we're done.
Fill in the blank.
$\text{SD} = \sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\mu\rvert^2}}}{N}} \approx$
Take the square root of the number we found in Step 4:
$\sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\mu\rvert^2}}}{N}} = \sqrt{{3.5}} \approx 1.87$
The standard deviation is $1.87$.
Yes! We did it! We successfully calculated the standard deviation of a small data set.

### Summary of what we did

We broke down the formula into five steps:
Step 1: Find the mean $\mu$.
$\mu = \dfrac{6+2 + 3 + 1}{4} = \dfrac{12}{4} = \blueD3$
Step 2: Find the square of the distance from each data point to the mean $\lvert x-\mu\rvert^2$.
$x$$\lvert x - \mu \rvert^2$
$6$$\lvert6-\blueD{3}\rvert^2 = 3^2 = 9$
$2$$\lvert2-\blueD{3}\rvert^2 = 1^2 = 1$
$3$$\lvert3-\blueD{3}\rvert^2 = 0^2 = 0$
$1$$\lvert1-\blueD{3}\rvert^2 = 2^2 = 4$
Steps 3, 4, and 5:
\begin{aligned} \text{SD} &= \sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\mu\rvert^2}}}{N}}\\\\\\\\ &= \sqrt{\dfrac{9 + 1 + 0 + 4}{4}} \\\\\\\\ &= \sqrt{\dfrac{{14}}{4}} ~~~~~~~~\small \text{Sum the squares of the distances (Step 3).} \\\\\\\\ &= \sqrt{{3.5}} ~~~~~~~~\small \text{Divide by the number of data points (Step 4).} \\\\\\\\ &\approx 1.87 ~~~~~~~~\small \text{Take the square root (Step 5).} \end{aligned}

## Try it yourself

Here's a reminder of the formula:
$\Large\text{SD} = \sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\mu\rvert^2}}}{N}}$
And here's a data set:
$1, 4, 7, 2,6$
Find the standard deviation of the data set.
$\text{SD}=$

### Find the mean

$\mu = \dfrac{1 + 4 + 7 + 2 + 6}{5} = \dfrac{20}{5} = \blueD4$

### Find the square of the distances from each of the data points to the mean

$x$$\lvert x - \mu \rvert^2$
$1$$\lvert1-\blueD{4}\rvert^2 = 3^2 = 9$
$4$$\lvert4-\blueD{4}\rvert^2 = 0^2 = 0$
$7$$\lvert7-\blueD{4}\rvert^2 = 3^2 = 9$
$2$$\lvert2-\blueD{4}\rvert^2 = 2^2 = 4$
$6$$\lvert6-\blueD{4}\rvert^2 = 2^2 = 4$

### Apply the formula

\begin{aligned} \text{SD} &= \sqrt{\dfrac{\sum\limits_{}^{}{{\lvert x-\mu\rvert^2}}}{N}} \\\\\\\\ &= \sqrt{\dfrac{9 + 0 + 9 + 4 + 4}{5}} \\\\\\\\ &= \sqrt{\dfrac{26}{5}}\\\\\\\\ &= \sqrt{5.2}\\\\\\\\ &\approx 2.28\end{aligned}

The standard deviation is approximately $2.28$.