**Linear Algebra**

# Guide on standard error in statistics

*schedule*Mar 5, 2023

*toc*Table of Contents

*expand_more*

**interactive map of data science**

# The expected value and variance of the sample mean

Suppose that a random sample of $n$ observations is taken from some population with mean $\mu_X$ and variance $\sigma_X^2$. Each random variable $X_1,X_2,…,X_n$ of the random sample will have the same distribution (iid). If such is the case, then the sample mean, $\bar{X}$, will have mean and variance as given below:

Note the following points:

these results hold true regardless of the underlying probability distribution of $X$.

only the expected value and variance of the sample mean $\bar{X}$ can be calculated - not the distribution of $\bar{X}$.

The expected value of the sample mean is:

The variance of the sample mean is:

# Standard error

The standard error is defined as the standard deviation of an estimator or statistic.

# Standard error of the sample mean

What is the standard error of the sample mean?

When we try to estimate the population mean using the sample mean, we know that the sample mean won't exactly equal the population mean. In order to know how good the estimate is, we use a statistic called the standard error of the sample mean.

A high value for the standard error means that there is high uncertainty in our estimate, whereas a low standard error means that we can be certain our estimate isn't off by much.

Recall from theorem 5 that the variance of the sample mean $\bar{X}$ is:

The standard error of the sample mean $\bar{X}$ is the standard deviation of $\bar{X}$:

As we can see, the standard error decreases as the sample size $n$ increases. This should be intuitive because the more data we use to compute the sample mean, the more precise the sample mean is in estimating the population mean. As the sample size tends to infinity, the standard error approaches zero.