An Introduction to the Normal Distribution

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
Let's look at an introduction to the normal distribution, also known as the Gaussian distribution. The normal distribution is an extremely important continuous probability distribution. It arises very very frequently in theory and practice. Here I've plotted out what is approximately the distribution of the height of adult Canadian males. And this, what we see here, is a normal distribution. This is our variable down here and f(x) is what we call the probability density function, which gives the height of the curve at point x. Here's the equation for the probability density function of the normal distribution. We're not going to be working with this directly very often, but there's a few points we should note. First of all, if the random variable X has a normal distribution, then it can take on any finite value. There are two parameters for this distribution. Mu is the mean of the distribution. And the mean can take on any finite value as well. Sigma is the standard deviation and sigma squared, of course, is the variance. Sigma has to be some positive value and of course sigma squared would have to be some positive value as well. Here's a normal distribution plotted out. Mu, the mean, is also right smack dab in the middle of the distribution. The distribution is symmetric about mu, so mu represents the mean of the probability distribution, and it's also the median of the probability distribution. No let's try and get some perspective for what sigma represents. Here I've plotted in one standard deviation below the mean to one standard deviation above the mean. And as we'll see a little bit later on this area between one standard deviation below to one standard deviation above is approximately 0.68. 68% of the area lies within one standard deviation of the mean Here I've plotted in mu minus two standard deviations to mu plus two standard deviations. And approximately 95% percent of the area lies within two standard deviations of the mean. And approximately 99.7% of the area lies within three standard deviations of the mean. Here's another plot to give us a little perspective on what sigma represents. Over here this red line represents a normal distribution with a standard deviation of sigma_2. And this white curve represents a normal distribution with a standard deviation of sigma_1. But sigma_2 is double sigma_1, so the standard deviation on the red curve is double the standard deviation of the white curve. And we can see that when the standard deviation is greater there is more area in the tails and a lower peak. There are an infinite number of different normal distributions corresponding to all the different possible values of mu and sigma. If X is a random variable that has a normal distribution with mean mu and variance sigma squared, we write this as the random variable X is distributed normally with a mean of mu and a variance of sigma squared. X is distributed normally, a big N, first term is the mean, second term is the variance. Be a bit careful here, as different sources can have a different term in here. Sometimes people have the second term representing the standard deviation. I have the second term representing the variance. So just be a bit careful with that. By definition, the standard normal distribution is a normal distribution with a mean of zero and a variance of one. We often represent random variables that have the standard normal distribution with the letter Z. And yes, I am Canadian. And this, we might say, if Z has a standard normal distribution, we might write this as Z is distributed normally with a mean of 0 and a variance of 1. That being mu, that being sigma squared. And if the variance is 1, of course sigma is 1 as well. As we will soon see, we very often need to find areas under the standard normal curve. Probabilities are simply areas under the curve, and very often the question of interest involves finding a probability. Finding probabilities and percentiles for the normal distribution requires integrating the probability density function. There isn't a closed-form solution and it must be integrated numerically. Fortunately for us ,we're going to use software or a standard normal table to actually find these values in practice.
Info
Channel: jbstatistics
Views: 642,256
Rating: undefined out of 5
Keywords: normal distribution, normal, Gaussian distribution, gaussian, continuous random variables, continuous, random variables, probability distributions, probability density function, pdf, introductory statistics, introductory, statistics, jbstatistics, jb statistics, 8msl, 8 minute stats lectures, intro stats videos, intro stats help, stats help, stats tutor, jeremy balka, AP statistics
Id: iYiOVISWXS4
Channel Id: undefined
Length: 5min 27sec (327 seconds)
Published: Sun Dec 23 2012
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.