P-value in statistics: Understanding the p-value and what it tells us - Statistics Help

Video Statistics and Information

Video

Captions Word Cloud

Reddit Comments

Captions

One of the most important concepts in statistics is the meaning of the P-value. Whenever we use Excel or other computer packages to analyse data, one of the key outputs is the p-value or sig. In formal terms, The p-value is the probability that, IF the null hypothesis were true, sampling variation would produce an estimate that is further away from the hypothesised value than our data estimate. In less formal terms, The p-value tells us how likely it is to get a result like this if the Null Hypothesis is true. We will now go through this step-by-step with an example. Helen sells Choconutties. Recently she has received complaints that the choconutties have fewer peanuts in them. than they are supposed to. The packet says that each 200g packet of choconutties contains 70g of peanuts or more. Helen can't open up all the packets to check as then she wouldn't be able to sell any. So she decides to use a statistical test on a sample of the packets. The null hypothesis, often called H "nought" is the thing we're trying to provide evidence against. For Helen, the null hypothesis is that the choconutties are as they should be. The mean or average weight of peanuts in the packet is 70 grams. The alternative hypothesis called H1 or HA is what we're trying to prove. The customers had complained that the weight of peanuts is less than what it should be. So the alternative hypothesis is that the average rate of peanuts is less than 70 grams. Helen decides to use a significance level of 0.05 if the P-value is lower than this, she will reject the null hypothesis Having decided on her hypotheses and on the significance level Helen takes a random sample of 20 packets of Choco-nutties from her current stock of 400 packets. she melts down the Choco-nutties and weighs the peanuts from each packet. If all of the values were lower than 70 grams with a mean of 30 grams for instance, it will be quite obvious that the bars did not have the required number of peanuts. It is very unlikely that you'll get 20 packets with a mean of 30 grams if the overall mean of all the packets in the population is 70 grams Conversely, if all the values of the 20 packets were much higher than 70 grams, it would be obvious that there were enough peanuts and that there was nothing to complain about. However, in this case the 20 packets contain the following weights of peanuts and the mean is 68.7 grams. This caused Helen to ask herself: "Does this provide enough evidence that the bars are short of peanuts or could this result just be from luck?" She asks her brother to use Excel to find the p-value for this data, comparing with the mean of 70 grams. The P value is 0.18 Judging from the data that we have, there is an 18 percent chance of getting a mean as low as this or lower if there is nothing wrong with the bars. That is, if the null hypothesis is true and the mean weight of nuts is 70 grams or more. This P value of 0.18 does not provide enough evidence to reject the null hypothesis. In this case helen does not have evidence to say that the bars are short of peanuts. This is a relief! The smaller the p-value is, the less likely it is that the result we got was simply a result of luck. If the P value had turned out to be very small we then would say that the result was significantly different from 70 grams. In general we start by saying that the null hypothesis is true. We take a sample and get a statistic. We work out how likely it is to get a statistic like this, if the null hypothesis is true. This is the p-value. If the P value is really really small, then our original idea must have been wrong, so we reject the null hypothesis. P is low, Null must go. A small P value indicates a significant result. the smaller the p-value is the more evidence we have that the null hypothesis is probably wrong. If the P-value is large, then our original idea is probably correct. we do not reject the null hypothesis. This is called a nonsignificant result. The P-value tells us whether we have evidence from the sample that there is an effect in the population. a P-value less than 0.05 means that we have evidence of an effect. A P-value of more than 0.05 means that there is no evidence of an effect. Sometimes a significance level different from 0.05 is used, but 0.05 is the most common one. This video uses plain language to get difficult ideas across. Some terminology might be viewed as incorrect by a rigorous statistician.

Info

Channel: Dr Nic's Maths and Stats

Views: 1,396,409

Rating: undefined out of 5

Keywords: p-value, statistics, educational, humor, math, AtMyPace, Stats, tutorial, Mathematics, stats, Help

Id: eyknGvncKLw

Channel Id: undefined

Length: 4min 42sec (282 seconds)

Published: Mon Oct 31 2011