Question 1

Starting at 4:22, why do you need to estimate the sample standard deviation when you already have it(.5)? He goes on to say that you put a hat on it to show that you estimated the population standard deviation by using the sample but why does the sigma have a hat for population estimate and have an x bar for sample? Is the notation correct on that section?

Accepted Answer

Don't forget, we don't really care about the st.dv. of the sampl, we care about it's relationship to the population. So we have to take measures that involve the actual population. You must first see the video "standard error of the mean" to get this one.

Question 2

Why are you not using a t-distribution to find the probability of getting the sample result?  I know that when the sample size is large (n = 100), a t-distribution is essentially the same as a normal distribution, but I think this lesson can be misleading when we are taught to use a t-distribution in the common case when the population standard deviation is not known and we are estimating it from the sample.

Accepted Answer

The t-test is more conservative, if the sample size is small. I think you would opt for the more conservative test, knowing that with a larger sample size, there is essentially no difference between t and z. In general, when comparing two means, the t-test is used. Note from the results given above by ericp, that the conclusion from either test is the same. The two groups differ significantly. In scientific reports, p-value is reported to 2 decimal places. So using either the z or t test, you would report a significant difference "with p < .01".

Question 3

SHouldn't it be the other way around when calculating the Z value?

(1.05-1.2)/0.05 instead of (1.2-1.05)/0.05?

My professor always told me to do it that way. The final conclusion doesn't change in this case though, but just wanted to make sure if that's the proper way.

Accepted Answer

since normal probability distribution (bell curve) is symmetric around the mean, it doesnt matter. It gives same result in terms of area under curve, thats why prof. wanted to make it less complex in saying that. But if we were dealing with a non symmetric prob. distr. like F distr, then it would matter. 
hope that helps.

Question 4

Is it valid to assume the sample SD is close to the population SD? Even if the sample size is high, the rats in the sample have been injected, how do we know that doesn't affect the sample SD?

Accepted Answer

It is an assumption you are making, justified by the fact that your Ho is that the drug has no effect, and that the populations (drug vs. no drug) will actually be identical. If the drug has no effect, then the standard deviation of drug and no drug rats should be the same. It is an assumption, justified with some logic, but not proven.

In a research paper, this would be recognized as a weakness, but an unavoidable one, because it is impossible to know the true standard deviation of either population - you only know the samples.

Question 5

I don't understand where Sal got 99.7%... can anyone explain? (8:50)

Accepted Answer

He mentioned this a couple of videos ago, but he is using the empirical rule, which states that, for a normal distribution, 99,7% of all values lie within 3 standard deviations of the mean. Similarly, 68,27% lies within 1 standard deviation and 95,45% within 2. See: http://en.wikipedia.org/wiki/68-95-99.7_rule

Question 6

Shouldn't we say that the alternative hypothesis is just μ<1.2s and not in both directions?

Accepted Answer

That's an important question.  In the end, it gets down to the reason that you are conducting the experiment.  In this case, the null hypothesis is that the drug doesn't have an effect on response time, so you want to measure both tails.  If your null hypothesis was that the drug doesn't have a *negative* effect on response time, then you would only measure one tail.

Question 7

How do you calculate the critical value? I cant find an explaination for it in your video list. Thank you!

Accepted Answer

short answer:  Critical values are generally chosen or looked up in a table (based on a chosen alpha).

longer answer:
--------------------
In this video there was no critical value set for this experiment.  In the last seconds of the video, Sal briefly mentions a p-value of 5% (0.05), which would have a critical of value of z = (+/-) 1.96. Since the experiment produced a z-score of 3, which is more extreme than 1.96, we reject the null hypothesis.

Generally, one would chose an alpha (a percentage) which represents the "tolerance level for making a mistake.*"   Then the corresponding critical value can be looked up from a table.  [* the "mistake" being to incorrectly reject the null hypothesis.  In other words, we made the error of claiming that the experiment had an effect when it did not.]

The critical value is the cut-off point that corresponds to that alpha;  any value beyond the critical value is less than alpha(%) likely to occur by chance.

see the wikipedia page for a z-tables and how to read them
http://en.wikipedia.org/wiki/Standard_normal_table

note that for an alpha of 5%, in a cumulative table, you would first divide your alpha in half for a two-tailed test, then subtract that from 1.  That is the value you are looking for in the table.  So we get 1 - (.05/2) = 1 - .025 = 0.9750
We find 0.9750 in our table, look at the row: 1.9; look at the column: 0.06;  add the two together to get the corresponding z-score:  1.96.

Question 8

If we assume that the null hypothesis is true, then why do we assume that the sample mean is 1.2 sec? We already know that it's 1.05 sec.

Accepted Answer

Because that  _*is*_  the null hypothesis (_*H0*_).

What we are testing is how likely we are to have seen the data, under the assumption that _*H0*_ is true. Null hypothesis testing follows a somewhat backward seeming logic, but this is apparently pretty standard in mathematics. 
1) We calculate how probable it is that we would have seen the observed data if _*H0*_ is true. 
2) We then either reject _*H0*_ (or fail to reject it) depending on how often we are willing to wrongly reject _*H0*_ (this is the Type I error rate).
3) If we reject _*H0*_ then we _provisionally_ conclude that our alternate hypothesis could be true ...

Question 9

Sal said: "Assuming the Null Hyphothesis was true, if the probability of getting the result from the sample is very small, then we reject the Null!"
But how could this be? Because if this probability is very small, it means that the Null is indeed true as a common sense. This is somewhat counter intuitive and I really get confused!!
Someone please, speak the kind of language where newbie like me can understand ?? I dont see any relationship here in this?? It seems that he took for granted that we already understood sth...but i actually many of us dont! So plze point out the logic that you make here related to the null, the z score, and the rejection. thank you!

Accepted Answer

I think that your comment shows you came to the proper realization. Maybe this will help clarify or solidify the ideas, or in case others don't fully see it as you did:

First: There is some population (of rats injected with this drug), and this population has a mean, µ.  We don't know the value of µ, but we can use a hypothesis test to gives us some information about it (if we knew µ, we wouldn't have to do a hypothesis test at all).

Second: We form a null hypothesis, in this case it is Ho: µ = 1.2. The 1.2 is just some value of interest to which we want to compare. In this case, it is the "status quo", the value of µ for rats NOT injected with the drug.

Third: We calculate our test statistic (the z-score) and p-value, _assuming that Ho is correct_. That is, we are assuming that the rats injected with the drug have the same value of µ as the rats not injected with the drug. It's important to remember that this is an assumption, and it may not accurately reflect reality. The _data values_ will follow the real value of µ, because that is reality. So if µ=1, then the values will tend to group around 1. If µ=1.2, then the values will tend to group around 1.2.

Then, if the null hypothesis is _wrong_, then the data will tend to group at a point that is _not_ the value in the null hypothesis (1.2), and then our p-value will wind up being very small. If the null hypothesis is correct, or close to being correct, then the p-value will be larger, because the data values will group around the value we hypothesized.

Course: Statistics and probability > Unit 12

Hypothesis testing and p-values

Want to join the conversation?

Video transcript