1 00:00:00,025 --> 00:00:10,025 [MUSIC] 2 00:00:11,807 --> 00:00:14,402 In traditional hypothesis testing, 3 00:00:14,402 --> 00:00:19,347 one has the option to perform z-test or a t-test, and the question is, 4 00:00:19,347 --> 00:00:24,062 under what circumstances should one perform a z-test or a t-test? 5 00:00:24,062 --> 00:00:30,403 Well, the answer is rather simple, if one is aware of the populations 6 00:00:30,403 --> 00:00:35,380 standard deviation or variance, we use the z-test. 7 00:00:35,380 --> 00:00:41,170 And that is when we are comparing the sample mean to a hypothetical or 8 00:00:41,170 --> 00:00:42,770 a population mean. 9 00:00:42,770 --> 00:00:48,230 And if the population standard deviation is not known and 10 00:00:48,230 --> 00:00:51,670 we're comparing the sample mean against the population mean within unknown 11 00:00:51,670 --> 00:00:56,040 standard deviation, then we use the t-test. 12 00:00:56,040 --> 00:01:03,560 Now there are four scenarios in which we perform these tests. 13 00:01:03,560 --> 00:01:08,350 First scenario is where we are comparing a sample mean to a population 14 00:01:08,350 --> 00:01:12,890 mean and the population standard deviation is known, 15 00:01:12,890 --> 00:01:16,960 in that particular case, we use a z-test. 16 00:01:16,960 --> 00:01:21,880 And in cases where we are comparing a sample mean to a population mean with 17 00:01:21,880 --> 00:01:25,110 an unknown standard deviation, we use the t-test. 18 00:01:25,110 --> 00:01:28,510 Now this I covered earlier in the last slide. 19 00:01:28,510 --> 00:01:33,110 The new thing here is that when we compare the means of two independent samples, that 20 00:01:33,110 --> 00:01:39,370 is comparing the means of two independent samples with unequal variances. 21 00:01:39,370 --> 00:01:44,995 If we are faced with this kind of a question, we use a t-test. 22 00:01:44,995 --> 00:01:49,790 Again, if we are comparing the means of two independent samples with 23 00:01:49,790 --> 00:01:52,790 equal variances, we still use a t-test. 24 00:01:52,790 --> 00:01:55,460 The underlying theory is that when you're using a z-test, 25 00:01:55,460 --> 00:01:58,350 you're basing your results on normal distribution, and 26 00:01:58,350 --> 00:02:04,195 when you are deploying t-test, you're basing your results on t-distribution. 27 00:02:04,195 --> 00:02:09,396 And the process of hypothesis testing could be 28 00:02:09,396 --> 00:02:15,272 made rather simple by looking at these thresholds. 29 00:02:15,272 --> 00:02:17,811 If you are comparing the means and 30 00:02:17,811 --> 00:02:22,889 in the particular case you are looking at the null being that the two 31 00:02:22,889 --> 00:02:27,710 averages are the same against two averages not being the same. 32 00:02:27,710 --> 00:02:33,704 Then you're using a two-tailed test, and in that particular case you're looking for 33 00:02:33,704 --> 00:02:39,215 a t-statistic or z-statistics of 1.96, the absolute value of 1.96. 34 00:02:39,215 --> 00:02:43,630 If that were to be the case, you reject the null hypothesis. 35 00:02:43,630 --> 00:02:47,160 That is, you're conducting a two-tailed test. 36 00:02:47,160 --> 00:02:51,440 You can be using normal distribution or a t-distribution, and 37 00:02:51,440 --> 00:02:58,720 you get the calculated z or t-statistics of greater than absolute value of 1.96. 38 00:02:58,720 --> 00:03:00,166 And the expected p-value, 39 00:03:00,166 --> 00:03:04,574 the probability of that happening would be less than 0.05 and you reject the null. 40 00:03:04,574 --> 00:03:08,800 The null being that the two means are equal. 41 00:03:08,800 --> 00:03:14,670 In the case of one-tailed test, where you're testing whether the mean or 42 00:03:14,670 --> 00:03:18,380 average of one entity is greater or less than the other, 43 00:03:18,380 --> 00:03:23,310 here the absolute value for z or t-statistic is 1.64, and 44 00:03:23,310 --> 00:03:26,320 the probability would still be less than 0.05. 45 00:03:26,320 --> 00:03:29,510 If that were to be the case, you reject the null. 46 00:03:29,510 --> 00:03:35,450 If the calculated value for z or t-statistic is less than 1.96, you fail 47 00:03:35,450 --> 00:03:41,010 to reject the null hypothesis, and the null being the two averages are the same. 48 00:03:41,010 --> 00:03:42,708 In case of a one-tailed test and the calculated value for z or 49 00:03:42,708 --> 00:03:44,705 t-statistic is less than 1.64, you fail to reject the the null and 50 00:03:44,705 --> 00:03:46,368 the null could be that one value or the average is less than or 51 00:03:46,368 --> 00:03:47,725 equal to the other or is greater than the other.