1 00:00:05,660 --> 00:00:09,270 In this video, I will illustrate how to state 2 00:00:09,270 --> 00:00:10,860 your hypothesis when you're 3 00:00:10,860 --> 00:00:14,100 comparing the averages between two entity. 4 00:00:14,100 --> 00:00:17,040 We will use basketball as an example, 5 00:00:17,040 --> 00:00:20,370 using Michael Jordan and Wilt Chamberlain, 6 00:00:20,370 --> 00:00:22,620 the two highest scorers in 7 00:00:22,620 --> 00:00:25,680 the history of basketball, as examples. 8 00:00:25,680 --> 00:00:27,480 If you were to recall, 9 00:00:27,480 --> 00:00:30,735 you would know that Michael Jordan, on average, 10 00:00:30,735 --> 00:00:34,140 scored 30.12 point in each game, 11 00:00:34,140 --> 00:00:37,440 and Chamberlain averages around 30.06 points per game. 12 00:00:37,440 --> 00:00:38,880 If you were to compare 13 00:00:38,880 --> 00:00:43,140 the two averages between Michael Jordan and Chamberlain, 14 00:00:43,140 --> 00:00:45,560 and even though they are very similar looking numbers, 15 00:00:45,560 --> 00:00:49,265 we need to set up a statistical hypothesis testing. 16 00:00:49,265 --> 00:00:50,900 We are interested in comparing 17 00:00:50,900 --> 00:00:54,645 the average points scored by the two basketball players, 18 00:00:54,645 --> 00:00:56,540 and the comparison of means or 19 00:00:56,540 --> 00:00:59,585 averages is available in three flavors. 20 00:00:59,585 --> 00:01:01,400 First, we can assume that 21 00:01:01,400 --> 00:01:05,570 the average points per game scored by the two players, 22 00:01:05,570 --> 00:01:08,315 Jordan and Chamberlain, are the same. 23 00:01:08,315 --> 00:01:10,370 That is, the difference between 24 00:01:10,370 --> 00:01:12,905 their mean scores is zero. 25 00:01:12,905 --> 00:01:14,495 If their averages are the same, 26 00:01:14,495 --> 00:01:17,990 average of one minus average of other should be zero, 27 00:01:17,990 --> 00:01:20,165 this becomes a null hypothesis. 28 00:01:20,165 --> 00:01:23,450 Let's say if Mu_j 29 00:01:23,450 --> 00:01:25,790 represents the average points 30 00:01:25,790 --> 00:01:27,730 per game scored by Michael Jordan, 31 00:01:27,730 --> 00:01:31,865 and Mu_c represents the average points 32 00:01:31,865 --> 00:01:35,135 per game scored by Wilt Chamberlain, 33 00:01:35,135 --> 00:01:40,515 we can state the null hypothesis to be Mu_j equal Mu_c, 34 00:01:40,515 --> 00:01:43,005 that is the average scored by 35 00:01:43,005 --> 00:01:46,440 Jordan and average scored by Chamberlain are the same. 36 00:01:46,440 --> 00:01:49,775 The alternative hypothesis would be that no, 37 00:01:49,775 --> 00:01:52,480 these averages are not the same, they are different. 38 00:01:52,480 --> 00:01:55,790 The alternative hypothesis H_a 39 00:01:55,790 --> 00:02:00,335 compared to null hypothesis H_0 or o. 40 00:02:00,335 --> 00:02:04,310 The alternative is that the averages are not the same, 41 00:02:04,310 --> 00:02:06,790 their average scores are different. 42 00:02:06,790 --> 00:02:09,480 Now, the other option, the second option, 43 00:02:09,480 --> 00:02:13,460 is to assume that Jordan scored better or higher. 44 00:02:13,460 --> 00:02:16,520 In that case, our null hypothesis is 45 00:02:16,520 --> 00:02:19,160 the average score by 46 00:02:19,160 --> 00:02:21,410 Michael Jordan is greater than or 47 00:02:21,410 --> 00:02:24,350 equal to the average score by Wilt Chamberlain. 48 00:02:24,350 --> 00:02:25,820 In this particular case, 49 00:02:25,820 --> 00:02:29,630 the alternative hypothesis would be different. 50 00:02:29,630 --> 00:02:31,070 It wouldn't be not equal to, 51 00:02:31,070 --> 00:02:33,175 but it would be less than. 52 00:02:33,175 --> 00:02:36,305 The alternative would be that the average scored by 53 00:02:36,305 --> 00:02:40,260 Jordan is less than that by Wilt Chamberlain. 54 00:02:40,260 --> 00:02:41,810 By the same account, 55 00:02:41,810 --> 00:02:44,210 our third option will be 56 00:02:44,210 --> 00:02:46,940 that the null hypothesis is that in fact Jordan scored 57 00:02:46,940 --> 00:02:49,440 less than Wilt Chamberlain and 58 00:02:49,440 --> 00:02:52,250 the alternative hypothesis will be the reverse of it, 59 00:02:52,250 --> 00:02:55,160 saying that no, Michael Jordan scored 60 00:02:55,160 --> 00:02:58,580 higher than Wilt Chamberlain. 61 00:02:58,580 --> 00:03:00,750 In a nutshell, we have three options. 62 00:03:00,750 --> 00:03:02,180 We can say the averages are the 63 00:03:02,180 --> 00:03:03,620 same and the null would be, 64 00:03:03,620 --> 00:03:05,795 no, they are not the same, not equal. 65 00:03:05,795 --> 00:03:08,150 We can say the average is less than, 66 00:03:08,150 --> 00:03:10,400 the null is that Jordan's average 67 00:03:10,400 --> 00:03:11,650 is higher than Chamberlain, 68 00:03:11,650 --> 00:03:13,820 and the alternative would be the reverse of it. 69 00:03:13,820 --> 00:03:16,055 The third option is to say that 70 00:03:16,055 --> 00:03:18,230 the average scored by 71 00:03:18,230 --> 00:03:20,960 Jordan is less than average scored by Chamberlain, 72 00:03:20,960 --> 00:03:23,885 and the reverse of it will be the alternative hypothesis. 73 00:03:23,885 --> 00:03:28,800 So, three ways of defining a hypothesis.