Non Parametric Method
Non Parametric Method
w, I I . becomes 0 < 5 and @> > respectively. Ifa sample value equals ty, (which can happen when we deal with rounded data even though the population is continuous), we simply discard it. In order to perform a sign test, when the sample size is very small, we use the Table of Binomial probabilities and when the sample is large we use the normal approximation to the Binomial distribution with Mean = 18, and variance = nO (1 - 0) S.E. (0) = /n0 (1-0) fae X-n0 The test statistic in this case is, Z= 7————, [nd (1-0) with @ = Fyand X the number of plus signs. Example 3 : The following are the measurements of breaking strength of a certain kind of 2-inch cotton ribbon in pounds: 163 165° 160 «189 161 171 158151 169 162 163° 139-172 165 148 = «166172, «163 187 173 Use the sign test to test the null hypothesis = 160 against the hypothesis \. > 160 at the 0.05 level of significance. Solution : 1, Null Hypothesis: H, : 1 = 160. Alternative Hypothesis: H, : |. > 160 => It is a case of one tailed test. 2. Level of significance: 0. = 0.05. 3. Test statistic: X= The observed number of plus signs. Replacing cach value exceeding 160 with a plus sign, each value less than 160 with a minus sign, and discarding the value which equals 160, we get : Pe et ee Here m = the total number of plus and minus signs = 19. x = the total number of plus signs = 15.Non-Parametric Methods 433 It is a case of Binomial Distribution. From the Binomial Table given at the end, we find P(X2 15) = 0.0095 for @ = and n= 19, [For 8 = 0.05, P(X > 15) = P(X = 15) + P(X = 16) + P(X = 17) + P(X = 18) + P(X = 19) = 0.0074 + 0.0018 + 0.0003 + 0.0000 + 0.0000 = 0.0095] 4. Decision: Since P-value (= 0.0095) is less than 0.05, so the null hypothesis must be rejected and we conclude that the mean breaking strength of given kind of ribbon exceeds 160 pounds. Example 4 : The following data,( in tons) are the amounts of sulfur oxides emitted by a large industrial plant in 40 days 24 15 200 29191822 2S 9 7 2 17 6 m4 MW 15S 23 Mm 6 M9 238 16 19 10 23 18 31013 0 Use the sign test to test the null hypothesis = 21.5 against the alternative hypothesis [> 21.5 at the 0.01 level of significance. Solution : 1. Setting up the Hypothes Null Hypothesis: H, : = 21.5. Alternative Hypothesis: H,: 1 <21.5 => Itis a case of one-tailed test. nv . Test Statistic: Replacing each value exceeding 21.5 with a plus sign, each value le minus sign, and discarding the one value which equals 21.5, we get re ee ee s than 21.5 with a ee ee X= the number of plus signs = 16. n= total number of plus and minus signs = 16 + 24 = 40. As the sample size, 1 = 40 is very large, so we shall use the Normal approximation to Binomial distribution. X-n0 Test Statistic: 2-0-0) where @ = ; = 0.5, 1 = sample size. 10. (0.5) 4 _ 16 2° fro: | Z| = 1.26, [on 40, 0 © 0.5} )(0.5) 3.16434 Biostatistics 3. Level of significance: Here o = 0.01 4. Ctitical values: The critical value | Z, | for o = 0.01 for one-tailed test is 2.33. Decision: Since | Z| <| Z, | as 1.26 < 2.33, so we accept the null hypothesis. EXERCISE - 17.1 . Use the sign test to see if there is a difference between the number of days until collection of an account receivable before and after a collection policy. Use 0.05 significance level. Before : 30 28 34 35 40 42 33 38 34 45 28 27 25 41 36 Afier 2 34) 29" 33 328 478 49 240 242) 937 44027) 33) 302 382 36) [Ans, There is no significant difference before and after new collection policy in the accounts receivable.] Rv The following data show employees’ rates of defective work before and after a change in the wage incentive plan. Compare the two sets of data given below to see if the change lowered the defective units produced. Use the 0.10 level of significance. Before 8 7 6 9 7 10 8 6 5 8°10 8 Wee oo 6 Ob 6 GD 3. Use the sign test on the data given below to determine whether there is a statistical increase in the values produced by treatment B over those produced by treatment 4: Subject > 1 2 3 4 5 6 7 8 9 10 Treatment Alte tte 46ee 4\ge037)009200028)0043)00 42, 51128). 27) TreatmentB : 52 43 37 32 31 39 44 53 26 31. Use 0.05 level of significance. 4. When is the sign test used? The scores under two conditions X and Y obtained by the respondents are given below: xX : 122 6 8 6 4 8 You. 7 2 Wh S$) 2 Apply the sign test and comment on your findings at 0.05 level of significance. . Acompany claims that if its product is added to an automobile’s gasoline tank, the distance travelled in kilometres per litre will improve. To test the claim, 15 different automobiles are chosen and the distances with and without the additive are measured; the results are shown below. Assuming that the driving conditions are the same, determine whether there is a difference due to the additive at significance levels of (a) 0.05 and (5) 0.01 With additive + 17.3141 9.8 12.5 7.8 12.2 14.3 11.7 13.8 16.0 24.8 11.2 12.8 14.0 12.1 Without additive :15.713.6 10.2 12.3 7.4 11.1 13.4 12.0 13.1 15.7 14.4 11.5 12.0 13.6 114 [Ans. (a) There is a difference at the 0.05 level of significance (b) There is no difference at the 0.01 level of significance.] wnNon-Parametric Methods 435 6. The following: data represents the number of hours that a rechargeable hedge trimmer operates before a recharge is required. 1S 2.2 0.9 13 2.0 1.6 18 15 2.6 12 hs Use the sign test to test the hypothesis at 0.05 level of significance that this particular trimmer operates, on the average, 1.8 hours before requiring a recharge. (Hint: H, : p= 1.8, H, : 1 # 1.8, we have - +--+--+--, so, » = 10 and Y= 3. Apply Binomial Table] [Ans. Accept the null hypothesis H, : t= 1.8] : On 12 visits to a doctor’s clinic, a patient had to wait in minutes as under: 17; 32; 25; 15; 28; 25; 20; 12; 35; 20; 26; 24. before being seen by the doctor. Use the sign test with ot = 0.05 to test the doctor’s claim that, on the average, his patient dé not wait more thn 20 minutes before being examined by him. {Ans. H, : 1 = 20 is accepted] 17.9 RANK SUM TESTS ~ Rank sum tests are a whole family of tests. We shall concentrate only on the following two members of this family: /1. Mann-Whitney U Test _P> Kruskal-Wallis H Test. Mann-Whitney tests is used when there are only two populations whereas Kruskal- Wallis test is employed when more than two populations are involved. The use of these tests will enable us to determine whether independent samples have been drawn from the same population or different populations have the same distribution. 17.10 MANN-WHITNEY U-TEST It is a non-parametric method used to determine whether two independent samples have been drawn from populations with same distribution. This test is also known as U- Test. This test enables us to test the null hypothesis that both population medians are equal (or that the two samples are drawn from a single population). It requires the two samples to the independent samples of observations measured at least at an ordinal level, i.e., we can at least say, of any two observations, which is greater. This method does not require the assumption that the difference between the two samples are normally distributed. This method helps us to determine whether the two samples have come from identical populations. If it is true that the samples have come from the same population, it is reasonable to assume that the medians of ranks assigned to the values of two sampls are more or less the same, The alternative hypothesis H7, would be: That the medians of the populations are not equal. In this case, most of the smaller ranks will go to the values of one sample, while most of the higher ranks will go to the other sample. The test involves the calculation of a statistic usually ealled U, whose distribution under the null hypothesis is known. In case of small samples ize i ¢ distribution is tabulated but for samples above 8, i ze is less than 8), the distribution is p eal the normal distribution. there is good approximation using