0% found this document useful (0 votes)
19 views

Sampling Theory

The document discusses key concepts in statistical inference including estimation, hypothesis testing, and sampling distributions. Estimation involves using samples to draw conclusions about populations and comes in two forms: point and interval estimation. Hypothesis testing involves a null and alternative hypothesis and calculating type I and type II errors. The standard normal distribution is commonly used for hypothesis testing involving means, proportions, and differences between groups.

Uploaded by

francis Magoba
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views

Sampling Theory

The document discusses key concepts in statistical inference including estimation, hypothesis testing, and sampling distributions. Estimation involves using samples to draw conclusions about populations and comes in two forms: point and interval estimation. Hypothesis testing involves a null and alternative hypothesis and calculating type I and type II errors. The standard normal distribution is commonly used for hypothesis testing involving means, proportions, and differences between groups.

Uploaded by

francis Magoba
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

SAMPLING DISTRIBUTION/THEORY

Statistical inference includes;


1) Estimation
2) Hypothesis testing
3) Probability density estimation
Estimation

Estimation is a branch of inferential statistics that deals with generalization procedures.


It is the process of drawing an inference about a population parameter based on a sample drawn
from the population.

We have two types of estimation;


a) Point estimation:- this is an estimation that uses a single value to summarize the
characteristics of a population such as mean, standard deviation, correlation coefficient
b) Interval estimation is where a set or range of values is used to summarize the characteristics
of a population parameter.
It is through interval estimation that we come up with confidence interval (CI).
𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 = 𝑃𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑝𝑎𝑟𝑎𝑚𝑒𝑡𝑒𝑟 ± 𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑥 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟
𝐶𝐼 = 𝜇 ± 𝑍𝛼 𝑥 𝑠𝑥̅ – Confidence interval of mean
Where 𝜇 𝑖𝑠 𝑡ℎ𝑒 𝑝𝑜𝑝𝑢𝑝𝑎𝑙𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑎𝑛 𝑎𝑛𝑑 𝑠𝑥̅ 𝑖𝑠 𝑡ℎ𝑒 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟 𝑜𝑓 𝑡ℎ𝑒 𝑚𝑒𝑎𝑛 𝑔𝑖𝑣𝑒𝑛 𝑎𝑠
𝑠
𝑠𝑥̅ = 𝑤ℎ𝑒𝑟𝑒 𝑠 𝑖𝑠 𝑡ℎ𝑒 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛.
√𝑛
𝐶𝐼 = 𝑝 ± 𝑍𝛼 𝑥 𝑠𝑝 − 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑓𝑜𝑟 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛
The standard error refers to the standard deviation of a sampling distribution

Properties of a good estimator;


(i) Unbiased:- if the mean of all sample means 𝑋̅ of all possible random samples of size
n, drawn from a population, N, equals to population parameter 𝜇.
𝑖. 𝑒 𝐸(𝑥̅ ) = 𝜇
(ii) Consistency
𝑎𝑠 𝑛 → 𝑁, ̅̅̅̅̅̅̅̅
𝑥→𝜇
As the sample size increases the accuracy of the estimator also increases.
(iii) Efficiency:- an estimator is said to be efficient than another if in repeated sampling,
its variance decreases.
(iv) Sufficiency:- an estimator is said to be efficient if it uses all the information in the
sample in estimating the required population parameter.

Hypothesis testing

- A statistical hypothesis is sometimes called confirmatory data analysis is a hypothesis that is


testable on the basis of observing a process that is modeled via a set of random variables.
- Statistical hypothesis is an assumption about a population parameter which may be true or not
true.
- Hypothesis is the proposition about the presumed relation among natural phenomena. Or
simply just defined as a testable belief/opinion/assumption.

There are two types of hypothesis;

(i) Null Hypothesis(𝐻0 )


- 𝐼𝑡 𝑖𝑠 𝑎 𝑠𝑡𝑎𝑡𝑒𝑚𝑒𝑛𝑡 𝑤ℎ𝑖𝑐ℎ 𝑟𝑒𝑝𝑟𝑒𝑠𝑒𝑛𝑡𝑠 𝑡ℎ𝑒 𝑡ℎ𝑒𝑜𝑟𝑦 𝑡ℎ𝑎𝑡 ℎ𝑎𝑠 𝑏𝑒𝑒𝑛 𝑝𝑢𝑡 𝑓𝑜𝑟𝑤𝑎𝑟𝑑, 𝑒𝑖𝑡ℎ𝑒𝑟 𝑏𝑒𝑐𝑎𝑢𝑠𝑒 𝑖𝑡 𝑖𝑠𝑏𝑒𝑙𝑖𝑒𝑣𝑒𝑑 𝑡𝑜 𝑏𝑒 𝑡𝑟𝑢𝑒 𝑜
- It can simply be defined as a statement of no change.
- It refers to all those values about a population parameter that are assumed to be true.
(ii) Alternative Hypothesis(𝐻1 )
- 𝐼𝑡 𝑖𝑠 𝑎 𝑠𝑡𝑎𝑡𝑒𝑚𝑒𝑛𝑡 𝑜𝑓 𝑤ℎ𝑎𝑡 𝑎 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐𝑎𝑙 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠 𝑖𝑠 𝑠𝑒𝑡 𝑢𝑝 𝑡𝑜 𝑒𝑠𝑡𝑎𝑏𝑙𝑖𝑠ℎ.
- It can simply be defined as a set of all those values about a parameter that are not specified
in the null hypothesis
Errors
1) Type I error is the error committed when one rejects a correct hypothesis given as alpha
(𝛼)
2) Type II error is the committed when one accepts a wrong hypothesis given as beta (𝛽)

Note
1) Simple hypothesis is the hypothesis that specifies the population distribution completely
sometimes referred to as a two tail test or non-directional test or a two sided test but a
composite hypothesis does not specify the population distribution completely sometimes
referred to as one tail test or directional test or a one sided test.
2) Parametric tests such as Z – tests (normal standard tests), t – tests and ANOVA are used in
data comprising specific measurements i.e quantitative data whereas non-parametric tests
such as chi square tests are used on non-parametric tests i.e these are tests that simply
compare frequencies of occurrences. These tests are particularly used when one wishes to
test a theory about qualitative observations e.g the type of company or quality of product.
3) Level of significance (𝛼) is the probability of rejecting a true null hypothesis
𝛼 = Pr(𝑅𝑒𝑗𝑒𝑐𝑡 𝐻0 \𝐻0 𝑖𝑠 𝑐𝑜𝑟𝑟𝑒𝑐𝑡).
The critical region refers to the region where the null hypothesis is rejected.

Steps in any hypothesis testing problem


1) State the hypothesis
2) Decide on a level of significance α.
3) Choose appropriate statistical test and compute the observed test statistic.
4) Using the distribution statistical test and α, determine the rejection region(s) (RR).
5) Conclusion: If the observed test statistic falls in the RR, reject H0 and conclude that based
on the sample information, we are (1 − α)100% confident that H1 is true. Otherwise,
conclude that there is not sufficient evidence to reject H0. In all the applied problems,
interpret the meaning of your decision.
6) State any assumptions you made in testing the given hypothesis.
7) Compute the p-value from the null distribution of the test statistic and interpret it.
STANDARD NORMAL (Z) DISTRIBUTION TESTS

Hypothesis Standard error Testing procedure


testing for
Mean 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟 𝑜𝑓 𝑡ℎ𝑒 𝑚𝑒𝑎𝑛 𝑥−𝜇
𝑧=
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑠𝑥̅
= = 𝑠𝑥̅ 𝑠
√𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒 𝑤ℎ𝑒𝑟𝑒 𝑠𝑥̅ =
𝑠 √𝑛
=
√𝑛
Proportion 𝑝𝑞 𝑝0 − 𝑝1
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟 𝑜𝑓 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 = 𝑠𝑝 = √ 𝑧=
𝑛 𝑠𝑝
𝑝 𝑖𝑠 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑠𝑢𝑐𝑐𝑒𝑠𝑠 𝑤ℎ𝑒𝑟𝑒 𝑝0 𝑖𝑠 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛
𝑎𝑛𝑑 𝑞 𝑖𝑠 𝑓𝑎𝑖𝑙𝑢𝑟𝑒 𝑖. 𝑒 𝑞 = 1 − 𝑝 𝑎𝑛𝑑𝑝1 𝑖𝑠 𝑠𝑎𝑚𝑝𝑙𝑒 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛

Difference 𝑥̅1 − 𝑥̅ 2
𝑠12 𝑠22 𝑧=
between 𝑠𝑥̅ = √ + 𝑠𝑥̅
means 𝑛1 𝑛2
Difference 𝑝1 − 𝑝2
𝑝𝑞 𝑝𝑞 𝑧=
between 𝑠𝑝 = √ + 𝑠𝑝
proportions 𝑛1 𝑛2

𝑪𝒓𝒊𝒕𝒊𝒄𝒂𝒍 𝒗𝒂𝒍𝒖𝒆𝒔 𝒐𝒓 𝒕𝒂𝒃𝒍𝒆 𝒗𝒂𝒍𝒖𝒆𝒔 𝒐𝒇 𝒂 𝒔𝒕𝒂𝒏𝒅𝒂𝒓𝒅 𝒏𝒐𝒓𝒎𝒂𝒍 𝒅𝒊𝒔𝒕𝒓𝒊𝒃𝒖𝒕𝒊𝒐𝒏

𝑙𝑒𝑣𝑒𝑙 𝑜𝑓 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑐𝑒 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 𝑡𝑤𝑜 𝑡𝑎𝑖𝑙 𝑜𝑛𝑒 𝑡𝑎𝑖𝑙


5% 95% 1.96 1.64
1% 99% 2.58% 2.33%
Note that when the value of n is less than 30 to minimize the error we carry out a t – test.

Example
The management of a local health club claims that its members lose on the average 15 pounds or
more within the first 3 months after joining the club. To check this claim, a consumer agency took
a random sample of 45 members of this health club and found that they lost an average of 13.8
pounds within the first 3 months of membership, with a standard deviation of 4.2 pounds.
(a) Find the p-value for this test.
(b) Based on the p-value in (a), would you reject the null hypothesis at α = 0.01?
Solution
(a) Let μ be the true mean weight loss in pounds within the first 3 months
Then we have to test the hypothesis
H0 : μ = 15 versus Ha : μ < 15
Here n = 45, x = 13.8, and s = 4.2. Because n = 45 > 30, we can use normal
approximation.
Hence, the test statistic is
𝑥̅ − 𝜇 13.8 − 15
𝑧= = = −1.9166 ≅ −1.92 𝑎𝑛𝑑 𝑡ℎ𝑒 𝑝 𝑣𝑎𝑙𝑢𝑒 = 𝑝(𝑧 < −1.92)
𝑠𝑥̅ 4.2
√45
= 0.0274
Thus, we can use an α as small as 0.0274 and still reject H0.
(b) No. Because the p-value = 0.0274 is greater than α = 0.01, one cannot reject H0.

Example
It is claimed that sports-car owners drive on the average 18,000 miles per year. A consumer firm
believes that the average mileage is probably lower. To check, the consumer firm obtained
information from 40 randomly selected sports-car owners that resulted in a sample mean of
17,463 miles with a sample standard deviation of 1348 miles. What can we conclude about this
claim? Use α = 0.01.
Solution
Let μ be the true population mean. We can formulate the hypotheses as
H0 : μ = 18,000 versus H1 : μ < 18,000.
The observed test statistic (for n ≥ 30) is

𝑥̅ − 𝜇 17463 − 18000
𝑧= = = −2.52
𝑠𝑥̅ 1348
√40
Rejection region is {z < −0.01} = {z < −2.33}.
Decision: Because z = −2.52 is less than −2.33, the null hypothesis is rejected at α = 0.01. There
insufficient evidence to conclude that the mean mileage on sport cars is less than 18,000 miles
per year.

Example
A manufacturer of sodas uses machines to dispense it into bottles at an average amount of 1196
litres per minute. The management went ahead to acquire a new machine which was tested on a
sample of 32 hourly production, producing at an average of 1234 litres per minute with a standard
deviation of 101. The production manager is convinced that the new machine
(i) Does not affect production at 5% level of significance
(ii) Is better than the old machine at 1% level of significance

Solution
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 101
𝑆𝑥̅ = = = 17.8544
√𝑅𝑎𝑛𝑑𝑜𝑚 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑝𝑎𝑐𝑒 √32

𝑥̅ − 𝜇 1234 − 1196
𝑍𝑐 = = = 2.1283
𝑆𝑥̅ 17.8544
(i) 𝐻0 : 𝑥̅ = 𝜇 𝑣𝑒𝑟𝑠𝑢𝑠 𝐻1 : 𝑥̅ ≠ 𝜇
𝑍𝑐 = 2.1283
𝑎𝑛𝑑 𝑍𝑡 = 1.96
𝑆𝑖𝑛𝑐𝑒 𝑡ℎ𝑒 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑖𝑠 𝑛𝑜𝑡 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 𝑡ℎ𝑒 𝑡𝑎𝑏𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑤𝑒 𝑟𝑒𝑗𝑒𝑐𝑡 𝐻0
𝑇ℎ𝑖𝑠 𝑖𝑚𝑝𝑙𝑖𝑒𝑠 𝑡ℎ𝑒 𝑛𝑒𝑤 𝑚𝑎𝑐ℎ𝑖𝑛𝑒 𝑎𝑓𝑓𝑒𝑐𝑡𝑠 𝑝𝑟𝑜𝑑𝑢𝑐𝑡𝑖𝑜𝑛
(ii) 𝐻0 : 𝑥̅ = 𝜇 𝑣𝑒𝑟𝑠𝑢𝑠 𝐻1 : 𝑥̅ > 𝜇
𝑍𝑐 = 2.1283
𝑎𝑛𝑑 𝑍𝑡 = 2,33
𝑆𝑖𝑛𝑐𝑒 𝑡ℎ𝑒 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑖𝑠 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 𝑡ℎ𝑒 𝑡𝑎𝑏𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑤𝑒 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0
𝑇ℎ𝑖𝑠 𝑖𝑚𝑝𝑙𝑖𝑒𝑠 𝑡ℎ𝑎𝑡 𝑡ℎ𝑒𝑟𝑒 𝑖𝑠 𝑛𝑜 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑡 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑖𝑛 𝑡ℎ𝑒 𝑚𝑎𝑐ℎ𝑖𝑛𝑒𝑠

Example
In a frequently traveled stretch of the I-75 highway, where the posted speed is 70 mph, it is
thought that people travel on the average of at least 75 mph. To check this claim, the following
radar measurements of the speeds (in mph) are obtained for 10 vehicles traveling on this stretch
of the interstate highway: 66, 74, 79, 80, 69, 77, 78, 65, 79 and 81
Do the data provide sufficient evidence to indicate that the mean speed at which people travel on
this stretch of highway is at most 75 mph? Test the appropriate hypothesis using α = 0.01. Draw
Solution
We need to test H0 : μ = 75 vs. Ha : μ > 75

(𝑪𝒐𝒎𝒑𝒍𝒆𝒕𝒆 𝒕𝒉𝒊𝒔 𝒒𝒖𝒆𝒔𝒕𝒊𝒐𝒏)

Example
Current statistics show that 12% of the population is believed to be infected by the COVID-19
virus. In a random sample of 80 persons screened for the virus while on transit 9 tested positive of
the virus. Do you think at 5% level of significance the current statistics claim is true?

Solution
𝐻0 : 𝑝0 = 𝑝1 𝑣𝑒𝑟𝑠𝑢𝑠 𝐻1 : 𝑝0 ≠ 𝑝1

𝐿𝑒𝑡 𝑝0 = 𝑃𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 𝑜𝑓 𝑡ℎ𝑜𝑠𝑒 𝑖𝑛𝑓𝑒𝑐𝑡𝑒𝑑 = 0.12


9
𝑎𝑛𝑑 𝑝1 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 = = 0.1125
80
𝑝𝑞 0.12 𝑥 0.88
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑜𝑜𝑟 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 = 𝑆𝑝 = √ =√ = 0.036
𝑛 80
𝑝0 − 𝑝1 0.12 − 0.1125
𝑍𝑐 = = = 0.2083
𝑆𝑝 0.036
𝑎𝑛𝑑 𝑍𝑡 = 1.96
𝑆𝑖𝑛𝑐𝑒 𝑡ℎ𝑒 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑖𝑠 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 𝑡ℎ𝑒 𝑡𝑎𝑏𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑤𝑒 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0
𝑇ℎ𝑖𝑠 𝑖𝑚𝑝𝑙𝑖𝑒𝑠 𝑡ℎ𝑎𝑡 𝑡ℎ𝑒 𝑐𝑙𝑎𝑖𝑚 𝑖𝑠 𝑡𝑟𝑢𝑒 𝑡ℎ𝑎𝑡 12% 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑖𝑠 𝑖𝑛𝑓𝑒𝑐𝑡𝑒𝑑.

Example
Research has shown that out of 85 school going children aged below 10 years 60 were found to be
going to school before the introduction of universal free primary education, but after the
introduction of the scheme it has been found out that 37 pupils out of 53 go to school. Is the scheme
effective at 99% confidence?
Solution
𝐻0 : 𝑝2 = 𝑝1 𝑣𝑒𝑟𝑠𝑢𝑠 𝐻1 : 𝑝2 ≠ 𝑝1

60
𝐿𝑒𝑡 𝑝1 = 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 𝑏𝑒𝑓𝑜𝑟𝑒 𝑡ℎ𝑒 𝑖𝑛𝑡𝑟𝑜𝑑𝑢𝑐𝑡𝑖𝑜𝑛 𝑜𝑓 𝑠𝑐ℎ𝑒𝑚𝑒 = = 0.7059
85
37
𝑎𝑛𝑑 𝑝2 = 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 𝑎𝑓𝑡𝑒𝑟 𝑡ℎ𝑒 𝑖𝑛𝑡𝑟𝑜𝑑𝑢𝑐𝑡𝑖𝑜𝑛 𝑜𝑓 𝑠𝑐ℎ𝑒𝑚𝑒 = = 0.69811
53
60 + 37 97
𝐿𝑒𝑡 𝑝 = 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 𝑜𝑓 𝑡ℎ𝑜𝑠𝑒 𝑤ℎ𝑜 𝑔𝑜 𝑡𝑜 𝑠𝑐ℎ𝑜𝑜𝑙 = = = 0.7029
85 + 53 138
𝑤ℎ𝑖𝑐ℎ 𝑓𝑜𝑙𝑙𝑜𝑤𝑠 𝑡ℎ𝑎𝑡 𝑞 = 1 − 𝑝 = 1 − 0.7029 = 0.2971
𝑝𝑞 𝑝𝑞
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑜𝑜𝑟 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛𝑠 = 𝑠𝑝 = 𝑆𝑝1−𝑝2 = √ +
𝑛1 𝑛2

0.7029𝑥0.2971 0.7029𝑥0.2971
𝑇ℎ𝑒𝑟𝑒𝑓𝑜𝑟𝑒 𝑠𝑝 = √ + = 0.07998
85 53

𝑝2 −𝑝1 0.69811−0.7059
𝑍𝑐 = = = 0.0974
𝑆𝑝 0.07998
𝑎𝑛𝑑 𝑍𝑡 = 2.58
𝑆𝑖𝑛𝑐𝑒 𝑡ℎ𝑒 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑖𝑠 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 𝑡ℎ𝑒 𝑡𝑎𝑏𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑤𝑒 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0
𝑇ℎ𝑖𝑠 𝑖𝑚𝑝𝑙𝑖𝑒𝑠 𝑡ℎ𝑎𝑡 𝑡ℎ𝑒 𝑠𝑐ℎ𝑒𝑚𝑒 𝑖𝑠 𝑛𝑜𝑡 𝑒𝑓𝑓𝑒𝑐𝑡𝑖𝑣𝑒 𝑖. 𝑒 𝑡ℎ𝑒𝑟𝑒 𝑖𝑠 𝑛𝑜 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒

Example
The following information was obtained from the accounts department in a sugar processing plant
in relation to the average salary paid the workers in different departments in the plant.
Department Production human resource
Sample size 45 32
Average salary ($) 950 1000
Standard deviation ($) 102 85
At 99% confidence, determine if there is any significant difference in the average pay in different
departments in the same plant.
Solution
𝐻0 : 𝑥̅1 = 𝑥̅ 2 𝑣𝑒𝑟𝑠𝑢𝑠 𝐻1 : 𝑥̅1 ≠ 𝑥̅ 2

𝑠12 𝑠22
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑜𝑜𝑟 𝑜𝑓 𝑡ℎ𝑒 𝑡𝑤𝑜 𝑚𝑒𝑎𝑛𝑠 = 𝑠𝑥̅ = 𝑆𝑥̅1−𝑥̅2 =√ +
𝑛1 𝑛2

1022 852
𝑇ℎ𝑒𝑟𝑒𝑓𝑜𝑟𝑒 𝑠𝑥̅ = √ + = 21.3771
45 32

𝑥̅ 2 − 𝑥̅1 1000 − 950


𝑍𝑐 = = = 2.339
𝑠𝑥̅ 21.3771
𝑎𝑛𝑑 𝑍𝑡 = 2.58
𝑆𝑖𝑛𝑐𝑒 𝑡ℎ𝑒 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑖𝑠 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 𝑡ℎ𝑒 𝑡𝑎𝑏𝑢𝑙𝑎𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑤𝑒 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0
𝑇ℎ𝑖𝑠 𝑖𝑚𝑝𝑙𝑖𝑒𝑠 𝑡ℎ𝑎𝑡 𝑡ℎ𝑒𝑟𝑒 𝑖𝑠 𝑛𝑜 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑡 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑖𝑛 𝑡ℎ𝑒 𝑎𝑣𝑒𝑟𝑎𝑔𝑒 𝑝𝑎𝑦.

𝒕 − 𝒔𝒕𝒂𝒕𝒊𝒔𝒕𝒊𝒄𝒂𝒍 𝒕𝒆𝒔𝒕

It is the equivalent of standard normal (Z)test applied when n < 30


𝑖. 𝑒 𝑡ℎ𝑒 𝑎𝑟𝑡ℎ𝑚𝑒𝑡𝑖𝑐 𝑚𝑒𝑎𝑛 𝑜𝑓 𝑠𝑚𝑎𝑙𝑙 𝑠𝑎𝑚𝑝𝑙𝑒𝑠 𝑎𝑟𝑒 𝑛𝑜𝑡 𝑛𝑜𝑟𝑚𝑎𝑙𝑙𝑦 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑑 ℎ𝑒𝑛𝑐𝑒 𝑓𝑜𝑟 𝑛
< 30
𝑥̅ − 𝜇
𝑤𝑒 ℎ𝑎𝑣𝑒 𝑡 = 𝑠
⁄ 𝑛

Features of a t – distribution
(i) It is unimodal and symmetrical at point 0
(ii) It is flatter than a normal distribution i.e the area near the the tails is greater than the normal
distribution
(iii)As 𝑛 → ∞, the t – distribution approaches the normal distribution
Note: we need to carry out t – statistical test for mean and difference between means

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy