FormulaSheet FinalExam
FormulaSheet FinalExam
Estimating One Population Mean When the Population Standard Deviation is Unknown
Under the confidence level 1 − 𝛼, the confidence interval for the population mean is
∗
𝑠
𝑥̅ ± 𝑡𝑛−1 𝑆𝐸(𝑥̅ ), , 𝑆𝐸(𝑥̅ ) =
√𝑛
∗
𝑡𝑛−1 𝑖𝑠 𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑤𝑖𝑡ℎ 𝑑𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚 𝑛 − 1 𝑢𝑛𝑑𝑒𝑟 𝑡ℎ𝑒 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 1 − 𝛼.
Testing One Population Mean when the Population Standard Deviation is Unknown
𝑥̅ − 𝜇
𝑇𝑒𝑠𝑡 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 𝑡= , 𝑛 − 1 𝑖𝑠 𝑑𝑒𝑔𝑟𝑒𝑒 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚
𝑠/√𝑛
Under the confidence level 1 − 𝛼, 𝑤ℎ𝑒𝑛 𝜎12 ≠ 𝜎22 the confidence interval of 𝜇1 − 𝜇2 is
𝑠12 𝑠22 (𝑠12 /𝑛1 + 𝑠22 /𝑛2 )2 𝑠12 𝑠22
(𝑥̅1 − 𝑥̅2 ) ± 𝑡𝛼/2 √( + ), 𝑑𝑓 = 2 , 𝑆𝐸(𝑥̅ 1 − 𝑥̅2 ) = √( + )
𝑛1 𝑛2 (𝑠1 /𝑛1 )2 (𝑠22 /𝑛2 )2 𝑛1 𝑛2
+
𝑛1 − 1 𝑛2 − 1
Under the confidence level 1 − 𝛼, 𝑤ℎ𝑒𝑛 𝜎12 = 𝜎22 the confidence interval of 𝜇1 − 𝜇2 is
1 1 (𝑛1 − 1)𝑠12 + (𝑛2 − 1)𝑠22
(𝑥̅1 − 𝑥̅2 ) ± 𝑡𝛼/2 √𝑠𝑝2 ( + ), 𝑑𝑓 = 𝑛1 + 𝑛2 − 2, 𝑠𝑝2 =
𝑛1 𝑛2 𝑛1 + 𝑛2 − 2
1 1
𝑆𝐸𝑝𝑜𝑜𝑙𝑒𝑑 (𝑥̅ − 𝑥̅2 ) = 𝑠𝑝 √ +
1 𝑛1 𝑛2
Testing the Difference between Two Means Based on Independent Samples
Test statistic for 𝜇1 − 𝜇2 𝑤ℎ𝑒𝑛 𝜎12 ≠ 𝜎22
𝑠𝐷 ̅) = 𝑠𝑑
𝜇 = 𝑥̅𝐷 ± 𝑡𝛼/2 , 𝑑𝑓 = 𝑛 − 1 𝑖𝑠 𝑑𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚, 𝑆𝐸(𝑑
√𝑛𝐷 √𝑛
𝐻0 : 𝜇𝑑 = 0
𝐻1 : 𝜇𝑑 ≠ 0, 𝑜𝑟 𝐻1 : 𝜇𝑑 > 0, 𝑜𝑟 𝐻1 : 𝜇𝑑 < 0
𝑥̅𝐷 − 𝜇𝐷
𝑇𝑒𝑠𝑡 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 𝑡= , 𝑑𝑓 = 𝑛 − 1 𝑖𝑠 𝑑𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚
𝑠𝐷 /√𝑛𝐷
𝐻0 : 𝜇1 = 𝜇2 = ⋯ = 𝜇𝑘
𝑀𝑆𝑇
𝑇𝑒𝑠𝑡 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 𝐹𝑘−1,𝑁−𝑘 = , 𝑟𝑒𝑗𝑒𝑐𝑡𝑖𝑛𝑔 𝑡ℎ𝑒 𝑛𝑢𝑙𝑙 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠 𝑤ℎ𝑒𝑛 𝐹𝑘−1,𝑁−𝑘 𝑖𝑠 𝑡𝑜𝑜 𝑙𝑎𝑟𝑔𝑒.
𝑀𝑆𝐸
𝑆𝑆𝑇 𝑆𝑆𝐸
𝑀𝑆𝑇 = , 𝑀𝑆𝐸 =
𝑘−1 𝑁−𝑘
𝑥𝑖𝑗 represents the ith level of the first factor and the jth level of the second factor. First factor A has a levels and
second factor B has b levels.
𝑏 𝑎
𝑆𝑆𝐸
𝑀𝑒𝑎𝑛 𝑆𝑞𝑢𝑟𝑒 𝑓𝑜𝑟 𝐸𝑟𝑟𝑜𝑟 𝑀𝑆𝐸 = , 𝑤ℎ𝑒𝑟𝑒 𝑁 = 𝑎 × 𝐵
𝑁 − (𝑎 + 𝑏 − 1)
Test statistic
𝑀𝑆𝐴
𝐹𝑎−1,𝑁−(𝑎+𝑏−1) =
𝑀𝑆𝐸
𝑀𝑆𝐵
𝐹𝑏−1,𝑁−(𝑎+𝑏−1) =
𝑀𝑆𝐸
(𝑂𝑏𝑠 − 𝐸𝑥𝑝)2
𝐶ℎ𝑖 − 𝑆𝑞𝑢𝑎𝑟𝑒 𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐, 𝜒2 = ∑
𝐸𝑥𝑝
2
(𝑂𝑏𝑠 − 𝐸𝑥𝑝)2
𝐶ℎ𝑖 − 𝑆𝑞𝑢𝑎𝑟𝑒 𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐, 𝜒 =∑
𝐸𝑥𝑝
𝐷𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝐹𝑟𝑒𝑒𝑑𝑜𝑚 = (𝑅 − 1)(𝐶 − 1),
Linear Regression
𝑠𝑦
𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑠𝑡𝑟𝑎𝑖𝑔ℎ𝑡 𝑙𝑖𝑛𝑒 𝑦̂ = 𝑏0 + 𝑏1 𝑥, 𝑏1 = 𝑟 , 𝑏0 = 𝑦̅ − 𝑏1 𝑥̅
𝑠𝑥
𝑅𝑒𝑠𝑖𝑑𝑢𝑎𝑙 𝑒 = 𝑦 − 𝑦̂
∑ 𝑒2
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑟𝑒𝑠𝑖𝑑𝑢𝑎𝑙 𝑠𝑒 = √
𝑛−2
𝛼
𝑡𝛼,𝑣 𝑖𝑠 𝑡 − 𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑤𝑖𝑡ℎ 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑐𝑒 𝑙𝑒𝑣𝑒 𝑎𝑛𝑑 𝑑𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚 𝑣 = 𝑛 − 2
2 2
𝐻0 : 𝜌 = 0
𝐻1 : 𝜌 ≠ 0
Test Statistic for 𝜌
𝑛−2
𝑡 = 𝑟√ , 𝑑𝑓 = 𝑣 = 𝑛 − 2
1 − 𝑟2
𝑠𝑒2
∗
𝑦̂𝑣 ± 𝑡𝑛−2 𝑆𝐸(𝜇̂ 𝑣 ), 𝑆𝐸(𝜇̂ 𝑣 ) = √𝑆𝐸 2 (𝑏1 )(𝑥𝑣 − 𝑥̅ )2 +
𝑛
𝑠𝑒2
𝑦̂𝑣 ± ∗
𝑡𝑛−2 𝑆𝐸(𝑦̂𝑣 ), 𝑆𝐸(𝑦̂𝑣 ) = √𝑆𝐸 2 (𝑏1 )(𝑥𝑣 − 𝑥̅ )2 + + 𝑠𝑒2
𝑛
𝑺𝑺𝑬
𝒔𝑬 = √
𝒏−𝒌−𝟏
Coefficient of determination
𝑆𝑆𝐸
𝑅2 = 1 −
∑(𝑦𝑖 − 𝑦̅)2
𝑆𝑆𝐸/(𝑛 − 𝑘 − 1) 𝑀𝑆𝐸
𝐴𝑑𝑗𝑢𝑠𝑡𝑒𝑑 𝑅 2 = 1 − =1− 2
∑(𝑦𝑖 − 𝑦̅)2 /(𝑛 − 1) 𝑠𝑦
The meaning of ANOVA table is as below:
𝑆𝑆𝑅
𝑀𝑒𝑎𝑛 𝑆𝑞𝑢𝑎𝑟𝑒, 𝑟𝑒𝑔𝑟𝑒𝑠𝑠𝑖𝑜𝑛; (𝑒𝑥𝑝𝑙𝑎𝑖𝑛𝑒𝑑) = 𝑀𝑆𝑅 =
𝑘
𝑆𝑆𝐸
𝑀𝑒𝑎𝑛 𝑆𝑞𝑢𝑎𝑟𝑒, 𝑒𝑟𝑟𝑜𝑟𝑠; (𝑟𝑒𝑠𝑖𝑑𝑢𝑎𝑙𝑠, 𝑢𝑛𝑒𝑥𝑝𝑙𝑎𝑖𝑛𝑒𝑑) = 𝑀𝑆𝐸 =
𝑛−𝑘−1
𝑀𝑆𝑅
𝐹=
𝑀𝑆𝐸
𝑆𝑆𝑅 𝐸𝑥𝑝𝑙𝑎𝑖𝑛𝑒𝑑
𝑅2 = =
𝑆𝑆𝑇𝑜𝑡𝑎𝑙 𝑇𝑜𝑡𝑎𝑙
2 𝑆𝑆𝐸/(𝑛 − 𝑘 − 1)
𝑅𝑎𝑑𝑗 =1−
𝑆𝑆𝑇/(𝑛 − 1)
T.DIST.2T(x, degree of freedom) gives the probability of the two tails outside of
the interval (-x, x).
F.INV(probability, df1, df2) gives the critical value given the probability of left
side of the critical value.
F.INV.RT(probability, df1, df2) gives the critical value given the probability of
right side of the critical value.
CHISQ.DIST(x, df, 1)) gives the probability of the left side of value of x of Chi-
Square distribution
CHISQ.DIST.RT(x, df, 1)) gives the probability of the right side of value of x of
Chi-Square distribution. (for getting p-value of a Chi-square test)
Instruction of adding the data analysis in since it is not default setting in Excel. All
temporary settings in the computer in our campus are cleaned up every day. If you
need to add the function in, here is the instruction.
Excel has all this analysis program built in but they are not showing up in the default
setting when you open Excel. You have to do the following steps in Excel to have the
functions available for you to use.
Now when you go back to Excel interface, click the group “Data”, you will see “Data
Analysis” showing on the right side of the ribbon area. Click it, you will be able to
perform different kind of z-tests, t-tests and other analysis.
those t-tests