0% found this document useful (0 votes)
17 views

Mm13 Content Module 9

Uploaded by

mrkpalmares0524
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views

Mm13 Content Module 9

Uploaded by

mrkpalmares0524
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

ADVANCED STATISTICS Page 1 of 12

ANOVA

Topic: ANOVA

Hello Math majors! Welcome to the world of Statistics!


.

I. INTRODUCTION:

This module will give you an in-depth understanding on using ANOVA as statistical tool for
determining the significant difference between two means.

II. OBJECTIVES:

At the end of this module, you will be able to:


1. Explain when ANOVA is used;
2. Read and interpret EXCEL printout for ANOVA;
3. Generate EXCEL printout for ANOVA; and,
4. Perform post ANOVA test practice intellectual honesty in the interpretation of test results.

III. PRELIMINARY ACTIVITIES:


The one-way analysis of variance (ANOVA) is used to determine whether there are any
statistically significant differences between the means of two or more independent (unrelated)
groups (although you tend to only see it used when there are a minimum of three, rather than two
groups). For example, you could use a one-way ANOVA to understand whether exam
performance differed based on test anxiety levels amongst students, dividing students into three
independent groups (e.g., low, medium and high-stressed students). Also, it is important to realize
that the one-way ANOVA is an omnibus test statistic and cannot tell you which specific groups
were statistically significantly different from each other; it only tells you that at least two groups
were different. Since you may have three, four, five or more groups in your study design,
determining which of these groups differ from each other is important.

Before you proceed to the main lesson, test yourself in this activity.

What is ANOVA?
In your own words, describe it in not more than 5 sentences?
ADVANCED STATISTICS Page 2 of 12
ANOVA

GREAT!!!
You may now proceed to the main lesson.

IV. LESSON PROPER:

Based on the preliminary activities, what did you notice about it?
________________________________________________________
CONGRATULATIONS!
You may now proceed to the lesson.

ANOVA
ANOVA is a statistical technique that assesses potential differences in a scale-level dependent variable by a
nominal-level variable having 2 or more categories. For example, an ANOVA can examine potential differences
in IQ scores by Country (US vs. Canada vs. Italy vs. Spain). Developed by Ronald Fisher in 1918, this test
extends the t and the z test which have the problem of only allowing the nominal level variable to have two
categories. This test is also called the Fisher analysis of variance.

The use of ANOVA depends on the research design. Commonly, ANOVAs are used in three ways: one-way
ANOVA, two-way ANOVA, and N-way ANOVA.

One-Way ANOVA
A one-way ANOVA has just one independent variable. For example, difference in IQ can be assessed by
Country, and County can have 2, 20, or more different categories to compare.

Two-Way ANOVA
A two-way ANOVA (are also called factorial ANOVA) refers to an ANOVA using two independent variables.
Expanding the example above, a 2-way ANOVA can examine differences in IQ scores (the dependent
variable) by Country (independent variable 1) and Gender (independent variable 2). Two-way ANOVA can
be used to examine the interaction between the two independent variables. Interactions indicate that
differences are not uniform across all categories of the independent variables. For example, females may
have higher IQ scores overall compared to males, but this difference could be greater (or less) in European
countries compared to North American countries.

N-Way ANOVA
A researcher can also use more than two independent variables, and this is an n-way ANOVA (with n being
the number of independent variables you have). For example, potential differences in IQ scores can be
examined by Country, Gender, Age group, Ethnicity, etc, simultaneously.
ADVANCED STATISTICS Page 3 of 12
ANOVA
GENERAL PURPOSE AND PROCEDURE

Omnibus ANOVA test:

The null hypothesis for an ANOVA is that there is no significant difference among the groups. The
alternative hypothesis assumes that there is at least one significant difference among the groups. After
cleaning the data, the researcher must test the assumptions of ANOVA. They must then calculate the F-ratio
and the associated probability value (p-value). In general, if the p-value associated with the F is smaller than
.05, then the null hypothesis is rejected and the alternative hypothesis is supported. If the null hypothesis is
rejected, one concludes that the means of all the groups are not equal. Post-hoc tests tell the researcher
which groups are different from each other.

So what if you find statistical significance? Multiple comparison tests

When you conduct an ANOVA, you are attempting to determine if there is a statistically significant
difference among the groups. If you find that there is a difference, you will then need to examine where the
group differences lay.

At this point you could run post-hoc tests which are t tests examining mean differences between the
groups. There are several multiple comparison tests that can be conducted that will control for Type I error
rate, including the Bonferroni, Scheffe, Dunnet, and Tukey tests.

Research Questions the ANOVA Examines


One-way ANOVA: Are there differences in GPA by grade level (freshmen vs. sophomores vs. juniors)?

Two-way ANOVA: Are there differences in GPA by grade level (freshmen vs. sophomores vs. juniors) and
gender (male vs. female)?

Data Level and Assumptions


The level of measurement of the variables and assumptions of the test play an important role in
ANOVA. In ANOVA, the dependent variable must be a continuous (interval or ratio) level of measurement.
The independent variables in ANOVA must be categorical (nominal or ordinal) variables. Like the t-test,
ANOVA is also a parametric test and has some assumptions. ANOVA assumes that the data is normally
distributed. The ANOVA also assumes homogeneity of variance, which means that the variance among the
groups should be approximately equal. ANOVA also assumes that the observations are independent of each
other. Researchers should keep in mind when planning any study to look out for extraneous or confounding
variables. ANOVA has methods (i.e., ANCOVA) to control for confounding variables.
ADVANCED STATISTICS Page 4 of 12
ANOVA
Testing of the Assumptions
1. The population from which samples are drawn should be normally distributed.
2. Independence of cases: the sample cases should be independent of each other.
3. Homogeneity of variance: Homogeneity means that the variance among the groups should be
approximately equal.
These assumptions can be tested using statistical software (like Intellectus Statistics!). The
assumption of homogeneity of variance can be tested using tests such as Levene’s test or the Brown-
Forsythe Test. Normality of the distribution of the scores can be tested using histograms, the values of
skewness and kurtosis, or using tests such as Shapiro-Wilk or Kolmogorov-Smirnov. The assumption of
independence can be determined from the design of the study.

It is important to note that ANOVA is not robust to violations to the assumption of independence. This
is to say, that even if you violate the assumptions of homogeneity or normality, you can conduct the test and
basically trust the findings. However, the results of the ANOVA are invalid if the independence assumption
is violated. In general, with violations of homogeneity the analysis is considered robust if you have equal
sized groups. With violations of normality, continuing with the ANOVA is generally ok if you have a large
sample size.

Simple analysis of variance or One-Way ANOVA is based on two sources of variations, namely:

1. Actual differences of the means due to TREATMENT.


This is represented by the sum of squares between columns (SSb).
2. Chance or experimental ERROR
This is represented by the sums of squares within columns (SSb)

For easy usage, the formula minus the complicated mathematical notations are as follows:
2 (∑ 𝑥)2
1. Total sums of squares: 𝑇𝑆𝑆 = ∑ 𝑥 = − 𝑁
1 (∑ 𝑥)2
2. Sum of the Squares Between Columns: 𝑆𝑆𝑏 = 𝑛𝑜.𝑜𝑓 𝑟𝑜𝑤𝑠 ∑(𝑠𝑢𝑚 𝑜𝑓 𝑒𝑎𝑐ℎ 𝑐𝑜𝑙𝑢𝑚𝑛𝑠 2 − 𝑁
3. Sum of the Squares Within Column: SSw = TSS – SSb
4. Total degree of freedom: df1 = N – 1
5. Between Columns Degree of Freedom: dfb = k – 1
6. Within columns Degree of Freedom: dft = df1 – dfb
𝑆𝑆
7. Mean Sum of Squares Between Columns: 𝑀𝑆𝑆𝑏 = 𝑑𝑓𝑏
𝑏
𝑆𝑆𝑤
8. Mean Sum of Squares Within Columns: 𝑀𝑆𝑆𝑤 = 𝑑𝑓
𝑤
𝑀𝑆𝑆𝑏
9. F – computed: 𝐹𝑐 = 𝑀𝑆𝑆𝑤

Where: N = the number of samples


F = the computed value of F
ADVANCED STATISTICS Page 5 of 12
ANOVA
Ft = the tabular value of F
k = the number of columns
df = the degree of freedom

In this module, formulas will not be used, instead an EXCEL Printout will be given after each problem.
At the same time you will be taught to generate EXCEL Printout of ANOVA single factor.
Under the assumption that the group or levels of the factor being studied represents populations
whose outcome measurements are randomly and independently drawn, follow the normal distribution, and
have equal variances, the null hypothesis of no differences in the population means
Ho: 𝜇1= 𝜇2= 𝜇3... 𝜇𝑛 There is no significant difference among the means of the groups being
compared, and is tested against the alternative hypothesis that the means are not all the same. Thus:
Ha:Not all 𝜇’s are equal.

Example:
The following table indicates the number of bottles of four popular brands of vinegar sold on six
randomly selected days. Test at 0.1 level of significance that there is no significant difference in the average
number of bottles sold of the four brands of vinegar.

A B C D
29 23 45 23
36 19 60 40
22 41 33 42
34 27 36 29
29 12 31 53
45 35 40 32

Solution:
Generate the Microsoft Excel Printout ANOVA.
To generate the printout, follow these steps:
1. Go to Excel
2. Click on Tools
3. Click data analysis
4. Select ANOVA: Single factor
5. Highlight all the data for the Input Range
6. Put/check alpha
7. Click the button for output range
8. Click one cell
9. Click “OK”.
ADVANCED STATISTICS Page 6 of 12
ANOVA
Here is the Microsoft Excel Output:

Below is the 5- step solution:


1. Ho: 𝜇𝐴 = 𝜇𝐵 = 𝜇𝐶 = 𝜇𝐷 : There is no difference in the average number of bottles of vinegar sold.
Ha: Not all the 𝜇′𝑠 are equal or at least one pair of means is significantly different.
2. 𝛼 = 0.01, 𝑝 𝑣𝑎𝑙𝑢𝑒 = 0.105792
3. Decision Rule: Reject H0 if p-value (0.105792) ≤ 𝛼(0.01)
4. Decision: Do not reject H0 because p value (0.105792) > 𝛼(0.01)
5. Conclusion: There is no significant difference in the average number of bottles of vinegar sold.
ADVANCED STATISTICS Page 7 of 12
ANOVA

Activity:
The table below shows the amount of dirt in milligrams that the four brands of laundry detergent have
removed. Test at 𝛼 = 0.05 if there is a significant difference among the mean amounts of dirt
removed by the four brands of laundry detergent.
A B C D
10 11 10 17
12 13 11 15
16 16 15 17
16 18 14 19
14 20 13 21

For your exercises, generate a Microsoft Excel Printout ANOVA and supply the missing data by
referring to the printout.

We had just finished the discussion on ANOVA.. Let’s move on to the next higher level of activity/ies
or exercise/s that demonstrate your potential skills/knowledge of what you have learned.
ADVANCED STATISTICS Page 8 of 12
ANOVA

V. ANALYSIS, APPLICATION AND EXPLORATION

Name: ______________________________ Grade & Section: __________________

Activity:
The table below shows the amount of dirt in milligrams that the four brands of laundry detergent have
removed. Test at 𝛼 = 0.05 if there is a significant difference among the mean amounts of dirt
removed by the four brands of laundry detergent.
A B C D
10 11 10 17
12 13 11 15
16 16 15 17
16 18 14 19
14 20 13 21

For your exercises, generate a Microsoft Excel Printout ANOVA and have a 5 step solution of
hypothesis testing.

Finally, let us summarize the lesson of what we had discussed today.


ADVANCED STATISTICS Page 9 of 12
ANOVA

VI. GENERALIZATION

Develop a graphic organizer which summarizes the important facts about ANOVA. Place your final output
below:

CONGRATULATIONS!
You have come to an end of Module 9.
OOPS! Don’t forget that you have still an assignment to do.
Here it is….
ADVANCED STATISTICS Page 10 of 12
ANOVA

VII. ASSIGNMENT

After your long journey of reading and accomplishing the module, let us now
challenge your mind by answering the evaluation part of this module.
ADVANCED STATISTICS Page 11 of 12
ANOVA

VIII. EVALUATION

Name:______________________________ Grade & Section: __________________


In the following multiple-choice questions, select the best answer.
1. Analysis of variance is a statistical method of comparing the ________ of several populations.
a. standard deviations
b. variances
c. means
d. proportions
ANSWER: _______
2. The ______ sum of squares measures the variability of the observed values around their respective
treatment means.
a. treatment
b. error
c. interaction
d. total
ANSWER: _______

3. The ________ sum of squares measures the variability of the sample treatment means around the
overall mean.
a. treatment
b. error
c. interaction
d. total
4. If the true means of the k populations are equal, then MSTR/MSE should be:
a. more than 1.00
b. close to 1.00
c. close to 0.00
d. close to -1.00
ANSWER: _______

5. If the MSE of an ANOVA for six treatment groups is known, you can compute
a. df1
b. the standard deviation of each treatment group
c. the pooled standard deviation
d. b and c
ANSWER: _______
ADVANCED STATISTICS Page 12 of 12
ANOVA
6. To determine whether the test statistic of ANOVA is statistically significant, it can be compared to
a critical value. What two pieces of information are needed to determine the critical value?
a. sample size, number of groups
b. mean, sample standard deviation
c. expected frequency, obtained frequency
ANSWER: _______

7. Which of the following is an assumption of one-way ANOVA comparing samples from three or
more experimental treatments?
a. All the response variables within the k populations follow a normal distributions.
b. The samples associated with each population are randomly selected and are independent
from all other samples.
c. The response variable within each of the k populations have equal variances.
d. All of the above.
ANSWER: _______

8. The error deviations within the SSE statistic measure distances:


a. within groups
b. between groups
c. both (a) and (b)
d. none of the above
e. between each value and the grand mean
ANSWER: _______

9. When the k population means are truly different from each other, it is likely that the average error
deviation:
a. is relatively large compared to the average treatment deviations
b. is relatively small compared to the average treatment deviations
c. is about equal to the average treatment deviation
d. differ significantly between at least two of the populations
ANSWER: _______

10. As variability due to chance decreases, the value of F will


a. increase CONGRATULATIONS on reaching the end of this module!
b. stay the same You may now proceed to the next module.
c. decrease Don’t forget to submit all the exercises, activities and portfolio
d. can’t tell from the given information on ___________________.
KEEP UP THE GOOD WORK.
Well Done!!!

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy