0% found this document useful (0 votes)

19 views

OLS Assumptions

The document discusses assumptions made in regression analysis including that the error terms have a mean value of zero, are homoscedastic, have no multicollinearity, are not autocorrelated, follow a normal distribution, the parameters are linear, and there is no serial correlation or non-stationarity. It then provides output from a regression analysis and discusses multicollinearity, heteroscedasticity, and autocorrelation.

Uploaded by

Abhimanyu Verma

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

OLS Assumptions

Uploaded by

Abhimanyu Verma

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 40

Assumptions

■ Zero mean value of ui, or E (ui | X2i, X3i) = 0 for each I

■ Homoscedasticity, or var (ui) = σ^2
■ No multicollinearity
■ No autocorrelation
■ Normality assumption
■ Linearity in parameters
■ No serial correlation
■ Stationarity
Dependent Variable: PAT
Method: Least Squares
Date: 04/23/19 Time: 23:27
Sample: 1 99
Included observations: 99

Variable Coefficient Std. Error t-Statistic Prob.

C 1371.198 588.9139 2.328351 0.0220

DEBT_EQUITY_RATIO 118.7550 196.0949 0.605600 0.5462
NET_SALES 0.054053 0.007966 6.785659 0.0000

R-squared 0.325295 Mean dependent var 3383.463

Adjusted R-squared 0.311238 S.D. dependent var 6036.003
S.E. of regression 5009.379 Akaike info criterion 19.90585
Sum squared resid 2.41E+09 Schwarz criterion 19.98449
Log likelihood -982.3394 Hannan-Quinn criter. 19.93766
F-statistic 23.14217 Durbin-Watson stat 1.675624
Prob(F-statistic) 0.000000
MULTICOLLINEARI
TY
DEFINITION

■ Multicollinearity occurs when your model includes

independent variables that are correlated not just to your
dependent variable, but also to each other.

TYPES
Y = β1 + β2X1 +β3X2+β4X3+µ

• Perfect collinearity
• Imperfect collinearity
■ Multicollinearity, as we have defined, adheres only to linear
relationship among X variables. It does not rule out nonlinear
relationships among them. For example, consider the
following regression model:

Y = β1 + β2X1 +β3X12+β4X13+µ
SOURCES

■ The data collection method employed, for example, sampling over a

limited range of the values taken by the regressors in the population.
■ Constraints on the model or in the population being sampled. For
example, in the regression of electricity consumption on income (X2) and
house size (X3) there is a physical constraint in the population in that
families with higher incomes generally have larger homes than families
with lower incomes.
■ In time series data, the regressors included in the model share a common
trend, that is, they all increase or decrease over time.
CONSEQUENCES

■ Although BLUE, the OLS estimators have large variances and

covariance, making precise estimation difficult.
■ Because of consequence 1, the confidence intervals tend to be
much wider. Hence, the probability of accepting a false
hypothesis (i.e., type II error) increases.
■ Also because of consequence 1, the t ratio of one or more
coefficients tends to give wrong results.
■ The OLS estimators and their standard errors can be
sensitive to small changes in the data
DETECTION

■ High R2 but few signiﬁcant t ratios.

■ High pair-wise correlations among regressors.
■ Auxiliary regressions
■ Variance Inflation Factor
REMEDIAL MEASURES

■ Dropping a variable(s).
■ Transformation of variables.

Yt = β1 +β2X2t +β3X3t +u
Yt−1 = β1 +β2X2,t−1 +β3X3,t−1 +ut−1
Yt −Yt−1 = β2(X2t − X2,t−1)+β3(X3t − X3,t−1)+v

■ Additional or new data.

HETEROSCEDASTICI
TY
Meaning

▰ The data should be homoscedastic which implies that the error

terms have identical variances for all observations under a given
value of X. It means that they should be equally spread across the
conditional expected mean of the error term.

13
Meaning

▰ Homoscedasticity can also be explained by saying that the Y

population corresponding to different X values have the same
variance around the respective X values. The following graphs
explain the same:

14
Graphical representation of
heteroscedasticity

15
Graphical representation of
heteroscedasticity
•The following graph
shows a heteroscedastic
data where the conditional
variance of Y variables
varies with a change in X
variable.

•We see that the variance is

the least when X=X1.
When X=X2 or Xn, the
variance tends to increase.
This raises a question on
the reliability of the data.
16
Reasons for heteroscedasticity

1. The regression model we choose might not suit the data. There
might be incorrect data transformation or incorrect functional form.
2. We might not consider a variable while recording data which was
an important variable in the first place. This violates the
assumption of specification of a regression model in CLRM.
3. Heteroscedasticity can also arise as a result of the presence of
outliers.
4. Skewness in the distribution of one or more regressors included in
the model may also lead to heteroscedasticity. Variables like
education, income, wealth, etc. give heteroscedastic data in most
cases distribution of which are generally uneven.

17
Consequences
1. Heteroscedasticity does not alter the unbiasedness and consistency
properties of OLS estimators.
2. But OLS estimators are no longer of minimum variance or
efficient. That is, they are not best linear unbiased estimators
(BLUE); they are simply linear unbiased estimators (LUE).
3. As a result, the t and F tests based under the standard assumptions
of CLRM may not be reliable, resulting in erroneous conclusions
regarding the statistical significance of the estimated regression
coefficients.

18
Detection- Graphical

19
Detection: Mathematical
▰ Park test
▰ Glejser test
▰ Spearman’s rank correlation test
▰ Goldfeld-Quandt test
▰ Breusch-Pagan-Godfrey test
▰ White’s general heteroscedasticity test
▰ Koenkar-Bassett test

20
An Example
Factors that determine the abortion rate across the 50 states in the USA:
▰ State = name of the state (50 US states).
▰ ABR=Abortion rate, number of abortions per thousand women
aged 15–44 in 1992.
▰ Religion = the percent of a state’s population that is Catholic,
Southern Baptist, Evangelical, or Mormon.
▰ Price = the average price charged in 1993 in non-hospital facilities
for an abortion at 10 weeks with local anesthesia (weighted by the
number of abortions performed in 1992).

21
An Example
▰ State = name of the state (50 US states).
▰ Laws = a variable that takes the value of 1 if a state enforces a law
that restricts a minor’s access to abortion, 0 otherwise.
▰ Funds = a variable that takes the value of 1 if state funds are
available for use to pay for an abortion under most circumstances,
0 otherwise.
▰ Educ = the percent of a state’s population that is 25 years or older
with a high school degree (or equivalent), 1990.
▰ Income = disposable income per capita, 1992.
▰ Picket = the percentage of respondents that reported experiencing
picketing with physical contact or blocking of patients.

22
The Econometrics Model
Abortioni = β1 + β2Reli + β3Pricei + β4Lawsi + β5Fundsi + β6Educi +
β7Incomei + β8Picketi + ui
Where i = 1,2,3,…..,50
we would expect ABR to be negatively related to religion, price, laws,
picket, education, and positively related to fund and income. We assume
the error term satisfies the standard classical assumptions, including the
assumption of homoscedasticity.
Of course, we will do a post-estimation analysis to see if this assumption
holds in the present case.

23
Consequences
1. Heteroscedasticity does not alter the unbiasedness and consistency
properties of OLS estimators.
2. But OLS estimators are no longer of minimum variance or
efficient. That is, they are not best linear unbiased estimators
(BLUE); they are simply linear unbiased estimators (LUE).
3. As a result, the t and F tests based under the standard assumptions
of CLRM may not be reliable, resulting in erroneous conclusions
regarding the statistical significance of the estimated regression
coefficients.

24
AUTOCORRELATI
ON
Introduction

Autocorrelation is a mathematical representation of the degree of similarity

between a given time series and a lagged version of itself over successive
time intervals.
■ E( ) 0
■ Time series data
■ (rho) is the coefficient of autocorrelation

■ = +E
■ First order autocorrelation
Patterns of positive and negative autocorrelation
Causes

■ Inertia
■ Specification bias
■ Cobweb phenomenon
■ Lags or autoregression
■ Manipulation of data
■ Nonstationarity
■ Data transformation
Consequences
■ Residual variance is underestimated
■ R square is overestimated
■ F and t test results are misleading
Overall, the results are unreliable
Tests to detect

■ Graphical method
■ Durbin – Watson d test
■ Runs test
■ Breusch - Godfrey (BG) test
Graphical method
1. A plot of lag 1 is a plot of the values
of versus

■ Vertical axis: for all t

■ Horizontal axis: for all t
2. Time sequence plot

■ Vertical axis: Residuals

■ Horizontal axis: Time
Durbin – Watson d test
■ Null hypothesis: = 0
Alternate hypothesis: 0
■ Formula

■ Interpretation of test results

Runs test
■ Null hypothesis: = 0
Alternate hypothesis: 0
■ Formula

N = N1 + N2 = total number of observations

N1 = Number of + symbols
N2 = Number of – symbols
R = Number of Runs
■ Construction of a confidence interval

E(R) +/- 1.96 (SE)

■ Interpretation of results

If R= Number of runs, lies within the confidence interval, we do not reject the null
hypothesis. However, if R lies outside the confident interval, we reject the null
hypothesis.
Example
(---------) (+++++++++++++++++++++) (----------)
R=3

Confidence interval= 95%

R = 3, which lies outside the confidence interval. Therefore, we reject out null hypothesis
and we can conclude that the residuals exhibit autocorrelation.
Remedial measures

■ Explore the possibility of mis-specification of the model

■ Generalized least square method to transform the original model
■ Newly- West test to obtain the corrected standard errors
Steps in E-views

■ Estimate the regression equation

■ Note down the Durbin Watson d statistic given in the window
■ Compare with critical d statistic
■ Open the ‘Residuals graph’ under ‘View’ to cross check your result with
the graphical method
THANK YOU!

Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Forecasting: Questions and Answers Q6.1 Q6.1 Answer
No ratings yet
Forecasting: Questions and Answers Q6.1 Q6.1 Answer
28 pages
CH 4 - Problems
No ratings yet
CH 4 - Problems
72 pages
OLS Assumptions and diagnostics
No ratings yet
OLS Assumptions and diagnostics
18 pages
Econometrics A
No ratings yet
Econometrics A
18 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
chapter-4
No ratings yet
chapter-4
38 pages
Violations of OLS
No ratings yet
Violations of OLS
64 pages
Topic 1 Wble
No ratings yet
Topic 1 Wble
58 pages
Multicollinearity AND Heteroskedasticity
No ratings yet
Multicollinearity AND Heteroskedasticity
75 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
21 pages
Yaregal Birhanu
No ratings yet
Yaregal Birhanu
8 pages
Finacial Econometrics Chapter Four
No ratings yet
Finacial Econometrics Chapter Four
28 pages
Outline: Basic Econometrics in Transportation Basic Econometrics in Transportation
No ratings yet
Outline: Basic Econometrics in Transportation Basic Econometrics in Transportation
7 pages
Ch5 Slides Ed3 Feb2021
No ratings yet
Ch5 Slides Ed3 Feb2021
49 pages
MFIN 305_Lecture3
No ratings yet
MFIN 305_Lecture3
66 pages
Diagnostic Tests
No ratings yet
Diagnostic Tests
51 pages
Multicollinearity, Heteroscedasticity and Autocorrelation
100% (3)
Multicollinearity, Heteroscedasticity and Autocorrelation
23 pages
Chapter 5 Violations of CLRM Assumptions
100% (2)
Chapter 5 Violations of CLRM Assumptions
25 pages
Econometricis Chapter Four PBMs
No ratings yet
Econometricis Chapter Four PBMs
26 pages
Econometrics Course For RDAE Chapter 5
No ratings yet
Econometrics Course For RDAE Chapter 5
82 pages
L1090_lecture7_AU24
No ratings yet
L1090_lecture7_AU24
27 pages
Econometrics and Softwar Applications (Econ 7031) Assignment
No ratings yet
Econometrics and Softwar Applications (Econ 7031) Assignment
8 pages
4 Regression Issues
No ratings yet
4 Regression Issues
44 pages
Assignment For Viva
No ratings yet
Assignment For Viva
54 pages
Econometrics
No ratings yet
Econometrics
8 pages
Chapter 4 - Acct
No ratings yet
Chapter 4 - Acct
16 pages
Ch5 - Slides 2022 - 11 - 29 - L1
No ratings yet
Ch5 - Slides 2022 - 11 - 29 - L1
35 pages
Chapter 4
No ratings yet
Chapter 4
55 pages
Heteroscedasticity Workshop
No ratings yet
Heteroscedasticity Workshop
72 pages
4_autocorrelation
No ratings yet
4_autocorrelation
13 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
8 pages
ch4 (Multi Hetro Auto)
No ratings yet
ch4 (Multi Hetro Auto)
43 pages
Heteroscedasticity Notes
No ratings yet
Heteroscedasticity Notes
9 pages
Economics 308: Econometrics Professor Moody: Describing The Relationship Between Two Variables
No ratings yet
Economics 308: Econometrics Professor Moody: Describing The Relationship Between Two Variables
8 pages
Lecture 10 Heteroscedasticity
No ratings yet
Lecture 10 Heteroscedasticity
6 pages
chapter4
No ratings yet
chapter4
62 pages
Chapter 5 - Violations of Regression Assumptions
No ratings yet
Chapter 5 - Violations of Regression Assumptions
44 pages
Econometery ch2
No ratings yet
Econometery ch2
38 pages
Chapter 4 (2)
No ratings yet
Chapter 4 (2)
62 pages
Chris Brooks_Chapter 5_slides
No ratings yet
Chris Brooks_Chapter 5_slides
71 pages
Chapter One Part 2
No ratings yet
Chapter One Part 2
5 pages
Chap4 Econometrics I Jonse
No ratings yet
Chap4 Econometrics I Jonse
51 pages
CH 4.Violations of the Assumptions of the Classical Model (1)
No ratings yet
CH 4.Violations of the Assumptions of the Classical Model (1)
54 pages
Chapter 04 (1)
No ratings yet
Chapter 04 (1)
70 pages
7dJDuD5Y2Fia6Ch 6 Multicollinearity&Heterosced
No ratings yet
7dJDuD5Y2Fia6Ch 6 Multicollinearity&Heterosced
23 pages
Chapter Four Violations of Basic Classical Assumptions: Y and The Random Error Term U
No ratings yet
Chapter Four Violations of Basic Classical Assumptions: Y and The Random Error Term U
32 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Econometrics For Finance Chapter 4
No ratings yet
Econometrics For Finance Chapter 4
44 pages
Oversikt ECN402
No ratings yet
Oversikt ECN402
40 pages
Advances 20220303 24
No ratings yet
Advances 20220303 24
13 pages
CHAPTER 6
No ratings yet
CHAPTER 6
10 pages
Ôn Final KTL
No ratings yet
Ôn Final KTL
5 pages
EC229 Part II Answers
No ratings yet
EC229 Part II Answers
9 pages
Violation of Assumptions
No ratings yet
Violation of Assumptions
61 pages
economatrics_postmte_1
No ratings yet
economatrics_postmte_1
46 pages
Lecture # 3 (Heteroskedasticity in Cross-Sectional Data)
No ratings yet
Lecture # 3 (Heteroskedasticity in Cross-Sectional Data)
5 pages
Chapter 4 New Edited
No ratings yet
Chapter 4 New Edited
45 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
15 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
CRM and CEM
No ratings yet
CRM and CEM
20 pages
Tools of Salespromotion
No ratings yet
Tools of Salespromotion
22 pages
Business Quiz
No ratings yet
Business Quiz
70 pages
Abcanalysis
No ratings yet
Abcanalysis
14 pages
Berman CH 01
No ratings yet
Berman CH 01
32 pages
Two Variable
No ratings yet
Two Variable
27 pages
TWO-VARIABLE New
No ratings yet
TWO-VARIABLE New
19 pages
Green Campus Policy - Shaheed Sukhdev College of Business Studies (1) - Removed
No ratings yet
Green Campus Policy - Shaheed Sukhdev College of Business Studies (1) - Removed
3 pages
Comm Colleges of University of Delhi
No ratings yet
Comm Colleges of University of Delhi
3 pages
Final Open Access Publishing
100% (1)
Final Open Access Publishing
78 pages
Gupta Et Al., 2014
No ratings yet
Gupta Et Al., 2014
6 pages
Handout Econometrics - Module
No ratings yet
Handout Econometrics - Module
86 pages
CH 02 Simple Regression TQT
No ratings yet
CH 02 Simple Regression TQT
61 pages
Algorithmic Trading Methods:: Applications Using Advanced Statistics, Optimization, and Machine Learning Techniques 2nd
100% (6)
Algorithmic Trading Methods:: Applications Using Advanced Statistics, Optimization, and Machine Learning Techniques 2nd
53 pages
Transformational Leadership and Teaching Efficiency of Selected Federal Universities in South West, Nigeria
No ratings yet
Transformational Leadership and Teaching Efficiency of Selected Federal Universities in South West, Nigeria
14 pages
Download full (Ebook) Stata: A Really Short Introduction by Felix Bittmann ISBN 9783110617290, 3110617293 ebook all chapters
100% (5)
Download full (Ebook) Stata: A Really Short Introduction by Felix Bittmann ISBN 9783110617290, 3110617293 ebook all chapters
71 pages
Pharmaceutical Statistics Practical and Clinical Applications Fourth Edition Revised and Expanded Drugs and the Pharmaceutical Sciences Sanford Bolton All Chapters Instant Download
No ratings yet
Pharmaceutical Statistics Practical and Clinical Applications Fourth Edition Revised and Expanded Drugs and the Pharmaceutical Sciences Sanford Bolton All Chapters Instant Download
42 pages
Analysis of Family Structure Influence On Academic Performance Among Secondary School Students in Bungoma East Sub-County, Kenya
No ratings yet
Analysis of Family Structure Influence On Academic Performance Among Secondary School Students in Bungoma East Sub-County, Kenya
11 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
112 pages
Demand Forecasting
No ratings yet
Demand Forecasting
99 pages
Multiple Regression
No ratings yet
Multiple Regression
21 pages
Paper Report
No ratings yet
Paper Report
30 pages
Quantitative Techniques Final As at 4 May 2005
0% (1)
Quantitative Techniques Final As at 4 May 2005
217 pages
MScFE 610 Econometrics - CompiledVideo - Transcripts - M2
No ratings yet
MScFE 610 Econometrics - CompiledVideo - Transcripts - M2
14 pages
Econometrics by Abdul Waheed
No ratings yet
Econometrics by Abdul Waheed
21 pages
Chapter-4-Simple Linear Regression & Correlation
100% (3)
Chapter-4-Simple Linear Regression & Correlation
9 pages
Advanced Research Methods: Presented By: Saqib Wahab Mahar Darshan Kumar
100% (1)
Advanced Research Methods: Presented By: Saqib Wahab Mahar Darshan Kumar
68 pages
Greene - Chap 9
No ratings yet
Greene - Chap 9
2 pages
Guidebook To Data Analyst
No ratings yet
Guidebook To Data Analyst
51 pages
18. Gonza´ lez et al, 2022
No ratings yet
18. Gonza´ lez et al, 2022
8 pages
A Quantitative Assessment of Student Performance and Examination Format
No ratings yet
A Quantitative Assessment of Student Performance and Examination Format
10 pages
School of Post Graduate Studies: Addis Ababa University
No ratings yet
School of Post Graduate Studies: Addis Ababa University
3 pages
MAS Wiley Questions 2019-28
No ratings yet
MAS Wiley Questions 2019-28
5 pages
2018-IPS Endterm Sols
No ratings yet
2018-IPS Endterm Sols
14 pages
Cuny Baruch Emba
No ratings yet
Cuny Baruch Emba
24 pages
Student Performance and High School Landscapes - Examining The Links
No ratings yet
Student Performance and High School Landscapes - Examining The Links
10 pages
Factors Affecting Method of Coffee Processing in Visakhapatnam District of Andhra Pradesh
No ratings yet
Factors Affecting Method of Coffee Processing in Visakhapatnam District of Andhra Pradesh
4 pages
Simple Linear Regression Part 1
No ratings yet
Simple Linear Regression Part 1
63 pages
Part 4C (Quantitative Methods For Decision Analysis) 354
No ratings yet
Part 4C (Quantitative Methods For Decision Analysis) 354
102 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

OLS Assumptions

Uploaded by

OLS Assumptions

Uploaded by

Assumptions

■ Zero mean value of ui, or E (ui | X2i, X3i) = 0 for each I

Variable Coefficient Std. Error t-Statistic Prob.

C 1371.198 588.9139 2.328351 0.0220

R-squared 0.325295 Mean dependent var 3383.463

■ Multicollinearity occurs when your model includes

■ The data collection method employed, for example, sampling over a

■ Although BLUE, the OLS estimators have large variances and

■ High R2 but few signiﬁcant t ratios.

■ Additional or new data.

▰ The data should be homoscedastic which implies that the error

▰ Homoscedasticity can also be explained by saying that the Y

•We see that the variance is

Autocorrelation is a mathematical representation of the degree of similarity

■ Vertical axis: for all t

■ Vertical axis: Residuals

■ Interpretation of test results

N = N1 + N2 = total number of observations

E(R) +/- 1.96 (SE)

Confidence interval= 95%

■ Explore the possibility of mis-specification of the model

■ Estimate the regression equation

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.