0% found this document useful (0 votes)

93 views

Simpreg

This document presents the results of regression analyses on a dataset with one independent variable (X) and one dependent variable (Y). It shows the mean model, simple regression model, and multiple regression model. The mean model establishes the baseline and finds the mean and confidence intervals for the mean of Y. The simple regression improves upon this by finding the linear relationship between X and Y, with a correlation of 0.812. The multiple regression calculates the regression coefficients, predictions, errors and other statistics using a matrix approach, finding an intercept of 27.21 and slope of 1.28 for the linear model of Y as a function of X.

Uploaded by

christos_nakis

Available Formats

Download as XLS, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views

Simpreg

Uploaded by

christos_nakis

Available Formats

Download as XLS, PDF, TXT or read online on Scribd

You are on page 1/ 6

Mean model

FORECASTS, ERROR MEASURES, AND CONFIDENCE INTERVALS

MEAN MODEL (p=1)
Y
FCST ERROR ABS ERR
45
53.667
-8.667
8.667
58
53.667
4.333
4.333
50
53.667
-3.667
3.667
54
53.667
0.333
0.333
62
53.667
8.333
8.333
53
53.667
-0.667
0.667
Number of data points (n)
Mean (AVERAGE(Y))
Sample standard deviation (STDEV(Y))
Std error of the mean (STDEV(Y)/SQRT(n))
Lower 95% limit for mean
Upper 95% limit for mean

6
53.667
5.955
2.431
47.417
59.916

SQ ERR LOWER 95%

75.111
38.358
18.778
38.358
13.444
38.358
0.111
38.358
69.444
38.358
0.444
38.358

Mean absolute error (MAE)

Sum of Squared Errors (SSE)
Mean Squared Error (SSE/(n-p))
RMSE (square root of MSE)
Critical t-value (95%, n-p d.f.)

Confidence interval for the mean = Mean +/- (t-value)*(std. error of mean)
Confidence interval for a prediction = Mean +/- (t-value)*(RMSE)
Note that RMSE for the mean model is just the sample standard deviation of the dependent variable,
...which is also the sample standard deviation of the errors in this case.

Page 1

4.333
177.333
35.467
5.955
2.571

UPPER 95%
68.975
68.975
68.975
68.975
68.975
68.975

Simple regression model

FORECASTS AND ERROR MEASURES

SIMPLE REGRESSION MODEL (p=2)

Number of data points (n)

Mean (AVERAGE)
Sample standard deviation (STDEV)
Sample variance (VAR)

X
18
25
15
22
24
20

Y
45
58
50
54
62
53

FCST
50.252
59.215
46.411
55.374
57.935
52.813

6
20.667
3.777
14.267

6
53.667
5.955
35.467

53.667
4.836
23.388

ERROR ABS ERR

-5.252
5.252
-1.215
1.215
3.589
3.589
-1.374
1.374
4.065
4.065
0.187
0.187

0.000
3.475
12.079

2.614

SQ ERR
27.587
1.476
12.879
1.887
16.528
0.035

10.065

Note that VAR(Y) = VAR(FCST) + VAR(ERR) = "explained variance" + "unexplained variance"

Correlation of X & Y (r = CORREL(X,Y))
0.81206
Regression slope coefficient (b = r*STDEV(Y)/STDEV(X))1.280374
Regression intercept (a = AVERAGE(Y)-b*AVERAGE(X)) 27.20561
Square of correlation coefficient (r squared)
0.659441
Unadjusted R-squared (1-VAR(ERR)/VAR(Y))
0.659441
Adjusted R-squared (1-MSE/VAR(Y))
0.574301

Mean absolute error (MAE)

Sum of Squared Errors (SSE)
MSE (SSE/(n-p))
RMSE (square root of MSE)
Critical t-value (95%, n-p d.f.)

2.614
60.393
15.098
3.886
2.776

The difference between unadjusted and adjusted R-squared is that VAR(ERR) = SSE/(n-1) whereas MSE = SSE/(n-2).
The former is a biased measure of the error variance, whereas the latter is an unbiased estimate, correcting for the
fact that 2 coefficients have been estimated, not 1.
Also note that MSE is not just the sample mean of the squared errors--it is the sum of squared errors divided by n-p,
not divided by n.
The RMSE for a regression model is also called the Standard Error of the Estimate (SEE)
The exact confidence interval for a prediction is equal to the prediction +/- (t-value) * (std. dev. of prediction)
...however the std. dev. of the prediction is NOT simply the RMSE of the model (unlike in the mean model).
Rather. it includes an additional factor that depends on the standard errors of the coefficients and the values
of the independent variables at that point.

Page 2

Multiple regression model

This worksheet shows the "brute force" calculation of regression coefficients, predictions, and confidence
intervals using matrix algebra. Essentially the same formulas would work for any number of data points and
independent variables, although the arrays would have to be reshaped.

The "X" matrix (constant & independent variable(s)):

X0
X1
X2...
1
18
1
25
1
15
1
22
1
24
1
20
6
2
4

The "Y" vector (dependent variable) and its deviations-from-mean and squared-deviations-from-mean:
Y
Y-AVG(Y) (Y-AVG(Y))^2
45
-8.66666667 75.11111111
(The blue cells are "live."
58
4.333333333 18.77777778
You can change their contents
50
-3.66666667 13.44444444
and see what happens....)
54
0.333333333 0.111111111
62
8.333333333 69.44444444
53
-0.66666667 0.444444444
53.66667
0.00000
29.55556 Average values

Number of data points (named N)

Number of coefficients to estimate (named P)
Number of "DEGREES OF FREEDOM" (named DF)

29.55556 = POPULATION VARIANCE of Y (named VARPY) is the average squared deviation of Y from its mean
35.46667 = SAMPLE VARIANCE of Y (named VARY) is the average squared deviation of Y from its mean ADJUSTED for the estimation of the mean from the finite sample
(i.e., it is the sum of squared deviations from the mean divided by N-1 rather than N)
Here is "X-transpose" (the X matrix transposed, named XT)
1
18

1
25

1
15

1
22

1
24

1
20

Now here is "X-transpose-X" (i.e., X-transpose times X, named XTX)

6
124
124
2634
And here is "X-transpose-X inverse" (the matrix inverse of the previous thing, named XTXINV)
6.154206 -0.28972
-0.28972 0.014019
Here is "X-transpose-Y" (X-transpose times the Y vector, named XTY)
322
6746
The vector of COEFFICIENT ESTIMATES ("beta hat") is equal to "X-transpose-X-inverse times X-transpose-Y"
(the previous two things matrix-multiplied together):
27.20561
1.280374
The vector of predictions (named YHAT) is now equal to "X beta-hat" (X times beta-hat):
50.25234
59.21495
46.41121
55.37383
57.93458
52.81308
ERRORS (actual minus predicted):
-5.25234
-1.21495
3.588785
-1.37383
4.065421
0.186916

SQUARED ERRORS:
27.58704
1.476111
12.87938
1.887414
16.52764
0.034938
60.39252 = Sum of Squared Errors (SSE)

The SIMPLE AVERAGE OF THE SQUARED ERRORS is the sum of squared errors divided by N:
10.06542 (This is a BIASED estimate of the average size of a squared error)
R-SQUARED is equal to 1 minus the average squared error divided by the population variance of Y:
0.659441 (This is a BIASED estimate of the fraction of variance "explained" by the model)
The MEAN SQUARED ERROR (MSE) is equal to the Sum of Squared Errors divided by the # Degrees of Freedom:
15.09813 (This is an UNBIASED estimate of the average size of a squared error)
ADJUSTED R-SQUARED is equal to 1 minus the MSE divided by the sample variance of Y
0.574301 (This is an UNBIASED estimate of the fraction of variance "explained" by the model)
The STANDARD ERROR OF THE ESTIMATE (SEE) is the square root of the MSE:
3.885631
The COVARIANCE MATRIX OF THE COEFFICIENT ESTIMATES ("COVMAT") is equal to X-transpose-X-inverse
times the MSE:
92.917 -4.37422
-4.37422 0.211656
The STANDARD ERRORS OF THE COEFFICIENT ESTIMATES are the square roots of the diagonal elements
of the covariance matrix:
9.639347
0.460061
The T-STATISTICS OF THE COEFFICIENT ESTIMATES
are the coefficients divided by their standard errors:

..and their exact SIGNIFICANCE LEVELS (p-values)

can be calculated using the TDIST function:

Page 3

Chart

55
Y
FCST

40
10

Page 4

Excel regression

SUMMARY OUTPUT: Tools/Data Analysis/Regression procedure

Regression Statistics
Multiple R
0.812059516
R Square
0.659440658
Adjusted R Square
0.574300822
Standard Error
3.885631331
Observations
6
ANOVA
df

SS
MS
F Significance F
1 116.9408 116.9408 7.745383 0.049663
4 60.39252 15.09813
5 177.3333

Regression
Residual
Total

Coefficients
Standard Error t Stat
P-value Lower 95%Upper 95%
Lower 95.000%
Upper 95.000%
27.20560748 9.639347 2.82235 0.047714 0.442436 53.96878 0.442436 53.96878
1.280373832 0.460061 2.783053 0.049663 0.003037 2.55771 0.003037 2.55771

Intercept
X Variable 1

RESIDUAL OUTPUT
Observation
1
2
3
4
5
6

Predicted Y
50.25233645
59.21495327
46.41121495
55.37383178
57.93457944
52.81308411

Residuals
Standard Residuals
-5.25234 -1.35173
-1.21495 -0.31268
3.588785 0.923604
-1.37383 -0.35357
4.065421 1.04627
0.186916 0.048104

X Variable 1 Residual Plot

Residuals

4
2
0
-2

-4
-6
X Variable 1

X Variable 1 Line Fit Plot

60
55
Y

50
45

40
14

X Variable 1

Page 5

Predicted Y

SG regression

Multiple Regression Analysis

----------------------------------------------------------------------------Dependent variable: Y
----------------------------------------------------------------------------Standard
T
Parameter
Estimate
Error
Statistic
P-Value
----------------------------------------------------------------------------CONSTANT
27.2056
9.63935
2.82235
0.0477
X
1.28037
0.460061
2.78305
0.0497
----------------------------------------------------------------------------Analysis of Variance
----------------------------------------------------------------------------Source
Sum of Squares
Df Mean Square
F-Ratio
P-Value
----------------------------------------------------------------------------Model
116.941
1
116.941
7.75
0.0497
Residual
60.3925
4
15.0981
----------------------------------------------------------------------------Total (Corr.)
177.333
5
R-squared = 65.9441 percent
R-squared (adjusted for d.f.) = 57.4301 percent
Standard Error of Est. = 3.88563
Mean absolute error = 2.61371
Durbin-Watson statistic = 1.79877
Regression Results for Y (PREDICTION FOR X=30)
-----------------------------------------------------------------------------------------------------Fitted
Stnd. Error Lower 95.0% CL Upper 95.0% CL Lower 95.0% CL Upper 95.0% CL
Row
Value
for Forecast
for Forecast
for Forecast
for Mean
for Mean
-----------------------------------------------------------------------------------------------------7
65.6168
6.00434
48.9461
82.2876
52.9075
78.3262
-----------------------------------------------------------------------------------------------------Y = 27.2056 + 1.28037*X

Page 6

Welding Repair Procedure
100% (4)
Welding Repair Procedure
4 pages
Entrepreship JK BOSE BY DR Muzzamil Rehman
100% (1)
Entrepreship JK BOSE BY DR Muzzamil Rehman
29 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
ANOVA and MANOVA: Statistics For Psychology
No ratings yet
ANOVA and MANOVA: Statistics For Psychology
34 pages
Proc Robust Reg
No ratings yet
Proc Robust Reg
56 pages
Hilux Elecrical Ewd307f
75% (4)
Hilux Elecrical Ewd307f
381 pages
Formulas Linear Regression PDF
No ratings yet
Formulas Linear Regression PDF
5 pages
Section 2
No ratings yet
Section 2
22 pages
CH 14 Handout
No ratings yet
CH 14 Handout
6 pages
Estadisticas Descriptivas - DSTAT Rhs ONE, X1, X2, X3, X4, X5, X6, X7, X8, X9, X10, X11, X12$
No ratings yet
Estadisticas Descriptivas - DSTAT Rhs ONE, X1, X2, X3, X4, X5, X6, X7, X8, X9, X10, X11, X12$
4 pages
Regression Analysis - Stata Annotated Output: Use Https://stats - Idre.ucla - Edu/stat/stata/notes/hsb2
No ratings yet
Regression Analysis - Stata Annotated Output: Use Https://stats - Idre.ucla - Edu/stat/stata/notes/hsb2
6 pages
CE1 Sol
No ratings yet
CE1 Sol
7 pages
Machine Learning-Lecture 1(Student)
No ratings yet
Machine Learning-Lecture 1(Student)
14 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
18 pages
Topic09. Multiple Regression
No ratings yet
Topic09. Multiple Regression
36 pages
Stata PDF
No ratings yet
Stata PDF
5 pages
2023 Statistics Fin 11
No ratings yet
2023 Statistics Fin 11
19 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Chapter12 Solutions PDF
No ratings yet
Chapter12 Solutions PDF
44 pages
Chapter 4: Answer Key: Case Exercises Case Exercises
No ratings yet
Chapter 4: Answer Key: Case Exercises Case Exercises
9 pages
Ch3 Multiple Regression
No ratings yet
Ch3 Multiple Regression
56 pages
Paper On Polynomial Regression
No ratings yet
Paper On Polynomial Regression
7 pages
What Is Multiple Linear Regression
No ratings yet
What Is Multiple Linear Regression
23 pages
HW 2
No ratings yet
HW 2
12 pages
Module 3 - SimpleLinearRegression - Afterclass1b
No ratings yet
Module 3 - SimpleLinearRegression - Afterclass1b
26 pages
Multi Variate Regression
No ratings yet
Multi Variate Regression
52 pages
22 Regression
No ratings yet
22 Regression
31 pages
22 Regression
No ratings yet
22 Regression
31 pages
Data Analytics Unit 3 Notes
100% (2)
Data Analytics Unit 3 Notes
28 pages
Analysing The Variance
No ratings yet
Analysing The Variance
14 pages
Chapter 13 Part 1
No ratings yet
Chapter 13 Part 1
49 pages
Multiple Linear Regression-I
No ratings yet
Multiple Linear Regression-I
6 pages
4 - Multiple Linear Regressions
No ratings yet
4 - Multiple Linear Regressions
61 pages
Session 2-3 (ANOVA) Regression
No ratings yet
Session 2-3 (ANOVA) Regression
54 pages
An Introduction To Statistical Learning
No ratings yet
An Introduction To Statistical Learning
19 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Multiple Linear Regression
100% (1)
Multiple Linear Regression
14 pages
Regression Equation For SI
No ratings yet
Regression Equation For SI
12 pages
Multiple Regression
No ratings yet
Multiple Regression
100 pages
Simple and Multiple Regression
100% (1)
Simple and Multiple Regression
39 pages
Multiple Regression
No ratings yet
Multiple Regression
60 pages
CH 11
No ratings yet
CH 11
44 pages
05_measures-for-in-sample-evaluation.en
No ratings yet
05_measures-for-in-sample-evaluation.en
1 page
Linear Regression
No ratings yet
Linear Regression
64 pages
Simple Regression
No ratings yet
Simple Regression
46 pages
Chapter 4 Demand Estimation
No ratings yet
Chapter 4 Demand Estimation
9 pages
DISC 212 Session 13
No ratings yet
DISC 212 Session 13
29 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Chapter 7 - S
No ratings yet
Chapter 7 - S
49 pages
Chapter 09 W12 L1 Multiple Regression Analysis 2015 UTP C10 PDF
No ratings yet
Chapter 09 W12 L1 Multiple Regression Analysis 2015 UTP C10 PDF
17 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Regression Equation
No ratings yet
Regression Equation
56 pages
Regression
No ratings yet
Regression
46 pages
8 SLR GSBA 545 2024
No ratings yet
8 SLR GSBA 545 2024
28 pages
Empirical Exercises 6
No ratings yet
Empirical Exercises 6
7 pages
Regression Notes
No ratings yet
Regression Notes
7 pages
REGRESSION
No ratings yet
REGRESSION
8 pages
Regression
No ratings yet
Regression
56 pages
CH 11
No ratings yet
CH 11
76 pages
Regression Notes
No ratings yet
Regression Notes
6 pages
Worked Examples in Mechanical Vibrations using MATLAB
From Everand
Worked Examples in Mechanical Vibrations using MATLAB
Eric Okoth Ogur
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Lang Electric Convection Oven - ECOF-T - 2M-W1348
No ratings yet
Lang Electric Convection Oven - ECOF-T - 2M-W1348
24 pages
CMSC 417 Midterm #1 (Fall 2001) - Solution
No ratings yet
CMSC 417 Midterm #1 (Fall 2001) - Solution
3 pages
Non-Linear Analyses Using LS-DYNA Implicit
No ratings yet
Non-Linear Analyses Using LS-DYNA Implicit
28 pages
Linux Foundation Certified Sysadmin (LFCS)
No ratings yet
Linux Foundation Certified Sysadmin (LFCS)
9 pages
Assignment/ Tugasan - Computer Programming Ii
No ratings yet
Assignment/ Tugasan - Computer Programming Ii
9 pages
SMALL-SCALE WEAVING. UNIDO - ILO TECHNICAL MEMORANDUM NO. 3 (13729.en) PDF
No ratings yet
SMALL-SCALE WEAVING. UNIDO - ILO TECHNICAL MEMORANDUM NO. 3 (13729.en) PDF
146 pages
Broadway Bridge Design Workshop Powerpoint
No ratings yet
Broadway Bridge Design Workshop Powerpoint
47 pages
Neilson Etal 2007 PDF
No ratings yet
Neilson Etal 2007 PDF
7 pages
Healthy Life Style
No ratings yet
Healthy Life Style
3 pages
Wmo 1127 en
No ratings yet
Wmo 1127 en
40 pages
Ws 5 Busmath
No ratings yet
Ws 5 Busmath
3 pages
Immediate Download Physiology of Behavior 12th Edition Carlson Test Bank All Chapters
100% (7)
Immediate Download Physiology of Behavior 12th Edition Carlson Test Bank All Chapters
24 pages
PP Sample Assignment
No ratings yet
PP Sample Assignment
110 pages
ST Microelctronics by Element14 Batch 1
No ratings yet
ST Microelctronics by Element14 Batch 1
9 pages
Manish Resume
No ratings yet
Manish Resume
2 pages
Transpo Chapter 2 Contract of Common Carriage
No ratings yet
Transpo Chapter 2 Contract of Common Carriage
4 pages
IV BSc. Nursing Practical Examination - Sept2024
No ratings yet
IV BSc. Nursing Practical Examination - Sept2024
1 page
Aggregate Planning (Written Report)
No ratings yet
Aggregate Planning (Written Report)
15 pages
NIOP Prior Cargo Lists
100% (1)
NIOP Prior Cargo Lists
6 pages
ABC Reviewer
No ratings yet
ABC Reviewer
4 pages
Performance Simulation of Turboprop Engine For Basic Trainer
No ratings yet
Performance Simulation of Turboprop Engine For Basic Trainer
13 pages
Transactions_2025-01-16_11-32-22
No ratings yet
Transactions_2025-01-16_11-32-22
3 pages
LHY Scilab Xcos Tutorial Part2 0
No ratings yet
LHY Scilab Xcos Tutorial Part2 0
19 pages
Pentagon-Versus-C A
No ratings yet
Pentagon-Versus-C A
2 pages
SWE322: Software Security: Malware
No ratings yet
SWE322: Software Security: Malware
57 pages
CH 10
No ratings yet
CH 10
87 pages
Healthwire Proposal - CWP
No ratings yet
Healthwire Proposal - CWP
12 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Simpreg

Uploaded by

Simpreg

Uploaded by

Mean model

FORECASTS, ERROR MEASURES, AND CONFIDENCE INTERVALS

SQ ERR LOWER 95%

Mean absolute error (MAE)

Simple regression model

FORECASTS AND ERROR MEASURES

Number of data points (n)

ERROR ABS ERR

Note that VAR(Y) = VAR(FCST) + VAR(ERR) = "explained variance" + "unexplained variance"

Mean absolute error (MAE)

Multiple regression model

The "X" matrix (constant & independent variable(s)):

Number of data points (named N)

Now here is "X-transpose-X" (i.e., X-transpose times X, named XTX)

..and their exact SIGNIFICANCE LEVELS (p-values)

SUMMARY OUTPUT: Tools/Data Analysis/Regression procedure

X Variable 1 Residual Plot

X Variable 1 Line Fit Plot

Multiple Regression Analysis

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.