Output and Questions - LAB 1
Output and Questions - LAB 1
For sample age, report the mean, median, mode, and variability (range,
standard deviation, variance) plus skewness and kurtosis.)
For the sample age, the following statistics can be reported:
o The mean age was 22.93.
o The median age was 22.00.
o The mode of age is 18 a (“a” indicating that there are multiple modes, and 18
is the smallest mode value shown)
o The range of age is 12 (Range: 12 = (Maximum data entry:30) - (Minimum
data entry:18)
o The standard deviation of age is 4.008.
o The variance of age is 16.067.
o The skewness of age is 0.394 (Positively skewed).
o The kurtosis of age is -1.240 (Platykurtic because there is too variation in
the data)
The box reflects about half of the data, meaning that around 50% of data entries are found
between 1.5 and 4. The dark line illustrated in the box plot represents the median number of
visits which is 3, as validated by the box plot. There are also 2 extra data points depicted by dots
on the graph. These dots correspond to individual 4 who visited 9 times and individual 8 who
visited 8 times within the year. These data points are known as outliers.
3. Is age normally distributed for the full dataset? How about when separated by
gender?
Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
*
Age .152 15 .200 .921 15 .203
Since our data is less than 50, we refer to the Shapiro-Wilk test. The significance column
value is greater than 0.05, indicating that the data follows a normal distribution.(0.203 > 0.05)
Tests of Normality
Gende Kolmogorov-Smirnova Shapiro-Wilk
r Statistic df Sig. Statistic df Sig.
Age Male .199 8 .200* .882 8 .196
Femal .135 7 .200 *
.962 7 .832
e
*. This is a lower bound of the true significance.
a. Lilliefors Significance Correction
When separated by gender the significance column value is greater than 0.05 for males
(0.196>0.05) and females (0.832> 0.05), meaning that the data also follows normal
distribution even when separated by gender.
4. What is the average number of visits in the past year and is it normally
distributed?
Statistic
Avrg_visits Mean 3.53
Since our data is less than 50, we refer to the Shapiro-Wilk test. The significance column
value is less than 0.05, indicating that the data does not follow a normal distribution. (0.05 >
0.033)
5. Reverse score the values for self-esteem (so 5s are 1s and 1s are 5s, etc…); then
provide means and standard deviations by gender.
Descriptives
Std.
Gender Statistic Error
esteem_REC Male Mean 3.63 .420
ODE 95% Confidence Lower 2.63
Interval for Mean Bound
Upper 4.62
Bound
5% Trimmed Mean 3.64
Median 4.00
Variance 1.411
Std. Deviation 1.188
Minimum 2
Maximum 5
Range 3
Interquartile Range 3
Skewness -.394 .752
Kurtosis -1.229 1.481
Femal Mean 2.14 .459
e 95% Confidence Lower 1.02
Interval for Mean Bound
Upper 3.27
Bound
5% Trimmed Mean 2.10
Median 2.00
Variance 1.476
Std. Deviation 1.215
Minimum 1
Maximum 4
Range 3
Interquartile Range 2
Skewness .414 .794
Kurtosis -1.525 1.587
Output file for lab 1
SET TLook=None FOOTNOTE=ON AUTORECOVERY=ON Small=0.0001
SUMMARY=None THREADS=AUTO Printback=On SIGLESS=YES TFit=Both
DIGITGROUPING=No LEADZERO=No.
FREQUENCIES VARIABLES=Age
/STATISTICS=STDDEV VARIANCE RANGE MEAN MEDIAN MODE SKEWNESS
SESKEW KURTOSIS SEKURT
/HISTOGRAM NORMAL
/ORDER=ANALYSIS.
Frequencies
Notes
Output Created 17-MAY-2023 11:28:51
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values are treated as
missing.
Cases Used Statistics are based on
all cases with valid data.
Syntax FREQUENCIES
VARIABLES=Age
/STATISTICS=STDDEV
VARIANCE RANGE
MEAN MEDIAN MODE
SKEWNESS SESKEW
KURTOSIS SEKURT
/HISTOGRAM NORMAL
/ORDER=ANALYSIS.
Statistics
Age
N Valid 15
Missing 0
Mean 22.93
Median 22.00
Mode 18a
Std. Deviation 4.008
Variance 16.067
Skewness .394
Std. Error of .580
Skewness
Kurtosis -1.240
Std. Error of Kurtosis 1.121
Range 12
a. Multiple modes exist. The
smallest value is shown
Age
Frequenc Valid Cumulative
y Percent Percent Percent
Valid 18 2 13.3 13.3 13.3
19 2 13.3 13.3 26.7
20 1 6.7 6.7 33.3
21 2 13.3 13.3 46.7
22 1 6.7 6.7 53.3
23 1 6.7 6.7 60.0
24 1 6.7 6.7 66.7
26 1 6.7 6.7 73.3
27 1 6.7 6.7 80.0
28 2 13.3 13.3 93.3
30 1 6.7 6.7 100.0
Total 15 100.0 100.0
Frequency
Histogram
3 Mean = 22.93
Std. Dev. = 4.008
N = 15
0
15 20 25 30 35
Age
FREQUENCIES VARIABLES=Avrg_visits
/HISTOGRAM NORMAL
/ORDER=ANALYSIS.
Frequencies
Notes
Output Created 17-MAY-2023 11:58:33
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
values are treated as
Handling missing.
Cases Used Statistics are based on
all cases with valid data.
Syntax FREQUENCIES
VARIABLES=Avrg_visits
/HISTOGRAM NORMAL
/ORDER=ANALYSIS.
Statistics
Avrg_visits
N Valid 15
Missing 0
Avrg_visits
Frequenc Valid Cumulative
y Percent Percent Percent
Valid 1 4 26.7 26.7 26.7
2 2 13.3 13.3 40.0
3 2 13.3 13.3 53.3
4 4 26.7 26.7 80.0
6 1 6.7 6.7 86.7
8 1 6.7 6.7 93.3
9 1 6.7 6.7 100.0
Total 15 100.0 100.0
Frequency
Histogram
4 Mean = 3.53
Std. Dev. = 2.503
N = 15
0
0 2 4 6 8 10
Avrg_visits
EXAMINE VARIABLES=Avrg_visits
/PLOT BOXPLOT STEMLEAF
/COMPARE GROUPS
/STATISTICS NONE
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Explore
Notes
Output Created 17-MAY-2023 12:08:34
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=Avrg_visits
/PLOT BOXPLOT
STEMLEAF
/COMPARE GROUPS
/STATISTICS NONE
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Avrg_visits
4.00 1 . 0000
2.00 2 . 00
2.00 3 . 00
4.00 4 . 0000
.00 5 .
1.00 6 . 0
2.00 Extremes (>=8.0)
Stem width: 1
Each leaf: 1 case(s)
10
8
8
Avrg_visits
EXAMINE VARIABLES=Age
/PLOT NPPLOT
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Explore
Notes
Output Created 17-MAY-2023 12:46:55
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=Age
/PLOT NPPLOT
/STATISTICS
DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Descriptives
Statistic Std. Error
Age Mean 22.93 1.035
95% Confidence Interval Lower 20.71
for Mean Bound
Upper 25.15
Bound
5% Trimmed Mean 22.81
Median 22.00
Variance 16.067
Std. Deviation 4.008
Minimum 18
Maximum 30
Range 12
Interquartile Range 8
Skewness .394 .580
Kurtosis -1.240 1.121
Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Age .152 15 .200 *
.921 15 .203
*. This is a lower bound of the true significance.
a. Lilliefors Significance Correction
Age
Expected Normal
Normal Q-Q Plot of Age
-1
15 20 25 30 35
Observed Value
Dev from Normal
0.2
0.0
-0.2
18 20 22 24 26 28 30
Observed Value
Explore
Notes
Output Created 17-MAY-2023 13:01:05
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=Age BY
Gender
/PLOT NPPLOT
/STATISTICS NONE
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Gender Statistic df Sig. Statistic df Sig.
Age Male .199 8 .200 *
.882 8 .196
Female .135 7 .200 *
.962 7 .832
*. This is a lower bound of the true significance.
a. Lilliefors Significance Correction
Age
-1
-2
Observed Value
Expected Normal
-1
-2
15 20 25 30 35
Observed Value
Detrended Normal Q-Q Plots
0.4
0.2
0.0000
-0.2
-0.4
18 20 22 24 26 28
Observed Value
Dev from Normal
Detrended Normal Q-Q Plot of Age
for Gender= Female
0.6
0.4
0.2
0.0000
-0.2
-0.4
18 20 22 24 26 28 30
Observed Value
EXAMINE VARIABLES=Avrg_visits
/PLOT NPPLOT
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Explore
Notes
Output Created 17-MAY-2023 13:11:56
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=Avrg_visits
/PLOT NPPLOT
/STATISTICS
DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Descriptives
Statistic Std. Error
Avrg_visit Mean 3.53 .646
s 95% Confidence Interval Lower 2.15
for Mean Bound
Upper 4.92
Bound
5% Trimmed Mean 3.37
Median 3.00
Variance 6.267
Std. Deviation 2.503
Minimum 1
Maximum 9
Range 8
Interquartile Range 3
Skewness 1.026 .580
Kurtosis .440 1.121
Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Avrg_visit .226 15 .038 .870 15 .033
s
a. Lilliefors Significance Correction
Avrg_visits
Expected Normal
Normal Q-Q Plot of Avrg_visits
3
-1
-2
0 2 4 6 8 10
Observed Value
Dev from Normal
0.75
0.50
0.25
0.00
-0.25
0 2 4 6 8 10
Observed Value
Explore
Notes
Output Created 17-MAY-2023 14:03:38
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=esteem_R
ECODE BY Gender
/PLOT NONE
/STATISTICS
DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Gender
Descriptives
Gender Statistic Std. Error
esteem_RECOD Male Mean 3.63 .420
E 95% Confidence Interval Lower 2.63
for Mean Bound
Upper 4.62
Bound
5% Trimmed Mean 3.64
Median 4.00
Variance 1.411
Std. Deviation 1.188
Minimum 2
Maximum 5
Range 3
Interquartile Range 3
Skewness -.394 .752
Kurtosis -1.229 1.481
Female Mean 2.14 .459
95% Confidence Interval Lower 1.02
for Mean Bound
Upper 3.27
Bound
5% Trimmed Mean 2.10
Median 2.00
Variance 1.476
Std. Deviation 1.215
Minimum 1
Maximum 4
Range 3
Interquartile Range 2
Skewness .414 .794
Kurtosis -1.525 1.587