0% found this document useful (0 votes)
55 views

Output and Questions - LAB 1

The document summarizes statistics for a sample's age variable. It reports the mean, median, mode, range, standard deviation, variance, skewness, and kurtosis for the age variable. It shows that age is positively skewed and platykurtic. Age is normally distributed for the full dataset and when separated by gender according to the Shapiro-Wilk test results. The average number of visits in the past year was 3.53 visits, which is not normally distributed. Reverse scoring self-esteem values produced higher mean self-esteem scores for males compared to females.

Uploaded by

Grace Kazeneza
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
55 views

Output and Questions - LAB 1

The document summarizes statistics for a sample's age variable. It reports the mean, median, mode, range, standard deviation, variance, skewness, and kurtosis for the age variable. It shows that age is positively skewed and platykurtic. Age is normally distributed for the full dataset and when separated by gender according to the Shapiro-Wilk test results. The average number of visits in the past year was 3.53 visits, which is not normally distributed. Reverse scoring self-esteem values produced higher mean self-esteem scores for males compared to females.

Uploaded by

Grace Kazeneza
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 24

1.

For sample age, report the mean, median, mode, and variability (range,
standard deviation, variance) plus skewness and kurtosis.)
For the sample age, the following statistics can be reported:
o The mean age was 22.93.
o The median age was 22.00.
o The mode of age is 18 a (“a” indicating that there are multiple modes, and 18
is the smallest mode value shown)
o The range of age is 12 (Range: 12 = (Maximum data entry:30) - (Minimum
data entry:18)
o The standard deviation of age is 4.008.
o The variance of age is 16.067.
o The skewness of age is 0.394 (Positively skewed).
o The kurtosis of age is -1.240 (Platykurtic because there is too variation in
the data)

2. Present the frequency distribution of the number of visits and include a


normality curve. Provide a box plot and interpret it.

The box reflects about half of the data, meaning that around 50% of data entries are found
between 1.5 and 4. The dark line illustrated in the box plot represents the median number of
visits which is 3, as validated by the box plot. There are also 2 extra data points depicted by dots
on the graph. These dots correspond to individual 4 who visited 9 times and individual 8 who
visited 8 times within the year. These data points are known as outliers.
3. Is age normally distributed for the full dataset? How about when separated by
gender?

Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
*
Age .152 15 .200 .921 15 .203
Since our data is less than 50, we refer to the Shapiro-Wilk test. The significance column
value is greater than 0.05, indicating that the data follows a normal distribution.(0.203 > 0.05)

Tests of Normality
Gende Kolmogorov-Smirnova Shapiro-Wilk
r Statistic df Sig. Statistic df Sig.
Age Male .199 8 .200* .882 8 .196
Femal .135 7 .200 *
.962 7 .832
e
*. This is a lower bound of the true significance.
a. Lilliefors Significance Correction

When separated by gender the significance column value is greater than 0.05 for males
(0.196>0.05) and females (0.832> 0.05), meaning that the data also follows normal
distribution even when separated by gender.

4. What is the average number of visits in the past year and is it normally
distributed?

Statistic
Avrg_visits Mean 3.53

The average number of visits in the past year was 3.53.


Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Avrg_vis .226 15 .038 .870 15 .033
its
a. Lilliefors Significance Correction

Since our data is less than 50, we refer to the Shapiro-Wilk test. The significance column
value is less than 0.05, indicating that the data does not follow a normal distribution. (0.05 >
0.033)
5. Reverse score the values for self-esteem (so 5s are 1s and 1s are 5s, etc…); then
provide means and standard deviations by gender.

Descriptives
Std.
Gender Statistic Error
esteem_REC Male Mean 3.63 .420
ODE 95% Confidence Lower 2.63
Interval for Mean Bound
Upper 4.62
Bound
5% Trimmed Mean 3.64
Median 4.00
Variance 1.411
Std. Deviation 1.188
Minimum 2
Maximum 5
Range 3
Interquartile Range 3
Skewness -.394 .752
Kurtosis -1.229 1.481
Femal Mean 2.14 .459
e 95% Confidence Lower 1.02
Interval for Mean Bound
Upper 3.27
Bound
5% Trimmed Mean 2.10
Median 2.00
Variance 1.476
Std. Deviation 1.215
Minimum 1
Maximum 4
Range 3
Interquartile Range 2
Skewness .414 .794
Kurtosis -1.525 1.587
Output file for lab 1
SET TLook=None FOOTNOTE=ON AUTORECOVERY=ON Small=0.0001
SUMMARY=None THREADS=AUTO Printback=On SIGLESS=YES TFit=Both
DIGITGROUPING=No LEADZERO=No.
FREQUENCIES VARIABLES=Age
/STATISTICS=STDDEV VARIANCE RANGE MEAN MEDIAN MODE SKEWNESS
SESKEW KURTOSIS SEKURT
/HISTOGRAM NORMAL
/ORDER=ANALYSIS.

Frequencies

Notes
Output Created 17-MAY-2023 11:28:51
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values are treated as
missing.
Cases Used Statistics are based on
all cases with valid data.
Syntax FREQUENCIES
VARIABLES=Age
/STATISTICS=STDDEV
VARIANCE RANGE
MEAN MEDIAN MODE
SKEWNESS SESKEW
KURTOSIS SEKURT
/HISTOGRAM NORMAL
/ORDER=ANALYSIS.

Resources Processor Time 00:00:02.88


Elapsed Time 00:00:01.77

Statistics
Age
N Valid 15
Missing 0
Mean 22.93
Median 22.00
Mode 18a
Std. Deviation 4.008
Variance 16.067
Skewness .394
Std. Error of .580
Skewness
Kurtosis -1.240
Std. Error of Kurtosis 1.121
Range 12
a. Multiple modes exist. The
smallest value is shown

Age
Frequenc Valid Cumulative
y Percent Percent Percent
Valid 18 2 13.3 13.3 13.3
19 2 13.3 13.3 26.7
20 1 6.7 6.7 33.3
21 2 13.3 13.3 46.7
22 1 6.7 6.7 53.3
23 1 6.7 6.7 60.0
24 1 6.7 6.7 66.7
26 1 6.7 6.7 73.3
27 1 6.7 6.7 80.0
28 2 13.3 13.3 93.3
30 1 6.7 6.7 100.0
Total 15 100.0 100.0
Frequency
Histogram
3 Mean = 22.93
Std. Dev. = 4.008
N = 15

0
15 20 25 30 35

Age

FREQUENCIES VARIABLES=Avrg_visits
/HISTOGRAM NORMAL
/ORDER=ANALYSIS.

Frequencies

Notes
Output Created 17-MAY-2023 11:58:33
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
values are treated as
Handling missing.
Cases Used Statistics are based on
all cases with valid data.
Syntax FREQUENCIES
VARIABLES=Avrg_visits
/HISTOGRAM NORMAL
/ORDER=ANALYSIS.

Resources Processor Time 00:00:00.61


Elapsed Time 00:00:00.27

Statistics
Avrg_visits
N Valid 15
Missing 0

Avrg_visits
Frequenc Valid Cumulative
y Percent Percent Percent
Valid 1 4 26.7 26.7 26.7
2 2 13.3 13.3 40.0
3 2 13.3 13.3 53.3
4 4 26.7 26.7 80.0
6 1 6.7 6.7 86.7
8 1 6.7 6.7 93.3
9 1 6.7 6.7 100.0
Total 15 100.0 100.0
Frequency
Histogram
4 Mean = 3.53
Std. Dev. = 2.503
N = 15

0
0 2 4 6 8 10

Avrg_visits

EXAMINE VARIABLES=Avrg_visits
/PLOT BOXPLOT STEMLEAF
/COMPARE GROUPS
/STATISTICS NONE
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Explore

Notes
Output Created 17-MAY-2023 12:08:34
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=Avrg_visits
/PLOT BOXPLOT
STEMLEAF
/COMPARE GROUPS
/STATISTICS NONE
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Resources Processor Time 00:00:00.17


Elapsed Time 00:00:00.19

Case Processing Summary


Cases
Valid Missing Total
N Percent N Percent N Percent
Avrg_visit 15 100.0% 0 0.0% 15 100.0%
s

Avrg_visits

Avrg_visits Stem-and-Leaf Plot

Frequency Stem & Leaf

4.00 1 . 0000
2.00 2 . 00
2.00 3 . 00
4.00 4 . 0000
.00 5 .
1.00 6 . 0
2.00 Extremes (>=8.0)

Stem width: 1
Each leaf: 1 case(s)

10

8
8

Avrg_visits

EXAMINE VARIABLES=Age
/PLOT NPPLOT
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Explore
Notes
Output Created 17-MAY-2023 12:46:55
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=Age
/PLOT NPPLOT
/STATISTICS
DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Resources Processor Time 00:00:00.22


Elapsed Time 00:00:00.35

Case Processing Summary


Cases
Valid Missing Total
N Percent N Percent N Percent
Age 15 100.0% 0 0.0% 15 100.0%

Descriptives
Statistic Std. Error
Age Mean 22.93 1.035
95% Confidence Interval Lower 20.71
for Mean Bound
Upper 25.15
Bound
5% Trimmed Mean 22.81
Median 22.00
Variance 16.067
Std. Deviation 4.008
Minimum 18
Maximum 30
Range 12
Interquartile Range 8
Skewness .394 .580
Kurtosis -1.240 1.121

Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Age .152 15 .200 *
.921 15 .203
*. This is a lower bound of the true significance.
a. Lilliefors Significance Correction

Age
Expected Normal
Normal Q-Q Plot of Age

-1

15 20 25 30 35

Observed Value
Dev from Normal

Detrended Normal Q-Q Plot of Age


0.4

0.2

0.0

-0.2

18 20 22 24 26 28 30

Observed Value

EXAMINE VARIABLES=Age BY Gender


/PLOT NPPLOT
/STATISTICS NONE
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Explore

Notes
Output Created 17-MAY-2023 13:01:05
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=Age BY
Gender
/PLOT NPPLOT
/STATISTICS NONE
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Resources Processor Time 00:00:00.52


Elapsed Time 00:00:00.58
Gender

Case Processing Summary


Cases
Valid Missing Total
Gender N Percent N Percent N Percent
Age Male 8 100.0% 0 0.0% 8 100.0%
Female 7 100.0% 0 0.0% 7 100.0%

Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Gender Statistic df Sig. Statistic df Sig.
Age Male .199 8 .200 *
.882 8 .196
Female .135 7 .200 *
.962 7 .832
*. This is a lower bound of the true significance.
a. Lilliefors Significance Correction

Age

Normal Q-Q Plots


Expected Normal
Normal Q-Q Plot of Age
for Gender= Male

-1

-2

17.5 20.0 22.5 25.0 27.5 30.0

Observed Value
Expected Normal

Normal Q-Q Plot of Age


for Gender= Female

-1

-2

15 20 25 30 35

Observed Value
Detrended Normal Q-Q Plots

Dev from Normal


Detrended Normal Q-Q Plot of Age
for Gender= Male

0.4

0.2

0.0000

-0.2

-0.4

18 20 22 24 26 28

Observed Value
Dev from Normal
Detrended Normal Q-Q Plot of Age
for Gender= Female

0.6

0.4

0.2

0.0000

-0.2

-0.4

18 20 22 24 26 28 30

Observed Value

EXAMINE VARIABLES=Avrg_visits
/PLOT NPPLOT
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Explore

Notes
Output Created 17-MAY-2023 13:11:56
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=Avrg_visits
/PLOT NPPLOT
/STATISTICS
DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Resources Processor Time 00:00:00.30


Elapsed Time 00:00:00.32

Case Processing Summary


Cases
Valid Missing Total
N Percent N Percent N Percent
Avrg_visit 15 100.0% 0 0.0% 15 100.0%
s

Descriptives
Statistic Std. Error
Avrg_visit Mean 3.53 .646
s 95% Confidence Interval Lower 2.15
for Mean Bound
Upper 4.92
Bound
5% Trimmed Mean 3.37
Median 3.00
Variance 6.267
Std. Deviation 2.503
Minimum 1
Maximum 9
Range 8
Interquartile Range 3
Skewness 1.026 .580
Kurtosis .440 1.121

Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Avrg_visit .226 15 .038 .870 15 .033
s
a. Lilliefors Significance Correction

Avrg_visits
Expected Normal
Normal Q-Q Plot of Avrg_visits
3

-1

-2

0 2 4 6 8 10

Observed Value
Dev from Normal

Detrended Normal Q-Q Plot of Avrg_visits

0.75

0.50

0.25

0.00

-0.25

0 2 4 6 8 10

Observed Value

RECODE Esteem (1=5) (2=4) (4=2) (5=1) INTO esteem_RECODE.


EXECUTE.
EXAMINE VARIABLES=esteem_RECODE BY Gender
/PLOT NONE
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Explore

Notes
Output Created 17-MAY-2023 14:03:38
Comments
Input Active Dataset DataSet0
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working 15
Data File
Missing Value Definition of Missing User-defined missing
Handling values for dependent
variables are treated as
missing.
Cases Used Statistics are based on
cases with no missing
values for any
dependent variable or
factor used.
Syntax EXAMINE
VARIABLES=esteem_R
ECODE BY Gender
/PLOT NONE
/STATISTICS
DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

Resources Processor Time 00:00:00.00


Elapsed Time 00:00:00.00

Gender

Case Processing Summary


Cases
Valid Missing Total
Gender N Percent N Percent N Percent
esteem_RECOD Male 8 100.0% 0 0.0% 8 100.0%
E Female 7 100.0% 0 0.0% 7 100.0%

Descriptives
Gender Statistic Std. Error
esteem_RECOD Male Mean 3.63 .420
E 95% Confidence Interval Lower 2.63
for Mean Bound
Upper 4.62
Bound
5% Trimmed Mean 3.64
Median 4.00
Variance 1.411
Std. Deviation 1.188
Minimum 2
Maximum 5
Range 3
Interquartile Range 3
Skewness -.394 .752
Kurtosis -1.229 1.481
Female Mean 2.14 .459
95% Confidence Interval Lower 1.02
for Mean Bound
Upper 3.27
Bound
5% Trimmed Mean 2.10
Median 2.00
Variance 1.476
Std. Deviation 1.215
Minimum 1
Maximum 4
Range 3
Interquartile Range 2
Skewness .414 .794
Kurtosis -1.525 1.587

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy