Lecture 1 - Introduction To Statistics
Lecture 1 - Introduction To Statistics
INTRODUCTION TO STATISTICS
www.unimy.edu.my
• Menuju masa depan (The Future)
Contents
▪ 5 Steps in Statistics : Collect, Organize, Present, Analyze, Interpret
▪ Types of Statistics: Descriptive Statistics vs. Inferential Statistics
▪ Types of Study: Observational Study vs. Experimental Study
▪ Population vs. Sample
▪ Parameter vs. Statistic
▪ Sources of Data/Data Collection – Survey, Interview, Census, Experimental
▪ Sampling Methods – Sampling with Probability vs. Sampling with non-Probability
▪ Types of Variables: Quantitative (Continuous/Discrete), Qualitative (Categorical)
▪ Measurement Scales: Nominal, Ordinal, Interval, Ratio
Statistics lets you know about the factors, risks, effects associated with making decision and allows
you to understand and reduce the variation (error) in the decision-making process.
Statistics provides you with methods for making better sense of the numbers used every
day to describe or analyze the world we live in.
What is Statistics?
The science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making
more effective decisions
• Collect
• Organize
• Present
• Analyze
• Interpret
Types of Statistics
DESCRIPTIVE STATISTICS - consists of procedures used to summarize and describe the important
characteristics of a set of measurements.
Example:
• Tabulating
• Graphing
• Numerical Measures
• Correlation
Example:
• Sampling distribution
• Hypothesis testing
• Confidence interval
• Linear Regression (Statistical Model)
Basic Vocabulary of Statistics
DATA- Data are the different values associated with a variable (or collection of variables)
PRIMARY DATA are collected specifically for the analysis desired
SECONDARY DATA have already been compiled and are available for statistical analysis
VARIABLE- A variable is a characteristic that changes or varies over time and/or for different
individuals or objects under consideration.
A CONSTANT has a fixed numerical value.
EXPERIMENTAL UNIT -An experimental unit is the individual or object on which a variable is
measured. A single measurement or data value results when a variable is actually measured
on an experimental unit.
DISCRETE- A discrete variable can assume only a finite or countable number of values.
CONTINUOUS-A continuous variable can assume the infinitely many values corresponding to the
points on a line interval.
11
Suppose we want to know the average height of
male students in UNIMY.
What is the data, variable, experimental unit,
Example of population, sample, parameter, statistic?
a study Data-Male student in UNIMY
Variable-height
EU-Male student
Population-All male students
Sample-100 male students
Parameter-Average height all male students
Statistics-Average height of 100 male students
13
Source of •Surveys through questionnaires
Data/Data •Interviews.
•Census
15
Types of Study
The researcher do not Example:
apply any treatment
A study to see if
to the sample under
1. Observational customers who enter
study – can only
a clothing store pay
Study observe the behavior
cash or using credit
of the
card for their goods.
objects/subjects.
(i) with replacement: a member of the population may be chosen more than once (picking the
candy from the bowl)
(ii) without replacement: a member of the population may be chosen only once (lottery ticket)
Sampling methods
Sampling methods can be:
(i)random (each member of the population has an equal chance of being selected)
(ii) stratified sample - divide the population into groups called strata and then take a sample from
each stratum. Each group have similar characteristics.
(iii) cluster sample - divide the population into groups called strata and then take a sample from each
stratum randomly. Each group have different characteristics
(iv) systematic sample- randomly select a starting point and take every n-th piece of data from a listing
of the population.
22
Probability
Sampling
23
Non-
probability
sampling
24
Qualitative Data
Qualitative data are generally described by words or letters. They are not as widely used a
quantitative data because many numerical techniques do not apply to the qualitative data. For
example, it does not make sense to find an average hair color or blood type.
Ordinal (if it takes the form of a word with more than two options (education - primary school,
secondary school and university).
Quantitative Data
Quantitative data are always numbers and are the result of counting or measuring attributes of a
population.
discrete (if it is the result of counting (the number of students of a given ethnic group in a class, the
number of books on a shelf, ...)
Ordinal – contain more information. Consists of distinct categories in which order is implied. Values
in one category are larger or smaller than values in other categories (e.g. rating-excelent, good, fair,
poor)
Ratio – consists of numerical measurements where the distance between numbers is of a known,
constant size, in addition, there is a nonarbitrary zero point.
28
Nominal