Lear NEOnlne
Standard Deviation
About Standard Deviations
The Standard Deviation is the positive square root of the variance. One of the most basic
approaches of Statistical analysis is the Standard Deviation. The Standard Deviation,
abbreviated as SD and represented by the letter ", indicates how far a value has varied from
the mean value. A low Standard Deviation indicates that the values are close to the mean,
whereas a large Standard Deviation indicates that the values are significantly different from the
mean. Let's look at how to determine the Standard Deviation of grouped and ungrouped data,
as well as the random variable's Standard Deviation.
What is Standard Deviation?
In descriptive Statistics, the Standard Deviation is the degree of dispersion or scatter of data
points relative to the mean. It is a measure of the data points’ Deviation from the mean and
describes how the values are distributed over the data sample. The Standard Deviation of a
sample, Statistical population, random variable, data collection, or probability distribution is the
square root of the variance.
When we have a certain amount of observations and they are all different, the value's mean
Deviation from the mean is then calculated.
On the other hand, the sum of squares of deviations from the mean does not appear to be a
reliable measure of dispersion. When the average of the squared differences from the mean is
low, the observations are close to the mean. This is a less dispersed level of dispersion. If this
number is large, it implies that the observations are dispersed from the mean to a greater
extent.
Standard deviation is the measurement of the dispersion of the data set from its mean value. It
is always measured in arithmetic value. Standard deviation is always positive and is denoted by
o (sigma). Standard Deviation is very accurate and is preferred from other measures of
dispersion.Lear NEOnlne
The Standard Deviation is calculated as The square root of variance by determining each data
point's deviation relative to the arithmetic mean. In case the data-points are far from the mean,
it denotes a higher deviation within the set of data. Hence, it indicates more spread out the
data, the higher is the standard deviation. The formula to calculate Standard Deviation is:
where:
x(i) = value of the i'th point in the set of data
x(bar) = the mean-value of the set of data
n= the number of data-points in the set of data
(Image to be added soon)
Properties of Standard Deviation
* Standard Deviation is only used in measuring dispersion or spread around the mean value
of the data set.
+ Standard deviation is always in positive value
+ It determines the dispersion or variation that exists from the average value.
+ Standard deviation is a very sensitive outlier. Any single outlier can distort the picture of
dispersion.
+ For the data set with an approximately same mean value, the greater the dispersion or
spread, the greater the Standard deviation.
* Standard deviation is zero when the values of a particular data set are the same.
* While analyzing the normally distrinuted data, the Standard Deviation is used in conjunction
along with the mean to calculate the data intervals.
If & = mean, S = Standard Deviation, and x = Value in the Data set, then
£Lear NEOnlne
around 68% of the Data is in the interval:- -S?/N)-(> /NP xc
D
step-deviation of Observations relative to an Assumed mean. It is calculated as D'=
(X-AYIC
C= Common Factor chosen.
Sigma for Discrete Series
There are two ways to calculate Standard Deviation in discrete series, theses are:
Direct Method
We know that in the discrete series, another frequency column is added: the direct method
formula to calculate SD is:
Standard deviation (a) = v(5fD?)/N)
Short-Cut Method
Standard deviation (0) = \( fD?/N)-(S) fD/Ny
Sigma for Frequency Distribution
Three different methods can be used to calculate standard deviation in frequency distribution
series; these methods are:
Direct Method
The direct method employed to derive standard deviation in a frequency distribution is very
similar to the discrete series done above. The value of observation (when used) in the
frequency distribution is the only difference between the two series. Here, the mid-value of the
class is determined by dividing the sum' of the upper value of the class and the lower value of
the class. The value thus derived is used for calculation. The formula is;
Standard Deviation (0) = \(5102//N)
In the calculation, D = Deviation of an item that is relative to mean value and is calculated as,Lear NEOnlne
D = Xi-Mean
F = frequencies corresponding to the Observations
N = The summation of the frequency.
Step-Deviation Method
The step-deviation method is the shortcut method to determine the Standard Deviation. The
formula is
Standard Deviation (a) = \(3> fD"2/N)-(S> fD'/N)? x c
In the above calculation, D'= Step-Deviation of the observations relative to the assumed value.
It is calculated as- D'= (Xi-A)/C
N= The Summation of Frequency.
= Common Factor chosen
Did You Know?
Without Standard Deviation D, one can't compare two sets of data effectively. Suppose there
are two data sets having the same average. Does that imply that the sets of data are exactly
the same? No. For ex. the data sets - 199, 200, 201, and other 0, 200, 400 have the same 200
average, yet they have different standard deviations. Here, the first data has a small standard
deviation (s=1) in comparison to the second set of data (s=200).UeamUVEOnIne
Hi Ayush,
Best courses for you
Full syllabus LIVE Crash course One-to-one LIVE
courses classes
ing from 267,000 Starting from €13,000 ting rom €4,000/hr
Books
Starting from £2,489
Show more coursesLear NEOnlne
Book your Free Demo session
Get a flavour of LIVE classes here at Vedantu
Vedantu Improvement Promise
We promise improvement in marks
or get your fees back. T&C Apply"