尊敬的 微信汇率:1円 ≈ 0.046166 元 支付宝汇率:1円 ≈ 0.046257元 [退出登录]
SlideShare a Scribd company logo
DATA SUMMERISATION
Dr Vincent Yusuph
Ecohas Kibaha 2023
OBJECTIVES
At the end of this session you should be able to:
 Explain data summarization
 Explain the characteristics, uses, advantages, and
disadvantages of each measure of location.
 Calculate mode, mean and median
 Compute and interpret variance, and the standard
deviation
 Identify the position of the arithmetic mean, median, and mode
for both a symmetrical and a skewed distribution.
 Explain the characteristics, uses, advantages, and disadvantages
of this measure of dispersion
4.3
Data summarisation
 Measures of Central Location
 Mean, Median, Mode
 Measures of Variability/spread
 Range, Standard Deviation, Variance, Coefficient
of Variation
 Measures of Relative Standing
 Percentiles, Quartiles
MEASURES OF CENTRAL
TENDENCY/LOCATION
 Often we need to summarise frequency
distributions in a few numbers for ease
of reporting or comparison
 Recall: with qualitative data, useful
summary statistics include ratio,
proportion, rate
Measures of central tendency/location
 The statistical methods used to measure
central tendency include the following
1. Mean
2. Median
3. Mode
MEAN
 Refers to arithmetic mean
 It is obtained by adding the individual observations divided
by the total number of observations.
 Advantages – it is easy to calculate. most useful of all the
averages.
 Disadvantages – influenced by abnormal values.
 Examples: In this case it will be (8 + 16 + 15 + 17 + 18 + 20
+ 25)/7 which comes to 17
Characteristics of the Mean
It is calculated by
summing the values
and dividing by the
number of values.
It requires the interval scale.
All values are used.
It is unique.
The sum of the deviations from the mean is 0.
The Arithmetic Mean
is the most widely used
measure of location and
shows the central value of
the data.
The major characteristics of the mean are:
Average
Joe
3- 7
Population Mean
N
X



where
µ is the population mean
N is the total number of observations.
X is a particular value.
 indicates the operation of adding.
For ungrouped data, the
Population Mean is the
sum of all the population
values divided by the total
number of population
values:
3- 8
Example 1
500
,
48
4
000
,
73
...
000
,
56






N
X

Find the mean mileage for the cars.
A Parameter is a measurable characteristic of a
population.
The Musenge
family owns
four cars.
The following
is the current
mileage on
each of the
four cars.
56,000
23,000
42,000
73,000
3- 9
Example 2
4
.
15
5
77
5
0
.
15
...
0
.
14







n
X
X
A statistic is a measurable characteristic of a sampl
A sample of
five
executives
received the
following
bonus last
year ($000):
14.0,
15.0,
17.0,
16.0,
15.0
3- 10
Statistics is a pattern language
Population Sample
Size N n
Mean
Variance
Standard
Deviation
Properties of the Arithmetic Mean
Every set of interval-level and ratio-level data has a
mean.
All the values are included in computing the mean.
A set of data has a unique mean.
The mean is affected by unusually large or small
data values.
The arithmetic mean is the only measure of location
where the sum of the deviations of each value from
the mean is zero.
Properties of the Arithmetic Mean
3- 12
MEDIAN
 When all the observation are arranged either in ascending
order or descending order, the middle observation is known
as median.
 In case of even number the average of the two middle values
is taken.
 Median is better indicator of central value as it is not affected
by the extreme values
 Example : The median of 4, 1, and 7 is 4 because when the
numbers are put in order (1 , 4, 7) , the number 4 is in the
middle.
The Median
There are as many
values above the
median as below it in
the data array.
For an even set of values, the median will be the
arithmetic average of the two middle numbers and is
found at the (n+1)/2 ranked observation.
The Median is the
midpoint of the values
after they have been
ordered from the smallest
to the largest.
3- 14
The ages for a sample of five BSc.HLS.III
students are: 21, 25, 19, 20, 22.
Arranging the data
in ascending order
gives:
19, 20, 21, 22, 25.
Thus the median is
21.
The median (continued)
3- 15
Example 5
Arranging the data in
ascending order gives:
73, 75, 76, 80
Thus the median is 75.5.
The heights of four basketball players, in inches,
are: 76, 73, 80, 75.
The median is found
at the (n+1)/2 =
(4+1)/2 =2.5th data
point.
3- 16
Properties of the Median
There is a unique median for each data set.
It is not affected by extremely large or small
values and is therefore a valuable measure of
location when such values occur.
It can be computed for ratio-level, interval-
level, and ordinal-level data.
It can be computed for an open-ended
frequency distribution if the median does not
lie in an open-ended class.
Properties of the Median
3- 17
MODE
 Most frequently occurring observation in a data is called
mode
 Not often used in medical statistics.
 EXAMPLE
 Number of decayed teeth in 10 children
 2,2,4,1,3,0,10,2,3,8
 Mean = 34 / 10 = 3.4
 Median = (0,1,2,2,2,3,3,4,8,10) = 2+3 /2
= 2.5
 Mode = 2 ( 3 Times)
Symmetric distribution: A distribution having the
same shape on either side of the center
Skewed distribution: One whose shapes on either
side of the center differ; a nonsymmetrical distribution.
Can be positively or negatively skewed, or bimodal
The Relative Positions of the Mean, Median, and Mode
3- 19
RELATIONSHIP BETWEEN
MEAN, MEDIAN, MODE
The Relative Positions of the Mean, Median, and Mode:
Symmetric Distribution
Zero skewness Mean
=Median
=Mode
Mode
Median
Mean
3- 21
The Relative Positions of the Mean, Median, and Mode:
Right Skewed Distribution
 Positively skewed: Mean and median are to the right of the
mode.
Mean>Median>Mode
Mode
Median
Mean
3- 22
Negatively Skewed: Mean and Median are to the left of the Mode.
Mean<Median<Mode
The Relative Positions of the Mean, Median, and
Mode: Left Skewed Distribution
Mode
Mean
Median
3- 23
CHOICE OF APPROPRIATE
MEASURE
 For symmetric distributions, mean is
preferred to median or mode:
 utilises all values
 mathematical niceties
 For asymmetric distributions, mean not
suitable:
 mean is sensitive to extreme values
 median more preferred since it is not
affected by extreme values
• Measures of central location fail to tell the
whole story about the distribution; that is,
how much are the observations spread out
around the mean value?
Measures of spread…
Measures of Variability…
For example, two sets of class
grades are shown. The mean
(=50) is the same in each case…
But, the red class has greater
variability than the blue class.
Dispersion
refers to the
spread or
variability in
the data.
Measures of dispersion include the following: range,
mean deviation, variance, and standard
deviation.
Range = Largest value – Smallest
value
Measures of Dispersion
0
5
10
15
20
25
30
0 2 4 6 8 10 12
3- 27
The following represents the current year’s Return
on Equity of the 25 companies in an investor’s
portfolio.
-8.1 3.2 5.9 8.1 12.3
-5.1 4.1 6.3 9.2 13.3
-3.1 4.6 7.9 9.5 14.0
-1.4 4.8 7.9 9.7 15.0
1.2 5.7 8.0 10.3 22.1
Example 9
Highest value: 22.1 Lowest value: -8.1
Range = Highest value – lowest value
= 22.1-(-8.1)
= 30.2
3- 28
Range…
 Its major advantage is the ease with which it can be
computed.
 Its major shortcoming is its failure to provide
information on the dispersion of the observations
between the two end points.
 Hence we need a measure of variability that
incorporates all the data and not just two
observations. Hence…
Variance: the
arithmetic mean
of the squared
deviations from
the mean.
Standard deviation: The
square root of the variance.
Variance and standard Deviation
3- 30
Not influenced by extreme values.
The units are awkward, the square of the
original units.
All values are used in the calculation.
The major characteristics of the
Population Variance are:
Population Variance
3- 31
Population Variance formula:
 (X - )2
N

=
X is the value of an observation in the
population
m is the arithmetic mean of the population
N is the number of observations in the
population

Population Standard Deviation formula:
2

Variance and standard deviation
3- 32
(-8.1-6.62)2 + (-5.1-6.62)2 + ... + (22.1-6.62)2
25




= 42.227
= 6.498
In Example 9, the variance and standard deviation are:
 (X - )2
N

=
Example 9 continued
3- 33
Sample variance (s2)
s2 =
(X - X)2
n-1
Sample standard deviation (s)
2
s
s 
Sample variance and standard deviation
3- 34
40
.
7
5
37




n
X
X
     
30
.
5
1
5
2
.
21
1
5
4
.
7
6
...
4
.
7
7
1
2
2
2
2













n
X
X
s
Example 11
The hourly wages earned by a sample of five students
are:
$7, $5, $11, $8, $6.
Find the sample variance and standard deviation.
30
.
2
30
.
5
2


 s
s
3- 35
Empirical Rule: For any symmetrical, bell-
shaped distribution:
About 68% of the observations will lie within 1s
the mean
About 95% of the observations will lie within 2s
of the mean
Virtually all the observations will be within 3s of
the mean
Interpretation and Uses of the
Standard Deviation
3- 36
4.37
The Empirical Rule…
 Approximately 68% of all observations fall
 within one standard deviation of the mean.

 Approximately 95% of all observations fall
 within two standard deviations of the mean.
 Approximately 99.7% of all observations fall
 within three standard deviations of the mean.
Bell-Shaped Curve showing the relationship between and .
 
3  1  1  3
68%
95%
99.7%
Interpretation and Uses of the Standard Deviation
3- 38
Interpreting the standard deviation
 The greater the variation in the data the
greater the standard deviation
 If all the values are the same the standard
deviation is zero
 For a symmetrical distribution almost all the
data will be contained within three standard
deviations
Coefficient of Variation…
 The coefficient of variation of a set of observations
is the standard deviation of the observations divided
by their mean,
 that is:
 Population coefficient of variation = CV =
 Sample coefficient of variation = cv =
4.41
Coefficient of Variation…
 This coefficient provides a
 proportionate measure of variation, e.g.
 A standard deviation of 10 may be perceived
as large when the mean value is 100, but only
moderately large when the mean value is 500.
4.42
Measures of Variability…
 If data are symmetric, with no serious outliers,
use range and standard deviation.
 If comparing variation across two data sets,
use coefficient of variation.
 The measures of variability introduced in this
section can be used only for interval data.
RECAP QUESTIONS

More Related Content

Similar to 5.DATA SUMMERISATION.ppt

Empirics of standard deviation
Empirics of standard deviationEmpirics of standard deviation
Empirics of standard deviation
Adebanji Ayeni
 
Newbold_chap03.ppt
Newbold_chap03.pptNewbold_chap03.ppt
Newbold_chap03.ppt
cfisicaster
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxLESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
MarjoriAnneDelosReye
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Murugesan Kandan
 
Stat11t chapter3
Stat11t chapter3Stat11t chapter3
Stat11t chapter3
raylenepotter
 
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhhpolar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
NathanAndreiBoongali
 
Data Description-Numerical Measure-Chap003 2 2.ppt
Data Description-Numerical Measure-Chap003 2 2.pptData Description-Numerical Measure-Chap003 2 2.ppt
Data Description-Numerical Measure-Chap003 2 2.ppt
ArkoKesha
 
Ppt central tendency measures
Ppt central tendency measuresPpt central tendency measures
Ppt central tendency measures
MtMt37
 
Topic 8a Basic Statistics
Topic 8a Basic StatisticsTopic 8a Basic Statistics
Topic 8a Basic Statistics
Yee Bee Choo
 
Statistics
StatisticsStatistics
Statistics
Deepanshu Sharma
 
Chapter 11 Psrm
Chapter 11 PsrmChapter 11 Psrm
Chapter 11 Psrm
mandrewmartin
 
Statistics.pdf
Statistics.pdfStatistics.pdf
Statistics.pdf
Shruti Nigam (CWM, AFP)
 
03 ch ken black solution
03 ch ken black solution03 ch ken black solution
03 ch ken black solution
Krunal Shah
 
Statistics 3, 4
Statistics 3, 4Statistics 3, 4
Statistics 3, 4
Diana Diana
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
Southern Range, Berhampur, Odisha
 
UNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.pptUNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.ppt
HaiderAli252366
 
UNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.pptUNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.ppt
EdwinDagunot4
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
Gautam G
 

Similar to 5.DATA SUMMERISATION.ppt (20)

Empirics of standard deviation
Empirics of standard deviationEmpirics of standard deviation
Empirics of standard deviation
 
Newbold_chap03.ppt
Newbold_chap03.pptNewbold_chap03.ppt
Newbold_chap03.ppt
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxLESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Stat11t chapter3
Stat11t chapter3Stat11t chapter3
Stat11t chapter3
 
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhhpolar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
 
Data Description-Numerical Measure-Chap003 2 2.ppt
Data Description-Numerical Measure-Chap003 2 2.pptData Description-Numerical Measure-Chap003 2 2.ppt
Data Description-Numerical Measure-Chap003 2 2.ppt
 
Ppt central tendency measures
Ppt central tendency measuresPpt central tendency measures
Ppt central tendency measures
 
Topic 8a Basic Statistics
Topic 8a Basic StatisticsTopic 8a Basic Statistics
Topic 8a Basic Statistics
 
Statistics
StatisticsStatistics
Statistics
 
Chapter 11 Psrm
Chapter 11 PsrmChapter 11 Psrm
Chapter 11 Psrm
 
Statistics.pdf
Statistics.pdfStatistics.pdf
Statistics.pdf
 
03 ch ken black solution
03 ch ken black solution03 ch ken black solution
03 ch ken black solution
 
Statistics 3, 4
Statistics 3, 4Statistics 3, 4
Statistics 3, 4
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
 
UNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.pptUNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.ppt
 
UNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.pptUNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.ppt
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
 

More from chusematelephone

hepatitis.pdfnzjsnnssnnznsbsbbsbshdhsjai
hepatitis.pdfnzjsnnssnnznsbsbbsbshdhsjaihepatitis.pdfnzjsnnssnnznsbsbbsbshdhsjai
hepatitis.pdfnzjsnnssnnznsbsbbsbshdhsjai
chusematelephone
 
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptxSHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
chusematelephone
 
Confidentiality in Medical Practice.ppt
Confidentiality in Medical Practice.pptConfidentiality in Medical Practice.ppt
Confidentiality in Medical Practice.ppt
chusematelephone
 
CONTRACEPTIVES.ppt
CONTRACEPTIVES.pptCONTRACEPTIVES.ppt
CONTRACEPTIVES.ppt
chusematelephone
 
Partogram2.ppt
Partogram2.pptPartogram2.ppt
Partogram2.ppt
chusematelephone
 
5.Meningitis (2).ppt
5.Meningitis (2).ppt5.Meningitis (2).ppt
5.Meningitis (2).ppt
chusematelephone
 
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptxSESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
chusematelephone
 
immunopathology-1-130219045942-phpapp01.pdf
immunopathology-1-130219045942-phpapp01.pdfimmunopathology-1-130219045942-phpapp01.pdf
immunopathology-1-130219045942-phpapp01.pdf
chusematelephone
 
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptxSESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
chusematelephone
 
SESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxSESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptx
chusematelephone
 
SESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxSESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptx
chusematelephone
 
GRAM STAINING- GLM.pptx
GRAM STAINING- GLM.pptxGRAM STAINING- GLM.pptx
GRAM STAINING- GLM.pptx
chusematelephone
 
GRAM POSITIVE BACTERIA.pptx
GRAM POSITIVE BACTERIA.pptxGRAM POSITIVE BACTERIA.pptx
GRAM POSITIVE BACTERIA.pptx
chusematelephone
 
SESSION 3 - Computer Software-1.pptx
SESSION 3 - Computer Software-1.pptxSESSION 3 - Computer Software-1.pptx
SESSION 3 - Computer Software-1.pptx
chusematelephone
 
Session 2.pptx
Session 2.pptxSession 2.pptx
Session 2.pptx
chusematelephone
 
LEVEL OF MEASUREMENTS_2.ppt
LEVEL OF MEASUREMENTS_2.pptLEVEL OF MEASUREMENTS_2.ppt
LEVEL OF MEASUREMENTS_2.ppt
chusematelephone
 
SESSION 7.ppt
SESSION 7.pptSESSION 7.ppt
SESSION 7.ppt
chusematelephone
 

More from chusematelephone (17)

hepatitis.pdfnzjsnnssnnznsbsbbsbshdhsjai
hepatitis.pdfnzjsnnssnnznsbsbbsbshdhsjaihepatitis.pdfnzjsnnssnnznsbsbbsbshdhsjai
hepatitis.pdfnzjsnnssnnznsbsbbsbshdhsjai
 
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptxSHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
 
Confidentiality in Medical Practice.ppt
Confidentiality in Medical Practice.pptConfidentiality in Medical Practice.ppt
Confidentiality in Medical Practice.ppt
 
CONTRACEPTIVES.ppt
CONTRACEPTIVES.pptCONTRACEPTIVES.ppt
CONTRACEPTIVES.ppt
 
Partogram2.ppt
Partogram2.pptPartogram2.ppt
Partogram2.ppt
 
5.Meningitis (2).ppt
5.Meningitis (2).ppt5.Meningitis (2).ppt
5.Meningitis (2).ppt
 
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptxSESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
 
immunopathology-1-130219045942-phpapp01.pdf
immunopathology-1-130219045942-phpapp01.pdfimmunopathology-1-130219045942-phpapp01.pdf
immunopathology-1-130219045942-phpapp01.pdf
 
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptxSESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
 
SESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxSESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptx
 
SESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxSESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptx
 
GRAM STAINING- GLM.pptx
GRAM STAINING- GLM.pptxGRAM STAINING- GLM.pptx
GRAM STAINING- GLM.pptx
 
GRAM POSITIVE BACTERIA.pptx
GRAM POSITIVE BACTERIA.pptxGRAM POSITIVE BACTERIA.pptx
GRAM POSITIVE BACTERIA.pptx
 
SESSION 3 - Computer Software-1.pptx
SESSION 3 - Computer Software-1.pptxSESSION 3 - Computer Software-1.pptx
SESSION 3 - Computer Software-1.pptx
 
Session 2.pptx
Session 2.pptxSession 2.pptx
Session 2.pptx
 
LEVEL OF MEASUREMENTS_2.ppt
LEVEL OF MEASUREMENTS_2.pptLEVEL OF MEASUREMENTS_2.ppt
LEVEL OF MEASUREMENTS_2.ppt
 
SESSION 7.ppt
SESSION 7.pptSESSION 7.ppt
SESSION 7.ppt
 

Recently uploaded

Emotion-Focused Couples Therapy - Marital and Family Therapy and Counselling ...
Emotion-Focused Couples Therapy - Marital and Family Therapy and Counselling ...Emotion-Focused Couples Therapy - Marital and Family Therapy and Counselling ...
Emotion-Focused Couples Therapy - Marital and Family Therapy and Counselling ...
PsychoTech Services
 
Helminthiasis or Worm infestation in Children for Nursing students
Helminthiasis or Worm infestation in Children for Nursing studentsHelminthiasis or Worm infestation in Children for Nursing students
Helminthiasis or Worm infestation in Children for Nursing students
RAJU B N
 
2nd-generation Antihistaminic Part I.pptx
2nd-generation Antihistaminic Part I.pptx2nd-generation Antihistaminic Part I.pptx
2nd-generation Antihistaminic Part I.pptx
Madhumita Dixit
 
Unlimited Fun With Call Girls Gurgaon ✅ 9711199012 💘 FULL CASH PAYMENT
Unlimited Fun With Call Girls Gurgaon ✅ 9711199012 💘 FULL CASH PAYMENTUnlimited Fun With Call Girls Gurgaon ✅ 9711199012 💘 FULL CASH PAYMENT
Unlimited Fun With Call Girls Gurgaon ✅ 9711199012 💘 FULL CASH PAYMENT
keshavtiwari584
 
Part III - Cumulative Grief: Learning how to honor the many losses that occur...
Part III - Cumulative Grief: Learning how to honor the many losses that occur...Part III - Cumulative Grief: Learning how to honor the many losses that occur...
Part III - Cumulative Grief: Learning how to honor the many losses that occur...
bkling
 
Call Girls Lucknow 9024918724 Vip Call Girls Lucknow
Call Girls Lucknow 9024918724 Vip Call Girls LucknowCall Girls Lucknow 9024918724 Vip Call Girls Lucknow
Call Girls Lucknow 9024918724 Vip Call Girls Lucknow
nandinirastogi03
 
All about shoulder Joint ..
All about shoulder Joint .. All about shoulder Joint ..
All about shoulder Joint ..
Aya Reyad
 
Discovering Human Gut Microbiome Dynamics
Discovering Human Gut Microbiome DynamicsDiscovering Human Gut Microbiome Dynamics
Discovering Human Gut Microbiome Dynamics
Larry Smarr
 
mortality & morbidity indicators of health .pptx
mortality & morbidity indicators of health .pptxmortality & morbidity indicators of health .pptx
mortality & morbidity indicators of health .pptx
swarnkarmadhu
 
Allopurinol (Anti-gout drug).pptx
Allopurinol (Anti-gout drug).pptxAllopurinol (Anti-gout drug).pptx
Allopurinol (Anti-gout drug).pptx
Madhumita Dixit
 
Breast cancer :Receptor (ER/PR/HER2 NEU) Discordance.pptx
Breast cancer :Receptor (ER/PR/HER2 NEU) Discordance.pptxBreast cancer :Receptor (ER/PR/HER2 NEU) Discordance.pptx
Breast cancer :Receptor (ER/PR/HER2 NEU) Discordance.pptx
Dr. Sumit KUMAR
 
Call Girls Gorakhpur 7742996321 Gorakhpur Escorts Service
Call Girls Gorakhpur 7742996321 Gorakhpur Escorts ServiceCall Girls Gorakhpur 7742996321 Gorakhpur Escorts Service
Call Girls Gorakhpur 7742996321 Gorakhpur Escorts Service
kapilsharma3523
 
Call Girls Kolkata 💯Call Us 🔝 7374876321 🔝 💃 Top Class Call Girl Service A...
Call Girls Kolkata   💯Call Us 🔝 7374876321 🔝 💃  Top Class Call Girl Service A...Call Girls Kolkata   💯Call Us 🔝 7374876321 🔝 💃  Top Class Call Girl Service A...
Call Girls Kolkata 💯Call Us 🔝 7374876321 🔝 💃 Top Class Call Girl Service A...
daljeetsingh9909
 
Hemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.Gawad
Hemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.GawadHemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.Gawad
Hemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.Gawad
NephroTube - Dr.Gawad
 
Applications of NMR in Protein Structure Prediction.pptx
Applications of NMR in Protein Structure Prediction.pptxApplications of NMR in Protein Structure Prediction.pptx
Applications of NMR in Protein Structure Prediction.pptx
Anagha R Anil
 
Call Girls Omr Road 8824825030 Top Class Chennai Escorts Available
Call Girls Omr Road 8824825030 Top Class Chennai Escorts AvailableCall Girls Omr Road 8824825030 Top Class Chennai Escorts Available
Call Girls Omr Road 8824825030 Top Class Chennai Escorts Available
simrankaur
 
Cirrhosis of liver - Seminar. pptx
Cirrhosis of liver - Seminar.       pptxCirrhosis of liver - Seminar.       pptx
Cirrhosis of liver - Seminar. pptx
Shashi Prakash
 
Call Girl Gurgaon Just Call 9999965857 Top Class Call Girl Service Available
Call Girl Gurgaon Just Call 9999965857 Top Class Call Girl Service AvailableCall Girl Gurgaon Just Call 9999965857 Top Class Call Girl Service Available
Call Girl Gurgaon Just Call 9999965857 Top Class Call Girl Service Available
gurkirankumar98700
 
Call Girls in Kolkata 💯Call Us 🔝 7374876321 🔝 💃 Top Class Call Girl Servic...
Call Girls in Kolkata   💯Call Us 🔝 7374876321 🔝 💃  Top Class Call Girl Servic...Call Girls in Kolkata   💯Call Us 🔝 7374876321 🔝 💃  Top Class Call Girl Servic...
Call Girls in Kolkata 💯Call Us 🔝 7374876321 🔝 💃 Top Class Call Girl Servic...
daljeetsingh9909
 
Congenital Disorders of digestive tract.pptx
Congenital Disorders of digestive tract.pptxCongenital Disorders of digestive tract.pptx
Congenital Disorders of digestive tract.pptx
Bhavyakelawadiya
 

Recently uploaded (20)

Emotion-Focused Couples Therapy - Marital and Family Therapy and Counselling ...
Emotion-Focused Couples Therapy - Marital and Family Therapy and Counselling ...Emotion-Focused Couples Therapy - Marital and Family Therapy and Counselling ...
Emotion-Focused Couples Therapy - Marital and Family Therapy and Counselling ...
 
Helminthiasis or Worm infestation in Children for Nursing students
Helminthiasis or Worm infestation in Children for Nursing studentsHelminthiasis or Worm infestation in Children for Nursing students
Helminthiasis or Worm infestation in Children for Nursing students
 
2nd-generation Antihistaminic Part I.pptx
2nd-generation Antihistaminic Part I.pptx2nd-generation Antihistaminic Part I.pptx
2nd-generation Antihistaminic Part I.pptx
 
Unlimited Fun With Call Girls Gurgaon ✅ 9711199012 💘 FULL CASH PAYMENT
Unlimited Fun With Call Girls Gurgaon ✅ 9711199012 💘 FULL CASH PAYMENTUnlimited Fun With Call Girls Gurgaon ✅ 9711199012 💘 FULL CASH PAYMENT
Unlimited Fun With Call Girls Gurgaon ✅ 9711199012 💘 FULL CASH PAYMENT
 
Part III - Cumulative Grief: Learning how to honor the many losses that occur...
Part III - Cumulative Grief: Learning how to honor the many losses that occur...Part III - Cumulative Grief: Learning how to honor the many losses that occur...
Part III - Cumulative Grief: Learning how to honor the many losses that occur...
 
Call Girls Lucknow 9024918724 Vip Call Girls Lucknow
Call Girls Lucknow 9024918724 Vip Call Girls LucknowCall Girls Lucknow 9024918724 Vip Call Girls Lucknow
Call Girls Lucknow 9024918724 Vip Call Girls Lucknow
 
All about shoulder Joint ..
All about shoulder Joint .. All about shoulder Joint ..
All about shoulder Joint ..
 
Discovering Human Gut Microbiome Dynamics
Discovering Human Gut Microbiome DynamicsDiscovering Human Gut Microbiome Dynamics
Discovering Human Gut Microbiome Dynamics
 
mortality & morbidity indicators of health .pptx
mortality & morbidity indicators of health .pptxmortality & morbidity indicators of health .pptx
mortality & morbidity indicators of health .pptx
 
Allopurinol (Anti-gout drug).pptx
Allopurinol (Anti-gout drug).pptxAllopurinol (Anti-gout drug).pptx
Allopurinol (Anti-gout drug).pptx
 
Breast cancer :Receptor (ER/PR/HER2 NEU) Discordance.pptx
Breast cancer :Receptor (ER/PR/HER2 NEU) Discordance.pptxBreast cancer :Receptor (ER/PR/HER2 NEU) Discordance.pptx
Breast cancer :Receptor (ER/PR/HER2 NEU) Discordance.pptx
 
Call Girls Gorakhpur 7742996321 Gorakhpur Escorts Service
Call Girls Gorakhpur 7742996321 Gorakhpur Escorts ServiceCall Girls Gorakhpur 7742996321 Gorakhpur Escorts Service
Call Girls Gorakhpur 7742996321 Gorakhpur Escorts Service
 
Call Girls Kolkata 💯Call Us 🔝 7374876321 🔝 💃 Top Class Call Girl Service A...
Call Girls Kolkata   💯Call Us 🔝 7374876321 🔝 💃  Top Class Call Girl Service A...Call Girls Kolkata   💯Call Us 🔝 7374876321 🔝 💃  Top Class Call Girl Service A...
Call Girls Kolkata 💯Call Us 🔝 7374876321 🔝 💃 Top Class Call Girl Service A...
 
Hemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.Gawad
Hemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.GawadHemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.Gawad
Hemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.Gawad
 
Applications of NMR in Protein Structure Prediction.pptx
Applications of NMR in Protein Structure Prediction.pptxApplications of NMR in Protein Structure Prediction.pptx
Applications of NMR in Protein Structure Prediction.pptx
 
Call Girls Omr Road 8824825030 Top Class Chennai Escorts Available
Call Girls Omr Road 8824825030 Top Class Chennai Escorts AvailableCall Girls Omr Road 8824825030 Top Class Chennai Escorts Available
Call Girls Omr Road 8824825030 Top Class Chennai Escorts Available
 
Cirrhosis of liver - Seminar. pptx
Cirrhosis of liver - Seminar.       pptxCirrhosis of liver - Seminar.       pptx
Cirrhosis of liver - Seminar. pptx
 
Call Girl Gurgaon Just Call 9999965857 Top Class Call Girl Service Available
Call Girl Gurgaon Just Call 9999965857 Top Class Call Girl Service AvailableCall Girl Gurgaon Just Call 9999965857 Top Class Call Girl Service Available
Call Girl Gurgaon Just Call 9999965857 Top Class Call Girl Service Available
 
Call Girls in Kolkata 💯Call Us 🔝 7374876321 🔝 💃 Top Class Call Girl Servic...
Call Girls in Kolkata   💯Call Us 🔝 7374876321 🔝 💃  Top Class Call Girl Servic...Call Girls in Kolkata   💯Call Us 🔝 7374876321 🔝 💃  Top Class Call Girl Servic...
Call Girls in Kolkata 💯Call Us 🔝 7374876321 🔝 💃 Top Class Call Girl Servic...
 
Congenital Disorders of digestive tract.pptx
Congenital Disorders of digestive tract.pptxCongenital Disorders of digestive tract.pptx
Congenital Disorders of digestive tract.pptx
 

5.DATA SUMMERISATION.ppt

  • 1. DATA SUMMERISATION Dr Vincent Yusuph Ecohas Kibaha 2023
  • 2. OBJECTIVES At the end of this session you should be able to:  Explain data summarization  Explain the characteristics, uses, advantages, and disadvantages of each measure of location.  Calculate mode, mean and median  Compute and interpret variance, and the standard deviation  Identify the position of the arithmetic mean, median, and mode for both a symmetrical and a skewed distribution.  Explain the characteristics, uses, advantages, and disadvantages of this measure of dispersion
  • 3. 4.3 Data summarisation  Measures of Central Location  Mean, Median, Mode  Measures of Variability/spread  Range, Standard Deviation, Variance, Coefficient of Variation  Measures of Relative Standing  Percentiles, Quartiles
  • 4. MEASURES OF CENTRAL TENDENCY/LOCATION  Often we need to summarise frequency distributions in a few numbers for ease of reporting or comparison  Recall: with qualitative data, useful summary statistics include ratio, proportion, rate
  • 5. Measures of central tendency/location  The statistical methods used to measure central tendency include the following 1. Mean 2. Median 3. Mode
  • 6. MEAN  Refers to arithmetic mean  It is obtained by adding the individual observations divided by the total number of observations.  Advantages – it is easy to calculate. most useful of all the averages.  Disadvantages – influenced by abnormal values.  Examples: In this case it will be (8 + 16 + 15 + 17 + 18 + 20 + 25)/7 which comes to 17
  • 7. Characteristics of the Mean It is calculated by summing the values and dividing by the number of values. It requires the interval scale. All values are used. It is unique. The sum of the deviations from the mean is 0. The Arithmetic Mean is the most widely used measure of location and shows the central value of the data. The major characteristics of the mean are: Average Joe 3- 7
  • 8. Population Mean N X    where µ is the population mean N is the total number of observations. X is a particular value.  indicates the operation of adding. For ungrouped data, the Population Mean is the sum of all the population values divided by the total number of population values: 3- 8
  • 9. Example 1 500 , 48 4 000 , 73 ... 000 , 56       N X  Find the mean mileage for the cars. A Parameter is a measurable characteristic of a population. The Musenge family owns four cars. The following is the current mileage on each of the four cars. 56,000 23,000 42,000 73,000 3- 9
  • 10. Example 2 4 . 15 5 77 5 0 . 15 ... 0 . 14        n X X A statistic is a measurable characteristic of a sampl A sample of five executives received the following bonus last year ($000): 14.0, 15.0, 17.0, 16.0, 15.0 3- 10
  • 11. Statistics is a pattern language Population Sample Size N n Mean Variance Standard Deviation
  • 12. Properties of the Arithmetic Mean Every set of interval-level and ratio-level data has a mean. All the values are included in computing the mean. A set of data has a unique mean. The mean is affected by unusually large or small data values. The arithmetic mean is the only measure of location where the sum of the deviations of each value from the mean is zero. Properties of the Arithmetic Mean 3- 12
  • 13. MEDIAN  When all the observation are arranged either in ascending order or descending order, the middle observation is known as median.  In case of even number the average of the two middle values is taken.  Median is better indicator of central value as it is not affected by the extreme values  Example : The median of 4, 1, and 7 is 4 because when the numbers are put in order (1 , 4, 7) , the number 4 is in the middle.
  • 14. The Median There are as many values above the median as below it in the data array. For an even set of values, the median will be the arithmetic average of the two middle numbers and is found at the (n+1)/2 ranked observation. The Median is the midpoint of the values after they have been ordered from the smallest to the largest. 3- 14
  • 15. The ages for a sample of five BSc.HLS.III students are: 21, 25, 19, 20, 22. Arranging the data in ascending order gives: 19, 20, 21, 22, 25. Thus the median is 21. The median (continued) 3- 15
  • 16. Example 5 Arranging the data in ascending order gives: 73, 75, 76, 80 Thus the median is 75.5. The heights of four basketball players, in inches, are: 76, 73, 80, 75. The median is found at the (n+1)/2 = (4+1)/2 =2.5th data point. 3- 16
  • 17. Properties of the Median There is a unique median for each data set. It is not affected by extremely large or small values and is therefore a valuable measure of location when such values occur. It can be computed for ratio-level, interval- level, and ordinal-level data. It can be computed for an open-ended frequency distribution if the median does not lie in an open-ended class. Properties of the Median 3- 17
  • 18. MODE  Most frequently occurring observation in a data is called mode  Not often used in medical statistics.  EXAMPLE  Number of decayed teeth in 10 children  2,2,4,1,3,0,10,2,3,8  Mean = 34 / 10 = 3.4  Median = (0,1,2,2,2,3,3,4,8,10) = 2+3 /2 = 2.5  Mode = 2 ( 3 Times)
  • 19. Symmetric distribution: A distribution having the same shape on either side of the center Skewed distribution: One whose shapes on either side of the center differ; a nonsymmetrical distribution. Can be positively or negatively skewed, or bimodal The Relative Positions of the Mean, Median, and Mode 3- 19
  • 21. The Relative Positions of the Mean, Median, and Mode: Symmetric Distribution Zero skewness Mean =Median =Mode Mode Median Mean 3- 21
  • 22. The Relative Positions of the Mean, Median, and Mode: Right Skewed Distribution  Positively skewed: Mean and median are to the right of the mode. Mean>Median>Mode Mode Median Mean 3- 22
  • 23. Negatively Skewed: Mean and Median are to the left of the Mode. Mean<Median<Mode The Relative Positions of the Mean, Median, and Mode: Left Skewed Distribution Mode Mean Median 3- 23
  • 24. CHOICE OF APPROPRIATE MEASURE  For symmetric distributions, mean is preferred to median or mode:  utilises all values  mathematical niceties  For asymmetric distributions, mean not suitable:  mean is sensitive to extreme values  median more preferred since it is not affected by extreme values
  • 25. • Measures of central location fail to tell the whole story about the distribution; that is, how much are the observations spread out around the mean value? Measures of spread…
  • 26. Measures of Variability… For example, two sets of class grades are shown. The mean (=50) is the same in each case… But, the red class has greater variability than the blue class.
  • 27. Dispersion refers to the spread or variability in the data. Measures of dispersion include the following: range, mean deviation, variance, and standard deviation. Range = Largest value – Smallest value Measures of Dispersion 0 5 10 15 20 25 30 0 2 4 6 8 10 12 3- 27
  • 28. The following represents the current year’s Return on Equity of the 25 companies in an investor’s portfolio. -8.1 3.2 5.9 8.1 12.3 -5.1 4.1 6.3 9.2 13.3 -3.1 4.6 7.9 9.5 14.0 -1.4 4.8 7.9 9.7 15.0 1.2 5.7 8.0 10.3 22.1 Example 9 Highest value: 22.1 Lowest value: -8.1 Range = Highest value – lowest value = 22.1-(-8.1) = 30.2 3- 28
  • 29. Range…  Its major advantage is the ease with which it can be computed.  Its major shortcoming is its failure to provide information on the dispersion of the observations between the two end points.  Hence we need a measure of variability that incorporates all the data and not just two observations. Hence…
  • 30. Variance: the arithmetic mean of the squared deviations from the mean. Standard deviation: The square root of the variance. Variance and standard Deviation 3- 30
  • 31. Not influenced by extreme values. The units are awkward, the square of the original units. All values are used in the calculation. The major characteristics of the Population Variance are: Population Variance 3- 31
  • 32. Population Variance formula:  (X - )2 N  = X is the value of an observation in the population m is the arithmetic mean of the population N is the number of observations in the population  Population Standard Deviation formula: 2  Variance and standard deviation 3- 32
  • 33. (-8.1-6.62)2 + (-5.1-6.62)2 + ... + (22.1-6.62)2 25     = 42.227 = 6.498 In Example 9, the variance and standard deviation are:  (X - )2 N  = Example 9 continued 3- 33
  • 34. Sample variance (s2) s2 = (X - X)2 n-1 Sample standard deviation (s) 2 s s  Sample variance and standard deviation 3- 34
  • 35. 40 . 7 5 37     n X X       30 . 5 1 5 2 . 21 1 5 4 . 7 6 ... 4 . 7 7 1 2 2 2 2              n X X s Example 11 The hourly wages earned by a sample of five students are: $7, $5, $11, $8, $6. Find the sample variance and standard deviation. 30 . 2 30 . 5 2    s s 3- 35
  • 36. Empirical Rule: For any symmetrical, bell- shaped distribution: About 68% of the observations will lie within 1s the mean About 95% of the observations will lie within 2s of the mean Virtually all the observations will be within 3s of the mean Interpretation and Uses of the Standard Deviation 3- 36
  • 37. 4.37 The Empirical Rule…  Approximately 68% of all observations fall  within one standard deviation of the mean.   Approximately 95% of all observations fall  within two standard deviations of the mean.  Approximately 99.7% of all observations fall  within three standard deviations of the mean.
  • 38. Bell-Shaped Curve showing the relationship between and .   3  1  1  3 68% 95% 99.7% Interpretation and Uses of the Standard Deviation 3- 38
  • 39. Interpreting the standard deviation  The greater the variation in the data the greater the standard deviation  If all the values are the same the standard deviation is zero  For a symmetrical distribution almost all the data will be contained within three standard deviations
  • 40. Coefficient of Variation…  The coefficient of variation of a set of observations is the standard deviation of the observations divided by their mean,  that is:  Population coefficient of variation = CV =  Sample coefficient of variation = cv =
  • 41. 4.41 Coefficient of Variation…  This coefficient provides a  proportionate measure of variation, e.g.  A standard deviation of 10 may be perceived as large when the mean value is 100, but only moderately large when the mean value is 500.
  • 42. 4.42 Measures of Variability…  If data are symmetric, with no serious outliers, use range and standard deviation.  If comparing variation across two data sets, use coefficient of variation.  The measures of variability introduced in this section can be used only for interval data.
  翻译: