Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
http://statisticsbyjim.com/basics/probability-distributions/
https://en.wikipedia.org/wiki/Probability_distribution
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 2
Properties of Distributions
Types of probability distribution
p(S)
1/6
0.16
0.14 5/36
0.12
1/9
0.10
0.08 1/12
0.06
1/18
0.04 The probability mass
1/36 function (pmf) of counts
0.02
from two dice
2 3 4 5 6 7 8 9 10 11 12
S
https://en.wikipedia.org/wiki/Probability_distribution
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 4
Properties of Distributions
Discrete probability example
Number of
Probability
Heads
0 0.25
1 0.50
2 0.25
0.4
0.3
0.2
34.1% 34.1%
0.1
2.1% 2.1%
0.1% 13.6% 13.6% 0.1%
0.0
https://en.wikipedia.org/wiki/Probability_distribution
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 6
Properties of Distributions
Continuous probability example 1
a
Refer to https://stattrek.com/probability-distributions/discrete-continuous.aspx for more information.
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 7
Properties of Distributions
Continuous probability example 2
Number of
Probability
Heads
0 0.25
1 0.50
2 0.25
Discrete Continuous
open@sap.com
Follow all of SAP
www.sap.com/contactsap
50
40
30
20
10
0
100 120 140 160 180 200
The Normal Distribution
https://www.mathsisfun.com/data/standard-normal-distribution.html
https://en.wikipedia.org/wiki/Normal_distribution
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 2
The Normal Distribution
Characteristics
Symmetry
50% 50%
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 3
The Normal Distribution
Standard deviation
Standardize
19.1% 19.1%
15.0% 15.0%
0.5% 9.2% 9.2% 0.5%
4.4% 4.4%
0.1% 0.1%
1.7% 1.7%
34.1% 34.1%
0.1
2.1% 2.1%
0.1% 13.6% 13.6% 0.1%
0.0
𝑥−𝜇
-3σ -2σ -1σ 0 1σ 2σ 3σ 𝑧=
𝜎
ഥ
𝒙 − 𝟑𝒔 ഥ
𝒙 − 𝟐𝒔 ഥ
𝒙−𝒔 ഥ
𝒙 ഥ
𝒙+𝒔 ഥ
𝒙 + 𝟐𝒔 ഥ
𝒙 + 𝟑𝒔
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 7
The Normal Distribution
Rules of thumb for detecting outliers
Not unusual
Moderately Moderately
unusual unusual
Outliers Outliers
https://en.wikipedia.org/wiki/Central_limit_theorem
https://machinelearningmastery.com/a-gentle-introduction-to-the-central-limit-theorem-for-machine-learning/
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 9
The Normal Distribution
Summary
open@sap.com
Follow all of SAP
www.sap.com/contactsap
Positive Kurtosis
Negative Kurtosis
Normal Distribution
https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/statistics-definitions/kurtosis-leptokurtic-platykurtic/
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 2
Kurtosis and Skewness
Kurtosis
▪ Data sets with high, positive kurtosis tend to ▪ Data sets with low kurtosis tend to have
have heavy tails, or outliers. light tails, or lack of outliers.
35
30
25
20
15
10
5
0
1 2 3 4 5 6 7 8 9 10 11 2 3 4 5 6 7 8 9 10 11
▪ This distribution has positive kurtosis ▪ This distribution has low kurtosis (no tails)
(heavier tails compared to the normal
distribution)
https://en.wikipedia.org/wiki/Kurtosis
https://www.spcforexcel.com/knowledge/basic-statistics/are-skewness-and-kurtosis-useful-statistics
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 3
Kurtosis and Skewness
Excess kurtosis
0.8
D, 3
S, 2 Key:
L, 1.2
0.7 N, 0 Red, kurt 3, Laplace (D)ouble exponential
C, -0.59376
W, -1 distribution;
0.6 U, -1.2
Orange, kurt 2, hyperbolic (S)ecant distribution;
0.1
0
-5 -4 -3 -2 -1 0 1 2 3 4 5
https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/statistics-definitions/kurtosis-leptokurtic-platykurtic/
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 4
Kurtosis and Skewness
Kurtosis in financial markets
https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/statistics-definitions/kurtosis-leptokurtic-platykurtic/
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 5
Kurtosis and Skewness
Introduction to skewness
Value of function
Random variable
https://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm
https://whatis.techtarget.com/definition/skewness
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 6
Kurtosis and Skewness
Mean and median
1 2 3 4 5 6
von Hippel, Paul T. (2005). "Mean, Median, and Skew: Correcting a Textbook Rule". Journal of Statistics Education. 13 (2).
https://en.wikipedia.org/wiki/Skewness
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 7
Kurtosis and Skewness
Why is skew important?
0,4
0,3
2,1% 2,1%
0,1% 13,6% 13,6% 0,1%
0,0
https://www.sheffield.ac.uk/polopoly_fs/1.579181!/file/stcp-marshallsamuels-NormalityS.pdf
https://www.quora.com/How-does-skewness-impact-regression-model
https://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm
https://www.linkedin.com/pulse/question-does-skewness-variable-impact-predictive-data-mosaddar for more information
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 8
Kurtosis and Skewness
Summary
open@sap.com
Follow all of SAP
www.sap.com/contactsap
a
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 3
Using the Normal Distribution to Calculate Probability
Empirical rule recap
50% 50%
34% 34%
2.35% 2.35%
0.15% 13.5% 13.5% 0.15%
μ-3σ μ -2σ μ -σ μ μ+σ μ+2σ μ+3σ
68%
95%
99.7%
Mean = 1.5
Question
▪ 95% of students at school are between
1.2m and 1.8m tall.
▪ Assuming this data is normally
distributed, calculate the mean and
standard deviation.
95%
2 SD 2 SD
http://davidmlane.com/hyperstat/z_table.html 4 SD
Question
▪ On average, a light bulb lasts 300
days with a standard deviation of
50 days.
▪ Assuming that bulb life is normally
distributed, what is the probability
that the light bulb will last at most
365 days?
https://www.hackmath.net/en/calculator/normal-distribution
https://www.hackmath.net/en/calculator/normal-distribution?mean=300&sd=50&above=&area=below&below=365&ll=&ul=&outsideLL=
&outsideUL=&draw=Calculate
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 7
Using the Normal Distribution to Calculate Probability
Example 2
Question
▪ Scores on an IQ test are normally
distributed.
▪ If the test has a mean of 110 and a
standard deviation of 20, what is the
probability that a person who takes
the test will score between 90 and
120?
https://www.hackmath.net/en/calculator/normal-distribution?mean=110&sd=20&above=&below=&area=between&ll=90&ul=120&outsideLL=
&outsideUL=&draw=Calculate
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 8
Using the Normal Distribution to Calculate Probability
Example 3
Question
▪ A student achieved a score
of 900 in an exam.
▪ The mean test score was
825 with a standard
deviation of 100.
▪ Assuming that test scores
are normally distributed,
what proportion of students
achieved a higher score
than 900?
https://www.hackmath.net/en/calculator/normal-distribution?mean=825&sd=100&area=above&above=900&below=&ll=&ul=&outsideLL=
&outsideUL=&draw=Calculate
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 9
Using the Normal Distribution to Calculate Probability
Summary
https://stattrek.com/probability-distributions/normal.aspx
https://www.mathsisfun.com/data/standard-normal-distribution.html
https://statistics.laerd.com/statistical-guides/normal-distribution-calculations.php
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 10
Thank you.
Contact information:
open@sap.com
Follow all of SAP
www.sap.com/contactsap
Ho: P = 0.5
H1: P ≠ 0.5
α < 0.05
https://stattrek.com/hypothesis-test/hypothesis-testing.aspx
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 3
Hypothesis Testing
Testing
Hypothesis Testing
Region of Acceptance
Two-Tailed Test
Non-Rejection Region
Reject Reject
Hypothesis Hypothesis
http://www.stat.yale.edu/Courses/1997-98/101/sigtest.htm
https://blog.minitab.com/blog/adventures-in-statistics-2/understanding-hypothesis-tests-significance-levels-alpha-and-p-values-in-statistics
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 6
Hypothesis Testing
Decision errors
Truth
H0 is True H0 is False
Type II Error
H0 Not Rejected Correct Decision
β
Statistician’s opinion
(based on the sample
data and decision rule)
Type I Error
H0 Rejected Correct Decision
α
https://en.wikipedia.org/wiki/Power_(statistics)
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 7
Hypothesis Testing
Summary
open@sap.com
Follow all of SAP
www.sap.com/contactsap