Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Introduction to Statistics
Descriptive Analysis
Inferential Analysis
Descriptive Statistics
Acceptance Sampling
Introduction to Statistics
The Nature of Statistics and the Collection of Data
What is Statistics?
A branch of mathematics used
to summarize, analyze, and
interpret a group of numbers or
observations
Descriptive Statistics
Procedures used to summarize, organize, and make sense of a set of scores
or observations
Typically presented graphically, in tabular form (in tables), or as summary
statistics (single values)
Inferential Statistics
Procedures used that allow researchers to infer or generalize observations
made with samples to the larger population from which they were selected
Before we go…
Which type of tables, graphs,
and summary measures to use
with our data?
Data
Measurements or observations
that are typically numeric
Observational
Internal vs. Elementary Experiment
Study (Survey)
External Units &
Data Variables
Qualitative
Population vs.
vs. Sample Quantitative Sample
Census
Variables Survey
Why Sampling?
Reducing cost of
collecting and processing
data
Census is senseless
Sampling can provide whenever the acquisition
more detailed of the desired
information than a information destroys the
census elementary units of
interest
Census is senseless
whenever it produces
information that comes
too late
Samples Types & Errors
Sampling
Techniques
Probability Non-Probability
Simple
Systematic Stratified Cluster
Random
random fluctuations in
the measurements
(constant offset)
Variable Data Types
Variable Data
Qualitative Type
Quantitative
(Categorical)
Nominal Measurement
Ratio
(no natural Ordinal Interval Level
(there is a true
order between (ordering) (no true zero)
zero)
the categories)
continuous data where the differences (intervals)
between the numbers are comparable
Discrete Continuous
(variable takes on a limited (variables can take on tiniest
number of outcomes) fractional values)
Minitab 16 Software
A statistical software used to
analyze data
o Graphing data
Tables Graphs
Pie Charts
Stem-&-Leaf
Diagrams
Box-&-Whisker
Diagrams
Absolute Frequency
Distribution
Absolute Class Frequency (number of companies in class)
Class (size of profit in
million of dollars) Tally Count
Total 100
Relative Frequency
Distribution
Absolute Class Frequency Relative Class Frequency
Class (size of profit in (number of companies in (proportion of all
million of dollars) class) companies in class)
-1,500 to under 0 3 .03
Mean µ Ordinal
Measures of
Central Median M
Tendency Nominal
Mode Mo
(Location) Quartiles (Percentiles)
Continuous
Ordinal
Range Measures of
Variance σ2 Dispersion
Continuous
Standard Deviation σ (Variability)
Measures of Skewness Sk
Continuous
Shape Kurtosis K
Proportion
π
Standard Normal Distribution
Statistics Formulas
Descriptive Statistics
Statistic Formula
σ𝑥
Mean 𝑥ҧ =
𝑛
𝑚 = 𝑥𝑛+1
Median (50% Quartile) 2
Middle value
Mode Most frequent value
Range Maximum - Minimum
σ(𝑥 − 𝑥)ҧ 2
Variance 2
𝑠 =
𝑛 −1
Standard Deviation 𝑠= 𝑠2
𝑥ҧ − 𝑚𝑜
Skewness 𝑠𝑘 =
𝑠
σ 𝑥 − 𝑥ҧ 4
Kurtosis 𝑘= / 𝑠4
𝑛
Quartiles
Cuts = Quartiles
Cut into
4 equal
parts
Order
Data
Skewness
Kurtosis
Minitab
Application
Inferential Statistics
Inferential Analysis
Hypothesis Relationship
Testing among Variables
Hypothesis Testing
(Significance Testing)
A systematic approach to assessing
tentative beliefs about reality.
Take a sample,
Select a test Derive a compute the test
State the
statistic decision rule statistic, & confront
Hypothesis
it with the decision
rule
H0 vs. Ha
z or t
Level of Significance Value
Significance (p-value)
α
Making a Decision
Types of Error
Test of Normality
Relationship among
Variables
Relationship between two
variables can be checked by
drawing scatterplots or running
statistical tests.
Scatterplots
Minitab
Application
Correlation
Perfect
Weak
Minitab
Application
Testing Relationship among
Variables
Variables Test
Both Variables are Nominal Chi-square
Independent Variable is Nominal & T-Test (Independent Variable has only two
Dependent Variable is Interval or Ratio categories)
ANOVA (Independent Variable has more
than two categories)
Contingency Table
Acceptance Sampling
The process of randomly inspecting
Statistical Process
Control (SPC) a sample of goods and deciding
whether to accept the entire lot
A statistical tool that involves
based on the results
inspecting a random sample of the
output from a process and
deciding whether the process is Process Capability
producing products with The ability of a production process to
predetermined range
All three of these statistical quality control categories are helpful in measuring and evaluating
the quality of products or services. However, statistical process control (SPC) tools are used most
frequently because they identify quality problems during the production process.
Why SPC is the Most
Important Tool of the SQC?
or process
Some Information about SPC
Describing certain
characteristics of a product &
a process
Measures of Variability
(standard deviation & range)
Cause-&-Effect
Flow Chart Histogram
Diagram
Scatter Diagram
1. Control Chart
A graph that shows whether a sample
of data falls within the common or
normal range of variation
Characteristics measured by
Control Chart
Variables Attributes
Construction: Construction:
The center line is the average The center line is the average
proportion defective in the number of defects, 𝑐.ҧ
population, 𝑝.ҧ construct the upper and lower
construct the upper and lower control limits of the chart
control limits of the chart UCL = 𝑐ҧ + 𝑧 𝑐ҧ
UCL = 𝑝ҧ + 𝑧𝜎𝑝 LCL = 𝑐ҧ - 𝑧 𝑐ҧ
LCL = 𝑝ҧ - 𝑧𝜎𝑝
ҧ − 𝑝)ҧ
𝑝(1
𝜎𝑝 =
𝑛
Minitab
Application
Process Capability
𝑈𝑆𝐿 − 𝑥Ӗ 𝑥Ӗ − 𝐿𝑆𝐿
𝐶𝑝𝑘 = min ,
3𝜎 3𝜎
Acceptance Sampling
No 100% Inspection
Double-Sampling Plan
A plan in which management
specifies two sample sizes and two
acceptance numbers; if the quality
of the lot is very good or very bad,
the consumer can make a decision
to accept or reject the lot on the
basis of the first sample, which is
smaller than in the single-sampling
plan.
Sampling by Attribute
Sampling by Variable
Take a Random The Single Sampling Procedure
Sample of size n from
the Lot of size N
Yes
d≤c? Accept Lot
No
Reject Lot
𝑁 −𝑛
𝐴𝑂𝑄 = 𝑃𝑎𝑐 𝑃
𝑁
proportion defective.
Create a Sampling Plan
Compare a Sampling Plan
2. Check Sheet
A simple document that is used for collecting data in real-
time and at the location where the data is generated.
3. Pareto Chart
A bar chart that is used to analyze the frequency of
problems or causes in a process
4. Flow Chart
MS Visio Software
5. Cause-&-Effect Diagram
Fishbone Diagram: help organize ideas & identify relationships,
encourages brainstorming for ideas
6. Histogram
A graphical representation of the distribution of data
7. Scatterplot
A graph of plotted points that show the relationship
between two sets of data
“
Thank You!
”
Presenter: Marwa Abo Amra
statistician.marwa@gmail.com