Sei sulla pagina 1di 6

9/17/2019

2. Data analysis and


statistics

[2] CHAP.2

Data Analysis 9/17/2019

2
Outline

 Statistical measures
 Data analysis using Pivot tables

Data Analysis 9/17/2019

1
9/17/2019

3
Statistical measures

 Box Plots

Data Analysis 9/17/2019

 Dot‐Scale Diagrams
shows a histogram of data
values as dots
corresponding to
individual data points,
along with the mean,
median, first and third
quartiles, and 1, 2, and 3
standard deviation ranges
from the mean.

Data Analysis 9/17/2019

2
9/17/2019

 Outliers
“Mild” outliers are often defined as being
between 1.5*IQR and 3*IQR to the left of Q1 or to
the right of Q3 , and “extreme” outliers as more
than 3*IQR away from these quartiles.

Data Analysis 9/17/2019

6
Review

 A frequency distribution is a table that shows the number of observations in


each of several nonoverlapping groups.
 A graphical depiction of a frequency distribution for numerical data in the
form of a column chart is called a histogram

Data Analysis 9/17/2019

3
9/17/2019

7
Data analysis using PivotTables

 A PivotTable is a table of statistics that summarizes the


data of a more extensive table (such as from a
database, spreadsheet, or business intelligence
program).

Data Analysis 9/17/2019

8
PivotTable in
Excel
 Given the dataset,
 Insert a PivotTable
 Select row and column
labels
 Adjust Value Field
Settings

Data Analysis 9/17/2019

4
9/17/2019

9
PivotTable in R

 https://www.rforexcelusers.com/make-pivottable-in-r/

Data Analysis 9/17/2019

10
PivotTable in Python

 https://pandas.pydata.org/pandas-
docs/stable/reference/api/pandas.pivot_table.html
 https://pbpython.com/pandas-pivot-table-explained.html

Data Analysis 9/17/2019

10

5
9/17/2019

11
Exercises

 Given the Accounting Department Survey Data


 Use Box plot to describe Years of Service, Years Undergraduate Study
 Describe Age Group using histogram
 The Excel file Cell Phone Survey reports opinions of a sample of consumers
regarding the signal strength, value for the dollar, and customer service for
their cell phone carriers. Use PivotTables to find the following:
a. The average signal strength by type of carrier.
b. Average value for the dollar by type of carrier and usage level.
c. Variance of perception of customer service by carrier and gender.
What conclusions might you reach from this information?

Data Analysis 9/17/2019

11

12
Reading

 [1] 1
 [2] 2
 [3] 2, 3

Data Analysis 9/17/2019

12

Potrebbero piacerti anche