Sei sulla pagina 1di 6

DATA QUALITY

ASSESSMENT
april 15, 2020

Shashank vikram singh

Overview
Primarily, Sprocket Central Pty Ltd needs help with its customer and transactions data. The
organisation has a large dataset relating to its customers, but their team is unsure how to
effectively analyse it to help optimise its marketing strategy.

Goals
1. To explore the given data set and analyze the quality of the data set.Find the flaws and
issues with the dataset.

2. Find measures to optimize the data set for future analysis.

Specifications
The given data set named “KPMG_VI_New_raw_data_update_final.xlsx” contains 5
sheets(datasets) namely CustomerDemographic ,NewCustomerList , CustomerAddress,
Transactions,Title sheet.

Milestones

1. CustomerDemographic dataset:

When we analyse the data set the first problem we encounter one problem that is the
columns aren't properly named, all the columns are unnamed on further analysis we
realize that first row of the dataset are row of column name so we fix this.
2
Thereafter we observe that there are many numbers of fields which are empty.

Next we see that in gender columns there is no uniformity in the fields

So we fixed it.

Next,we have a column of date of birth but date of birth is not really usefull for our
analysis so we make a column with age of customers.
3

Next,we se there is a column names default with contains random characters which
makes no sense so to make a futher analysis easy we will drop this column.
4
2. NewCustomerList dataset:

Similar to the the demographic dataset the columns are not properly named over
here.

We fixed it.

Next there are some column whose names aren't provided.


5
3. Transaction dataset

We observe that columns are not properly named.

secondly ,there are lot of field in the number of purchases columns which are missing
and that particular filed plays a really important role in out analysis so we will drop all
rows with missing columns.
6

**PLEASE FIND THE CODE IN THE GIVEN GIT REPO**

GITHUB:https://github.com/sparky1911/k_p_m_g

Potrebbero piacerti anche