Benvenuto in Scribd!

Salta carosello

MGTSC 645 - Assignment 2 - Shivani Gupta

Caricato da

Shivani

Il 0% ha trovato utile questo documento (0 voti)

21 visualizzazioni4 pagine

Titolo originale

MGTSC 645_Assignment 2_Shivani Gupta.docx

Copyright

Formati disponibili

DOCX, PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Copyright:

Formati disponibili

Scarica in formato DOCX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

21 visualizzazioni4 pagine

MGTSC 645 - Assignment 2 - Shivani Gupta

Caricato da

Shivani

Copyright:

Formati disponibili

Scarica in formato DOCX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 4

Cerca all'interno del documento

MGTSC 645 Shivani Gupta

Assignment 2 1646112

Decision Tree:

For the bank data that consists of multiple information – age, balance, job, education, deposit etc., the
python code is written for the decision tree portion.

The mean age for the dataset is: 41.232

The number of observations is: 11162

The number of individuals with age less than 65 is: 10737

The below image shows the data output for the above three questions:

After cleaning the data and removing all unknown cells, the number of observations that are left is: 2675

The below image shows the output of the code after data cleaning:
After data cleaning, the dummy variables are created for columns – Marital, Education, Housing, Loan,
Contact, Poutcome and Deposit such that the number of dummy variables for each of the column is one
less than the types of outcome. For example: Marital has three possible outcomes like divorced, single
and married. Therefore, two dummy variables are created for Married categorical variable.

Thus, this step brings us to a total of 20 columns in the dataset and the sample of the dataset is shown
in the picture of the output attached below:

For defining the categorical variables into dummy variables, for k possible values of the variable, we
need to create (k-1) dummy variables to ensure the variable is completely defined. Therefore, one
dummy variable for each column is dropped.

The decision tree for the bank dataset will look like this:

Balance
?

Mediu High
Low
m >$2500

Own a Educati Marital

house? on? Status?

Second Marrie Divorce

No Yes Primary Tertiary Single
ary d d

Any
Job?
loan?

Manag
Student Retired Yes No
emnet

Payment
Days?

<100 >100
Then the data is split for training and testing with 30% of the data to be used for testing the model.

Then using the sklearn - Decision Tree Classifier, the decision tree is built.

The decision tree model works by learning the functioning and training on 70% of the data and it takes
into account the data from columns - age, marital, education, balance, housing, loan, contact, day,
month, duration, campaign, pdays, previous, poutcome and deposit. And then it returns the confusion
matrix. The confusion matrix displays the number of observations for which the prediction of the model
was same as the actual data. It also provides the number of observations for which the model predicted
differently.

Based on that the classification report is generated that gives the accuracy, precision, f1-score and the
support values for the model.

And the below image of the output shows the confusion matrix, the classification report.

The accuracy of the model is: 99.13%

Whereas the precision is: 99.38%

Neural Network

Now for the neural network portion, the following variables are used to predict the default – education,
job, balance, loan, deposit, housing. The neural network for the code is as follows:

Education

Job 1

Balance

Loan 2

Deposit
3
Default
Housing
Input Layer Hidden Layer Output Layer

The output of the code is shown in the below image:

The accuracy of this code is: 98.45%

The accuracy of the neural network is a little less than the accuracy of the decision tree that is because
in the neural network code, a limited number of variables are used to predict the default, whereas in the
decision tree, all the variables are considered.

Potrebbero piacerti anche

Intermediate Accounting DeMYSTiFieD
Da Everand
Intermediate Accounting DeMYSTiFieD
Geri B. Wink
Valutazione: 5 su 5 stelle
5/5 (4)
Real Estate Math Express: Rapid Review and Practice with Essential License Exam Calculations
Da Everand
Real Estate Math Express: Rapid Review and Practice with Essential License Exam Calculations
Stephen Mettling
Nessuna valutazione finora
07.2.decision Trees
Documento33 pagine
07.2.decision Trees
Ahmed abdallah
Nessuna valutazione finora
Machine Learning: BY:Vatsal J. Gajera (09BCE010)
Documento25 pagine
Machine Learning: BY:Vatsal J. Gajera (09BCE010)
Riya Yadav
Nessuna valutazione finora
Problem Statements:: Inferential Statistics
Documento5 pagine
Problem Statements:: Inferential Statistics
vinutha
Nessuna valutazione finora
Properties of Normal Distribution
Documento16 pagine
Properties of Normal Distribution
R Jay Pangilinan Herno
Nessuna valutazione finora
Padm
Documento40 pagine
Padm
Sonal Pwalia
Nessuna valutazione finora
Classification Ppts 2021
Documento80 pagine
Classification Ppts 2021
PRIYA RATHORE
Nessuna valutazione finora
Session 6 Student Survey Working File
Documento8 pagine
Session 6 Student Survey Working File
Hafiz M. Uzair
Nessuna valutazione finora
PADM - Decision Trees
Documento43 pagine
PADM - Decision Trees
neha
Nessuna valutazione finora
Chapter 4: Machine Learning
Documento30 pagine
Chapter 4: Machine Learning
gary
Nessuna valutazione finora
BRM Test Sahil Mankotia
Documento6 pagine
BRM Test Sahil Mankotia
Sudarshan Mandal
Nessuna valutazione finora
Statistics Assignment 2
Documento16 pagine
Statistics Assignment 2
Karma Ahmed
Nessuna valutazione finora
Normal Distribution: Example: John Michael Obtained A Score of 82 in
Documento3 pagine
Normal Distribution: Example: John Michael Obtained A Score of 82 in
Ronieta Villanueva
Nessuna valutazione finora
Discriminant Analysis
Documento5 pagine
Discriminant Analysis
Rishi Shrivastava
Nessuna valutazione finora
Inferential Statistics
Documento10 pagine
Inferential Statistics
Sapana Sonawane
Nessuna valutazione finora
Question: Erika and Kitty, Who Are Twins, Just Received $30,000 Each For Their 25th Birthday. They Both Hav..
Documento4 pagine
Question: Erika and Kitty, Who Are Twins, Just Received $30,000 Each For Their 25th Birthday. They Both Hav..
Malik Asad
Nessuna valutazione finora
L-10 Iiitmg
Documento28 pagine
L-10 Iiitmg
Pavan Kumar
Nessuna valutazione finora
Week 4 Part 1 Classification
Documento71 pagine
Week 4 Part 1 Classification
Michael Zewdie
Nessuna valutazione finora
Session 9 10 Decision Tree
Documento41 pagine
Session 9 10 Decision Tree
Shishir Gupta
Nessuna valutazione finora
Project Report Loan Predictor
Documento4 pagine
Project Report Loan Predictor
HemantPanday
Nessuna valutazione finora
Decision Tree
Documento7 pagine
Decision Tree
Sreshta Tric
Nessuna valutazione finora
19th March 2022 Student Survey
Documento13 pagine
19th March 2022 Student Survey
Syed Muslim
Nessuna valutazione finora
Data Analytics & R: Regression and Anova
Documento11 pagine
Data Analytics & R: Regression and Anova
Rabi Kant
Nessuna valutazione finora
Master of Arts : Economics
Documento18 pagine
Master of Arts : Economics
nitikanehi
Nessuna valutazione finora
Algebraic Expression and Manipulation Notes - Mathematics D 4024 Notes - O Level Academy
Documento2 pagine
Algebraic Expression and Manipulation Notes - Mathematics D 4024 Notes - O Level Academy
Naseeha
Nessuna valutazione finora
Lecturenotes DecisionTree Spring15
Documento16 pagine
Lecturenotes DecisionTree Spring15
newforall732
Nessuna valutazione finora
DM GTU Study Material Presentations Unit-4 21052021124323PM
Documento28 pagine
DM GTU Study Material Presentations Unit-4 21052021124323PM
Sarvaiya Sanjay
Nessuna valutazione finora
Bus. Math q1q2 - Modules
Documento96 pagine
Bus. Math q1q2 - Modules
Doneth Pineda
Nessuna valutazione finora
CH 3
Documento42 pagine
CH 3
Aarya Sharma
Nessuna valutazione finora
Credit Scoring Modelling For Retail Banking Sector
Documento9 pagine
Credit Scoring Modelling For Retail Banking Sector
henrique_oliv
Nessuna valutazione finora
FormG ENetwork v4.0 Chapter 2
Documento1 pagina
FormG ENetwork v4.0 Chapter 2
peter
Nessuna valutazione finora
Business Mathematics: Capslet
Documento7 pagine
Business Mathematics: Capslet
Debbie Florida Engkoh
Nessuna valutazione finora
Grade 5 LAS.Q3MELC3
Documento7 pagine
Grade 5 LAS.Q3MELC3
carl manaay
Nessuna valutazione finora
EDA Cat2
Documento54 pagine
EDA Cat2
Sri Karthik Avala
Nessuna valutazione finora
Exercises Classificatiwqeon
Documento7 pagine
Exercises Classificatiwqeon
PascDoina
Nessuna valutazione finora
Parte 1-Data Mining
Documento7 pagine
Parte 1-Data Mining
Valery Quiroga
Nessuna valutazione finora
Chapter 2 & 3-Review of Probability and Statistics
Documento93 pagine
Chapter 2 & 3-Review of Probability and Statistics
lamakadbey
Nessuna valutazione finora
Women Micro Bank - Final Submission. PPTX (Autosaved)
Documento16 pagine
Women Micro Bank - Final Submission. PPTX (Autosaved)
debjaniroy1702
Nessuna valutazione finora
This Study Resource Was: Answer
Documento5 pagine
This Study Resource Was: Answer
Saurabh Sharma
Nessuna valutazione finora
Cot2 - 2022 Activity Sheets - 1
Documento2 pagine
Cot2 - 2022 Activity Sheets - 1
MAFIL GAY BABERA
Nessuna valutazione finora
Solved - You Have $50,000 To Invest in Three Stocks. Let Ri Be ...
Documento3 pagine
Solved - You Have $50,000 To Invest in Three Stocks. Let Ri Be ...
Ameer Hamza
Nessuna valutazione finora
01 Machine Learning Fundamentals
Documento29 pagine
01 Machine Learning Fundamentals
Héctor Avirari
Nessuna valutazione finora
Quant 2020 Lecture 1
Documento59 pagine
Quant 2020 Lecture 1
Barrio Bravo
Nessuna valutazione finora
Statistics Chapter 8
Documento3 pagine
Statistics Chapter 8
api-550527993
Nessuna valutazione finora
DM Unit-3
Documento29 pagine
DM Unit-3
Bhure Vedika
Nessuna valutazione finora
Capstone - 1 Notes - Vikas Chauhan PDF
Documento13 pagine
Capstone - 1 Notes - Vikas Chauhan PDF
Vikas Chauhan
100% (3)
Problem Set 2 - Unit 2 - Simple Programs - Problem-2
Documento3 pagine
Problem Set 2 - Unit 2 - Simple Programs - Problem-2
hr.promosi
Nessuna valutazione finora
1.2.2 Quiz Reading Histograms
Documento7 pagine
1.2.2 Quiz Reading Histograms
Anoushah Wasif
Nessuna valutazione finora
Job Market Signaling
Documento33 pagine
Job Market Signaling
Afrina Tasneem
Nessuna valutazione finora
GMAT Manual
Documento104 pagine
GMAT Manual
Ashish Sinha
Nessuna valutazione finora
Math Lessons
Documento148 pagine
Math Lessons
Audi Alteram Partem
Nessuna valutazione finora
Lind 10e Chap04
Documento30 pagine
Lind 10e Chap04
Hạ Miên
Nessuna valutazione finora
Classification Problems
Documento53 pagine
Classification Problems
Naveen Jaishankar
Nessuna valutazione finora
Section 1-4 - Weighted Averages-Complete
Documento5 pagine
Section 1-4 - Weighted Averages-Complete
vasudha
Nessuna valutazione finora
03 - Decision - Tree - Hunt Algorithm
Documento28 pagine
03 - Decision - Tree - Hunt Algorithm
Avin Unggul Wijaya XI-MIPA-2
Nessuna valutazione finora
Elective Mathematics 7: First Quarter - Module 1: Business Mathematics
Documento12 pagine
Elective Mathematics 7: First Quarter - Module 1: Business Mathematics
enchong091
Nessuna valutazione finora
Week 1 Class Workbook MATH 1047 Group 1
Documento47 pagine
Week 1 Class Workbook MATH 1047 Group 1
RAMAN
Nessuna valutazione finora
Algebra DeMYSTiFieD, Second Edition
Da Everand
Algebra DeMYSTiFieD, Second Edition
Rhonda Huettenmueller
Valutazione: 3.5 su 5 stelle
3.5/5 (8)
Math Practice Simplified: Division (Book F): Developing Fluency with Basic Number Combinations for Division
Da Everand
Math Practice Simplified: Division (Book F): Developing Fluency with Basic Number Combinations for Division
Ann Cassill Sofge
Nessuna valutazione finora
Foundations of Machine
Documento120 pagine
Foundations of Machine
che
Nessuna valutazione finora
Lesson 5 - Supervised Learning-Classification
Documento91 pagine
Lesson 5 - Supervised Learning-Classification
aditya jain
100% (1)
23 Mar 23 Kotak Daily
Documento64 pagine
23 Mar 23 Kotak Daily
AbhijitChandra
Nessuna valutazione finora
Analysis of The Priority of Community Complaints in Tangerang City Through E-Government Based On Social Media
Documento10 pagine
Analysis of The Priority of Community Complaints in Tangerang City Through E-Government Based On Social Media
Rizky Febriyanto Sunaryo
Nessuna valutazione finora
IJSC Vol 7 Iss 4 Paper 5 1459 1466
Documento9 pagine
IJSC Vol 7 Iss 4 Paper 5 1459 1466
ajaythermal
Nessuna valutazione finora
CS583 Unsupervised Learning
Documento95 pagine
CS583 Unsupervised Learning
Jrey Kumalah
Nessuna valutazione finora
Course Flyer Course Overview and Sample Certificate Together For Data Science Course
Documento4 pagine
Course Flyer Course Overview and Sample Certificate Together For Data Science Course
Bikash Mahaseth
Nessuna valutazione finora
Comparative Analysis of Machine Learning Techniques For Indian Liver Disease Patients
Documento5 pagine
Comparative Analysis of Machine Learning Techniques For Indian Liver Disease Patients
M. Talha Nadeem
Nessuna valutazione finora
Modelmap: An R Package For Model Creation and Map Production
Documento69 pagine
Modelmap: An R Package For Model Creation and Map Production
Carlos Eduardo Melo Martínez
Nessuna valutazione finora
MLT Notes
Documento2 pagine
MLT Notes
Murthi Mk
Nessuna valutazione finora
DMT Doc Final
Documento20 pagine
DMT Doc Final
Pradeep reddy Jonnala
Nessuna valutazione finora
Top 50 Machine Learning Interview Questions (2023) - Simplilearn
Documento24 pagine
Top 50 Machine Learning Interview Questions (2023) - Simplilearn
Suvetha M
Nessuna valutazione finora
CS3491 - Artificial Intelligence and Machine Learning
Documento34 pagine
CS3491 - Artificial Intelligence and Machine Learning
paranjothi karthik
Nessuna valutazione finora
Machine Learning Technique Based Wrist Radial Pulse Diagnosis
Documento37 pagine
Machine Learning Technique Based Wrist Radial Pulse Diagnosis
Varunapriyan Kumaran
Nessuna valutazione finora
Email Phishing 01
Documento72 pagine
Email Phishing 01
amit mahindre
Nessuna valutazione finora
Prediction of Metabolic Syndrome Using Machine Learning Approach
Documento4 pagine
Prediction of Metabolic Syndrome Using Machine Learning Approach
Md Mehedi Hasan Rifat
Nessuna valutazione finora
Prediction of Stock Performance Using Analytical Techniques: Carol Hargreaves
Documento7 pagine
Prediction of Stock Performance Using Analytical Techniques: Carol Hargreaves
Arijit Das
Nessuna valutazione finora
Decision Tree
Documento8 pagine
Decision Tree
Varun Bhayana
Nessuna valutazione finora
Data Mining - Classification: Alternative Techniques
Documento120 pagine
Data Mining - Classification: Alternative Techniques
Tran Duy Quang
100% (1)
Classification Algorithms Used in Data Mining. This Is A Lecture Given To MSC Students.
Documento63 pagine
Classification Algorithms Used in Data Mining. This Is A Lecture Given To MSC Students.
Sushil Kulkarni
100% (5)
Paper 3-Credit Card Fraud Detection Using Deep Learning
Documento8 pagine
Paper 3-Credit Card Fraud Detection Using Deep Learning
Apu Raj
Nessuna valutazione finora
ID3 AllanNeymark
Documento22 pagine
ID3 AllanNeymark
Rajesh Kumar
Nessuna valutazione finora
Image Compression Techniques Comparative Analysis Using SVD-WDR and SVD-WDR With Principal Component Analysis
Documento6 pagine
Image Compression Techniques Comparative Analysis Using SVD-WDR and SVD-WDR With Principal Component Analysis
Editor IJRITCC
Nessuna valutazione finora
Top 10 Data Mining Algorithms
Documento65 pagine
Top 10 Data Mining Algorithms
spongebob11
Nessuna valutazione finora
Data Preparation
Documento12 pagine
Data Preparation
Researchpro Global
Nessuna valutazione finora
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material II 19-May-2021 Random Forest
Documento22 pagine
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material II 19-May-2021 Random Forest
RenuSharma
Nessuna valutazione finora
Another Research Paper For Fyp
Documento13 pagine
Another Research Paper For Fyp
Sehar Khan
Nessuna valutazione finora
Data Warehousing & Mining: Unit - Iv
Documento32 pagine
Data Warehousing & Mining: Unit - Iv
Sunil Kr Pandey
Nessuna valutazione finora
ML Sas
Documento17 pagine
ML Sas
martdiaz
Nessuna valutazione finora
ML Recap
Documento96 pagine
ML Recap
Amit Mithun
Nessuna valutazione finora