Benvenuto in Scribd!

HW 8

Caricato da

Il 0% ha trovato utile questo documento (0 voti)

19 visualizzazioni2 pagine

This document contains 4 homework problems related to evaluating machine learning models. Problem 1 involves calculating evaluation metrics like accuracy and F1 score from a confusion matrix on test data. Problem 2 compares 2 regression models using sum of squared errors and R^2. Problem 3 plots ROC curves and calculates AUC and Gini coefficients to compare 2 binary classification models. Problem 4 analyzes changes in a model's prediction frequencies over time to determine if retraining is needed.

Descrizione originale:

kkjhljljkl

Titolo originale

hw8

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

19 visualizzazioni2 pagine

HW 8

Caricato da

Annie Zou

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 2

Cerca all'interno del documento

ELE 364: HW #8

1. A test is used to predict if a person has a certain disease. The table below shows the test predictions
on 20 subjects. Based on the table, compute the following evaluation measures.

ID Target Prediction ID Target Prediction

1 True True 11 False False
2 False False 12 True True
3 True True 13 True False
4 False True 14 True True
5 False True 15 False True
6 True True 16 True True
7 False False 17 True True
8 True False 18 False False
9 True True 19 True True
10 True True 20 False False

(a) A confusion matrix and classification accuracy.

(b) The average class accuracy using the arithmetic mean and harmonic mean.
(c) The precision, recall, and F1 measure.
(d) Another test whose confusion matrix is given below is also used to detect the same disease.
Which performance metrics would you use to compare the tests? Which test is better?
Target ↓ | Prediction → True False
True 9 1
False 5 5

2. The table below shows the predictions made for a continuous target feature by two different
prediction models for a test set.

ID Target Model 1 Prediction Model 2 Prediction

1 1.200 1.400 1.700
2 2.100 2.500 2.700
3 3.700 3.300 3.300
4 4.100 4.700 4.800
5 5.300 6.300 4.900
6 5.700 5.100 5.300
7 3.200 3.000 2.900
8 7.100 6.700 6.500
9 4.300 4.700 4.900
10 4.500 4.000 4.400

(a) Based on these predictions, calculate the sum of squared errors. Which model is better based
on this evaluation measure.
(b) Calculate the R2 measure. Which model is better based on the R2 measure.
(c) Based on the evaluation measures calculated, which model do you think is performing better
for this dataset?

3. A company develops two different models to predict the behavior of its stock in the market. The
tools predict whether the stock will rise or fall. They measure the true positive rate (TPR) and
false positive rate (FPR) based on the prediction scores and resulting predictions of the models.
The table below shows the TPR and FPR calculated at four threshold values. The threshold
values are different for each model.

Model 1 Model2
TPR 0.4 0.5 0.6 0.9 1 TPR 0.2 0.3 0.7 1 1
FPR 0.3 0.5 0.7 0.9 1 FPR 0.2 0.4 0.6 0.8 1

(a) Plot the ROC curve for each model. Assume the TPR value between two threshold values is
equal to the TPR value calculated at the higher threshold value. For example, for model 1,
TPR is 0.4 when FPR is 0.2 and TPR is 0.9 at FPR=0.8.
(b) Compute the area under the curve (AUC) for each model. Which model performs better?
(c) Compute the Gini coefficient for each model.

4. A news agency is using a model to predict the party affiliation of its subscribers. The table below
shows the prediction frequencies of the model at the time the model was built, for the month after
deployment, and for a month-long period one year after deployment.

Target Original Sample 1 Sample 2

Party A 300 160 350
Party B 400 180 200
Non-affiliated 200 120 300

(a) Draw the bar plots of these three sets of prediction frequencies. Does the model need to be
retrained at these points based on the frequency plots?
(b) Calculate the stability index for the periods of Sample 1 and Sample 2, and determine whether
the model should be retrained at these points. Does the change in the prediction distribution
indicate that the model does not work well anymore?

Potrebbero piacerti anche

MBA 2nd Sem Exam Paper
Documento24 pagine
MBA 2nd Sem Exam Paper
keyur
Nessuna valutazione finora
Assessing Forecasting Error: The Prediction Interval
Documento10 pagine
Assessing Forecasting Error: The Prediction Interval
doxamaria
Nessuna valutazione finora
Tutorial 2
Documento5 pagine
Tutorial 2
Kim Ngọc Huyền
Nessuna valutazione finora
2018 Qtii
Documento10 pagine
2018 Qtii
Vaishigan Paramananthasivam
Nessuna valutazione finora
Dec 2013 2810007
Documento4 pagine
Dec 2013 2810007
Grishma Bhindora
Nessuna valutazione finora
List of Correction For Applied Statistics Module
Documento26 pagine
List of Correction For Applied Statistics Module
Thurgah Vshiny
Nessuna valutazione finora
Pre Board - 2 11 Eco
Documento3 pagine
Pre Board - 2 11 Eco
NDA Aspirant
Nessuna valutazione finora
DL ClassProject Report - 2019A7PS0029P
Documento8 pagine
DL ClassProject Report - 2019A7PS0029P
Sankha Das
Nessuna valutazione finora
Data Mining A Tutorial Based Primer 2nd Roiger Solution Manual
Documento12 pagine
Data Mining A Tutorial Based Primer 2nd Roiger Solution Manual
JeremyGibsonwpoy
100% (37)
Fa17 Practice Midterm2
Documento6 pagine
Fa17 Practice Midterm2
Mygod
Nessuna valutazione finora
1 Econreview-Questions
Documento26 pagine
1 Econreview-Questions
glenia
Nessuna valutazione finora
1 Econreview-Questions
Documento26 pagine
1 Econreview-Questions
glenia
Nessuna valutazione finora
1 Econreview-Questions
Documento26 pagine
1 Econreview-Questions
glenia
Nessuna valutazione finora
Sample Questions PUHE6003
Documento19 pagine
Sample Questions PUHE6003
Duane Lewis
Nessuna valutazione finora
T1 Q. T Nov. 2018
Documento2 pagine
T1 Q. T Nov. 2018
Govind N V
Nessuna valutazione finora
Standard Deviation Practice
Documento7 pagine
Standard Deviation Practice
Trisha Bechard
100% (1)
The University of Auckland
Documento24 pagine
The University of Auckland
Goutam Das
Nessuna valutazione finora
CE251 Java Practical List PDF
Documento11 pagine
CE251 Java Practical List PDF
yash
Nessuna valutazione finora
CE251 Java Practical List
Documento11 pagine
CE251 Java Practical List
NEEL SOJITRA
Nessuna valutazione finora
Parul University: Seat No
Documento2 pagine
Parul University: Seat No
Abhay Singh B
Nessuna valutazione finora
Question Bank (Unit I To IV)
Documento73 pagine
Question Bank (Unit I To IV)
Mukesh sahani
Nessuna valutazione finora
11 4variationswithinadataset
Documento4 pagine
11 4variationswithinadataset
Christian Batista
Nessuna valutazione finora
Endterm - 1
Documento14 pagine
Endterm - 1
himanshubahmani
Nessuna valutazione finora
Paper Open Elective
Documento3 pagine
Paper Open Elective
CHANDERHASS
Nessuna valutazione finora
MS4610 - Introduction To Data Analytics Final Exam Date: November 24, 2021, Duration: 1 Hour, Max Marks: 75
Documento11 pagine
MS4610 - Introduction To Data Analytics Final Exam Date: November 24, 2021, Duration: 1 Hour, Max Marks: 75
Mohd Saud
Nessuna valutazione finora
BA9201 Statistics For Managemant JAN 2012
Documento10 pagine
BA9201 Statistics For Managemant JAN 2012
Sivakumar Natarajan
Nessuna valutazione finora
MPM 68 Individual 2 Assign 1
Documento4 pagine
MPM 68 Individual 2 Assign 1
eyob yohannes
Nessuna valutazione finora
Model Paper - Business Statistics
Documento7 pagine
Model Paper - Business Statistics
Ishini Saparamadu
Nessuna valutazione finora
BA7102-Statistics For Management PDF
Documento12 pagine
BA7102-Statistics For Management PDF
suriya
Nessuna valutazione finora
PS - Gtu Paper
Documento3 pagine
PS - Gtu Paper
sujal patel
Nessuna valutazione finora
Exam 1 Notes
Documento2 pagine
Exam 1 Notes
BBYPENNY
Nessuna valutazione finora
Classification Accuracy Logarithmic Loss Confusion Matrix Area Under Curve F1 Score Mean Absolute Error
Documento9 pagine
Classification Accuracy Logarithmic Loss Confusion Matrix Area Under Curve F1 Score Mean Absolute Error
Harpreet Singh Bagga
Nessuna valutazione finora
Revision Questions
Documento2 pagine
Revision Questions
Dimpho Sonjani-Sibiya
Nessuna valutazione finora
Cccu Cge13101 Exam2013a
Documento13 pagine
Cccu Cge13101 Exam2013a
Ping Fan
Nessuna valutazione finora
Laboratory Activity 2 Statistics
Documento17 pagine
Laboratory Activity 2 Statistics
JENNEFER LEE
Nessuna valutazione finora
Gujarat Technological University
Documento3 pagine
Gujarat Technological University
Aniket Patel
Nessuna valutazione finora
Chapter 4 (Hypothesis Testing)
Documento20 pagine
Chapter 4 (Hypothesis Testing)
Dyg Nademah Pengiran Mustapha
Nessuna valutazione finora
Test Bank Questions Chapters 1 and 2
Documento3 pagine
Test Bank Questions Chapters 1 and 2
Khánh Huyền
Nessuna valutazione finora
Test Bank Questions Chapters 1 and 2
Documento3 pagine
Test Bank Questions Chapters 1 and 2
Anonymous 8ooQmMoNs1
50% (2)
10 Ai Evaluation tp01
Documento5 pagine
10 Ai Evaluation tp01
tanjirouchihams12
Nessuna valutazione finora
Practice Exam 09 Multiple Choice
Documento11 pagine
Practice Exam 09 Multiple Choice
Amanda Brown
Nessuna valutazione finora
Group 8 - Business Stats Project - Installment I
Documento16 pagine
Group 8 - Business Stats Project - Installment I
Abhee Raj
Nessuna valutazione finora
Introduction To Econometrics, Tutorial
Documento22 pagine
Introduction To Econometrics, Tutorial
agonza70
Nessuna valutazione finora
Homework Week 13
Documento2 pagine
Homework Week 13
Nigar Qurbanova
Nessuna valutazione finora
Quamet MSC530M G02 3T 2020-2021 - Part IV
Documento2 pagine
Quamet MSC530M G02 3T 2020-2021 - Part IV
MingLi Jiang
Nessuna valutazione finora
EvaluationQuestions Class 10 Ai
Documento6 pagine
EvaluationQuestions Class 10 Ai
kritavearn
Nessuna valutazione finora
QB For ADS
Documento12 pagine
QB For ADS
Kunj Trivedi
Nessuna valutazione finora
UNIT4 Confusion Matrix
Documento12 pagine
UNIT4 Confusion Matrix
Jaya Sankar
Nessuna valutazione finora
Python Learning
Documento21 pagine
Python Learning
Vinay
Nessuna valutazione finora
Output Iteman Analisis 10 Soal
Documento18 pagine
Output Iteman Analisis 10 Soal
rumifasabrun09
Nessuna valutazione finora
Take Home Test MEI 2014
Documento3 pagine
Take Home Test MEI 2014
மோகனா Karunakaran
Nessuna valutazione finora
Ignou June New
Documento7 pagine
Ignou June New
Anonymous WtjVcZCg
Nessuna valutazione finora
Q.B Statistics
Documento7 pagine
Q.B Statistics
Navkar Jain Sahab
Nessuna valutazione finora
Multiple Choice Test Bank Questions No Feedback - Chapter 3
Documento5 pagine
Multiple Choice Test Bank Questions No Feedback - Chapter 3
Đức Nghĩa
100% (1)
Evolutionary Algorithms for Food Science and Technology
Da Everand
Evolutionary Algorithms for Food Science and Technology
Evelyne Lutton
Nessuna valutazione finora
Practical 1-1 Merged
Documento16 pagine
Practical 1-1 Merged
pese4095
Nessuna valutazione finora
Exam AFF700 211210 - Solutions
Documento11 pagine
Exam AFF700 211210 - Solutions
nnajichinedu20
Nessuna valutazione finora
8102 Stats
Documento5 pagine
8102 Stats
gaurav jain
Nessuna valutazione finora
Guided Randomness in Optimization, Volume 1
Da Everand
Guided Randomness in Optimization, Volume 1
Maurice Clerc
Nessuna valutazione finora
Advanced Portfolio Management: A Quant's Guide for Fundamental Investors
Da Everand
Advanced Portfolio Management: A Quant's Guide for Fundamental Investors
Giuseppe A. Paleologo
Nessuna valutazione finora
Exotic DVM 11 3 Complete
Documento12 pagine
Exotic DVM 11 3 Complete
Luc Card
Nessuna valutazione finora
Pulmonary Embolism
Documento48 pagine
Pulmonary Embolism
ganga2424
100% (3)
Manual For Tacho Universal Edition 2006: Legal Disclaimer
Documento9 pagine
Manual For Tacho Universal Edition 2006: Legal Disclaimer
boirx
Nessuna valutazione finora
Case Study - Kelompok 2
Documento5 pagine
Case Study - Kelompok 2
elida wen
Nessuna valutazione finora
Binary Options
Documento24 pagine
Binary Options
samsa7
Nessuna valutazione finora
Introduction To Retail Loans
Documento2 pagine
Introduction To Retail Loans
Sameer Shah
Nessuna valutazione finora
The Kicker Transcription
Documento4 pagine
The Kicker Transcription
miles
Nessuna valutazione finora
5066452
Documento53 pagine
5066452
jlcheefei9258
Nessuna valutazione finora
Syncope
Documento105 pagine
Syncope
John Das
Nessuna valutazione finora
Solubility Product Constants
Documento6 pagine
Solubility Product Constants
Bilal Ahmed
Nessuna valutazione finora
Cyclic Meditation
Documento8 pagine
Cyclic Meditation
Satadal Gupta
Nessuna valutazione finora
Mahesh R Pujar: (Volume3, Issue2)
Documento6 pagine
Mahesh R Pujar: (Volume3, Issue2)
Ignited Minds
Nessuna valutazione finora
Forex Day Trading System
Documento17 pagine
Forex Day Trading System
Social Malik
100% (1)
Odisha State Museum-1
Documento26 pagine
Odisha State Museum-1
ajitkpatnaik
Nessuna valutazione finora
Facebook: Daisy Buchanan
Documento5 pagine
Facebook: Daisy Buchanan
belenrichardi
Nessuna valutazione finora
Mechanical Engineering - Workshop Practice - Laboratory Manual
Documento77 pagine
Mechanical Engineering - Workshop Practice - Laboratory Manual
rajeevranjan_br
100% (4)
Gaming Ports Mikrotik
Documento6 pagine
Gaming Ports Mikrotik
Ray Ohms
Nessuna valutazione finora
Genomic Tools For Crop Improvement
Documento41 pagine
Genomic Tools For Crop Improvement
Neeru Redhu
Nessuna valutazione finora
CN1111 Tutorial 4 Question
Documento3 pagine
CN1111 Tutorial 4 Question
thenewperson
0% (1)
Pioneer vsx-1020-k 1025-k SM PDF
Documento132 pagine
Pioneer vsx-1020-k 1025-k SM PDF
luisclaudio31
Nessuna valutazione finora
Moquerio - Defense Mechanism Activity
Documento3 pagine
Moquerio - Defense Mechanism Activity
Roxan Moquerio
Nessuna valutazione finora
Retail Banking Black Book
Documento95 pagine
Retail Banking Black Book
omprakash shinde
Nessuna valutazione finora
JFC 180BB
Documento2 pagine
JFC 180BB
nazmul
Nessuna valutazione finora
A Hybrid Genetic-Neural Architecture For Stock Indexes Forecasting
Documento31 pagine
A Hybrid Genetic-Neural Architecture For Stock Indexes Forecasting
Maurizio Idini
Nessuna valutazione finora
Alternative Network Letter Vol 7 No.1-Apr 1991-EQUATIONS
Documento16 pagine
Alternative Network Letter Vol 7 No.1-Apr 1991-EQUATIONS
Equitable Tourism Options (EQUATIONS)
Nessuna valutazione finora
Semi Detailed Lesson Plan
Documento2 pagine
Semi Detailed Lesson Plan
Jean-jean Dela Cruz Camat
Nessuna valutazione finora
Stentofon Pulse: IP Based Intercom System
Documento22 pagine
Stentofon Pulse: IP Based Intercom System
Craig
Nessuna valutazione finora
Shelly e Commerce
Documento13 pagine
Shelly e Commerce
Varun_Arya_8382
Nessuna valutazione finora
Marketing Channels: A Strategic Tool of Growing Importance For The Next Millennium
Documento59 pagine
Marketing Channels: A Strategic Tool of Growing Importance For The Next Millennium
Anonymous ibmeej9
Nessuna valutazione finora
BLP#1 - Assessment of Community Initiative (3 Files Merged)
Documento10 pagine
BLP#1 - Assessment of Community Initiative (3 Files Merged)
John Gladhimer Canlas
Nessuna valutazione finora