Benvenuto in Scribd!

Salta carosello

Prediction and Accuracy

Caricato da

harneet kaur

Il 0% ha trovato utile questo documento (0 voti)

7 visualizzazioni8 pagine

Titolo originale

Prediction and accuracy.pptx

Copyright

Formati disponibili

PPTX, PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Copyright:

Formati disponibili

Scarica in formato PPTX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

7 visualizzazioni8 pagine

Prediction and Accuracy

Caricato da

harneet kaur

Copyright:

Formati disponibili

Scarica in formato PPTX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 8

Cerca all'interno del documento

Prediction and accuracy

Metrics for Performance Evaluation…

PREDICTED CLASS
Class=Yes Class=No

Class=Yes a b
ACTUAL (TP) (FN)
CLASS
Class=No c d
(FP) (TN)

• Most widely-used metric:

ad TP  TN
Accuracy  
a  b  c  d TP  TN  FP  FN
4/1/20 Data Mining: Concepts and Techniques 2
Limitation of Accuracy
• Consider a 2-class problem
– Number of Class 0 examples = 9990
– Number of Class 1 examples = 10

• If model predicts everything to be class 0,

accuracy is 9990/10000 = 99.9 %
– Accuracy is misleading because model does not
detect any class 1 example

4/1/20 Data Mining: Concepts and Techniques 3

Cost Matrix
PREDICTED CLASS

C(i|j) Class=Yes Class=No

Class=Yes C(Yes|Yes) C(No|Yes)

ACTUAL
CLASS Class=No C(Yes|No) C(No|No)

C(i|j): Cost of misclassifying class j example as class i

4/1/20 Data Mining: Concepts and Techniques 4

Computing Cost of Classification

Model M1 PREDICTED CLASS Model M2 PREDICTED CLASS

+ - + -
ACTUAL ACTUAL
+ 150 40 CLASS + 250 45
CLASS
- 60 250 - 5 200

Accuracy = 80% Accuracy = 90%

4/1/20 Data Mining: Concepts and Techniques 5

Predictor Error Measures

• Measure predictor accuracy: measure how far off the predicted value is from
the actual known value
• Loss function: measures the error betw. yi and the predicted value yi’
– Absolute error: | yi – yi’|
– Squared error: (yi – yi’)2
• Test error (generalization error):
d the average loss over the test setd
– Mean absolute error: i 1  | yi  yi ' |
Mean squared error: 
i 1
( yi  yi ' ) 2

d d
d
d

| y i  yi ' |  ( yi  yi ' ) 2
i 1
– Relative absolute error: i 1
d Relative squared error: d
| y
i 1
i y| (y
i 1
i  y)2

The mean squared-error exaggerates the presence of outliers

Popularly use (square) root mean-square error, similarly, root relative
squared error
4/1/20 Data Mining: Concepts and Techniques 6
Evaluating the Accuracy of a Classifier or
Predictor (I)
• Holdout method
– Given data is randomly partitioned into two independent sets
• Training set (e.g., 2/3) for model construction
• Test set (e.g., 1/3) for accuracy estimation
– Random sampling: a variation of holdout
• Repeat holdout k times, accuracy = avg. of the accuracies obtained
• Cross-validation (k-fold, where k = 10 is most popular)
– Randomly partition the data into k mutually exclusive subsets, each
approximately equal size
– At i-th iteration, use Di as test set and others as training set
– Leave-one-out: k folds where k = # of tuples, for small sized data
– Stratified cross-validation: folds are stratified so that class dist. in each
fold is approx. the same as that in the initial data

4/1/20 Data Mining: Concepts and Techniques 7

Evaluating the Accuracy of a Classifier or Predictor (II)

• Bootstrap
– Works well with small data sets
– Samples the given training tuples uniformly with replacement
• i.e., each time a tuple is selected, it is equally likely to be selected
again and re-added to the training set
• Several boostrap methods, and a common one is .632 boostrap
– Suppose we are given a data set of d tuples. The data set is sampled d times, with
replacement, resulting in a training set of d samples. The data tuples that did not
make it into the training set end up forming the test set. About 63.2% of the
original data will end up in the bootstrap, and the remaining 36.8% will form the
test set (since (1 – 1/d)d ≈ e-1 = 0.368)
– Repeat the sampling procedue k times, overall accuracy of the model:
k
acc( M )   (0.632  acc( M i ) test _ set 0.368  acc( M i ) train _ set )
i 1

4/1/20 Data Mining: Concepts and Techniques 8

Potrebbero piacerti anche

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Da Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Valutazione: 4 su 5 stelle
4/5 (895)
Ordinary Least Square Technique: Advantage of The Principle of Least Squares
Documento9 pagine
Ordinary Least Square Technique: Advantage of The Principle of Least Squares
Giri Prasad
Nessuna valutazione finora
Never Split the Difference: Negotiating As If Your Life Depended On It
Da Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Valutazione: 4.5 su 5 stelle
4.5/5 (838)
Exam2 100b w21
Documento4 pagine
Exam2 100b w21
Abigail
Nessuna valutazione finora
The Yellow House: A Memoir (2019 National Book Award Winner)
Da Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Valutazione: 4 su 5 stelle
4/5 (98)
Completely Randomized Designs: Gary W. Oehlert
Documento33 pagine
Completely Randomized Designs: Gary W. Oehlert
Kirandeep Kaur
Nessuna valutazione finora
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Da Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Valutazione: 4 su 5 stelle
4/5 (5794)
St. Mary'S College of Tagum, Inc.: Tagum City, Davao Del Norte, Philippines
Documento2 pagine
St. Mary'S College of Tagum, Inc.: Tagum City, Davao Del Norte, Philippines
Relen Calihanan
Nessuna valutazione finora
Shoe Dog: A Memoir by the Creator of Nike
Da Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Valutazione: 4.5 su 5 stelle
4.5/5 (537)
CSSBB
Documento103 pagine
CSSBB
rjpequeno
Nessuna valutazione finora
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Da Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Valutazione: 4.5 su 5 stelle
4.5/5 (266)
Chi-Square Test For For Goodness-of-Fit: Announcements
Documento4 pagine
Chi-Square Test For For Goodness-of-Fit: Announcements
Nellson Nichioust
Nessuna valutazione finora
The Little Book of Hygge: Danish Secrets to Happy Living
Da Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Valutazione: 3.5 su 5 stelle
3.5/5 (400)
DM Jab,+Jurnal+Duan+128 136
Documento9 pagine
DM Jab,+Jurnal+Duan+128 136
Abdul Majid Hamid
Nessuna valutazione finora
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Da Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Valutazione: 4.5 su 5 stelle
4.5/5 (474)
Lecture 5 Chunk 2
Documento28 pagine
Lecture 5 Chunk 2
Nabiela
Nessuna valutazione finora
Yes Please
Da Everand
Yes Please
Amy Poehler
Valutazione: 4 su 5 stelle
4/5 (1891)
ANOVA
Documento3 pagine
ANOVA
Abir Hasan
Nessuna valutazione finora
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Da Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Valutazione: 3.5 su 5 stelle
3.5/5 (231)
ARCH GARCH Assignment
Documento8 pagine
ARCH GARCH Assignment
Sruti Suhasaria
Nessuna valutazione finora
Grit: The Power of Passion and Perseverance
Da Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Valutazione: 4 su 5 stelle
4/5 (588)
Differential Geometry in Statistical Inference
Documento246 pagine
Differential Geometry in Statistical Inference
Potado Tomado
75% (4)
The Emperor of All Maladies: A Biography of Cancer
Da Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Valutazione: 4.5 su 5 stelle
4.5/5 (271)
Biostat Practice 23 07 Categorical
Documento18 pagine
Biostat Practice 23 07 Categorical
reem
Nessuna valutazione finora
The Unwinding: An Inner History of the New America
Da Everand
The Unwinding: An Inner History of the New America
George Packer
Valutazione: 4 su 5 stelle
4/5 (45)
Heteroscedasticity
Documento7 pagine
Heteroscedasticity
Bristi Rodh
Nessuna valutazione finora
On Fire: The (Burning) Case for a Green New Deal
Da Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Valutazione: 4 su 5 stelle
4/5 (74)
QA Syllabus
Documento9 pagine
QA Syllabus
Sophia
Nessuna valutazione finora
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Da Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Valutazione: 4.5 su 5 stelle
4.5/5 (345)
Quant Studies Chapter 7
Documento14 pagine
Quant Studies Chapter 7
latrattoriamolise
Nessuna valutazione finora
Team of Rivals: The Political Genius of Abraham Lincoln
Da Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Valutazione: 4.5 su 5 stelle
4.5/5 (234)
Sampling Distribution
Documento102 pagine
Sampling Distribution
Mark B. Barroga
Nessuna valutazione finora
Principles: Life and Work
Da Everand
Principles: Life and Work
Ray Dalio
Valutazione: 4 su 5 stelle
4/5 (599)
Rev 5 Hypothesis Tests STPM T3
Documento3 pagine
Rev 5 Hypothesis Tests STPM T3
KwongKH
50% (2)
Angela's Ashes: A Memoir
Da Everand
Angela's Ashes: A Memoir
Frank McCourt
Valutazione: 4.5 su 5 stelle
4.5/5 (440)
524-Business Research Methods
Documento9 pagine
524-Business Research Methods
mansoor_ahmad_shahzad
Nessuna valutazione finora
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Da Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Valutazione: 4 su 5 stelle
4/5 (1090)
CH 5 Discussion Bnad 277
Documento6 pagine
CH 5 Discussion Bnad 277
api-700555992
Nessuna valutazione finora
Fear: Trump in the White House
Da Everand
Fear: Trump in the White House
Bob Woodward
Valutazione: 3.5 su 5 stelle
3.5/5 (738)
Path Analysis
Documento25 pagine
Path Analysis
Mira
Nessuna valutazione finora
Steve Jobs
Da Everand
Steve Jobs
Walter Isaacson
Valutazione: 4.5 su 5 stelle
4.5/5 (806)
Moving Average Method
Documento48 pagine
Moving Average Method
Shobhit Saxena
100% (1)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Da Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Valutazione: 3.5 su 5 stelle
3.5/5 (2259)
Compiled Notes
Documento12 pagine
Compiled Notes
creacion impresiones
Nessuna valutazione finora
Rise of ISIS: A Threat We Can't Ignore
Da Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Valutazione: 3.5 su 5 stelle
3.5/5 (137)
M Sigma: Confidence Level (95.0%) 0 Confidence Level (95.0%) 0
Documento16 pagine
M Sigma: Confidence Level (95.0%) 0 Confidence Level (95.0%) 0
andreypop2010
Nessuna valutazione finora
John Adams
Da Everand
John Adams
David McCullough
Valutazione: 4.5 su 5 stelle
4.5/5 (2409)
Biostatistic
Documento20 pagine
Biostatistic
andirio7486
Nessuna valutazione finora
The Glass Castle: A Memoir
Da Everand
The Glass Castle: A Memoir
Jeannette Walls
Valutazione: 4.5 su 5 stelle
4.5/5 (1713)
T-Tests For Dummies
Documento10 pagine
T-Tests For Dummies
MeganAisling
Nessuna valutazione finora
Bad Feminist: Essays
Da Everand
Bad Feminist: Essays
Roxane Gay
Valutazione: 4 su 5 stelle
4/5 (1016)
MIT18 05S14 Class27-Slides
Documento48 pagine
MIT18 05S14 Class27-Slides
Yuvraj Wale
Nessuna valutazione finora
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Da Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Valutazione: 4.5 su 5 stelle
4.5/5 (121)
AP Psychology Review Sheet Unit 1
Documento4 pagine
AP Psychology Review Sheet Unit 1
buiclara07
Nessuna valutazione finora
The Outsider: A Novel
Da Everand
The Outsider: A Novel
Stephen King
Valutazione: 4 su 5 stelle
4/5 (1839)
56 A Comparative Study of The Recent A&f Pag 237 (2023)
Documento466 pagine
56 A Comparative Study of The Recent A&f Pag 237 (2023)
Dominic angel
Nessuna valutazione finora
The Light Between Oceans: A Novel
Da Everand
The Light Between Oceans: A Novel
M.L. Stedman
Valutazione: 4.5 su 5 stelle
4.5/5 (789)
Minhaj University Lahore: Course Overview
Documento4 pagine
Minhaj University Lahore: Course Overview
Babar Sultan
Nessuna valutazione finora
The Woman in Cabin 10
Da Everand
The Woman in Cabin 10
Ruth Ware
Valutazione: 3.5 su 5 stelle
3.5/5 (2322)
A Man Called Ove: A Novel
Da Everand
A Man Called Ove: A Novel
Fredrik Backman
Valutazione: 4.5 su 5 stelle
4.5/5 (4609)
Wolf Hall: A Novel
Da Everand
Wolf Hall: A Novel
Hilary Mantel
Valutazione: 4 su 5 stelle
4/5 (3811)
Manhattan Beach: A Novel
Da Everand
Manhattan Beach: A Novel
Jennifer Egan
Valutazione: 3.5 su 5 stelle
3.5/5 (792)
Brooklyn: A Novel
Da Everand
Brooklyn: A Novel
Colm Toibin
Valutazione: 3.5 su 5 stelle
3.5/5 (1937)
The Perks of Being a Wallflower
Da Everand
The Perks of Being a Wallflower
Stephen Chbosky
Valutazione: 4.5 su 5 stelle
4.5/5 (2104)
The Art of Racing in the Rain: A Novel
Da Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Valutazione: 4 su 5 stelle
4/5 (4200)
Little Women
Da Everand
Little Women
Louisa May Alcott
Valutazione: 4 su 5 stelle
4/5 (104)
A Tree Grows in Brooklyn
Da Everand
A Tree Grows in Brooklyn
Betty Smith
Valutazione: 4.5 su 5 stelle
4.5/5 (1929)
The Constant Gardener: A Novel
Da Everand
The Constant Gardener: A Novel
John le Carré
Valutazione: 3.5 su 5 stelle
3.5/5 (104)
Sing, Unburied, Sing: A Novel
Da Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Valutazione: 4 su 5 stelle
4/5 (1103)
Her Body and Other Parties: Stories
Da Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Valutazione: 4 su 5 stelle
4/5 (821)