Benvenuto in Scribd!

Perceptron Lower Bound & The Winnow Algorithm: Innow

Caricato da

Il 0% ha trovato utile questo documento (0 voti)

38 visualizzazioni3 pagine

For any deterministic algorithm, there exists a data set which is separable by a margin of g on which the algorithm makes at least, 1 g 2 mistakes. We claim that, for any b [?] -1, +1 n, there is a w with w = 1 such that [?]i [?] [n], b I (w I e I ) = gb i.

Descrizione originale:

Titolo originale

Winnow

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

38 visualizzazioni3 pagine

Perceptron Lower Bound & The Winnow Algorithm: Innow

Caricato da

Arvind Adimoolam

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 3

Cerca all'interno del documento

Stat 928: Statistical Learning Theory Lecture: 19

Perceptron Lower Bound & The Winnow Algorithm

Instructor: Sham Kakade
1 Lower Bound
Theorem 1.1. Suppose A =
_
x R
d

|x| 1
_
and
1

2
d. Then for any deterministic algorithm, there exists a
data set which is separable by a margin of on which the algorithm makes at least
1

2
| mistakes.
Proof. Let n =
1

2
|. Note that n d and
2
n 1. Let e
i
be the unit vector with a 1 in the ith coordinate and zeroes
in others. Consider e
1
, . . . , e
n
. We now claim that, for any b 1, +1
n
, there is a w with |w| 1 such that
i [n], b
i
(w
i
e
i
) = .
To see this, simply choose w
i
= b
i
. Then the above equality is true. Moreover, |w|
2
=
2

n
i=1
b
2
i
=
2
n 1.
Now given an algorithm /, dene the data set (x
i
, y
i
)
n
i=1
as follows. Let x
i
= e
i
for all i and y
1
= /(x
1
).
Dene y
i
for i > 1 recursively as
y
i
= /(x
1
, y
1
, . . . , x
i1
, y
i1
, x
i
) .
It is clear that the algorithm makes n mistakes when run on this data set. By the above claim, no matter what y
i
s turn
out to be, the data set is separable by a margin of .
2 The Winnow Algorithm
Algorithm 1 WINNOW
Input parameter: > 0 (learning rate)
w
1

1
d
1
for t = 1 to T do
Receive x
t
R
d
Predict sgn(w
t
x
t
)
Receive y
t
1, +1
if sgn(w
t
x
t
) ,= y
t
then
i [d], w
t+1,i

wt,i exp(ytxt,i)
Zt
where Z
t
=

d
i=1
w
t,i
exp(y
t
x
t,i
)
else
w
t+1
w
t
end if
end for
Theorem 2.1. Suppose Assumption M holds. Further assume that w

0. Let
M
T
:=
T

t=1
1[sgn(w
t
x
t
) ,= y
t
]
1
denote the number of mistakes the WINNOW algorithm makes. Then, for a suitable choice of , we have,
M
T

2|x
1:T
|
2

|
2
1

2
ln d .
Proof. Let u

= w

/|w

|. Since we assume w

0, u

is a probability distribution. At all times, the weight

vector w
t
maintained by WINNOW is also a probability distribution. Let us measure the progress of the algorithm by
analyzing the relative entropy between these two distributions at time t. Accordingly, dene

t
:=
d

i=1
u

i
ln
u

i
w
t,i
.
When there is no mistake
t+1
=
t
. On a round when a mistake occurs, we have

t+1

t
=
d

i=1
u

i
ln
w
t,i
w
t+1,i
=
d

i=1
u

i
ln
Z
t
exp(y
t
x
t,i
)
= ln(Z
t
)
d

i=t
u

i
y
t
d

i=1
u

i
x
t,i
= ln(Z
t
) y
t
(u

x
t
)
ln(Z
t
) /|w

|
1
, (1)
where the last inequality follows from the denition of u

and Assumption M. Let L = |x

1:T
|

. Then y
t
x
t,i

[L, L] for all t, i. Then we can bound
Z
t
=
d

i=1
w
t,i
e
ytxt,i
using the convexity of the function t e
t
on the interval [L, L] as follows.
Z
t

d

i=1
1 + y
t
x
t,i
/L
2
e
L
+
1 y
t
x
t,i
/L
2
e
L
=
e
L
+ e
L
2
d

i=1
w
t,i
+
e
L
e
L
2
_
y
t
d

i=1
w
t,i
x
t,i
_
=
e
L
+ e
L
2
+
e
L
e
L
2
y
t
(w
t
x
t
)

e
L
+ e
L
2
because having a mistake implies y
t
(w
t
x
t
) 0 and e
L
e
L
> 0. So we have proved
ln(Z
t
) ln
_
e
L
+ e
L
2
_
. (2)
Dene
C() := /|w

|
1
ln
_
e
L
+ e
L
2
_
.
2
Combining (1) and (2) then gives us

t+1

t
C()1[y
t
,= sgn(w
t
x
t
)] .
Unwinding the recursion gives,

T+1

1
C()M
T
.
Since relative entropy is always non-negative
T+1
0. Further,

1
=
d

i=1
u

i
ln(du

i
)
d

i=1
u

i
ln d = ln d
which gives us
0 ln d C()M
T
and therefore M
T

ln d
C()
. Setting
=
1
2L
ln
_
L + /|w

|
1
L /|w

|
1
_
to maximize the denominator C() gives
M
T

ln d
g
_

Lw

1
_
where g() :=
1+
2
ln(1 + ) +
1
2
ln(1 ). Finally, noting that g()
2
/2 proves the theorem.
3

Potrebbero piacerti anche

Introductory Differential Equations: with Boundary Value Problems, Student Solutions Manual (e-only)
Da Everand
Introductory Differential Equations: with Boundary Value Problems, Student Solutions Manual (e-only)
Martha L. Abell
Nessuna valutazione finora
Bound On The Loss of The Widrow-Hoff Algorithm
Documento6 pagine
Bound On The Loss of The Widrow-Hoff Algorithm
Pinrolinvic Liemq Manembu
Nessuna valutazione finora
Chapter 05
Documento13 pagine
Chapter 05
seanwu95
Nessuna valutazione finora
Perceptron
Documento3 pagine
Perceptron
api-3814100
Nessuna valutazione finora
f (t) dt = lim f (τ − t a = t < t < · · · < t - t − t - → 0 as n → ∞ and t ≤ τ ≤ t f (x) dg (x) = lim f (τ − g (t - g - → 0 as n → ∞
Documento12 pagine
f (t) dt = lim f (τ − t a = t < t < · · · < t - t − t - → 0 as n → ∞ and t ≤ τ ≤ t f (x) dg (x) = lim f (τ − g (t - g - → 0 as n → ∞
Tu Shirota
Nessuna valutazione finora
Lect1 Measu Handout
Documento7 pagine
Lect1 Measu Handout
keyyongpark
Nessuna valutazione finora
Vector Autoregressions: How To Choose The Order of A VAR
Documento8 pagine
Vector Autoregressions: How To Choose The Order of A VAR
v4nhuy3n
Nessuna valutazione finora
CS 229, Public Course Problem Set #1 Solutions: Supervised Learning
Documento10 pagine
CS 229, Public Course Problem Set #1 Solutions: Supervised Learning
suhar adi
Nessuna valutazione finora
Lecture6 Notes
Documento5 pagine
Lecture6 Notes
Melanie2023
Nessuna valutazione finora
Random Numbers and Monte Carlo Methods
Documento6 pagine
Random Numbers and Monte Carlo Methods
Atif Avdović
Nessuna valutazione finora
Statistical Inference For Ergodic Diffusion Process: Yu.A. Kutoyants
Documento24 pagine
Statistical Inference For Ergodic Diffusion Process: Yu.A. Kutoyants
Lameune
Nessuna valutazione finora
Lecture 2
Documento57 pagine
Lecture 2
happy_user
Nessuna valutazione finora
A Simple Proof of AdaBoost Algorithm
Documento4 pagine
A Simple Proof of AdaBoost Algorithm
Xuqing Wu
Nessuna valutazione finora
E 04 Ot 6 S
Documento7 pagine
E 04 Ot 6 S
Ân Trần
Nessuna valutazione finora
Sol FM1115FinalExam2015
Documento13 pagine
Sol FM1115FinalExam2015
Colly Lau
Nessuna valutazione finora
Adam: Adaptive Moment Estimation: The Error To Be Minimized
Documento4 pagine
Adam: Adaptive Moment Estimation: The Error To Be Minimized
Raghunath Siripudi
Nessuna valutazione finora
Finite Difference Methods For HJB PDEs
Documento12 pagine
Finite Difference Methods For HJB PDEs
bobmezz
Nessuna valutazione finora
CS 224 Problem Set 4 - Solutions
Documento3 pagine
CS 224 Problem Set 4 - Solutions
feng_ning_ding4153
Nessuna valutazione finora
Homework Set 3 Solutions
Documento15 pagine
Homework Set 3 Solutions
Eunchan Kim
Nessuna valutazione finora
Vectorized Matlab Codes For Linear Two-Dimensional Elasticity
Documento16 pagine
Vectorized Matlab Codes For Linear Two-Dimensional Elasticity
Vincent Cucumazzø
Nessuna valutazione finora
Special Models: t t κ t t
Documento23 pagine
Special Models: t t κ t t
Lameune
Nessuna valutazione finora
Continuous Systems Short
Documento2 pagine
Continuous Systems Short
Shreyansh Tibdewal
Nessuna valutazione finora
Pfinal
Documento7 pagine
Pfinal
Lic Walter Andrés Ortiz Vargas
Nessuna valutazione finora
Essay Draft
Documento6 pagine
Essay Draft
Edgars Vītiņš
Nessuna valutazione finora
The Expectation-Maximisation Algorithm: 14.1 The EM Algorithm - A Method For Maximising The Likeli-Hood
Documento21 pagine
The Expectation-Maximisation Algorithm: 14.1 The EM Algorithm - A Method For Maximising The Likeli-Hood
naumz
Nessuna valutazione finora
Percept Ron
Documento2 pagine
Percept Ron
Vishesh
Nessuna valutazione finora
3 Stochastic Control of Jump Diffusions: 3.1 Dynamic Programming
Documento19 pagine
3 Stochastic Control of Jump Diffusions: 3.1 Dynamic Programming
amirnekoei
Nessuna valutazione finora
Notes 1 GammaFunction
Documento4 pagine
Notes 1 GammaFunction
Atonu Tanvir Hossain
Nessuna valutazione finora
Applications of Diagonalisation: Reading
Documento19 pagine
Applications of Diagonalisation: Reading
nepalboy20202192
Nessuna valutazione finora
18.S096: Homework Problem Set 1 (Revised) : Topics in Mathematics of Data Science (Fall 2015) Afonso S. Bandeira
Documento6 pagine
18.S096: Homework Problem Set 1 (Revised) : Topics in Mathematics of Data Science (Fall 2015) Afonso S. Bandeira
Abdelrhman Shoeeb
Nessuna valutazione finora
Shooting Method 5
Documento7 pagine
Shooting Method 5
مرتضى عباس
Nessuna valutazione finora
AME 301: Differential Equations, Vibrations and Controls: Notes On Finite-Difference Methods For
Documento19 pagine
AME 301: Differential Equations, Vibrations and Controls: Notes On Finite-Difference Methods For
Mukhlizar Ismail
Nessuna valutazione finora
Math 270D: Programming Project 1: Due Date: Friday, February 14, 2003
Documento4 pagine
Math 270D: Programming Project 1: Due Date: Friday, February 14, 2003
jev
Nessuna valutazione finora
Eli Maor, e The Story of A Number, Among References
Documento10 pagine
Eli Maor, e The Story of A Number, Among References
bdfbdfbfgbf
Nessuna valutazione finora
Dynamical Systems and Numerical Integration
Documento78 pagine
Dynamical Systems and Numerical Integration
Kadiri Saddik
Nessuna valutazione finora
The Gamma Function
Documento20 pagine
The Gamma Function
Henry Pomares Canchano
Nessuna valutazione finora
Probabilistic Modelling and Reasoning
Documento13 pagine
Probabilistic Modelling and Reasoning
Alex McMurray
Nessuna valutazione finora
Ps 1
Documento25 pagine
Ps 1
Khả Uyên
Nessuna valutazione finora
Properties of Numerical Methods
Documento19 pagine
Properties of Numerical Methods
Ssheshan Pugazhendhi
Nessuna valutazione finora
Online Learning: T T T T T T T T
Documento8 pagine
Online Learning: T T T T T T T T
S
Nessuna valutazione finora
1 Review and Overview: CS229T/STATS231: Statistical Learning Theory
Documento4 pagine
1 Review and Overview: CS229T/STATS231: Statistical Learning Theory
yojama
Nessuna valutazione finora
Micro Prelim Solutions
Documento32 pagine
Micro Prelim Solutions
Megan Johnston
Nessuna valutazione finora
Convergence Stability Consistency
Documento4 pagine
Convergence Stability Consistency
meysam_namdar
Nessuna valutazione finora
Solving Convolution Problems: PART I: Using The Convolution Integral
Documento4 pagine
Solving Convolution Problems: PART I: Using The Convolution Integral
saunvict
Nessuna valutazione finora
Solving Convolution Problems: PART I: Using The Convolution Integral
Documento4 pagine
Solving Convolution Problems: PART I: Using The Convolution Integral
Mayank Nautiyal
Nessuna valutazione finora
If Y Is A Proper Finite Dimensional Subspace of Normed Space X, Then Dist (X, Y) 1
Documento3 pagine
If Y Is A Proper Finite Dimensional Subspace of Normed Space X, Then Dist (X, Y) 1
Rishabh Sarma
Nessuna valutazione finora
Analisa Numerik
Documento62 pagine
Analisa Numerik
mohammad affan
Nessuna valutazione finora
Computing The (CDF) F (X) P (X X) of The Distribution Maps A Number in The Domain To A Probability Between 0 and 1 and Then Inverting That Function
Documento9 pagine
Computing The (CDF) F (X) P (X X) of The Distribution Maps A Number in The Domain To A Probability Between 0 and 1 and Then Inverting That Function
ipsita
Nessuna valutazione finora
Safe Computing: Week 1: Monday, Jan 23
Documento7 pagine
Safe Computing: Week 1: Monday, Jan 23
Tushar Parmar
Nessuna valutazione finora
Forecasting
Documento7 pagine
Forecasting
Gelssomina
Nessuna valutazione finora
Calculation of Value of Pi Using Monte-Carlo Method
Documento54 pagine
Calculation of Value of Pi Using Monte-Carlo Method
Shihabudheen
Nessuna valutazione finora
Divide Conquer 1
Documento30 pagine
Divide Conquer 1
Mithil Jogi
Nessuna valutazione finora
Central Limit Theorem
Documento3 pagine
Central Limit Theorem
Micky Mickcky
Nessuna valutazione finora
1 Review of Key Concepts From Previous Lectures: Lecture Notes - Amber Habib - December 1
Documento4 pagine
1 Review of Key Concepts From Previous Lectures: Lecture Notes - Amber Habib - December 1
Christopher Bell
Nessuna valutazione finora
On The Joint Distribution of The Surplus Immediately Prior To Ruin and The Deficit at Ruin
Documento5 pagine
On The Joint Distribution of The Surplus Immediately Prior To Ruin and The Deficit at Ruin
ramzi
Nessuna valutazione finora
Week 7 Lecture
Documento54 pagine
Week 7 Lecture
HANJING QUAN
Nessuna valutazione finora
dg1 hw5 Solutions
Documento11 pagine
dg1 hw5 Solutions
christophercabezas93
Nessuna valutazione finora
Rss Grad Diploma Module3 Solutions Specimen B PDF
Documento10 pagine
Rss Grad Diploma Module3 Solutions Specimen B PDF
A Pirzada
Nessuna valutazione finora
Statistical Theory MT 2007 Problems 1: Solution Sketches
Documento4 pagine
Statistical Theory MT 2007 Problems 1: Solution Sketches
Fábio Salinas
Nessuna valutazione finora
Chapter 1
Documento30 pagine
Chapter 1
alula1
Nessuna valutazione finora
Non Trailable2English
Documento6 pagine
Non Trailable2English
Suman Thakur
Nessuna valutazione finora
Sainsbury 2010 PDF
Documento13 pagine
Sainsbury 2010 PDF
ronald
Nessuna valutazione finora
Datasheet Solis 110K 5G
Documento2 pagine
Datasheet Solis 110K 5G
Aneeq Tahir
Nessuna valutazione finora
Interleaved Memory Organisation, Associative Memo
Documento19 pagine
Interleaved Memory Organisation, Associative Memo
Gourav Salla
Nessuna valutazione finora
Manual Xtable EXCEL Link
Documento7 pagine
Manual Xtable EXCEL Link
Elena Alexandra Beladan
Nessuna valutazione finora
Automatic Power Factor Detection and Cor
Documento53 pagine
Automatic Power Factor Detection and Cor
Ashrita
Nessuna valutazione finora
EngView Folding Carton
Documento89 pagine
EngView Folding Carton
Marilyn Arias
Nessuna valutazione finora
Chapter 4-The Simple Interest 2
Documento121 pagine
Chapter 4-The Simple Interest 2
course hero
Nessuna valutazione finora
Ch23 Review Problems
Documento25 pagine
Ch23 Review Problems
حمدةالنهدية
Nessuna valutazione finora
1979 IC Master
Documento2.398 pagine
1979 IC Master
Iliuta John
Nessuna valutazione finora
Sand Control
Documento12 pagine
Sand Control
NIRAJ DUBEY
Nessuna valutazione finora
Plaxis
Documento6 pagine
Plaxis
Rehan Hakro
Nessuna valutazione finora
SMPS Teune Mee PDF
Documento71 pagine
SMPS Teune Mee PDF
bacuoc.nguyen356
Nessuna valutazione finora
A Tale of Two Cultures: Contrasting Quantitative and Qualitative Research - Mahoney e Goertz
Documento24 pagine
A Tale of Two Cultures: Contrasting Quantitative and Qualitative Research - Mahoney e Goertz
andre_eiras2057
Nessuna valutazione finora
Case Study
Documento8 pagine
Case Study
milan Gandhi
Nessuna valutazione finora
DV-08-UK (Oct-07)
Documento28 pagine
DV-08-UK (Oct-07)
hepcomotion
Nessuna valutazione finora
ISO-14236-2000 Traducido Español
Documento11 pagine
ISO-14236-2000 Traducido Español
Pablo A.
100% (1)
Fiat Bravo Training Manual
Documento111 pagine
Fiat Bravo Training Manual
Ja Ja
75% (4)
Siesta Tutorial
Documento14 pagine
Siesta Tutorial
Charles Marcotte Girard
Nessuna valutazione finora
Difference Between CURE Clustering and DBSCAN Clustering - GeeksforGeeks
Documento3 pagine
Difference Between CURE Clustering and DBSCAN Clustering - GeeksforGeeks
Ravindra Kumar Prajapati
Nessuna valutazione finora
Amt 113 - Weight and Balance Lec
Documento67 pagine
Amt 113 - Weight and Balance Lec
Nino Angob
Nessuna valutazione finora
In Context: Subject Area: Organic Chemistry Level: 14-16 Years (Higher) Topic: Addition Polymers Source: RSC - Li/2Grwsij
Documento5 pagine
In Context: Subject Area: Organic Chemistry Level: 14-16 Years (Higher) Topic: Addition Polymers Source: RSC - Li/2Grwsij
Rajlaxmi Jain
Nessuna valutazione finora
Gilian Gilibrator Manual
Documento25 pagine
Gilian Gilibrator Manual
Evaavivah
Nessuna valutazione finora
ATRA GM 4L60-4L60E (700R4) Rebuild Procedures
Documento0 pagine
ATRA GM 4L60-4L60E (700R4) Rebuild Procedures
Juan Manuel Aguero Diaz
83% (12)
FMS 304 Research Methodology - 0 PDF
Documento188 pagine
FMS 304 Research Methodology - 0 PDF
vicky
100% (2)
Cuda GDB
Documento64 pagine
Cuda GDB
Vinícius Lisboa
Nessuna valutazione finora
Kenelm Digby On Quantity As Divisibility PDF
Documento28 pagine
Kenelm Digby On Quantity As Divisibility PDF
valexandrescu
Nessuna valutazione finora
Steel Design (Moment and Shear Check) (For Simply Supported)
Documento8 pagine
Steel Design (Moment and Shear Check) (For Simply Supported)
aikaless
Nessuna valutazione finora
Statistics
Documento2 pagine
Statistics
Anish John
Nessuna valutazione finora
Foot Abnormality
Documento23 pagine
Foot Abnormality
Kezia Pane
Nessuna valutazione finora