Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
co/data-science-course/
Classifiers
x f yest
f(x,w,b) = sign(w.
denotes +1 x
denotes -1 - b)
/
Support Vector Machines: Slide 3
Classifiers
x f yest
f(x,w,b) = sign(w.
denotes +1 x
denotes -1 - b)
Any of these
would be fine..
..but which
is best?
MinuBso-
Pulanndeary
How do we
compute M in
terms of w
and b?
• Plus-plane = { x : w . x + b = +1 }
• Minus-plane { x : w . x + b = -1
Cla=im: The vector w
} is perpendicular to the Plus Plane.
Why?
How do we
compute M in
terms of w
and b?
• Plus-plane = { x : w . x + b = +1 }
• Minus-plane { x : w . x + b = -1
Cla=im: The vector w
} is perpendicular to the Plus Plane.
Why? Let u and v be two vectors on
the Plus Plane. What is w .
–(uv) ?
And so of course the vector w is
also perpendicular to the
Minus Plane Support Vector Machines: Slide 14
Classifier
M = Margin Width
x+ =
=
s -x
-
= [1i,xxj)=(1
K(x i12 √2+
xi1xxiTi2xj)x2i2,=
2 √2xi1xi1√
1+ xxj1x
22
2 2x
i2 i+j2T=
2] 2[1xi1xxj1j12 √ √22xxj1i1√
xi22xj2+ xi22xj22 + xj2]
xj12+
=
= φ(xi) Tφ(xj), where φ(x) = [1 x12 √x2j1xxj21x2 x 2 √2x1
2
√2x2]
Thus, a kernel function implicitly maps data to a high-dimensional space
(without the need to compute each φ(x) explicitly).
Support Vector Machines: Slide 16
What Functions are
For some
Kern els?
K(x ,x ) checking that
i j K(xi ,x
φfu(nxci)tioφn(xsj)can be cumberj
T
)=
some. Mercer’s theorem:
Every semi-positive definite symmetric function
is a kernel
Semi-positive definite symmetric functions
cor espond to a semi-positive
definite Gram matrix: symmetric
K(x1,x1) K(x1,x2) K(x1,x3) … K(x1,xn)
K(x2,x1) K(x2,x2) K(x2,x3) K(x2,xn)
K=
… … … … …
K(xn,x1) K(xn,x2) K(xn,x3) … K(xn,xn)
x x
Mapping
iT j (x), where φ(x) is
Φ: itself x