Lecture 3

1
Data Analytics
Wiki Supervised
Learning Definition
Supervised learning is the Data miningtask of

inferring a function from labeled training data.The
training data consist of a set of training examples. In
supervised learning, each example is a pair consisting
of an input object (typically a vector) and a desired
output value (also called thesupervisory signal).
A supervised learning algorithm analyzes the

training data and produces an inferred function,
which can be used for mapping new examples. An
optimal scenario will allow for the algorithm to
correctly determine the class labels for unseen
instances. This requires the learning algorithm to
generalize from the training data to unseen situations
in a reasonable way.
Wiki Unsupervised
Learning Definition
In Data mining, the problem of

unsupervised learning is that of
trying to find hidden structure in
unlabeled data. Since the examples
given to the learner are unlabeled,
there is no error or reward signal to
evaluate a potential solution.
Supervised learning
From a theoretical point of view, supervised and
unsupervised learning differ only in the causal
structure of the model. In supervised learning, the
model defines the effect one set of observations,
called inputs, has on another set of observations,
called outputs. In other words, the inputs are
assumed to be at the beginning and outputs at the
end of the causal chain. The models can include
mediating variables between the inputs and
outputs.
Unsupervised learning,
All the observations are assumed to be caused by
latent variables, that is, the observations are
assumed to be at the end of the causal chain. In
practice, models for supervised learning often
leave the probability for inputs undefined. This
model is not needed as long as the inputs are
available, but if some of the input values are
missing, it is not possible to infer anything about
the outputs. If the inputs are also modelled, then
missing inputs cause no problem since they can
be considered latent variables as in unsupervised
learning.
Supervised Vs
Unsuspervised
Example:
suppose
you had a basket and it is

fulled with some different kinds of
fruits, your task is to arrange them as
groups.
For
understanding let me clear the

names of the fruits in our basket.
We
havefour types of fruits. They

are: apple, banana, grape and
cherry.
Supervised Learning :
You already learnfrom your previous work about

the physical characters of fruits.
So arranging the same type of fruits at one place

is easy now.
Your previous work is called as training data in

data mining.
so you already learn the things from your train

data, this is because of response variable.
Response variable mean just a decision

variable.
You can observe response variable below (FRUIT

NAME) .
NO.
SIZE
Big
COLOR
SHAPE
FRUIT NAME
Red
Rounded
shape with a
Apple
depression
at the top
Small
Red
Heartshaped to
nearly
globular
Big
Green
Long curving
Banana
cylinder
Green
Round to
oval,Bunch
shape
Cylindrical
Small
Cherry
Grape
Suppose
1
0
you have taken an new fruit from the basket then you will
see the size , color and shape of that particular fruit.
If size is Big , color is Red , shape is rounded shape with a
depression at the top, you will conform the fruit name as apple and
you will put in apple group.
Likewise for other fruits also.
Job of groping fruits was done and happy ending.
You can observe in the table that a column was labeled as FRUIT
NAME this is called as response variable.
If you learn the thing before from training data and then applying
that knowledge to the test data(for new fruit), This type of learning
is called as Supervised Learning.
Classification come under Supervised learning.
UnSupervised
Learning :
Suppose you had a basket and it is fulled with some

different typesfruits, your task is to arrange them as
groups.
This time you dont know any thing about that

fruits,honestly saying this is the first time you have
seen them.
so how will you arrange them.
What will you do first???
You will take a fruit and you will arrange them by

considering physical character of that particular fruit.
suppose you have considered color.
Then you will arrange them on considering base

condition as color.
1
1
UnSupervised
Learning
:
Then the groups will be some thing like this.
RED
COLOR GROUP: apples & cherry fruits.
GREEN
so
1
2
COLOR GROUP: bananas & grapes.
now you will take another physical character such assize .
RED
COLOR AND BIG SIZE: apple.
RED
COLOR AND SMALL SIZE: cherry fruits.
GREEN
COLOR AND BIG SIZE: bananas.
GREEN
COLOR AND SMALL SIZE: grapes.
job done happy ending.

Here
you didnt know learn any thing before ,means no train data and no
response variable.
This
type of learning is know unsupervised learning.
Clustering comes under unsupervised learning.
Supervised Vs
UnSupervised Learning
With unsupervised learning it is possible to learn larger

and more complex models than with supervised
learning. This is because in supervised learning one is
trying to find the connection between two sets of
observations. The difficulty of the learning task
increases exponentially in the number of steps
between the two sets and that is why supervised
learning cannot, in practice, learn models with deep
hierarchies.
In unsupervised learning, the learning can proceed

hierarchically from the observations into ever more
abstract levels of representation. Each additional
hierarchy needs to learn only one step and therefore
the learning time increases (approximately) linearly in
the number of levels in the model hierarchy.
1
3
Supervised Vs
If the causal relation between the input and

output observations is complex -- in a sense
there is a large causal gap -- it is often easier to
bridge the gap using unsupervised learning
instead of supervised learning. This is depicted
in figure3. Instead of finding the causal
pathway from inputs to outputs, one starts
building the model upwards from both sets of
observations in the hope that in higher levels of
abstraction the gap is easier to bridge. Notice
also that the input and output observations are
in symmetrical positions in the model.
1
4
Supervised Vs
1
5
Unsupervised learning can be used for bridging the causal gap

between input and output observations. The latent variables in
the higher levels of abstraction are the causes for both sets of
observations and mediate the dependence between inputs and
outputs.
Supervised Vs
1
6

Lecture 3

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Lecture 3

Caricato da

Copyright:

Formati disponibili

1

Supervised learning is the Data miningtask of

A supervised learning algorithm analyzes the

In Data mining, the problem of

you had a basket and it is

understanding let me clear the

havefour types of fruits. They

You already learnfrom your previous work about

So arranging the same type of fruits at one place

Your previous work is called as training data in

so you already learn the things from your train

Response variable mean just a decision

You can observe response variable below (FRUIT

Suppose you had a basket and it is fulled with some

This time you dont know any thing about that

so how will you arrange them.

What will you do first???

You will take a fruit and you will arrange them by

Then you will arrange them on considering base

COLOR GROUP: apples & cherry fruits.

COLOR GROUP: bananas & grapes.

now you will take another physical character such assize .

COLOR AND BIG SIZE: apple.

COLOR AND SMALL SIZE: cherry fruits.

COLOR AND BIG SIZE: bananas.

COLOR AND SMALL SIZE: grapes.

job done happy ending.

type of learning is know unsupervised learning.

Clustering comes under unsupervised learning.

With unsupervised learning it is possible to learn larger

In unsupervised learning, the learning can proceed

If the causal relation between the input and

Unsupervised learning can be used for bridging the causal gap

Potrebbero piacerti anche