Sei sulla pagina 1di 23

Income Lot_Size Ownership

60 18.4 Owner
85.5 16.8 Owner
64.8 21.6 Owner
61.5 20.8 Owner
87 23.6 Owner
110.1 19.2 Owner
108 17.6 Owner
82.8 22.4 Owner
69 20 Owner
93 20.8 Owner
51 22 Owner
81 20 Owner
75 19.6 Nonowner
52.8 20.8 Nonowner
64.8 17.2 Nonowner
43.2 20.4 Nonowner
84 17.6 Nonowner
49.2 17.6 Nonowner
59.4 16 Nonowner
66 18.4 Nonowner
47.4 16.4 Nonowner
33 18.8 Nonowner
51 14 Nonowner
63 14.8 Nonowner
Gini Entropy Gini Gain Entropy Gain
Before Split 12 12 0.5 1 0.641 0.221

Gini Left
Income Lot_Size Ownership Income Owner NonOwnerGC
60 18.4 Owner 33 0 1 0.000
85.5 16.8 Owner 43.2 0 2 0.000
64.8 21.6 Owner 47.4 0 3 0.000
61.5 20.8 Owner 49.2 0 4 0.000
87 23.6 Owner 51 1 5 0.278
110.1 19.2 Owner 52.8 1 6 0.245
108 17.6 Owner 59.4 1 7 0.219
82.8 22.4 Owner 60 2 7 0.346
69 20 Owner 61.5 3 7 0.420
93 20.8 Owner 63 3 8 0.397
51 22 Owner 64.8 4 9 0.426
81 20 Owner 66 4 10 0.408
75 19.6 Nonowner 69 5 10 0.444
52.8 20.8 Nonowner 75 5 11 0.430
64.8 17.2 Nonowner 81 6 11 0.457
43.2 20.4 Nonowner 82.8 7 11 0.475
84 17.6 Nonowner 84 7 12 0.465
49.2 17.6 Nonowner 85.5 8 12 0.480
59.4 16 Nonowner 87 9 12 0.490
66 18.4 Nonowner 93 10 12 0.496
47.4 16.4 Nonowner 108 11 12 0.499
33 18.8 Nonowner 110.1 12 12 0.500
51 14 Nonowner
63 14.8 Nonowner

24
Entropy Gain

Gini Right Gini Score


Owner NonOwnerGC Entropy Left Entropy Ri
12 11 0.499 0.478 Err:502 0.999
12 10 0.496 0.455 Err:502 0.994
12 9 0.490 0.429 Err:502 0.985
12 8 0.480 0.400 Err:502 0.971
11 7 0.475 0.426 0.650 0.964
11 6 0.457 0.395 0.592 0.937
11 5 0.430 0.359 0.544 0.896
10 5 0.444 0.407 0.764 0.918
9 5 0.459 0.443 0.881 0.940
9 4 0.426 0.413 0.845 0.890
8 3 0.397 0.413 0.890 0.845
8 2 0.320 0.371 0.863 0.722
7 2 0.346 0.407 0.918 0.764
7 1 0.219 0.359 0.896 0.544
6 1 0.245 0.395 0.937 0.592
5 1 0.278 0.426 0.964 0.650
5 0 0.000 0.368 0.949 Err:502
4 0 0.000 0.400 0.971 Err:502
3 0 0.000 0.429 0.985 Err:502
2 0 0.000 0.455 0.994 Err:502
1 0 0.000 0.478 0.999 Err:502
0 0 #DIV/0! #DIV/0! 1.000 #DIV/0!
0.359
Gini Entropy Gini Gain Entropy Gain
12 12 0.5 1 0.593 0.139

Gini Left Gini Right


Total Lot Size Owner NonOwnerGC Owner
Err:502 14 0 1 0.000 12
Err:502 14.8 0 2 0.000 12
Err:502 16 0 3 0.000 12
Err:502 16.4 0 4 0.000 12
0.886 16.8 1 4 0.320 11
0.836 17.2 1 5 0.278 11
0.779 17.6 2 7 0.346 10
0.861 18.4 3 8 0.397 9
0.916 18.8 3 9 0.375 9
0.870 19.2 4 9 0.426 8
0.870 19.6 4 10 0.408 8
0.804 20 6 10 0.469 6
0.861 20.4 6 11 0.457 6
0.779 20.8 8 12 0.480 4
0.836 21.6 9 12 0.490 3
0.886 22 10 12 0.496 2
Err:502 22.4 11 12 0.499 1
Err:502 23.6 12 12 0.500 0
Err:502
Err:502
Err:502
#DIV/0!
Gini Score
NonOwnerGC Entropy LefEntropy Ri Total
11 0.499 0.478 Err:502 0.999 Err:502
10 0.496 0.455 Err:502 0.994 Err:502
9 0.490 0.429 Err:502 0.985 Err:502
8 0.480 0.400 Err:502 0.971 Err:502
8 0.488 0.453 0.722 0.982 0.928
7 0.475 0.426 0.650 0.964 0.886
5 0.444 0.407 0.764 0.918 0.861
4 0.426 0.413 0.845 0.890 0.870
3 0.375 0.375 0.811 0.811 0.811
3 0.397 0.413 0.890 0.845 0.870
2 0.320 0.371 0.863 0.722 0.804
2 0.375 0.438 0.954 0.811 0.907
1 0.245 0.395 0.937 0.592 0.836
0 0.000 0.400 0.971 Err:502 Err:502
0 0.000 0.429 0.985 Err:502 Err:502
0 0.000 0.455 0.994 Err:502 Err:502
0 0.000 0.478 0.999 Err:502 Err:502
0 #DIV/0! #DIV/0! 1.000 #DIV/0! #DIV/0!

9999.000 0.000
Purchase by Income
Income Gr Gender Rural /Urb Purchased Low Middle High
Low M U No Yes
Low F U No No
Low F R No Total 0
Middle M U No
Middle M R No
Middle F R Yes
Middle F U Yes
High M U Yes
High F R Yes
High F U Yes
Yes No Yes No
Income Low Low Middle Middle
0 3 2 2

Gender M M F F

1 3 4 2

U U R R
Geography 3 3 2 2
Yes No Yes No Yes No Yes No
High High Total Low Low Middle Middle High High
3 0 10 0 3 2 2 3 0

For Gini
10 1 3 4 2
For Gini

10 3 3 2 2
10 Gini 0 0.5 0 0.2 entropy Err:502 1

10 Gini 0.375 0.444444 0.416667 entropy 0.811278 0.918296

10 Gini 0.5 0.5 0.4 entropy 1 1


Gain
Gini Entropy
Err:502 Err:502 0.6

0.875489 0.124511

1 0
No Subject Income Group Gender Rural /Urban Purchased Income Gr
1 Ajay Low M U No 1
2 Amit Low F U No 1
3 Rita Low F R No 1
4 Sumitra Middle M U No 2
5 Ravi Middle M R No 2
6 Kiran Middle F R Yes 2
7 Jai Middle F U Yes 2
8 Anselm High M U Yes 3
9 Farhan High F R Yes 3
10 Jaideep High F U Yes 3

Bhagirath High M R 3

Distance Smallest distnaces


Ajay 2.24 1 1 Ravi No
Amit 2.45 2 1 Anselm Yes
Rita 2.24 3 1 Farhan Yes
Sumitra 1.41 therefore Bhagirath Yes
Ravi 1.00
Kiran 1.41
Jai 1.73
Anselm 1.00
Farhan 1.00
Jaideep 1.41
Gender Rural /Urb Purchased
1 1 No
0 1 No
0 0 No
1 1 No
1 0 No
0 0 Yes
0 1 Yes
1 1 Yes
0 0 Yes
0 1 Yes

1 0
Income Lot_Size Ownership
1 60 18.4 Owner
2 85.5 16.8 Owner
3 64.8 21.6 Owner 25
4 61.5 20.8 Owner
5 87 23.6 Owner
24
6 110.1 19.2 Owner
7 108 17.6 Owner
8 82.8 22.4 Owner 23
9 69 20 Owner
10 93 20.8 Owner 22
11 51 22 Owner
12 81 20 Owner
21
13 75 19.6 Nonowner
14 52.8 20.8 Nonowner
15 64.8 17.2 Nonowner 20

16 43.2 20.4 Nonowner


17 84 17.6 Nonowner 19
18 49.2 17.6 Nonowner
19 59.4 16 Nonowner 18
20 66 18.4 Nonowner
21 47.4 16.4 Nonowner
17
22 33 18.8 Nonowner
23 51 14 Nonowner
24 63 14.8 Nonowner 16
25 60 23 ?
15
20 30

1 2 3 4 5 6 7
1 0
2 25.55015 0
3 5.768882 21.24924 0
4 2.830194 24.33105 3.395585 0
5 27.49618 6.963476 22.28991 25.65326 0
6 50.10639 24.7168 45.36353 48.62633 23.51531 0
7 48.00667 22.51422 43.38479 46.60998 21.84033 2.640076 0
8 23.14822 6.216912 18.01777 21.36001 4.368066 27.48691 25.65307
9 9.141116 16.80744 4.494441 7.542546 18.35647 41.10779 39.07378
10 33.08716 8.5 28.21135 31.5 6.621178 17.17469 15.33754
11 9.693297 34.88968 13.8058 10.56835 36.03554 59.16629 57.16957
12 21.06086 5.521775 16.27882 19.5164 6.997142 29.11099 27.10646
13 15.04792 10.86692 10.39423 13.55323 12.64911 35.10228 33.06055
14 7.589466 32.94374 12.02664 8.7 34.31443 57.32233 55.29268
15 4.947727 20.70386 4.4 4.883646 23.10411 45.34413 43.20185
16 16.91863 42.45292 21.63331 18.30437 43.91674 66.91076 64.86047
17 24.01333 1.7 19.61224 22.72642 6.708204 26.149 24
18 10.82959 36.30881 16.10466 12.70945 38.27323 60.92101 58.8
19 2.473863 26.11226 7.77946 5.239275 28.62726 50.80089 48.62633
20 6 19.56553 3.417601 5.1 21.63423 44.10726 42.00762
21 12.75774 38.1021 18.1604 14.77058 40.24922 62.76249 60.61188
22 27.00296 52.53808 31.92303 28.57009 54.21291 77.10104 75.0096
23 10.01798 34.61344 15.75436 12.5096 37.25802 59.32832 57.11357
24 4.68615 22.58871 7.034202 6.184658 25.56247 47.30507 45.08703
25 4.6 26.2429 5 2.662705 27.00667 50.24391 48.30279
Chart Title
25

24

23

22

21

20

19

18

17

16

15
20 30 40 50 60 70 80 90 100 110

8 9 10 11 12 13 14 15 16 17

25.65307 39.07378 15.33754 57.16957 27.10646 33.06055 55.29268 43.20185 64.86047 24


0 14.00714 10.32473 31.80252 3 8.28734 30.04264 18.73606 39.65047 4.947727
14.00714 0 24.01333 18.11077 12 6.013319 16.21974 5.047772 25.8031 15.19079
10.32473 24.01333 0 42.01714 12.02664 18.03996 40.2 28.42886 49.80161 9.551963
31.80252 18.11077 42.01714 0 30.06659 24.1197 2.163331 14.61095 7.962412 33.29204
3 12 12.02664 30.06659 0 6.013319 28.21135 16.44019 37.80212 3.841875
8.28734 6.013319 18.03996 24.1197 6.013319 0 22.23241 10.47855 31.81006 9.219544
30.04264 16.21974 40.2 2.163331 28.21135 22.23241 0 12.52837 9.60833 31.36367
18.73606 5.047772 28.42886 14.61095 16.44019 10.47855 12.52837 0 21.83575 19.20417
39.65047 25.8031 49.80161 7.962412 37.80212 31.81006 9.60833 21.83575 0 40.89597
4.947727 15.19079 9.551963 33.29204 3.841875 9.219544 31.36367 19.20417 40.89597 0
33.94113 19.94492 43.91674 4.753946 31.89044 25.8774 4.816638 15.60513 6.621178 34.8
24.25943 10.4 33.94113 10.32279 21.96725 16.01 8.160882 5.531727 16.7869 24.65198
17.26963 3.4 27.10646 15.42595 15.08509 9.079648 13.41641 1.697056 22.88755 18.01777
35.90487 21.89795 45.81179 6.657327 33.79231 27.78489 6.96563 17.41838 5.8 36.61967
49.92995 36.01999 60.03332 18.28223 48.015 42.00762 19.90075 31.84023 10.32473 51.01412
32.89073 18.97367 42.54692 8 30.59412 24.64467 7.034202 14.16616 10.0896 33.19578
21.20849 7.939773 30.59412 13.99428 18.73606 12.9244 11.83385 3 20.57669 21.18584
22.80789 9.486833 33.07325 9.055385 21.2132 15.38051 7.528612 7.528612 17 24.6

1 2 3
2.662705 4.6 5
4 1 3
100 110 120

18 19 20 21 22 23 24 25

58.8 48.62633 42.00762 60.61188 75.0096 57.11357 45.08703 48.30279


33.94113 24.25943 17.26963 35.90487 49.92995 32.89073 21.20849 22.80789
19.94492 10.4 3.4 21.89795 36.01999 18.97367 7.939773 9.486833
43.91674 33.94113 27.10646 45.81179 60.03332 42.54692 30.59412 33.07325
4.753946 10.32279 15.42595 6.657327 18.28223 8 13.99428 9.055385
31.89044 21.96725 15.08509 33.79231 48.015 30.59412 18.73606 21.2132
25.8774 16.01 9.079648 27.78489 42.00762 24.64467 12.9244 15.38051
4.816638 8.160882 13.41641 6.96563 19.90075 7.034202 11.83385 7.528612
15.60513 5.531727 1.697056 17.41838 31.84023 14.16616 3 7.528612
6.621178 16.7869 22.88755 5.8 10.32473 10.0896 20.57669 17
34.8 24.65198 18.01777 36.61967 51.01412 33.19578 21.18584 24.6
0 10.32473 16.81904 2.163331 16.24438 4.024922 14.08119 12.07477
10.32473 0 7.02282 12.00666 26.54807 8.634813 3.794733 7.025667
16.81904 7.02282 0 18.70722 33.00242 15.63202 4.68615 7.560423
2.163331 12.00666 18.70722 0 14.59863 4.326662 15.68184 14.22392
16.24438 26.54807 33.00242 14.59863 0 18.62901 30.26549 27.32471
4.024922 8.634813 15.63202 4.326662 18.62901 0 12.02664 12.72792
14.08119 3.794733 4.68615 15.68184 30.26549 12.02664 0 8.731552
12.07477 7.025667 7.560423 14.22392 27.32471 12.72792 8.731552 0
2.662705
No Subject Income Group Gender Rural /Urban Purchased Income Gr
1 Ajay Low M U No 1
2 Amit Low F U No 1
3 Rita Low F R No 1
4 Sumitra Middle M U No 2
5 Ravi Middle M R No 2
6 Kiran Middle F R Yes 2
7 Jai Middle F U Yes 2
8 Anselm High M U Yes 3
9 Farhan High F R Yes 3
10 Jaideep High F U Yes 3

11 Bhagirath High M R 3
12 Aman Middle F U 2

P(Yes/Bhagirath) P(Yes/Aman)
=> P(Y/high,Male,Rural) =>
=> P(high/Y) * P(Male/Y)*P(Rural/Y)*P(Yes) =>
0.6 0.2 0.4 0.5
0.6 * 0.2 * 0.4*0.5
0.024
Part 1

P(No/Bhagirath) P(No/Aman)
=> P(N/high,Male,Rural) =>
=> P(high/N) * P(Male/N)*P(Rural/N)*P(No)/P(Bhagirath) =>
0 0.6 0.4 0.5
0 * 0.6 * 0.4 * 0.5
Part 2 0

0.024
1
Yes/Aman
No/Aman
Total
Purchased
Gender Rural /Urb Purchased Income Gr Yes No P(Y) P(N)
1 1 No Low 0 3 3 0 0.6
0 1 No Middle 2 2 4 0.4 0.4
0 0 No High 3 0 3 0.6 0
1 1 No 5 5 10
1 0 No
0 0 Yes
0 1 Yes Purchased
1 1 Yes Gender Yes No P(Y) P(N)
0 0 Yes M 1 3 4 0.2 0.6
0 1 Yes F 4 2 6 0.8 0.4
5 5 10
1 0
0 1
Purchased
Rural /Urb Yes No P(Y) P(N)
U 3 3 6 0.6 0.6
R 2 2 4 0.4 0.4
P(Yes/Aman) 5 5 10
P(Y/Middle,Female,Urban)
P(Middle/Y) * P(Female/Y)*P(Urban/Y)*P(Yes)
0.4 0.8 0.6
0.4*0.8*0.6*0.5
0.096

(only numerator)
0.5

P(No/Aman)
P(N/Middle,Female,Urban)
P(Middle/N) * P(Female/N)*P(Urban/N)*P(No)
0.4 0.4 0.6
0.4*0.4*0.6*0.5
0.048

0.144
0.5
0.666667
0.333333
1
No Subject Income Gr Gender Rural /Urb Purchased Income Gr Gender Rural /Urb
1 Ajay Low M U No 1 1 1
2 Amit Low F U No 1 0 1
3 Rita Low F R No 1 0 0
4 Sumitra Middle M U No 2 1 1
5 Ravi Middle M R No 2 1 0
6 Kiran Middle F R Yes 2 0 0
7 Jai Middle F U Yes 2 0 1
8 Anselm High M U Yes 3 1 1
9 Farhan High F R Yes 3 0 0
10 Jaideep High F U Yes 3 0 1

Bhagirath High M R 3 1 0

P(Yes/Bhagirath)
=> P(Y/high,Male,Rural)
=> P(high/Y) * P(Male/Y)*P(Rural/Y)*P(Yes)/P(Bhagirath) P(Bhagirath) =P (High) * P(M) * P('R)
0.6 0.2 0.4 0.5 0.3
0.6 * 0.2 * 0.4*0.5 0.048
0.024

Prob 0.5

P(No/Bhagirath)
=> P(N/high,Male,Rural)
=> P(high/N) * P(Male/N)*P(Rural/N)*P(No)/P(Bhagirath) P(Bhagirath) =P (High) * P(M) * P('R)
0 0.6 0.4 0.5 0.3
0 * 0.6 * 0.4 * 0.5 0.3
0

Prob 0
Yes No
Normalize 1 0
Purchased
Purchased Income Gr Yes No P(Y) P(N)
No Low 0 3 3 0 0.6
No Middle 2 2 4 0.4 0.4
No High 3 0 3 0.6 0
No 5 5 10
No
Yes
Yes Purchased
Yes Gender Yes No P(Y) P(N)
Yes M 1 3 4 0.2 0.6
Yes F 4 2 6 0.8 0.4
5 5 10

Purchased
Rural /Urb Yes No P(Y) P(N)
U 3 3 6 0.6 0.6
R 2 2 4 0.4 0.4
5 5 10

th) =P (High) * P(M) * P('R)


0.4 0.4

th) =P (High) * P(M) * P('R)


1 1

Potrebbero piacerti anche