Clustering With WEKA Explorer: Lab Exercise Four

Caricato da

ardindavirman

Il 0% ha trovato utile questo documento (0 voti)

304 visualizzazioni11 pagine

Weka Program Documentation

Titolo originale

Weka LabFour

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Weka Program Documentation

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

304 visualizzazioni11 pagine

Clustering With WEKA Explorer: Lab Exercise Four

Caricato da

ardindavirman

Weka Program Documentation

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 11

Cerca all'interno del documento

Lab Exercise Four

Clustering with WEKA Explorer

1. Open a terminal window from the left bar. Go to directory /opt/weka-3-6-13, then
type command :
java –jar weka.jar.

2. Fire up WEKA to get the GUI Chooser panel. Select Explorer from the four
choices on the right side.

3. We are on Preprocess now. Click the Open file button to bring up a standard
dialog through which you can select a file. Choose the telco_labFour.csv file.

4. You could ignore irrelevant attributes during the clustering process, like custIds.
To identify redundant attributes, we could check the correlation from
Visualization of the data set under Visualize Tab. age and agecat are correlated.
One of them should be ignored. We keep age for clustering purpose; also ed
(removing edcat), then we have 8 attributes left for clustering (we will ignore
custIds, agecat and edcat when we perform clustering.
5. Before we do clustering with Weka, we need to normalize your numeric data
values (use Normalize filter). Since we have the class label, we would like to set
it to nominal before normalization. This information will be used to evaluate the
clustering performance.

6. To perform clustering on the data set, click Cluster tab and choose
SimpleKMeans algorithm. We set k = 2 for this data set. Choose Classes to
clusters evaluation and select the last attribute as class label. Check Store clusters
for visualization. Click Ignore attributes and select custIds, agecat, edcat, and
the last attribute churn. Then click Start.
1839+623 = 2462
7. You could visualize the clustering results by right-clicking the result list and
choose visualize clusters assignments. You could select different combination of
two attributes as X and Y.
8. You could save the clustering results by clicking Save button on the Visualization
panel. The results are saved in a .arff file. You could use Weka to open it and
view the results.

9. If the data set has no class labels, then when you perform clustering on the data
set, choose Use Training Dataset as Cluster mode.
10. HierarchicalClusterer implements agglomerative (bottom-up) generation of
hierarchical clusters. Several different link types, which are ways of measuring
the distance between clusters, are available as options.
11. Since the Hierarchical Clustering algorithm builds a tree for the whole dataset,
let’s practice this algorithm on a smaller dataset due to the memory space
limitation. Open glass.arff dataset used in the previous lab, first normalize all the
numeric values in the dataset into [0,1]. Then chose HierarchicalCluster cluster.
Since this datset has 6 classes, we set numClusters as 6. To save time, we set
printNewick as False.
12. After you run the clustering algorithm, you could right click the clustering result
and check its hierarchical tree by clicking Visualize Tree.

13. Since the performance of HierarchicalClustering is not good, we could run K-

means algorithm on the same dataset and compare their performance. Do not
forget to set the number of clusters to 6.
14. Save the clustering results as glass_kmeans_result.arff file and reopen it with
Weka. Since clustering results are saved as the last column, they are considered as
class labels for the datset. The original class labels are considered as a feature of
the dataset, you could click the Type attribute to see the overlapping among
clusters vs. classes.

Potrebbero piacerti anche

Beginning C# 6 Programming with Visual Studio 2015
Da Everand
Beginning C# 6 Programming with Visual Studio 2015
Benjamin Perkins
Nessuna valutazione finora
Diffuse Algorithms for Neural and Neuro-Fuzzy Networks: With Applications in Control Engineering and Signal Processing
Da Everand
Diffuse Algorithms for Neural and Neuro-Fuzzy Networks: With Applications in Control Engineering and Signal Processing
Boris.A Skorohod
Nessuna valutazione finora
Django-autocomplete-light Documentation
Documento49 pagine
Django-autocomplete-light Documentation
HERNAN DARIO MENDIETA
Nessuna valutazione finora
Java Programming: From Problem Analysis To Program Design
Documento75 pagine
Java Programming: From Problem Analysis To Program Design
rehpzupista
Nessuna valutazione finora
A Practical Guide To Amplicon and Metagenomic Analysis of Microbiome Data
Documento16 pagine
A Practical Guide To Amplicon and Metagenomic Analysis of Microbiome Data
LuisFrancisco Bustamante
Nessuna valutazione finora
COBOL Reserved Words
Documento8 pagine
COBOL Reserved Words
Palem Sudeep
Nessuna valutazione finora
Useful R Packages For Data Analysis
Documento3 pagine
Useful R Packages For Data Analysis
tejukmr
Nessuna valutazione finora
Pandas Cheat Sheet
Documento2 pagine
Pandas Cheat Sheet
Josh Iz
Nessuna valutazione finora
24051
Documento3 pagine
24051
bhuvi2312
100% (1)
Java Programming Tutorial 1: How To Start "Hello World" Program in Java
Documento14 pagine
Java Programming Tutorial 1: How To Start "Hello World" Program in Java
Habibur Rahman
Nessuna valutazione finora
Lecture 4 - Decisions and Conditions - EDITED
Documento24 pagine
Lecture 4 - Decisions and Conditions - EDITED
Pearl Kimberly Quidor Lavarez
100% (1)
Crystal Report 8.5 For Visual - 89028
Documento8 pagine
Crystal Report 8.5 For Visual - 89028
dsadsad
Nessuna valutazione finora
Java Reference
Documento679 pagine
Java Reference
Kausthub Thapar
Nessuna valutazione finora
Week 3 Teradata Exercise Guide: Managing Big Data With Mysql Dr. Jana Schaich Borg, Duke University
Documento6 pagine
Week 3 Teradata Exercise Guide: Managing Big Data With Mysql Dr. Jana Schaich Borg, Duke University
Nuts
Nessuna valutazione finora
Week 2 Teradata Practice Exercises Guide
Documento7 pagine
Week 2 Teradata Practice Exercises Guide
Felicia Cristina Gune
50% (2)
Java Games With Greenfoot Lesson4
Documento9 pagine
Java Games With Greenfoot Lesson4
Jessica Chiang
Nessuna valutazione finora
Exercises Java Advanced Features
Documento26 pagine
Exercises Java Advanced Features
Costicã Pardaillan Lupu
Nessuna valutazione finora
OFcheat Sheet
Documento1 pagina
OFcheat Sheet
mimi
Nessuna valutazione finora
3.5.7 Lab - Create A Python Unit Test - ILM
Documento9 pagine
3.5.7 Lab - Create A Python Unit Test - ILM
Ilham Ciplen
Nessuna valutazione finora
Assdadasx CC C
Documento46 pagine
Assdadasx CC C
rama
Nessuna valutazione finora
Vs Code Cheatsheet
Documento12 pagine
Vs Code Cheatsheet
Dev Rathore
Nessuna valutazione finora
FLEXnet ID Dongle Drivers
Documento24 pagine
FLEXnet ID Dongle Drivers
Anonymous 44Espf
Nessuna valutazione finora
User Manual Selligent Designer en v5.0
Documento112 pagine
User Manual Selligent Designer en v5.0
Shiva_1912
Nessuna valutazione finora
My ML Lab Manual
Documento21 pagine
My ML Lab Manual
Anonymous qZP5Zyb2
Nessuna valutazione finora
C Cheatsheet CodeWithHarry PDF
Documento11 pagine
C Cheatsheet CodeWithHarry PDF
Race VinDiesel
Nessuna valutazione finora
What Is An AS400
Documento159 pagine
What Is An AS400
therock_023
Nessuna valutazione finora
Python Basics
Documento69 pagine
Python Basics
Second Books
Nessuna valutazione finora
CS261 CourseNotes PDF
Documento271 pagine
CS261 CourseNotes PDF
April Brown
Nessuna valutazione finora
Datastructure Unit 1 SKM
Documento110 pagine
Datastructure Unit 1 SKM
krishna moorthy
Nessuna valutazione finora
9780387986326
Documento1 pagina
9780387986326
Zeeshan Ahmad
Nessuna valutazione finora
The Essential R Reference
Da Everand
The Essential R Reference
Mark Gardener
Nessuna valutazione finora
Loops in C and C++ - GeeksforGeeks
Documento1 pagina
Loops in C and C++ - GeeksforGeeks
Ateeq Ahmed
Nessuna valutazione finora
Java Programming Syllabus
Documento2 pagine
Java Programming Syllabus
Ramanagouda Khanapur
100% (1)
PHP Listado de Ejemplos
Documento137 pagine
PHP Listado de Ejemplos
lee9120
Nessuna valutazione finora
Object Oriented Programming
Documento11 pagine
Object Oriented Programming
Ali Shana'a
Nessuna valutazione finora
EMBO 2013 HandoutDeuteration
Documento54 pagine
EMBO 2013 HandoutDeuteration
Rigel_T
Nessuna valutazione finora
Topic 4 - Repetition Control Structure
Documento124 pagine
Topic 4 - Repetition Control Structure
Winter Nai
Nessuna valutazione finora
1-4 Introduction To JAVA
Documento30 pagine
1-4 Introduction To JAVA
ecardnyl25
Nessuna valutazione finora
EMBEDDED SQL
Documento35 pagine
EMBEDDED SQL
nairarung
Nessuna valutazione finora
LibreOffice Calc Guide 7.4
Documento541 pagine
LibreOffice Calc Guide 7.4
yaatrin
Nessuna valutazione finora
Java Programming Language Midterm
Documento188 pagine
Java Programming Language Midterm
Aldwin Harvey Salumbides
Nessuna valutazione finora
The Intuition Behind PCA: Machine Learning Assignment
Documento11 pagine
The Intuition Behind PCA: Machine Learning Assignment
Palash Ghosh
Nessuna valutazione finora
Data Analysis With Python - FreeCodeCamp PDF
Documento28 pagine
Data Analysis With Python - FreeCodeCamp PDF
CHOVATIYA HEMAL
Nessuna valutazione finora
Data Structure and Algorithm
Documento179 pagine
Data Structure and Algorithm
Giovanni de los Santos
Nessuna valutazione finora
Logistics TTP Exercises
Documento7 pagine
Logistics TTP Exercises
RayClydeJabonillo
Nessuna valutazione finora
Alexander Shvets Design Patterns Explained Simply
Documento2 pagine
Alexander Shvets Design Patterns Explained Simply
Andrea Griffin
Nessuna valutazione finora
Curso de Python 3
Documento158 pagine
Curso de Python 3
Rogério Santos
Nessuna valutazione finora
Data Structures and Algorithms Course Introduction
Documento28 pagine
Data Structures and Algorithms Course Introduction
Rajat Garg
Nessuna valutazione finora
Ejemplos Con Base de Datos Summit Sporting Goods
Documento14 pagine
Ejemplos Con Base de Datos Summit Sporting Goods
Oskar Cadena
Nessuna valutazione finora
OPNQRYF (Open Query File) Command Description
Documento56 pagine
OPNQRYF (Open Query File) Command Description
nairarung
Nessuna valutazione finora
Android SQLite Database Tutorial
Documento11 pagine
Android SQLite Database Tutorial
MR. Muslih
Nessuna valutazione finora
PASCAL Tutorial
Documento46 pagine
PASCAL Tutorial
mickaylia green
Nessuna valutazione finora
Loops in C Language
Documento13 pagine
Loops in C Language
muhammad_ameen
Nessuna valutazione finora
Chapter 01 - JAVA Programming
Documento29 pagine
Chapter 01 - JAVA Programming
sunnyxm
Nessuna valutazione finora
pytest Documentation Release 2.5.2
Documento193 pagine
pytest Documentation Release 2.5.2
eze
Nessuna valutazione finora
Introduction To Parallel Computing
Documento34 pagine
Introduction To Parallel Computing
Muhammed İkbaL Gürbüz
100% (1)
UNIT I Basics of C Programming
Documento44 pagine
UNIT I Basics of C Programming
courageouscse
Nessuna valutazione finora
Sigmod278 Silberstein
Documento12 pagine
Sigmod278 Silberstein
Dmytro Shteflyuk
Nessuna valutazione finora
Simultaneous multithreading A Complete Guide
Da Everand
Simultaneous multithreading A Complete Guide
Gerardus Blokdyk
Nessuna valutazione finora
PrimeFaces Beginner's Guide
Da Everand
PrimeFaces Beginner's Guide
K. Siva Prasad Reddy
Nessuna valutazione finora
Kali Linux Installation Requirements
Documento16 pagine
Kali Linux Installation Requirements
ardindavirman
Nessuna valutazione finora
PDM Sept2012 Demandforecasting and Inventoryplanning
Documento21 pagine
PDM Sept2012 Demandforecasting and Inventoryplanning
ardindavirman
Nessuna valutazione finora
Assignment II - IT Project Management LTA Per.3
Documento4 pagine
Assignment II - IT Project Management LTA Per.3
ardindavirman
100% (1)
1 Mazda Padang Sales SPV Arif - Abdillah spvpdg78!: No Name Dealer Name Position Username Password WRS Password
Documento2 pagine
1 Mazda Padang Sales SPV Arif - Abdillah spvpdg78!: No Name Dealer Name Position Username Password WRS Password
ardindavirman
Nessuna valutazione finora
Tamil Literary Garden 2010 Lifetime Achievement Award Ceremony
Documento20 pagine
Tamil Literary Garden 2010 Lifetime Achievement Award Ceremony
Anthony Vimal
Nessuna valutazione finora
JEE Test Series Schedule
Documento4 pagine
JEE Test Series Schedule
B.K.Sivaraj raj
Nessuna valutazione finora
IS BIOCLIMATIC ARCHITECTURE A NEW STYLE OF DESIGN
Documento5 pagine
IS BIOCLIMATIC ARCHITECTURE A NEW STYLE OF DESIGN
Jorge Dávila
Nessuna valutazione finora
Philippine Development Plan (Optimized)
Documento413 pagine
Philippine Development Plan (Optimized)
herbertjohn24
Nessuna valutazione finora
Db2 Compatibility PDF
Documento23 pagine
Db2 Compatibility PDF
Muhammed Abdul Qader
Nessuna valutazione finora
Signal Processing Problems Chapter 12
Documento20 pagine
Signal Processing Problems Chapter 12
C
Nessuna valutazione finora
MySQL Cursor With Example
Documento7 pagine
MySQL Cursor With Example
Nizar Achmad
Nessuna valutazione finora
Activity Design Scouting
Documento10 pagine
Activity Design Scouting
Honeyjo Nette
100% (9)
Trendline Mastery: Course Outline: 3. Interview of Peter Bain by Frank Paul
Documento5 pagine
Trendline Mastery: Course Outline: 3. Interview of Peter Bain by Frank Paul
nacare
Nessuna valutazione finora
RBI and Maintenance For RCC Structure Seminar
Documento4 pagine
RBI and Maintenance For RCC Structure Seminar
coxshuler
Nessuna valutazione finora
Literary Text Analysis Worksheet
Documento1 pagina
Literary Text Analysis Worksheet
api-403444340
Nessuna valutazione finora
Physical Properties of Sea Water
Documento45 pagine
Physical Properties of Sea Water
jisu
Nessuna valutazione finora
ASTM C 136 Sieve Analysis of Fine and Coarse Aggregates (D)
Documento5 pagine
ASTM C 136 Sieve Analysis of Fine and Coarse Aggregates (D)
Yasir Dharejo
Nessuna valutazione finora
Companies Database
Documento2 pagine
Companies Database
NIRAJ KUMAR
Nessuna valutazione finora
Identifying Research Problems
Documento29 pagine
Identifying Research Problems
Edel Borden Paclian
Nessuna valutazione finora
Neptune Sign House Aspect
Documento80 pagine
Neptune Sign House Aspect
mesagirl
94% (53)
【小马过河】35 TOEFL iBT Speaking Frequent Words
Documento10 pagine
【小马过河】35 TOEFL iBT Speaking Frequent Words
kakiwn
Nessuna valutazione finora
Gpredict User Manual 1.2
Documento64 pagine
Gpredict User Manual 1.2
Will Jackson
Nessuna valutazione finora
Consistency Models
Documento42 pagine
Consistency Models
Pixel Dinosaur
Nessuna valutazione finora
An Approach To The Aural Analysis of Emergent Musical Forms
Documento25 pagine
An Approach To The Aural Analysis of Emergent Musical Forms
mykhos
0% (1)
Ice Cream: Uses and Method of Manufacture
Documento6 pagine
Ice Cream: Uses and Method of Manufacture
Mari Liz
Nessuna valutazione finora
Roland Barthes and Struggle With Angel
Documento8 pagine
Roland Barthes and Struggle With Angel
Abdullah Bektas
0% (1)
Mitchell 1986
Documento34 pagine
Mitchell 1986
Sara Veronica Florentin Cuenca
Nessuna valutazione finora
Lenovo IdeaPad U350 UserGuide V1.0
Documento138 pagine
Lenovo IdeaPad U350 UserGuide V1.0
Marc Bengtsson
Nessuna valutazione finora
Relay Testing Management Software
Documento10 pagine
Relay Testing Management Software
chichid2008
Nessuna valutazione finora
Balkan Tribes
Documento3 pagine
Balkan Tribes
CANELO_PIANO
Nessuna valutazione finora
Lesson Rubric Team Group (Lesson Plan 1)
Documento2 pagine
Lesson Rubric Team Group (Lesson Plan 1)
Yodalis Vazquez
Nessuna valutazione finora
DPCA OHE Notice of Appeal 4-11-2018 Final
Documento22 pagine
DPCA OHE Notice of Appeal 4-11-2018 Final
branax2000
Nessuna valutazione finora
Sprite Graphics For The Commodore 64
Documento200 pagine
Sprite Graphics For The Commodore 64
scottmac67
Nessuna valutazione finora
10 1016@j Ultras 2016 09 002
Documento11 pagine
10 1016@j Ultras 2016 09 002
Ismahene Smaheno
Nessuna valutazione finora