K-Menas Problem

Caricato da

Gentian Strana

Il 0% ha trovato utile questo documento (0 voti)

46 visualizzazioni2 pagine

k-means problem to solve

Titolo originale

k-menas problem

Copyright

Formati disponibili

DOC, PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

k-means problem to solve

Copyright:

Formati disponibili

Scarica in formato DOC, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

46 visualizzazioni2 pagine

K-Menas Problem

Caricato da

Gentian Strana

k-means problem to solve

Copyright:

Formati disponibili

Scarica in formato DOC, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 2

Cerca all'interno del documento

Njohja e Mostrave

Universiteti i Prishtinës

Pjesa e dytë e projektit (Afati për dorëzim: 23.01.2017)

Problem Description

There exists an unclassified data set with hidden data structures in it. The task in
this assignment is to perform comprehensive Cluster Analysis in order to reveal
the structures and similar data groups.
The data set consists of unlabeled data set called test.txt and initial centroids data
set namely centroids.txt in the archive. Both files have the following format:
[attribute1_value <space> attribute2_value <space> ... <space>
attribute90_value].
The unlabeled data set includes 350 samples and the initial centroids set consists
of 15 samples. Data instances in both files have 90 attributes.
Finally, prepare an academic report and deliver it together with source code and
any additional material, which you were using during you work.

Tasks:

1. Implement a simple K-means method, which is able to handle real values

data in attributes. Also you need to add functionality in your program that
allows utilization of Euclidean, Manhattan (City Block), Euclidean Squared
(the same as the Euclidean distance, but does not take the square root) and
Chebyshev distances. You are free to use any kind of weights (for
feature or data instance) in the program if necessary.

2. Perform attributes values rescaling in order to obtain normalized data

within the range [0,1], which is more suitable and reliable for proper cluster
analysis. You can use following equation for rescaling: xNew=(x-Min)/
(Max-Min). Feel free to bring own rescaling method.

3. Perform clustering of the unlabeled data set. You could use provided initial
centroids set or generate your own. Also there could be considered next
stopping criteria:
3.1 Maximal number of iterations: 100
3.2 Cluster are consistent (no changes in group matrix or centroids on
current iteration, which mean that the clusters are balanced).
4. Cluster Analysis could be also represented more formally as optimization
procedure, which tries to minimize the Residual Sum of Squares objective
function:

where μ(ωk) – is a centroid of a particular cluster k, K – total amount of

clusters, x – data sample in this cluster ωk.

4.1 Please, provide value of RSS function on each iteration in your

program for a particular distance measure and K number.

4.2 Discuss the changing of RSS function value (increasing or decreasing

and why) during Cluster Analysis (from the first iteration until the last
one)?

5. Try different numbers of clusters in your program (K=2...15) and build a

plot that shows the dependency between number K and value of RSS
function on the last iteration.

5.1 What is the optimal number of clusters K for a given data set?

5.2 Did you get any empty clusters? What is the possible solution for this
problem?

Potrebbero piacerti anche

Solving Quadratic Equation by Factoring PDF
Documento13 pagine
Solving Quadratic Equation by Factoring PDF
Angelo Doma
Nessuna valutazione finora
Blast (Basic Local Alignment Search Tool)
Documento28 pagine
Blast (Basic Local Alignment Search Tool)
yasasve
Nessuna valutazione finora
Machine Learning - Advanced Concepts
Da Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
Nessuna valutazione finora
Assignment 3 Specification
Documento3 pagine
Assignment 3 Specification
Razin
Nessuna valutazione finora
CS5785 Homework 4: .PDF .Py .Ipynb
Documento5 pagine
CS5785 Homework 4: .PDF .Py .Ipynb
Al Tarino
Nessuna valutazione finora
Assignment 2 With Program
Documento8 pagine
Assignment 2 With Program
Palash Saroware
Nessuna valutazione finora
Cluster Analysis
Documento7 pagine
Cluster Analysis
sogekingu88
Nessuna valutazione finora
Pe 2
Documento5 pagine
Pe 2
mcantimurcan
Nessuna valutazione finora
ML Unit-5
Documento8 pagine
ML Unit-5
Supriya alluri
Nessuna valutazione finora
0/1 Knapsack Algorithm Comparison: Jacek Dzikowski Illinois Institute of Technology E-Mail: Dzikjac@iit - Edu
Documento15 pagine
0/1 Knapsack Algorithm Comparison: Jacek Dzikowski Illinois Institute of Technology E-Mail: Dzikjac@iit - Edu
skimdad
Nessuna valutazione finora
Dynamicclustering
Documento6 pagine
Dynamicclustering
kasun prabhath
Nessuna valutazione finora
Submission Instructions:: Homework #3 Due by Sunday 3/6, 11:59pm
Documento5 pagine
Submission Instructions:: Homework #3 Due by Sunday 3/6, 11:59pm
Jia Huang
Nessuna valutazione finora
6 Clustering
Documento15 pagine
6 Clustering
Monis Khan
Nessuna valutazione finora
K-Means in Python - Solution
Documento6 pagine
K-Means in Python - Solution
Rodrigo Violante
Nessuna valutazione finora
Background: 1.1 General Introduction
Documento19 pagine
Background: 1.1 General Introduction
dimtry
Nessuna valutazione finora
Introduction To Deep Learning Assignment 0: September 2023
Documento3 pagine
Introduction To Deep Learning Assignment 0: September 2023
christiaanbergsma03
Nessuna valutazione finora
Jaipur National University: Project Design With Seminar
Documento26 pagine
Jaipur National University: Project Design With Seminar
Faizan Shaikh
100% (1)
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
Documento4 pagine
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
vinay1214
Nessuna valutazione finora
Pflib - An Object Oriented Matlab Toolbox For Particle Filtering
Documento8 pagine
Pflib - An Object Oriented Matlab Toolbox For Particle Filtering
Ashish Bhardwaj
Nessuna valutazione finora
Recor
Documento6 pagine
Recor
Hariharan.k
Nessuna valutazione finora
CATBOOST Paper - 11 PDF
Documento7 pagine
CATBOOST Paper - 11 PDF
Marco
Nessuna valutazione finora
HST.582J / 6.555J / 16.456J Biomedical Signal and Image Processing
Documento11 pagine
HST.582J / 6.555J / 16.456J Biomedical Signal and Image Processing
Mahroosh Banday
Nessuna valutazione finora
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
Documento5 pagine
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
Journal of Computer Applications
Nessuna valutazione finora
DWM Experiment5 E059
Documento15 pagine
DWM Experiment5 E059
Shubham Gupta
Nessuna valutazione finora
Lab3Block1 2021-1
Documento3 pagine
Lab3Block1 2021-1
Alex Widén
Nessuna valutazione finora
Data Mining Algorithms in R - Clustering - Fuzzy Clustering - Fuzzy C-Means - Wikibooks, Open Books For An Open World
Documento8 pagine
Data Mining Algorithms in R - Clustering - Fuzzy Clustering - Fuzzy C-Means - Wikibooks, Open Books For An Open World
Snr Kofi Agyarko Ababio
Nessuna valutazione finora
Data Mining Assignment No. 1
Documento22 pagine
Data Mining Assignment No. 1
NIRAV SHAH
Nessuna valutazione finora
Pattern Recognition Letters: Krista Rizman Z Alik
Documento7 pagine
Pattern Recognition Letters: Krista Rizman Z Alik
durdanecoban
Nessuna valutazione finora
Nonlinear Dimensionality Reduction
Documento18 pagine
Nonlinear Dimensionality Reduction
aakarsh
Nessuna valutazione finora
Ex 7
Documento17 pagine
Ex 7
api-322416213
Nessuna valutazione finora
Unsupervised K-Means Clustering Algorithm
Documento17 pagine
Unsupervised K-Means Clustering Algorithm
Ahmad Faisal
Nessuna valutazione finora
A Novel Approach For Data Clustering Using Improved K-Means Algorithm PDF
Documento6 pagine
A Novel Approach For Data Clustering Using Improved K-Means Algorithm PDF
Ninad Samel
Nessuna valutazione finora
SQLDM - Implementing K-Means Clustering Using SQL: Jay B.Simha
Documento5 pagine
SQLDM - Implementing K-Means Clustering Using SQL: Jay B.Simha
Moh Ali M
Nessuna valutazione finora
Subject Code: 18CS3064: Time: 2 Hours Max. Marks: 50 Key and Scheme of Evaluation
Documento17 pagine
Subject Code: 18CS3064: Time: 2 Hours Max. Marks: 50 Key and Scheme of Evaluation
krishnasai tadiboina
Nessuna valutazione finora
Kabir Data Preprocessing Python
Documento14 pagine
Kabir Data Preprocessing Python
El Arbi Abdellaoui Alaoui
Nessuna valutazione finora
Clustering Large Data Sets With Mixed Numeric and Categorical Values
Documento14 pagine
Clustering Large Data Sets With Mixed Numeric and Categorical Values
Nurlita Kusuma Dewi
Nessuna valutazione finora
Welcome To International Journal of Engineering Research and Development (IJERD)
Documento5 pagine
Welcome To International Journal of Engineering Research and Development (IJERD)
IJERD
Nessuna valutazione finora
Maxbox Starter60 Machine Learning
Documento8 pagine
Maxbox Starter60 Machine Learning
Max Kleiner
Nessuna valutazione finora
Modul Praktikum SciPy
Documento15 pagine
Modul Praktikum SciPy
Yobell Kevin Siburian
Nessuna valutazione finora
A Practical Guide To Support Vector Classification
Documento16 pagine
A Practical Guide To Support Vector Classification
Jônatas Oliveira Silva
Nessuna valutazione finora
Clustering in R
Documento12 pagine
Clustering in R
Renuka
Nessuna valutazione finora
An Improved K-Means Clustering Algorithm
Documento16 pagine
An Improved K-Means Clustering Algorithm
David Moreno
Nessuna valutazione finora
Data Stream Clustering
Documento3 pagine
Data Stream Clustering
john949
Nessuna valutazione finora
ML Implementation
Documento14 pagine
ML Implementation
noussayer mighri
Nessuna valutazione finora
Machine Learning Assignments
Documento3 pagine
Machine Learning Assignments
Ujesh Maurya
Nessuna valutazione finora
ENGR 253 LAB #2 - MATLAB Functions and Signal Plots: Objective
Documento3 pagine
ENGR 253 LAB #2 - MATLAB Functions and Signal Plots: Objective
Algerian Aissaoui
Nessuna valutazione finora
A Practical Guide To Support Vector Classification: I I I N L
Documento15 pagine
A Practical Guide To Support Vector Classification: I I I N L
rabbityeah
Nessuna valutazione finora
K-Means Clustering and PCA
Documento17 pagine
K-Means Clustering and PCA
exquy school
Nessuna valutazione finora
MAE3456 - MEC3456 LAB 01: Due: 11:59PM (Sharp), Friday 12 March 2021 (End of Week 2)
Documento6 pagine
MAE3456 - MEC3456 LAB 01: Due: 11:59PM (Sharp), Friday 12 March 2021 (End of Week 2)
kain
Nessuna valutazione finora
2021 ITS665 - ISP565 - GROUP PROJECT-revMac21
Documento6 pagine
2021 ITS665 - ISP565 - GROUP PROJECT-revMac21
Umairah Ibrahim
Nessuna valutazione finora
B22EE010 Report
Documento9 pagine
B22EE010 Report
Anuj Vijay Patil (B22EE010)
Nessuna valutazione finora
BDA List of Experiments For Practical Exam
Documento21 pagine
BDA List of Experiments For Practical Exam
Pharoah Gamerz
Nessuna valutazione finora
Pakhira2009 K Means Distributed 111 PDF
Documento8 pagine
Pakhira2009 K Means Distributed 111 PDF
دراز زيان أبو جاسر
Nessuna valutazione finora
Solution First Point ML-HW4
Documento6 pagine
Solution First Point ML-HW4
Juan Sebastian Otálora Montenegro
100% (1)
Text Clustering and Validation For Web Search Results
Documento7 pagine
Text Clustering and Validation For Web Search Results
International Journal of Application or Innovation in Engineering & Management
Nessuna valutazione finora
Ex 6
Documento16 pagine
Ex 6
Pardhasaradhi Nallamothu
Nessuna valutazione finora
L7 Functions s01
Documento10 pagine
L7 Functions s01
lonerstar
Nessuna valutazione finora
MLchallenge2022 Block4
Documento9 pagine
MLchallenge2022 Block4
fede
Nessuna valutazione finora
21BEC505 Exp2
Documento7 pagine
21BEC505 Exp2
jay
Nessuna valutazione finora
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
Documento16 pagine
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
Talha Farooq
Nessuna valutazione finora
Lab DigitRecognitionMINST
Documento10 pagine
Lab DigitRecognitionMINST
techoverlord.contact
Nessuna valutazione finora
RJ 2018 048
Documento9 pagine
RJ 2018 048
Julián Buitrago
Nessuna valutazione finora
Dive Into Deep Learning Fundamental Walkthrough 1638714338
Documento5 pagine
Dive Into Deep Learning Fundamental Walkthrough 1638714338
Nima Alipour
Nessuna valutazione finora
Cyclical Learning Rate For Training Neural Networks (Leslie Smith)
Documento10 pagine
Cyclical Learning Rate For Training Neural Networks (Leslie Smith)
Nothen
Nessuna valutazione finora
Summative Test in Special Products
Documento3 pagine
Summative Test in Special Products
Neil Jo
Nessuna valutazione finora
Convergence Criteria
Documento9 pagine
Convergence Criteria
ajota_7
Nessuna valutazione finora
Image Captioning
Documento16 pagine
Image Captioning
Pallavi Bharti
Nessuna valutazione finora
Lab5 Gams Formulation
Documento48 pagine
Lab5 Gams Formulation
Ketan Patel
Nessuna valutazione finora
ANN Models
Documento42 pagine
ANN Models
Aakansh Shrivastava
Nessuna valutazione finora
Bibliografia
Documento3 pagine
Bibliografia
Ignacio Lopez
Nessuna valutazione finora
Question Paper Code:: (10×2 20 Marks)
Documento3 pagine
Question Paper Code:: (10×2 20 Marks)
Manimegalai
Nessuna valutazione finora
IPM Slides
Documento34 pagine
IPM Slides
Math Department
Nessuna valutazione finora
3.3 Characteristics of Polynomial Functions in Factored Form
Documento3 pagine
3.3 Characteristics of Polynomial Functions in Factored Form
Ashley Elliott
Nessuna valutazione finora
Zeroing Neural Networks, An Introduction To, A Survey Of, and Predictive Computations For Time-Varying Matrix Problems
Documento24 pagine
Zeroing Neural Networks, An Introduction To, A Survey Of, and Predictive Computations For Time-Varying Matrix Problems
goatcock
Nessuna valutazione finora
PSO PARSIMONY A Method For Finding Parsimonious and Accurate Mach - 2023 - Neur
Documento12 pagine
PSO PARSIMONY A Method For Finding Parsimonious and Accurate Mach - 2023 - Neur
Sumeet Mitra
Nessuna valutazione finora
3 Sol
Documento3 pagine
3 Sol
naman
Nessuna valutazione finora
Optimization Methods (MFE) : Elena Perazzi
Documento28 pagine
Optimization Methods (MFE) : Elena Perazzi
ddd huang
Nessuna valutazione finora
Chapter 11
Documento6 pagine
Chapter 11
Siddharth Trivedi
Nessuna valutazione finora
Ilolas Math 7 q2 Wk3 Melc 5 Final Edited
Documento10 pagine
Ilolas Math 7 q2 Wk3 Melc 5 Final Edited
Jean Marie Ga Lacson
Nessuna valutazione finora
The Traveling Salesman Problem: A Neural Network Perspective
Documento60 pagine
The Traveling Salesman Problem: A Neural Network Perspective
Wafa Elgalhoud
Nessuna valutazione finora
Introduction To Artificial Neural Networks
Documento70 pagine
Introduction To Artificial Neural Networks
madhu shree m
Nessuna valutazione finora
Yogesh Meena (BCA-M15 4th SEM) CONM CCE
Documento10 pagine
Yogesh Meena (BCA-M15 4th SEM) CONM CCE
Yogesh Meena
Nessuna valutazione finora
Bilinear Interpolation
Documento4 pagine
Bilinear Interpolation
Sridhar Panneer
Nessuna valutazione finora
Automata Theory Assignment 1
Documento8 pagine
Automata Theory Assignment 1
Harsh Patil HP
Nessuna valutazione finora
Send+ More Money Problem
Documento1 pagina
Send+ More Money Problem
Jay Shah
Nessuna valutazione finora
Lecture 3 Fixed Point
Documento37 pagine
Lecture 3 Fixed Point
Salman Zahid
Nessuna valutazione finora
Time Delay Neural Networks
Documento3 pagine
Time Delay Neural Networks
navinsehgal
Nessuna valutazione finora
Nonlinear Systems: Rooting-Finding Problem
Documento28 pagine
Nonlinear Systems: Rooting-Finding Problem
Lam Wong
Nessuna valutazione finora
C Program To Find Shortest Path Using Dijkstra
Documento9 pagine
C Program To Find Shortest Path Using Dijkstra
Kosta Nikolic
Nessuna valutazione finora