Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Unsupervised Learning
Luc Anselin
http://spatial.uchicago.edu
• ci’cj = 0
• components are orthogonal to each other
• Σk aik2 = 1
• the sum of the squared loadings equals one
• computation
• matrix decomposition
Variable2
Variable2
1 1 1
4 8 4 8 4 8
2 3 2 3 5 7
Variable1 Variable1 Variable1
2 and 3 are the closest points. They become a cluster. 5 and 7 are the closest points. They become a cluster. 1 and the cluster of 2 and 3 are the closest points.
2 5 2 5 2 5
3 3 3
7 7 7
Variable2
Variable2
Variable2
1 1 1
4 8 4 8 4 8
2 3 1 5 7 2 3 1 4 5 7 2 3 1 4 5 7 8
Variable1 Variable1 Variable1
4 and the cluster of 1, 2, and 3 are the closest points. 8 and the cluster of 5 and 7 are the closest points. The two remaining clusters are the closest points.
Variable2
Variable2
1 1 1 for four clusters
4 8 4 8 4 8
2 3 1 4 5 7 8 2 3 1 4 5 7 8 2 3 1 4 5 7 8
Variable1 Variable1 Variable1
The algorithm has finished. Rewind algorithm to reveal desired number of clusters.
2 2 2 2
3 3 3 3
4 4 4 4
δ x x5
1 5 1 5 1 5 1
δ δ δ
types of linkages
Source: Grolemund and Wickham (2016)
k=4
k=6
k=4
k=6
Randomly assign points to k groups. (Here k = 3). Compute centroid of each group. Reassign each point to group of closest centroid.
x x x
x
x x
x x x
Re-compute centroid of each group. Reassign each point to group of closest centroid. Re-compute centroid of each group.
x x
x x
x x
Reassign each point to group of closest centroid. Re-compute centroid of each group. Stop when group membership ceases to change.
• replicability
• set random seed
k=4
k=6
• spatial similarity
• only contiguous objects in same group
• shape
• compactness
• multi-objective approach
• automatic zoning
• graph-based approaches
• explicit optimization
• inefficient approach
• heuristic
• graph pruning
• several heuristics
• Assuncao et al (2006)
• algorithm
k=4
k=6
• additional constraints
• districting: target population size
• number of clusters
• exogenous
• endogenous, max-p region problem