Skip to article frontmatterSkip to article content

What is Clustering

Basic idea of clustering

Identifying breeds of dogs

Identify dogs that look similar from the photos. Source: Solving the mystery of my dog’s breed with ML

Identify dogs that look similar from the photos. Source: Solving the mystery of my dog’s breed with ML

Identifying individual pet

Looking for the same person or pet in the photo gallery. Image Source: iOS 17 Photos App Recognizes Your Pets

Looking for the same person or pet in the photo gallery. Image Source: iOS 17 Photos App Recognizes Your Pets

Group together similar/near instances

The data points.

The data points.

2 big groups.

2 big groups.

Another 2 big groups.

Another 2 big groups.

Those on the left split into 2 small groups.

Those on the left split into 2 small groups.

Those on the right split into 2 small groups.

Those on the right split into 2 small groups.

4 small groups.

4 small groups.

What could ‘similar’ or ‘near’ mean?

The two distances assume the value ranges are equal (similar) among dimensions. (Equal aspect)

How to measure similarity/dissimilarity?

The clustering results are highly dependent on the similarity metric (or distance measure) chosen to evaluate the proximity between data points within the cluster analysis.

What is spatial clustering?

].column[

What is spatial clustering?

On Spatial Point Patterns

Using the GPS records of a person moving within a neighborhood to detect clusters (with noises) and identify the place where the participant had visited by follow-up interview and manual checking. Feng et al. 2024.

Using the GPS records of a person moving within a neighborhood to detect clusters (with noises) and identify the place where the participant had visited by follow-up interview and manual checking. Feng et al. 2024.

On Attribute-based Spatial Clustering

The study run clustering on the kernel density of various GPS types and identify clusters with similar density of the different POI types. Chin et al. 2023

The study run clustering on the kernel density of various GPS types and identify clusters with similar density of the different POI types. Chin et al. 2023

Types of clustering methods

Partitioning:

Hierarchical Clustering:

References
  1. Feng, C., Chin, W. C. B., Gao, S., Chua, V., & Ho, E. L. (2024). Illustrating a Splatial Framework to Aging: Absolute, Relative, Relational, and Mental Space in Singapore. Transactions in GIS, 28(7), 2281–2294. 10.1111/tgis.13235