Data clustering basics with k-means algorithm

Data Clustering¶

This post is based on (Gan).

Data clustering is a process of assigning a set of records into subsets, called clusters, such that records in the same cluster are similar and records in different clusters are quite distinct.

A typical clustering process involves the following five steps:

more ...