Data clustering basics with k-means algorithm
Data Clustering¶
This post is based on (Gan).
Data clustering is a process of assigning a set of records into subsets, called clusters, such that records in the same cluster are similar and records in different clusters are quite distinct.
A typical clustering process involves the following five steps: