K-Means
#Computers
A method of clustering data by minimizing $\displaystyle J$, or the distortion measure
Training
- Initialize $\displaystyle \left{ \mu_{k} \right}$, or the cluster prototypes to some values
- Assuming $\displaystyle \left{ \mu_{k} \right}$ is fixed, minimize $\displaystyle J$ by varying $\displaystyle \left{ r_{nk} \right}$, or the indicator functions for each data point
- I.e. reassign classes for each data point based on distance from cluster prototype
- $$
r_{nk}=\begin{cases}
1, & k=\text{argmin}{j}\left\lVert x{n}-\mu_{j}\right\rVert^{2}_{2} \\ 0, & \text{otherwise}
\end{cases}
$$
- Assuming $\displaystyle \left{ r_{nk} \right}$ is fixed, minimize $\displaystyle J$ by reassigning the cluster prototypes
- Terminate if $\displaystyle J$ stops changing much or repeat the above steps