Gaussian Mixture Model Clustering
Each point belongs to every cluster, but has a different level of membership. It assumes that each cluster has a certain statistical distribution.
Expectation - Maximization Algorithm
Advantages :
- Soft Clustering, sample members of multiple clusters.
- Cluster shape flexibility. (cluster can contain another cluster inside of it)
Disadvantages :
- Sensitive to initialization values
- Slow convergence rate
- Possible to converge to a local optimum
Process:
- Initialize k Gaussian distributions
- Soft cluster the data into the Gaussian distributions
(Expectation step)
- Re-estimate the Gaussian
(Maximization step)
- Evaluate log-likelihood to check for convergence
- Repeat from Step.2 until convergence