Contents

DATA100-L24: Clustering

Contents

introduction to clustering

/datal24/image.png /datal24/image-1.png no label at all 😢

K-means clustering

算法动画演示

K-Means vs KNN /datal24/image-2.png

minimizing inertia

convex?? 损失函数不一定凸,梯度下降难顶 how to see which one is better ❓ /datal24/image-3.png /datal24/image-4.png 但是找到全局最优解非常困难 /datal24/image-5.png

agglomerative clustering

演示见上面链接以及lec code!

和CS61B的minimum spanning tree类似,每次合并两个最近的点,直到终止条件

/datal24/image-6.png /datal24/image-7.png /datal24/image-8.png outlier 有时忽略处理或者自成一类

picking K

/datal24/image-9.png

/datal24/image-10.png Smax? /datal24/image-11.png

can s be negative? /datal24/image-12.png /datal24/image-14.png /datal24/image-13.png

summary

/datal24/image-15.png