Contents

DATA100-L7: Visualization Ⅰ

distribution 定义

/datal7/image.png

bar plots for distribution

data8 example

/datal7/image-1.png

compound way

/datal7/image-2.png

seaborn example

/datal7/image-3.png

1
2
3
4
import seaborn as sns
sns.countplot(x='variable', data=df)
# rug plot
sns.rugplot(x='variable', data=df, color='black')

plotly example

/datal7/image-4.png

处理异常值(outliers)和峰值(mode)

density curve

密度曲线看峰 /datal7/image-5.png

  • 箱型图 /datal7/image-6.png /datal7/image-7.png
1
2
import seaborn as sns
sns.boxplot(x='variable', data=df)
  • violin plot 和箱型图对比来看,violin plot宽度有意义
1
2
import seaborn as sns
sns.violinplot(x='variable', data=df)

/datal7/image-8.png 处理overplotting random jitter /datal7/image-9.png