L14-Visualizing and Understanding

HHZZ published on 2024-11-06 included in UMich-EECS-498

Visualizing and Understanding Visualizing 对第一层、第二层以及最后进入FC层的特征图进行可视化 PCA√ t-SNE√ 非线性降维最大激活 Understanding 细节

L10-Training I

HHZZ published on 2024-11-04 included in UMich-EECS-498

Training I Activation Functions Sigmoid function: $\sigma(x) = \frac{1}{1 + e^{-x}}$ 不是0中心两端饱和 always all positive or negative :( exp() 计算复杂，但是对于GPU不是问题 tanh function: $\tanh(x) = \frac{e^x - e^{-x}}{e^x + e^{-x}}$ sigmoid变体 ReLU function: $f(x) = max(0, x)$ 不会饱和计算快非0中心 dead relu ==> leaky relu Leaky ReLU function: $f(x) = max(0.01x, x)$ 解决了dead relu问题 ==> PRelu function：把0.01改成可学习的参数 ELU function: $f(x) = \begin{cases} x & x \geq 0 \ \alpha(e^x - 1) & x < 0 \end{cases}$ Data Preprocessing 参见DATA-100相关课程

L9-Hard and Software

HHZZ published on 2024-11-02 included in UMich-EECS-498

Hard and Soft ware Hardware eecs 598.009 GPU programming! 其实我很想了解一下cuda编程 tensorflow支持TPU，pytorch呢？计算图存储在GPU内存里面 Software the point of deep learning frameworks allow rapid prototyping automatically compute gradients run it all efficiently on GPUs or else PyTorch sigmoid0减少计算图节点的设计，因为反向传播重写了 1 2 3 4 5 6 7 8 9 10 11 12 13 14 class Sigmoid(torch.autograd.Function): @staticmethod def forward(ctx, input): y = 1 / (1 + torch.exp(-input)) ctx.save_for_backward(input) return y @staticmethod def backward(ctx, grad_output): input, = ctx.