DATA100-L15: Cross Validation, Regularization

2024-07-19 70 words One minute

Contents

Cross Validation

1
2


from sklearn.utils import shuffle
training_set, dev_set = np.split(shuffle(data), [int(.8*len(data))])

比较validation error和training error，选择最优的模型。

K=1 is equivalent to holdout method.

provide an unbiased estimate of the model’s performance on new, unseen data.

the small the ball, the simpler the model 拉格朗日思想，$\alpha$ 越大，约束越强，模型越简单。岭回归

标准化数据，be on the same scale