注册并分享邀请链接,可获得视频播放与邀请奖励。

Yun-Ta Tsai (@yunta_tsai) “Getting used to being liked likely means you are overfit to RLHF. The problem wi” — TopicDigg

Yun-Ta Tsai 的个人资料封面
Yun-Ta Tsai 的头像
Yun-Ta Tsai
@yunta_tsai
Sr. Staff Engineer @Tesla_AI
加入 October 2022
217 正在关注    112.3K 粉丝
Getting used to being liked likely means you are overfit to RLHF. The problem with overfitting is that the pain overwhelms the limbic system once you try to sample trajectories outside the known distribution. As more people like you, your sampling regime becomes smaller and smaller to avoid negative feedback. Eventually you get stuck and become a slave to your own feelings. That’s why I have never seen a model student happy once they become a “model”. Their weights are frozen and cannot be updated anymore. They cannot risk being better than their own SOTA.
显示更多
0
86
513
76
转发到社区