注册并分享邀请链接,可获得视频播放与邀请奖励。

Francesco Bertolotti (@f14bertolotti) “Stellar performance from a 3B model. These results were achieved primarily throu” — TopicDigg

Francesco Bertolotti 的个人资料封面
Francesco Bertolotti 的头像
Francesco Bertolotti
@f14bertolotti
AI Researcher
加入 October 2021
141 正在关注    1.9K 粉丝
Stellar performance from a 3B model. These results were achieved primarily through post-training refinements on Qwen2.5-Coder. The paper doesn't provide many details, but it appears they distill from RL ckpts and then do a final RL-based instruct RL. 🔗
显示更多
0
20
457
70
转发到社区