注册并分享邀请链接，可获得视频播放与邀请奖励。

立即注册

Francesco Bertolotti 的头像

Francesco Bertolotti (@f14bertolotti)

@f14bertolotti

AI Researcher

141 正在关注 1.9K 粉丝

Francesco Bertolotti@f14bertolotti

2026.06.16 05:20

Stellar performance from a 3B model. These results were achieved primarily through post-training refinements on Qwen2.5-Coder. The paper doesn't provide many details, but it appears they distill from RL ckpts and then do a final RL-based instruct RL. 🔗

显示更多

0

0

20

457

70

转发到社区

热门用户

@aleabitoreddit

898.2K 粉丝

1.1M 粉丝

26M 粉丝

45.1M 粉丝

46.7M 粉丝

@YGBABYMONSTER_

858.8K 粉丝

6.3M 粉丝

BTS JAPAN OFFICIAL

@BTS_jp_official

13.7M 粉丝

2.7M 粉丝

1.2M 粉丝

731.1K 粉丝

TWICE JAPAN OFFICIAL

@JYPETWICE_JAPAN

3.5M 粉丝

12.6M 粉丝

427.9K 粉丝

ポケモン公式

3M 粉丝