TopicDigg
最新
社区
登录
注册
注册并分享邀请链接,可获得视频播放与邀请奖励。
立即注册
Jiayi Weng (@Trinkle23897) “Codex grew programmatic policies with no neural nets: max score on Breakout, and” — TopicDigg
Jiayi Weng
@Trinkle23897
MTS
@openai
, author of the entire post-training RL infra, core contributor of ChatGPT/GPT4/GPT4o etc. 30U30
加入 June 2014
181
正在关注
11.8K
粉丝
Jiayi Weng
@Trinkle23897
2026.05.08 03:49
Codex grew programmatic policies with no neural nets: max score on Breakout, and SOTA-level scores on MuJoCo. Maybe heuristics were not too weak. Maybe they were just too expensive to maintain. Maybe it's the next paradigm.
显示更多
0
0
59
1.4K
229
转发到社区
热门用户
Serenity
@aleabitoreddit
491.2K 粉丝
BTS_official
@bts_bighit
45.1M 粉丝
ITZY
@ITZYofficial
6.3M 粉丝
BABYMONSTER
@YGBABYMONSTER_
858.8K 粉丝
BTS JAPAN OFFICIAL
@BTS_jp_official
13.7M 粉丝
ポケモン公式
@Pokemon_cojp
2.9M 粉丝
2PM
@follow_2PM
1.2M 粉丝
BABYMONSTER JAPAN OFFICIAL
@_BABYMONSTER_JP
191.9K 粉丝
TWICE JAPAN OFFICIAL
@JYPETWICE_JAPAN
3.5M 粉丝
TWICE
@JYPETWICE
12.4M 粉丝
ENHYPEN
@ENHYPEN_members
13.8M 粉丝
sunny
@77sunnyx
916.2K 粉丝
GOT7
@GOT7Official
8.8M 粉丝
Pop Crave
@PopCrave
3.9M 粉丝
半半子💖BANBANKO
@Banbanko_
522.8K 粉丝