注册并分享邀请链接,可获得视频播放与邀请奖励。

Deli Chen (@victor207755822) “🧵 Deli AutoResearch SKILL is now officially open source! 🎉 https://t.co/V3lwwd” — TopicDigg

Deli Chen 的个人资料封面
Deli Chen 的头像
Deli Chen
@victor207755822
Deep Learning Researcher @deepseek_ai | #AGIforEveryone# Prev. BS and MS @PKU1898 | | All opinions are my own. | INTP-T | 人心惟危,道心惟微
加入 December 2023
181 正在关注    31K 粉丝
🧵 Deli AutoResearch SKILL is now officially open source! 🎉 Alongside it, we’re dropping our 4th survey paper — this time on Self-play. Inspired by AlphaZero, we got a powerful insight: prior knowledge doesn’t always lift the ceiling. Models can discover more globally optimal solutions just by playing against themselves. The biggest change in this paper? For the first time, the AutoResearch Agent autonomously planned GPU experiments — and submitted actual RL runs on the DeepSeek 285B model. The entire RL pipeline — experiment design, code writing, running, debugging, and conclusion summarization — was 100% automated, with zero human intervention from me. This was incredibly difficult, but an incredibly important step. GRPO is the tool being called by the AutoResearch Agent here. We see this as the beginning of our Continual Learning research journey. 🚀 As always, this is my personal research project, unaffiliated with any organization. All views are my own. #AI# #ReinforcementLearning# #SelfPlay# #OpenSource# #AutoML# #ContinualLearning# #DeepSeek#
显示更多
0
15
1.1K
168
转发到社区