Jim Fan (@DrJimFan) “How to build *TruthGPT*? I listened to a talk by the legendary @johnschulman2. I”

Jim Fan

@DrJimFan

NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.

加入 December 2012

3.1K 正在关注 409.8K 粉丝

Jim Fan@DrJimFan

2023.04.21 17:03

How to build *TruthGPT*? I listened to a talk by the legendary @johnschulman2. It's densely packed with lots of deep insight. Key takeaways: - Supervised finetuning (or behavior cloning) makes the model prone to hallucination, while RL mitigates it. - NLP is far from done! 1/🧵

显示更多