注册并分享邀请链接,可获得视频播放与邀请奖励。

Yann LeCun (@ylecun) “I started using the concept in 2016 (e.g. in my NIPS 216 keynote, in which I cal” — TopicDigg

Yann LeCun 的个人资料封面
Yann LeCun 的头像
Yann LeCun
@ylecun
Professor at NYU & Executive Chairman at AMI Labs. Ex-Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
加入 June 2009
784 正在关注    1.2M 粉丝
I started using the concept in 2016 (e.g. in my NIPS 216 keynote, in which I called it a "world simulator"). I published papers on video prediction in 2016. This was meant to be a key step to train world models. Ha&Schmi appeared in 2018. The slide below is from a talk I gave at Brown in Nov 2017. Full deck here: We were hoping to train world models through video prediction. At the time, we were using generative architectures. We tried latent-variable models and GAN-style training. But never quite worked on natural video. Around 2021, I realized that predicting at the pixel level was not a good idea. That's when the JEPA concept emerged: find an abstract representation within which predictions are performed.
显示更多