Andrej Karpathy (@karpathy) “The non-obvious crux of the shift is an empirical finding, emergent only at scal” — TopicDigg

注册并分享邀请链接，可获得视频播放与邀请奖励。

立即注册

Andrej Karpathy 的头像

Andrej Karpathy

@karpathy

I like training large deep neural nets.

加入 April 2009

1.1K 正在关注 3M 粉丝

Andrej Karpathy@karpathy

2022.11.18 01:37

The non-obvious crux of the shift is an empirical finding, emergent only at scale, and well-articulated in the GPT-3 paper ( Basically, Transformers demonstrate the ability of "in-context" learning. At run-time, in the activations. No weight updates.

显示更多

0

0

5

221

19

转发到社区

热门用户

@aleabitoreddit

825.9K 粉丝

川沐｜Trumoo🐮

255K 粉丝

24.3K 粉丝

74.3K 粉丝

14.9K 粉丝

潘驴邓晓闲缺一

21.3K 粉丝

@nmb48_official

134.2K 粉丝

SODクリエイト_info【SODGROUP】@作品情報配信

33.5K 粉丝

브라운더스트2 공식트위터

19.8K 粉丝

@hkt48_official_

150.9K 粉丝

승리의 여신: 니케 - 신규 업데이트

76.5K 粉丝

250.5K 粉丝

185.2K 粉丝

2M 粉丝

@official_TPE48

10.9K 粉丝