注册并分享邀请链接,可获得视频播放与邀请奖励。

levi (@levidiamode) “Day 134/365 of GPU Programming Spending the day reading the papers of benchmarks” — TopicDigg

levi 的个人资料封面
levi 的头像
levi
@levidiamode
365 days of GPU programming ▓▓▓▓▓▓▓▓▓▓▓▓▓░░░░░░░░░░░░░░░░░░░░░░░░░░ 133/365
加入 June 2025
602 正在关注    4.5K 粉丝
Day 134/365 of GPU Programming Spending the day reading the papers of benchmarks I've been repeatedly seeing. Starting with MMLU, GPQA, LongBench and NoLiMa and their different iterations (v1 vs v2, standard vs pro, etc). Working on inference optimization the past few days made me realize I don't really know anything about benchmarks, so trying to become more aware of various benchmarks, their strengths and limitations. Any other benchmarks I should look into more deeply?
显示更多
0
0
104
6
转发到社区