注册并分享邀请链接,可获得视频播放与邀请奖励。

Max Lv (@m0d8ye) “基于开源大模型和分布式网络的 inference 服务成为可能。接下来就是 web3 inference” — TopicDigg

Max Lv 的个人资料封面
Max Lv 的头像
Max Lv
@m0d8ye
Chip Architect (2013-Present) | Bitcoin Enthusiast since 2011. Open source developer: Views are my own.
加入 June 2009
273 正在关注    18.9K 粉丝
基于开源大模型和分布式网络的 inference 服务成为可能。接下来就是 web3 inference 服务了吧。
Wow, it has happened! 30.55 tok/s on GLM-5.2 4-bit (from @Zai_org) ran by six RTX Pro 6000's across the USA scattered over WAN! I can't believe this. It was an insane build, you can read more about it on
显示更多