注册并分享邀请链接,可获得视频播放与邀请奖励。

Google Gemma (@googlegemma) “16 parallel runs of Gemma 4 26B A4B on a single NVIDIA DGX Spark! Pushing 18 tok” — TopicDigg

Google Gemma 的个人资料封面
Google Gemma 的头像
Google Gemma
@googlegemma
The official home of Google's Gemma. Lightweight, state-of-the-art open models by Google DeepMind, built on Gemini tech. What will you build? 🚀💻
加入 April 2025
0 正在关注    87.9K 粉丝
16 parallel runs of Gemma 4 26B A4B on a single NVIDIA DGX Spark! Pushing 18 tok/s per instance and a 300 tok/s aggregate. It can even hit 32 parallel runs. This level of concurrency highlights how efficient the architecture is.
显示更多
0
24
501
37
转发到社区