注册并分享邀请链接,可获得视频播放与邀请奖励。

Max Lv (@m0d8ye) “1T 模型跑在 8 卡系统达到 1000 tokens/sec 这不就是明牌 B300 NVL8 了吗?” — TopicDigg

Max Lv 的个人资料封面
Max Lv 的头像
Max Lv
@m0d8ye
Chip Architect (2013-Present) | Bitcoin Enthusiast since 2011. Open source developer: Views are my own.
加入 June 2009
273 正在关注    18.9K 粉丝
1T 模型跑在 8 卡系统达到 1000 tokens/sec 这不就是明牌 B300 NVL8 了吗?
🚀 1,000+ TOKENS/S ON A 1T MODEL! 🚀 We are thrilled to release Xiaomi MiMo-V2.5-Pro-UltraSpeed in collaboration with @TileRT_AI , breaking the 1,000 tokens/s output speed on a 1 Trillion parameter model for the FIRST TIME! Not wafer-scale integration like Cerebras. Not pure on-chip SRAM chips like Groq. We achieve 1,000 tps on a 1T MoE model using just a SINGLE, STANDARD 8-GPGPU NODE. Read the full technical deep dive: Want to experience the future of real-time AI? 👉 Apply for UltraSpeed now: ⏳ Limited-Time Access: Application-based · Jun 8 – Jun 23 (PDT) 💬 Chat Experience: Completely FREE for a limited time — try the blazing-fast web chat now. ⚡ UltraSpeed API: Just 3x the price for a ~10x boost in output experience. 🤝 Enterprise & Large-Scale Needs: business-mimo@xiaomi.com
显示更多
0
15
87
5
转发到社区