Composer 2.5 being Pareto dominant in coding per CursorBench is important.
This is after only a few weeks of supplemental training and/or RL in the Colossus 2 cluster.
The 1.5 trillion parameter version of Grok will likely be a much better base model than Kimi. We shall see.