Cerebras Kimi K2.6 Hits 981 tok/s, Beating Top GPUs in Inference
Cerebras launches 1T-parameter Kimi K2.6 on Wafer-Scale Engine 3, reaching 981 tokens/sec—6.7× faster than top cloud GPUs and 23× market average.
22 min0
Cerebras launches 1T-parameter Kimi K2.6 on Wafer-Scale Engine 3, reaching 981 tokens/sec—6.7× faster than top cloud GPUs and 23× market average.