Kimi K2 开放代理人工智能(英文版)
发布者:wx****a3
2025-08-21
6 MB
32 页
文件列表:
Kimi K2 开放代理人工智能(英文版).pdf |
下载文档 |
资源简介
>
We introduce Kimi K2, a Mixture-of-Experts (MoE) large language model with 32 billion activated
parameters and 1 trillion total parameters. We propose the MuonClip optimizer, which improves upon
Muon with a novel QK-clip technique to address training instability while enjoying the advanced
token efficiency of Muon. Based on MuonClip, K2 was pre-trained on 15.5 trillion tokens with zero
loss spike. During post-training, K2 undergoes a multi-stage post-training process, highlighted by a
large-scal
加载中...
本文档仅能预览20页


