思考模式切换
1 硬分叉
2 软切换
messages = [
{"role": "user", "content": "问题 /no_think"},
{"role": "assistant", "content": "<think>\n\n</think>\n"}
]3 动态计算预算
4 训练阶段的差异
Last updated
messages = [
{"role": "user", "content": "问题 /no_think"},
{"role": "assistant", "content": "<think>\n\n</think>\n"}
]Last updated