[Klaud Cold] Update kimik2.5-fp4-b200-vllm vLLM image to v0.24.0 / 将 kimik2.5-fp4-b200-vllm 的 vLLM 镜像 升级至 v0.24.0#2075
Conversation
中文:将 kimik2.5-fp4-b200-vllm 的 vLLM 镜像 升级至 v0.24.0。 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
65a6dcf to
2d9ba77
Compare
|
Thanks for the contribution! Please reach out to respective companies' CODEOWNER to fill in the latest PR_REVIEW_CHECKLIST.md before pinging core maintainer on Slack for review. In order for the signoff PR check bot to trigger, you must follow the PR_REVIEW_CHECKLIST.md template correctly, including the phrase For PR verification, add the PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. See GitHub's docs on re-running failed jobs 感谢你的贡献!请联系相应公司的 CODEOWNER 填写最新的 PR_REVIEW_CHECKLIST.md,然后再在 Slack 上联系核心维护者进行审阅。为了触发 signoff PR 检查机器人,你必须正确遵循 PR_REVIEW_CHECKLIST.md 模板,包括保留英文语句 如需进行 PR 验证,请为此 PR 添加 PR 作者有责任确保合并后所有 GitHub Action 任务完全通过。 很多时候失败只是偶发抖动(flake),重新运行失败的任务即可解决。参见 GitHub 关于重新运行失败任务的文档 |
3 similar comments
|
Thanks for the contribution! Please reach out to respective companies' CODEOWNER to fill in the latest PR_REVIEW_CHECKLIST.md before pinging core maintainer on Slack for review. In order for the signoff PR check bot to trigger, you must follow the PR_REVIEW_CHECKLIST.md template correctly, including the phrase For PR verification, add the PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. See GitHub's docs on re-running failed jobs 感谢你的贡献!请联系相应公司的 CODEOWNER 填写最新的 PR_REVIEW_CHECKLIST.md,然后再在 Slack 上联系核心维护者进行审阅。为了触发 signoff PR 检查机器人,你必须正确遵循 PR_REVIEW_CHECKLIST.md 模板,包括保留英文语句 如需进行 PR 验证,请为此 PR 添加 PR 作者有责任确保合并后所有 GitHub Action 任务完全通过。 很多时候失败只是偶发抖动(flake),重新运行失败的任务即可解决。参见 GitHub 关于重新运行失败任务的文档 |
|
Thanks for the contribution! Please reach out to respective companies' CODEOWNER to fill in the latest PR_REVIEW_CHECKLIST.md before pinging core maintainer on Slack for review. In order for the signoff PR check bot to trigger, you must follow the PR_REVIEW_CHECKLIST.md template correctly, including the phrase For PR verification, add the PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. See GitHub's docs on re-running failed jobs 感谢你的贡献!请联系相应公司的 CODEOWNER 填写最新的 PR_REVIEW_CHECKLIST.md,然后再在 Slack 上联系核心维护者进行审阅。为了触发 signoff PR 检查机器人,你必须正确遵循 PR_REVIEW_CHECKLIST.md 模板,包括保留英文语句 如需进行 PR 验证,请为此 PR 添加 PR 作者有责任确保合并后所有 GitHub Action 任务完全通过。 很多时候失败只是偶发抖动(flake),重新运行失败的任务即可解决。参见 GitHub 关于重新运行失败任务的文档 |
|
Thanks for the contribution! Please reach out to respective companies' CODEOWNER to fill in the latest PR_REVIEW_CHECKLIST.md before pinging core maintainer on Slack for review. In order for the signoff PR check bot to trigger, you must follow the PR_REVIEW_CHECKLIST.md template correctly, including the phrase For PR verification, add the PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. See GitHub's docs on re-running failed jobs 感谢你的贡献!请联系相应公司的 CODEOWNER 填写最新的 PR_REVIEW_CHECKLIST.md,然后再在 Slack 上联系核心维护者进行审阅。为了触发 signoff PR 检查机器人,你必须正确遵循 PR_REVIEW_CHECKLIST.md 模板,包括保留英文语句 如需进行 PR 验证,请为此 PR 添加 PR 作者有责任确保合并后所有 GitHub Action 任务完全通过。 很多时候失败只是偶发抖动(flake),重新运行失败的任务即可解决。参见 GitHub 关于重新运行失败任务的文档 |
|
see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=28720105405 |
There was a problem hiding this comment.
LGTM — mechanical vLLM image bump for kimik2.5-fp4-b200-vllm from v0.22.0 to v0.24.0 with matching perf-changelog entry.
Extended reasoning...
Overview
This PR bumps the vLLM image for the single recipe kimik2.5-fp4-b200-vllm from vllm/vllm-openai:v0.22.0 to vllm/vllm-openai:v0.24.0 in configs/nvidia-master.yaml, and appends a matching entry to perf-changelog.yaml linking back to this PR. Two lines of substantive change across two files.
Security risks
None. This is a container image tag update in a benchmark recipe config — no auth, crypto, permissions, or user-facing surface touched.
Level of scrutiny
Low. This is a mechanical Klaud Cold image bump that matches the exact pattern of recent merged commits (e.g. 2d9ba77 which is the same PR title/shape). The full-sweep-fail-fast label is applied, so the benchmark sweep will empirically validate whether v0.24.0 actually works before merge.
Other factors
The perf-changelog entry is well-formed and correctly references PR #2075. No outstanding reviewer comments beyond the standard recipe-reminder bot messages. Bug hunting system found nothing.
|
see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=28720105405 |
|
Canary blocked by cluster-side model staging, not the image bump: enroot cannot mount |
|
Closing: blocked by NVIDIA cluster-side model staging (NVFP4 checkpoint missing/unreadable) or image-pull lock issues. Will re-cut once the cluster staging is fixed. |
Summary
Update vLLM image from vllm/vllm-openai:v0.22.0 to vllm/vllm-openai:v0.24.0
Recipes touched:
kimik2.5-fp4-b200-vllm中文说明
将 vLLM 镜像 从 vllm/vllm-openai:v0.22.0 升级至 vllm/vllm-openai:v0.24.0。涉及配置:
kimik2.5-fp4-b200-vllm。Test plan
🤖 Generated with Claude Code