[Klaud Cold] [AMD] gpt-oss-fp4-mi355x (vllm): W4A8 moe optimizations and vllm image bump / gpt-oss-fp4-mi355x(vLLM):W4A8 MoE 优化与 vLLM 镜像升级#2051
Conversation
# Conflicts: # perf-changelog.yaml
|
Thanks for the contribution! Please reach out to respective companies' CODEOWNER to fill in the latest PR_REVIEW_CHECKLIST.md before pinging core maintainer on Slack for review. In order for the signoff PR check bot to trigger, you must follow the PR_REVIEW_CHECKLIST.md template correctly, including the phrase For PR verification, add the PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. See GitHub's docs on re-running failed jobs 感谢你的贡献!请联系相应公司的 CODEOWNER 填写最新的 PR_REVIEW_CHECKLIST.md,然后再在 Slack 上联系核心维护者进行审阅。为了触发 signoff PR 检查机器人,你必须正确遵循 PR_REVIEW_CHECKLIST.md 模板,包括保留英文语句 如需进行 PR 验证,请为此 PR 添加 PR 作者有责任确保合并后所有 GitHub Action 任务完全通过。 很多时候失败只是偶发抖动(flake),重新运行失败的任务即可解决。参见 GitHub 关于重新运行失败任务的文档 |
perf-changelog.yaml resolved by taking main's entries and re-appending this PR's gptoss-fp4-mi355x-vllm entry at the tail. 中文:将 origin/main 合并进本分支;perf-changelog.yaml 按惯例处理 - 采用 main 的条目并将本 PR 的条目重新追加到末尾。 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
Image & model
a8w4 optimizations picked up via the image
Extend the concs sweep coverage