-
Notifications
You must be signed in to change notification settings - Fork 217
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[AMD] gpt-oss-fp4-mi355x (vllm): W4A8 moe optimizations and vllm image bump
#2051
opened Jul 4, 2026 by
xiaohuguo2023
Collaborator
Loading…
[NV] llm-d-vllm: Add llm-d to the InferenceX benchmarking framework
full-sweep-enabled
#2050
opened Jul 4, 2026 by
ezrasilvera
Collaborator
Loading…
3 tasks
[codex] Remove stale CollectiveX workflow / 删除过时的 CollectiveX 工作流
#2040
opened Jul 4, 2026 by
Oseltamivir
Collaborator
Loading…
[Klaud Cold] Remove disallowed --hf-overrides indexer override from DSV4 ATOM disagg / 移除 DSV4 ATOM disagg 中不允许的 --hf-overrides indexer 覆盖
full-sweep-fail-fast
#2038
opened Jul 4, 2026 by
functionstackx
Collaborator
Loading…
[Klaud Cold] Remove disallowed --hf-overrides indexer override from DSV4 FP4 MI355X ATOM / 移除 DSV4 FP4 MI355X ATOM 中不允许的 --hf-overrides indexer 覆盖
full-sweep-fail-fast
#2037
opened Jul 4, 2026 by
functionstackx
Collaborator
Loading…
[WIP] Update Minimax M3 FP4 B200 Eagle
full-sweep-enabled
#2007
opened Jul 3, 2026 by
wzhao18
Collaborator
Loading…
Update Minimax M3 FP4 B300 Eagle
full-sweep-enabled
#2006
opened Jul 3, 2026 by
wzhao18
Collaborator
Loading…
Finalize CollectiveX v1 cross-vendor EP benchmark suite / 完成 CollectiveX v1 跨厂商 EP 基准测试套件
#2004
opened Jul 3, 2026 by
Oseltamivir
Collaborator
Loading…
[AMD] MiniMax-M3 MXFP8 MI355X vLLM: nightly + AITER-on TP4 + emulatin linear / MiniMax-M3 MXFP8 MI355X vLLM:升级 nightly + 启用 AITER TP4 + emulation linear
full-sweep-enabled
#2003
opened Jul 3, 2026 by
hongxiayang
Collaborator
Loading…
[AMD] MiniMax-M3 FP4/FP8 MI355X ATOMESH (disagg): refactor config & add MTP recipes / 重构配置并新增 MTP 配方 / 설정 리팩토링 및 MTP 레시피 추가
AMD
evals-only
Suppress throughput and run only eval jobs; combine with all-evals to expand selection
full-sweep-enabled
#2000
opened Jul 3, 2026 by
seungrokj
Collaborator
Loading…
8 tasks
[WIP] Test Kimi 2.5 B300 Agg
full-sweep-enabled
#1998
opened Jul 3, 2026 by
wzhao18
Collaborator
Loading…
[DNM][AMD] agentX benchmark (v1.0) / agentX 基准测试 (v1.0) / agentX 벤치마크 (v1.0)
#1996
opened Jul 3, 2026 by
seungrokj
Collaborator
Loading…
chore(deps): bump the github-actions group across 1 directory with 3 updates
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#1995
opened Jul 3, 2026 by
dependabot
Bot
Loading…
Update Minimax M3 B300 FP4 vllm
full-sweep-enabled
#1994
opened Jul 2, 2026 by
wzhao18
Collaborator
Loading…
[NV] perf: update MiniMax-M3 FP4 B300 vLLM MTP
full-sweep-fail-fast
#1991
opened Jul 2, 2026 by
anish-shanbhag
Collaborator
Loading…
[WIP] [do not merge] Add MiniMax-M3 FP4 B200 Dynamo-vLLM disagg config
full-sweep-fail-fast-no-canary
Full sweep, no canary gate; first failure in a matrix cancels that matrix
#1982
opened Jul 2, 2026 by
jasonlizhengjian
Collaborator
Loading…
[AMD] DeepSeek-V4 FP4 MI355X vLLM MTP: bump image to latest nightly / DeepSeek-V4 FP4 MI355X vLLM MTP:升级镜像至最新 nightly
full-sweep-fail-fast
#1981
opened Jul 2, 2026 by
Fangzhou-Ai
Collaborator
Loading…
[AMD] DeepSeek-V4 FP4 MI355X vLLM STP: bump image to latest nightly / DeepSeek-V4 FP4 MI355X vLLM STP:升级镜像至最新 nightly
full-sweep-fail-fast
#1980
opened Jul 2, 2026 by
Fangzhou-Ai
Collaborator
Loading…
[AMD] MiniMax-M3 FP4 MI355X vLLM MTP: close gap vs ATOM (INT4 all-reduce + index-sharing) / MiniMax-M3 FP4 MI355X vLLM MTP:缩小与 ATOM 的性能差距(INT4 all-reduce + 跨层索引共享)
full-sweep-fail-fast
#1979
opened Jul 2, 2026 by
Fangzhou-Ai
Collaborator
Loading…
[AMD] MiniMax-M3 FP4 MI355X vLLM STP: close gap vs ATOM (INT4 all-reduce + index-sharing) / MiniMax-M3 FP4 MI355X vLLM STP:缩小与 ATOM 的性能差距(INT4 all-reduce + 跨层索引共享)
full-sweep-fail-fast
#1969
opened Jul 1, 2026 by
Fangzhou-Ai
Collaborator
Loading…
test the GB300 cluster after the node patch
full-sweep-enabled
#1961
opened Jun 30, 2026 by
richardhuo-nv
Collaborator
Loading…
Update Qwen3.5 FP4 MI355X MTP recipe with tuned env/flags / 使用调优的环境变量和参数更新 Qwen3.5 FP4 MI355X MTP 配方
#1957
opened Jun 29, 2026 by
amd-fuyuajin
Collaborator
Loading…
[AMD] Enable AITER MoE for MiniMax-M3 MI355X vLLM MTP benchmarks / 为 MiniMax-M3 MI355X vLLM MTP 基准测试启用 AITER MoE
#1955
opened Jun 29, 2026 by
Fangzhou-Ai
Collaborator
•
Draft
2 of 3 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.