cuda.core: make managed-prefetch test page-size aware by rparolin · Pull Request #2268 · NVIDIA/cuda-python

rparolin · 2026-06-25T21:35:38Z

Summary

TestPrefetchBatch.test_per_buffer_location started failing on the CTK 13.x linux-aarch64 CI configs after the runners moved to the nvidia-64k (64 KB page-size) kernel:

>       assert last0 == _HOST_LOCATION_ID
E       assert 0 == -1

Root cause

The test hardcoded a 4096-byte allocation and assumed two pooled buffers landed on separate physical pages. Managed-memory prefetch and CU_MEM_RANGE_ATTRIBUTE_LAST_PREFETCH_LOCATION operate at page granularity, and ManagedMemoryResource is a pool, so the two allocate(4096) calls are packed adjacently (pointers 4 KB apart).

4 KB pages: each 4 KB buffer is its own page → per-buffer prefetch is independent → passes.
64 KB pages: both buffers share one 64 KB page → prefetching bufs[1] to the device migrates the whole shared page → querying bufs[0] reports device 0 instead of host (-1) → assert 0 == -1.

The prefetch itself worked correctly; the test's premise (sub-page allocations are independently prefetchable) only holds when buffer size ≥ page size. The failure is latent on any genuine 64 KB-page platform (Grace / Grace-Blackwell), so reverting the runner kernel only masks it.

Fix

Derive _MANAGED_TEST_ALLOCATION_SIZE from mmap.PAGESIZE so each buffer occupies a full page on every platform (no hardcoded page size).
Add a precondition asserting the two buffers sit on distinct physical pages, so a future pool-packing change fails loudly instead of silently migrating a shared page.

Verification

pytest tests/memory/test_managed_ops.py → 34 passed, 1 skipped on an x86 (4 KB page) RTX 5880 Ada box, including the guarded test_per_buffer_location.

🤖 Generated with Claude Code

TestPrefetchBatch.test_per_buffer_location hardcoded a 4096-byte allocation and assumed two pooled buffers landed on separate physical pages. Managed-memory prefetch and CU_MEM_RANGE_ATTRIBUTE_LAST_PREFETCH_LOCATION operate at page granularity, so on nvidia-64k aarch64 kernels both 4 KB buffers shared one 64 KB page; prefetching buf[1] to the device migrated the shared page and buf[0]'s host prefetch reported device 0 (assert 0 == -1). Derive the allocation size from mmap.PAGESIZE so each buffer occupies a full page on every platform, and add a precondition asserting the two buffers sit on distinct pages so a pool-packing regression fails loudly. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

mdboom · 2026-06-25T21:41:07Z

Should fix #2267.

github-actions · 2026-06-25T21:54:40Z

Doc Preview CI
🚀 View preview at https://nvidia.github.io/cuda-python/pr-preview/pr-2268/
https://nvidia.github.io/cuda-python/pr-preview/pr-2268/cuda-core/
https://nvidia.github.io/cuda-python/pr-preview/pr-2268/cuda-bindings/
https://nvidia.github.io/cuda-python/pr-preview/pr-2268/cuda-pathfinder/
Preview will be ready when the GitHub Pages deployment is complete.

rparolin added this to the cuda.core next milestone Jun 25, 2026

rparolin added bug Something isn't working test Improvements or additions to tests cuda.core Everything related to the cuda.core module labels Jun 25, 2026

rparolin requested review from kkraus14 and leofang June 25, 2026 21:36

rparolin self-assigned this Jun 25, 2026

rparolin modified the milestones: cuda.core next, cuda.bindings 13.4.0 & 12.9.8 Jun 25, 2026

kkraus14 approved these changes Jun 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cuda.core: make managed-prefetch test page-size aware#2268

cuda.core: make managed-prefetch test page-size aware#2268
rparolin wants to merge 1 commit into
NVIDIA:mainfrom
rparolin:fix/managed-prefetch-page-size

rparolin commented Jun 25, 2026

Uh oh!

mdboom commented Jun 25, 2026

Uh oh!

github-actions Bot commented Jun 25, 2026

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

rparolin commented Jun 25, 2026

Summary

Root cause

Fix

Verification

Uh oh!

mdboom commented Jun 25, 2026

Uh oh!

github-actions Bot commented Jun 25, 2026

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants