Avoid eager C-order copy in NibabelReader (fixes #8107) by aymuos15 · Pull Request #8825 · Project-MONAI/MONAI

aymuos15 · 2026-04-13T10:24:40Z

Summary

NibabelReader._get_array_data forced np.asanyarray(img.dataobj, order="C"), triggering a full dense memory reorder on every load on top of the file read/decompression step. This is the hot path reported in Very Slow Loading of NIfTI (.nii.gz) Files Compared to SimpleITK #8107.
Drop the forced C-order conversion and keep nibabel's native (F-order) layout. This aligns the CPU path with the existing GPU/cupy branch just above, which already returns F-order via .reshape(data_shape, order="F").
Downstream MONAI conversion paths (convert_to_tensor/convert_to_numpy in monai/utils/type_conversion.py, monai/data/image_writer.py, recon utils) already call ascontiguousarray where they actually need C-contiguous memory, so the reader does not need to pay that cost eagerly at load time.

Biggest wins are on uncompressed .nii, where nibabel's memmap view is returned lazily rather than being materialized by a forced reorder. Compressed .nii.gz still pays the decompression cost but skips the subsequent reorder pass, which matches the "twice as long" observation from @ericspod in the issue thread.

Compatibility note

The returned array's memory layout changes from C-contiguous to whatever nibabel provides (typically F-contiguous). Any external caller consuming reader.get_data(...)[0] directly via .tobytes() or a raw C-extension buffer without first calling ascontiguousarray would see a different byte order. All in-repo consumers already guard themselves.

Test plan

New regression test in tests/data/test_init_reader.py loads a small NIfTI through NibabelReader for both .nii and .nii.gz, asserts array equality, and asserts the returned data is not C-contiguous (i.e. no eager C copy).
pytest tests/data/test_init_reader.py passes locally.
runtests.sh-equivalent checks: ruff, black --skip-magic-trailing-comma --check, isort --check, pycln, pre-commit hooks — all clean on touched files.

Fixes #8107

Nibabel exposes NIfTI voxel buffers in their native Fortran layout, but MONAI was forcing np.asanyarray(img.dataobj, order="C") in NibabelReader._get_array_data(). For compressed .nii.gz inputs that adds a full dense memory reorder on top of the file read/decompression step, which is the hot path reported in issue Project-MONAI#8107. Drop the forced C-order conversion and keep nibabel's native array layout instead. Downstream MONAI conversion paths already handle contiguity when they actually need it, so the reader does not need to pay that cost eagerly at load time. Add a regression test that loads a small NIfTI image through NibabelReader and asserts the returned data is still correct while preserving the native F-contiguous layout. This guards against reintroducing the eager copy in the reader path. Signed-off-by: Soumya Snigdha Kundu <soumya_snigdha.kundu@kcl.ac.uk>

Exercise both .nii and .nii.gz inputs in the tiny layout regression test so the reader path stays covered without adding a benchmark or a heavier fixture. Signed-off-by: Soumya Snigdha Kundu <soumya_snigdha.kundu@kcl.ac.uk>

coderabbitai · 2026-04-13T10:24:50Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 105123e7-44f8-4a59-b8bb-164b4ba564e3

📥 Commits

Reviewing files that changed from the base of the PR and between f1873ec and 2ca1aa9.

📒 Files selected for processing (1)

monai/data/image_reader.py

🚧 Files skipped from review as they are similar to previous changes (1)

monai/data/image_reader.py

📝 Walkthrough

Walkthrough

Removed the explicit order="C" argument from np.asanyarray(img.dataobj) in NibabelReader._get_array_data, letting NumPy decide memory layout instead of forcing a C-contiguous copy. Added a unit test test_nibabel_reader_avoids_eager_c_order_copy (skipped if nibabel absent) that writes small NIfTI files (.nii and .nii.gz), reads them with NibabelReader(mmap=False), verifies data equality with the source array, and asserts the resulting ndarray is not C-contiguous.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Title check	✅ Passed	Title clearly describes the main change: removing forced C-order conversion in NibabelReader and referencing the fixed issue.
Description check	✅ Passed	Description includes summary, compatibility notes, and test plan matching template structure with technical details about the performance issue and solution.
Linked Issues check	✅ Passed	Changes directly address `#8107`'s root cause: eliminating numpy.asanyarray with forced C-order that consumed ~20s in profiling, replacing with native nibabel layout.
Out of Scope Changes check	✅ Passed	All changes (removing order parameter, adding regression test) are within scope of fixing the performance issue identified in `#8107`.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (1)

tests/data/test_init_reader.py (1)
84-84: Add a docstring to the new test method.

Line 84 introduces a new definition without a docstring; add a short Google-style docstring.

As per coding guidelines, "Docstrings should be present for all definition which describe each variable, return value, and raised exception in the appropriate section of the Google-style of docstrings."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/data/test_init_reader.py` at line 84, Add a Google-style docstring to
the test function test_nibabel_reader_avoids_eager_c_order_copy describing what
the test verifies (that the nibabel reader avoids an eager C-order copy),
include a short "Args:" section only if the test takes parameters (omit
otherwise), and include a "Raises:" or "Returns:" section only if the test
explicitly raises or returns something (omit otherwise); place the docstring
immediately under the def line in triple quotes.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@tests/data/test_init_reader.py`:
- Line 84: Add a Google-style docstring to the test function
test_nibabel_reader_avoids_eager_c_order_copy describing what the test verifies
(that the nibabel reader avoids an eager C-order copy), include a short "Args:"
section only if the test takes parameters (omit otherwise), and include a
"Raises:" or "Returns:" section only if the test explicitly raises or returns
something (omit otherwise); place the docstring immediately under the def line
in triple quotes.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: be4e5a1e-b10b-4097-a28d-5d3c9fda14f0

📥 Commits

Reviewing files that changed from the base of the PR and between 5a2d0a7 and f1873ec.

📒 Files selected for processing (2)

monai/data/image_reader.py
tests/data/test_init_reader.py

atharvajoshi01

Removing the explicit order='C' avoids a potentially expensive copy when the underlying NIfTI data is already in Fortran order (which is the native layout for medical imaging volumes). NumPy's default behavior without the order arg will return the data in its native layout, which is what downstream code expects anyway. Test coverage looks good.

ericspod · 2026-04-16T11:49:35Z

This change was originally added here explicitly for a fix. It's not stated and I'm struggling to refresh my memory but it could be MetaTensor or lazy transform related given the context. @wyli or @atbenmurray may remember better than I, but it seemed this served a purpose so more investigation is needed with transforms.

aymuos15 · 2026-04-16T11:55:14Z

Ahh okay, my bad. Should've looked into that. My internal testing showed this was better. Ill have a more deeper dive on this.

ericspod · 2026-04-16T11:59:51Z

Ahh okay, my bad. Should've looked into that. My internal testing showed this was better. Ill have a more deeper dive on this.

Thanks for the effort, it's possible we can do this change now with other changes that have been done since, or add a constructor argument to choose ordering. If you could please look into it further that would be great.

atbenmurray · 2026-05-01T13:04:52Z

I wasn't involved in this one, I think

aymuos15 · 2026-05-04T08:55:00Z

convert_to_tensor already calls np.ascontiguousarray at the tensor boundary (type_conversion.py:171, added in #4539, 2022-06), seven months before #5916 added the reader-level copy. So we can actually drop order="C". The contiguity guarantee is preserved, and end-to-end pipelines (LoadImage+EnsureChannelFirst+Spacing) produce bit-identical output for F-order vs C-order reader paths.

aymuos15 · 2026-05-04T08:56:44Z

Code to benchmark:

import gc, os, statistics, tempfile, time, tracemalloc
import numpy as np
import nibabel as nib
from monai.data import NibabelReader

SIZES = [(128, 128, 128), (256, 256, 128), (256, 256, 256), (512, 512, 256)]
DTYPES = [np.int16, np.float32]
SUFFIXES = [".nii", ".nii.gz"]
N_TRIALS = 5
N_INNER = 3

class CReader(NibabelReader):
    def _get_array_data(self, img, filename):
        return np.asanyarray(img.dataobj, order="C")

class NativeReader(NibabelReader):
    def _get_array_data(self, img, filename):
        return np.asanyarray(img.dataobj)

def time_load(reader_cls, fn):
    times = []
    for _ in range(N_INNER):
        gc.collect()
        r = reader_cls(mmap=False)
        t0 = time.perf_counter()
        img = r.read(fn)
        data, _ = r.get_data(img)
        _ = data.shape
        t1 = time.perf_counter()
        times.append(t1 - t0)
        del data, img, r
    return min(times)

def peak_mem(reader_cls, fn):
    gc.collect()
    tracemalloc.start()
    r = reader_cls(mmap=False)
    img = r.read(fn)
    data, _ = r.get_data(img)
    _, peak = tracemalloc.get_traced_memory()
    tracemalloc.stop()
    del data, img, r
    return peak

print(f"{'Size':<20}{'dtype':<10}{'fmt':<8}{'C-order (ms)':>14}{'Native (ms)':>14}{'speedup':>10}{'peak C (MB)':>14}{'peak N (MB)':>14}")
print("-" * 120)
with tempfile.TemporaryDirectory() as tdir:
    for size in SIZES:
        for dtype in DTYPES:
            arr = np.random.RandomState(0).randint(0, 1000, size=size).astype(dtype)
            for suf in SUFFIXES:
                fn = os.path.join(tdir, f"vol_{size}_{dtype.__name__}{suf}")
                nib.save(nib.Nifti1Image(arr, np.eye(4)), fn)
                with open(fn, "rb") as f: f.read()  # warm OS cache

                trials_c = [time_load(CReader, fn) for _ in range(N_TRIALS)]
                trials_n = [time_load(NativeReader, fn) for _ in range(N_TRIALS)]
                peak_c = peak_mem(CReader, fn)
                peak_n = peak_mem(NativeReader, fn)

                med_c = statistics.median(trials_c) * 1000
                med_n = statistics.median(trials_n) * 1000
                speedup = med_c / med_n if med_n else float("inf")
                print(f"{str(size):<20}{dtype.__name__:<10}{suf:<8}"
                      f"{med_c:>14.2f}{med_n:>14.2f}{speedup:>9.2f}x"
                      f"{peak_c/1e6:>14.1f}{peak_n/1e6:>14.1f}")
                os.remove(fn)

Table:

Size	dtype	fmt	C-order (ms)	Native (ms)	speedup	peak C (MB)	peak N (MB)
(128, 128, 128)	int16	.nii	5.02	1.55	3.23x	8.4	4.2
(128, 128, 128)	int16	.nii.gz	32.77	27.79	1.18x	8.5	8.5
(128, 128, 128)	float32	.nii	6.44	2.47	2.61x	16.8	8.4
(128, 128, 128)	float32	.nii.gz	52.71	49.13	1.07x	16.9	16.9
(256, 256, 128)	int16	.nii	38.31	3.47	11.05x	33.6	16.8
(256, 256, 128)	int16	.nii.gz	151.18	115.48	1.31x	33.7	33.7
(256, 256, 128)	float32	.nii	84.59	13.53	6.25x	67.1	33.6
(256, 256, 128)	float32	.nii.gz	269.72	211.42	1.28x	67.2	67.2
(256, 256, 256)	int16	.nii	224.89	15.01	14.99x	67.1	33.6
(256, 256, 256)	int16	.nii.gz	419.21	228.68	1.83x	67.2	67.2
(256, 256, 256)	float32	.nii	323.08	30.47	10.60x	134.2	67.1
(256, 256, 256)	float32	.nii.gz	788.41	417.67	1.89x	134.4	134.4
(512, 512, 256)	int16	.nii	1172.43	51.28	22.86x	268.5	134.2
(512, 512, 256)	int16	.nii.gz	2238.94	933.13	2.40x	268.6	268.6
(512, 512, 256)	float32	.nii	1436.58	118.83	12.09x	536.9	268.5
(512, 512, 256)	float32	.nii.gz	2992.65	1569.16	1.91x	537.0	537.0

ericspod · 2026-05-05T16:51:58Z

@coderabbitai Is the statement about not needing order="C" in #8825 (comment) correct?

coderabbitai · 2026-05-05T16:52:48Z

Tip

For best results, initiate chat on the files or code changes.

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash
# Check type_conversion.py for ascontiguousarray
rg -n "ascontiguousarray" --type py -C 5

# Check NibabelReader._get_array_data and surrounding context
rg -n "_get_array_data" --type py -C 10

# Check if there's any direct use of NibabelReader output that bypasses convert_to_tensor
rg -n "get_data\|_get_array_data\|NibabelReader" --type py -C 3 monai/data/image_reader.py | head -100

Length of output: 33345

🏁 Script executed:

#!/bin/bash
# Get full NibabelReader._get_array_data body
sed -n '1192,1250p' monai/data/image_reader.py

# Also check how the result of NibabelReader.get_data is consumed - specifically MetaTensor construction
rg -n "convert_to_tensor\|convert_to_numpy" monai/data/image_reader.py -A2 -B2

# Check if lazy resampling/transforms consume the array directly without type conversion
rg -n "c_contiguous\|is_contiguous\|order.*['\"]C['\"]" --type py monai/ -C 3

Length of output: 2707

@ericspod Yes, the claim is correct for standard MONAI pipeline usage. Here's the verification:

convert_to_tensor already enforces contiguity — type_conversion.py line 171:

# unconditional for ndim > 0 numpy arrays
data = np.ascontiguousarray(data)

And convert_to_numpy does the same at line 236. Since meta_tensor.py:555 calls convert_to_tensor(im, ...) when constructing a MetaTensor, any array passing through LoadImage will be made C-contiguous regardless of what the reader returns.

The GPU path in NibabelReader._get_array_data already returns F-order (reshape(data_shape, order="F")), so removing order="C" on the CPU path makes the two paths consistent.

One caveat worth noting: callers using NibabelReader directly (e.g., reader.get_data(img)) without subsequently calling convert_to_tensor/convert_to_numpy will now receive F-order arrays. This is a behavioral change at the raw API level, even if it's fine within the standard pipeline. The PR's regression test actually asserts not data.flags.c_contiguous, which is correct for the reader in isolation but could surprise direct API users. Worth documenting in the reader's docstring or release notes.

Signed-off-by: Eric Kerfoot <17726042+ericspod@users.noreply.github.com>

ericspod · 2026-05-05T16:59:28Z

Hi @aymuos15 I've made a very minor change to the documentation based on what Coderabbit said. As a sanity check it agrees with your assessment and since our tests pass we should be good. I'm aware a lot of our tests only check the shape of output arrays rather than contents so it's possible they are not thorough enough, but from our analysis I'm confident to merge this.

aymuos15 · 2026-05-05T17:02:24Z

Thank you very much for double checking everything. I did run quite a few personal tests apart from shapes to confirm. I am quite confident as well! Do you think this warrants another PR for stronger tests around this?

ericspod · 2026-05-05T17:05:52Z

Thank you very much for double checking everything. I did run quite a few personal tests apart from shapes to confirm. I am quite confident as well! Do you think this warrants another PR for stronger tests around this?

For this specifically we're good I feel. There's still a lot of test refactoring to do, and tighter loading test criteria would be part of that bigger effort. If you wanted to have a look at things that wouldn't hurt, perhaps coordinate with @garciadias who was doing some of that earlier.

aymuos15 · 2026-05-05T17:10:06Z

I will speak to him then, thank you.

aymuos15 added 2 commits April 13, 2026 11:05

Broaden NibabelReader layout regression coverage

e7412d5

Exercise both .nii and .nii.gz inputs in the tiny layout regression test so the reader path stays covered without adding a benchmark or a heavier fixture. Signed-off-by: Soumya Snigdha Kundu <soumya_snigdha.kundu@kcl.ac.uk>

aymuos15 requested review from KumoLiu, Nic-Ma, ericspod and virginiafdez as code owners April 13, 2026 10:24

aymuos15 changed the base branch from main to dev April 13, 2026 10:26

Merge branch 'dev' into fix-8107-nibabel-load

f1873ec

coderabbitai Bot reviewed Apr 13, 2026

View reviewed changes

atharvajoshi01 reviewed Apr 15, 2026

View reviewed changes

Merge branch 'dev' into fix-8107-nibabel-load

12c644d

Merge branch 'dev' into fix-8107-nibabel-load

c03a9db

ericspod approved these changes May 5, 2026

View reviewed changes

ericspod reviewed May 5, 2026

View reviewed changes

Comment thread monai/data/image_reader.py Outdated

Apply suggestion from @ericspod

2ca1aa9

Signed-off-by: Eric Kerfoot <17726042+ericspod@users.noreply.github.com>

ericspod enabled auto-merge (squash) May 5, 2026 16:59

ericspod merged commit 862f3a6 into Project-MONAI:dev May 5, 2026
26 checks passed

Conversation

aymuos15 commented Apr 13, 2026

Summary

Compatibility note

Test plan

Uh oh!

coderabbitai Bot commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Estimated code review effort

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

atharvajoshi01 left a comment

Choose a reason for hiding this comment

Uh oh!

ericspod commented Apr 16, 2026

Uh oh!

aymuos15 commented Apr 16, 2026

Uh oh!

ericspod commented Apr 16, 2026

Uh oh!

atbenmurray commented May 1, 2026

Uh oh!

aymuos15 commented May 4, 2026

Uh oh!

aymuos15 commented May 4, 2026

Uh oh!

ericspod commented May 5, 2026

Uh oh!

coderabbitai Bot commented May 5, 2026

Uh oh!

Uh oh!

ericspod commented May 5, 2026

Uh oh!

aymuos15 commented May 5, 2026

Uh oh!

ericspod commented May 5, 2026

Uh oh!

aymuos15 commented May 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

coderabbitai Bot commented Apr 13, 2026 •

edited

Loading