[python-package] Added tests on Booster.shuffle_model() by daguirre11 · Pull Request #7168 · lightgbm-org/LightGBM

daguirre11 · 2026-02-24T22:10:22Z

Contributes to #7031
BEFORE

AFTER

2 line difference in coverage for /lightgbm/basic.py

From my understanding shuffle_models literally just reorders the trees by calling the C API LGBM_BoosterShuffleModels. The predictions will be the same but the actual model itself will be different, hence, why model_to_string() is different before and after (Tree=0 and Tree=1 switch positions). Please let me know if I am understanding this correctly.

jameslamb

Thanks for starting this! This test should be made significantly stronger to give us high confidence that the behavior of shuffle_models() is correct.

I left some guidance in comments. But if you're feeling like it's too much for you to investigate right now, please let me know and we can close this so someone else can contribute it.

jameslamb · 2026-03-01T03:21:44Z

tests/python_package_test/test_basic.py

+    train_set = lgb.Dataset(X_train, label=y_train)
+    booster = lgb.Booster(
+        params={"objective": "binary", "verbose": -1},
+        train_set=train_set,
+    )
+    for _ in range(2):
+        booster.update()


Suggested change

train_set = lgb.Dataset(X_train, label=y_train)

booster = lgb.Booster(

params={"objective": "binary", "verbose": -1},

train_set=train_set,

)

for _ in range(2):

booster.update()

booster = lgb.train(

params={

"objective": "binary",

"num_iterations": 10,

"num_leaves": 7,

"verbose": -1,

},

train_set=lgb.Dataset(X, label=y),

)

Let's use lgb.train() for this instead of a for loop and an update please, and let's make the model smaller so the test is faster.

jameslamb · 2026-03-01T03:27:03Z

tests/python_package_test/test_basic.py

+    model_str_before = booster.model_to_string()
+    booster.shuffle_models(start_iteration=0, end_iteration=-1)
+    model_str_after = booster.model_to_string()
+    assert model_str_before != model_str_after


This is not a very strong test. For example, it'd pass if shuffle_models() corrupted the model in some serious and incorrect way. This should be made much stricter.

To do that, you'll have to look a bit deeper into what the function is doing. Start with the docstring:

LightGBM/python-package/lightgbm/basic.py

Lines 4509 to 4515 in e3d5270

Parameters

----------

start_iteration : int, optional (default=0)

The first iteration that will be shuffled.

end_iteration : int, optional (default=-1)

The last iteration that will be shuffled.

If <= 0, means the last available iteration.

The test should train 10 trees (for example) and:

omit the first 2 trees, and confirm that their placement is not changed

omit the final tree, and confirm that its placement isn't changed

confirm that the set of trees is identical and only the ordering is different

confirm that booster.predict() (with start_iteration left at its default) produces identical results before and after (ordering should not affect the predictions if you predict with all trees)

check that the expected behavior happens if start_iteration is negative or end_iteration is larger than the number of trees in the model

jameslamb · 2026-03-01T03:28:35Z

tests/python_package_test/test_basic.py

+    X_train, _, y_train, _ = train_test_split(
+        *load_breast_cancer(return_X_y=True),
+        test_size=0.1,
+        random_state=42,
+    )


Suggested change

X_train, _, y_train, _ = train_test_split(

*load_breast_cancer(return_X_y=True),

test_size=0.1,

random_state=42,

)

X, y = load_breast_cancer(return_X_y=True)

The test isn't using the held-out validation data, let's skip the unnecessary train-test splitting.

@jameslamb sorry for the late response, I will address all your responses on PRs today. Thank you for reviewing!

added test for booster shuffle models

008c620

daguirre11 requested review from StrikerRUS, borchero, guolinke, jameslamb, jmoralez and shiyu1994 as code owners February 24, 2026 22:10

jameslamb added the maintenance label Feb 27, 2026

jameslamb requested changes Mar 1, 2026

View reviewed changes

jameslamb added the in progress label Mar 1, 2026

jameslamb changed the title ~~[python-package] Added unit test for Booster shuffle models~~ [python-package] Added tests on Booster.shuffle_model() Mar 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python-package] Added tests on Booster.shuffle_model()#7168

[python-package] Added tests on Booster.shuffle_model()#7168
daguirre11 wants to merge 1 commit intolightgbm-org:masterfrom
daguirre11:test-booster-shuffle-models

daguirre11 commented Feb 24, 2026

Uh oh!

jameslamb left a comment

Uh oh!

jameslamb Mar 1, 2026

Uh oh!

jameslamb Mar 1, 2026 •

edited

Loading

Uh oh!

jameslamb Mar 1, 2026

Uh oh!

daguirre11 Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	Parameters
	----------
	start_iteration : int, optional (default=0)
	The first iteration that will be shuffled.
	end_iteration : int, optional (default=-1)
	The last iteration that will be shuffled.
	If <= 0, means the last available iteration.

Conversation

daguirre11 commented Feb 24, 2026

Uh oh!

jameslamb left a comment

Choose a reason for hiding this comment

Uh oh!

jameslamb Mar 1, 2026

Choose a reason for hiding this comment

Uh oh!

jameslamb Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jameslamb Mar 1, 2026

Choose a reason for hiding this comment

Uh oh!

daguirre11 Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jameslamb Mar 1, 2026 •

edited

Loading