Skip to content

use the user provide process group for autotuning#1823

Merged
shunting314 merged 1 commit intomainfrom
shunting314/stack/21
Apr 2, 2026
Merged

use the user provide process group for autotuning#1823
shunting314 merged 1 commit intomainfrom
shunting314/stack/21

Conversation

@shunting314
Copy link
Copy Markdown
Contributor

@shunting314 shunting314 commented Mar 26, 2026

Stacked PRs:


use the user provide process group for autotuning

Previously when we autotune a dist kernel, we assume the process group is dist.group.WORLD. It's often true but can be wrong for complex cases like multi-dimension parallelism. I added an example kernel for SP+TP.

With 4 GPUs, there will be 2 TP groups handling different part of the sequence. In this case, when we autotune, we should use the user specified process group for synchronization across ranks.

shunting314 added a commit that referenced this pull request Mar 26, 2026
stack-info: PR: #1823, branch: shunting314/stack/21
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 0fd4cbe to 8242314 Compare March 26, 2026 06:31
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 26, 2026
@shunting314 shunting314 requested review from jansel and yf225 March 26, 2026 06:33
@shunting314 shunting314 marked this pull request as draft March 26, 2026 06:37
@shunting314 shunting314 changed the base branch from shunting314/stack/20 to main March 26, 2026 06:37
shunting314 added a commit that referenced this pull request Mar 26, 2026
stack-info: PR: #1823, branch: shunting314/stack/21
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 8242314 to cd1a8dd Compare March 26, 2026 06:37
@shunting314 shunting314 changed the base branch from main to shunting314/stack/20 March 26, 2026 06:37
@shunting314 shunting314 marked this pull request as ready for review March 26, 2026 06:37
@shunting314 shunting314 marked this pull request as draft March 26, 2026 06:49
@shunting314 shunting314 changed the base branch from shunting314/stack/20 to main March 26, 2026 06:49
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from cd1a8dd to 5a1af0f Compare March 26, 2026 06:49
@shunting314 shunting314 changed the base branch from main to shunting314/stack/20 March 26, 2026 06:49
@shunting314 shunting314 marked this pull request as ready for review March 26, 2026 06:49
@shunting314 shunting314 marked this pull request as draft March 26, 2026 06:51
@shunting314 shunting314 changed the base branch from shunting314/stack/20 to main March 26, 2026 06:51
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 5a1af0f to 41697a4 Compare March 26, 2026 06:51
@shunting314 shunting314 changed the base branch from main to shunting314/stack/20 March 26, 2026 06:52
@shunting314 shunting314 marked this pull request as ready for review March 26, 2026 06:52
@shunting314 shunting314 marked this pull request as draft March 26, 2026 06:54
@shunting314 shunting314 changed the base branch from shunting314/stack/20 to main March 26, 2026 06:54
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 41697a4 to 4075b49 Compare March 26, 2026 06:54
@shunting314 shunting314 marked this pull request as draft March 27, 2026 20:52
@shunting314 shunting314 changed the base branch from shunting314/stack/22 to main March 27, 2026 20:52
shunting314 added a commit that referenced this pull request Mar 27, 2026
stack-info: PR: #1823, branch: shunting314/stack/21
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 39e0be8 to 161d0a0 Compare March 27, 2026 20:52
@shunting314 shunting314 changed the base branch from main to shunting314/stack/22 March 27, 2026 20:52
@shunting314 shunting314 marked this pull request as ready for review March 27, 2026 20:52
@shunting314 shunting314 marked this pull request as draft March 27, 2026 21:20
@shunting314 shunting314 changed the base branch from shunting314/stack/22 to main March 27, 2026 21:20
@shunting314 shunting314 changed the base branch from main to shunting314/stack/22 March 27, 2026 21:20
@shunting314 shunting314 marked this pull request as ready for review March 27, 2026 21:20
@shunting314 shunting314 marked this pull request as draft March 27, 2026 22:05
@shunting314 shunting314 changed the base branch from shunting314/stack/22 to main March 27, 2026 22:05
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 161d0a0 to 7db7a82 Compare March 27, 2026 22:05
@shunting314 shunting314 marked this pull request as ready for review March 27, 2026 22:05
@shunting314 shunting314 marked this pull request as draft April 1, 2026 19:16
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 7db7a82 to 4ec8d7c Compare April 1, 2026 19:16
@shunting314 shunting314 marked this pull request as ready for review April 1, 2026 19:17
@shunting314 shunting314 mentioned this pull request Apr 1, 2026
@shunting314 shunting314 marked this pull request as draft April 1, 2026 19:19
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 4ec8d7c to 33392f1 Compare April 1, 2026 19:19
@shunting314 shunting314 marked this pull request as ready for review April 1, 2026 19:19
stack-info: PR: #1823, branch: shunting314/stack/21
@shunting314 shunting314 marked this pull request as draft April 1, 2026 22:37
@shunting314 shunting314 force-pushed the shunting314/stack/21 branch from 33392f1 to 53488e5 Compare April 1, 2026 22:37
@shunting314 shunting314 marked this pull request as ready for review April 1, 2026 22:37
@shunting314 shunting314 merged commit 061553f into main Apr 2, 2026
21 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants