Support enable_gqa and only support 4D Q, K, and V
#2558
+114
−5
Merged
GitHub Advanced Security / lintrunner
succeeded
Sep 11, 2025 in 2s
No new alerts in code changed by this pull request
Loading