Skip to content

[ROCm] Optimize kgemm_4bit_inference_naive for ROCm, use it for batch sizes other than 1 #2634

[ROCm] Optimize kgemm_4bit_inference_naive for ROCm, use it for batch sizes other than 1

[ROCm] Optimize kgemm_4bit_inference_naive for ROCm, use it for batch sizes other than 1 #2634

Annotations

1 warning

build-cuda (ubuntu-22.04-arm, 12.6.3)

succeeded Apr 22, 2026 in 5m 40s