[ROCm] Optimize kgemm_4bit_inference_naive for ROCm, use it for batch sizes other than 1 #253
This workflow is awaiting approval from a maintainer in #1920
This workflow is awaiting approval from a maintainer in #1920
tests-pr.yml
on: pull_request
Matrix: CPU
Waiting for pending jobs
Matrix: CUDA
Waiting for pending jobs