Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels #1858
+4,759
−4
We went looking everywhere, but couldn’t find those commits.
Sometimes commits can disappear after a force-push. Head back to the latest changes here.