Commit e2dc832
Remove dead quant_state.dtype mutation in matmul_4bit CPU path (#1917)
The mutation `quant_state.dtype = A.dtype` is unnecessary: MatMul4Bit.forward
already casts via `.to(A.dtype)`, and gemv_4bit doesn't read state.dtype.
Removing it eliminates the Dynamo graph break on CPU under activation
checkpointing, so the regression test no longer needs a CPU skip.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 1869cd8 commit e2dc832
2 files changed
Lines changed: 0 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
382 | 382 | | |
383 | 383 | | |
384 | 384 | | |
385 | | - | |
386 | 385 | | |
387 | | - | |
388 | | - | |
389 | 386 | | |
390 | 387 | | |
391 | 388 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
453 | 453 | | |
454 | 454 | | |
455 | 455 | | |
456 | | - | |
457 | | - | |
458 | | - | |
459 | 456 | | |
460 | 457 | | |
461 | 458 | | |
| |||
0 commit comments