Skip to content

Commit da56a0d

Browse files
committed
Fix CI: Install CUDA toolkit for flashinfer JIT compilation
vLLM 0.13.0 uses flashinfer as its default attention backend, which requires nvcc for JIT compilation of CUDA kernels at runtime. The CI runner has GPU runtime libraries but not the CUDA development toolkit, causing the error: RuntimeError: Could not find nvcc and default cuda_home='/usr/local/cuda' doesn't exist This adds cuda-toolkit-12-8 installation and sets CUDA_HOME environment variable before installing torchforge. Note: Temporarily added branch trigger to test this fix - remove after merging.
1 parent 1ad7e7c commit da56a0d

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

.github/workflows/integration_test.yaml

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ name: Integration Tests (8 card)
22

33
on:
44
push:
5-
branches: [ main ]
5+
branches: [ main, fix-ci-cuda-toolkit ] # TODO: remove fix-ci-cuda-toolkit after testing
66
workflow_dispatch:
77

88
concurrency:
@@ -33,6 +33,12 @@ jobs:
3333
python-version: '3.12'
3434
- name: Update pip
3535
run: python -m pip install --upgrade pip
36+
- name: Install CUDA toolkit
37+
run: |
38+
# flashinfer (used by vLLM 0.13.0) requires nvcc for JIT compilation
39+
sudo dnf install -y cuda-toolkit-12-8
40+
echo "CUDA_HOME=/usr/local/cuda-12.8" >> $GITHUB_ENV
41+
echo "/usr/local/cuda-12.8/bin" >> $GITHUB_PATH
3642
- name: Install torchforge
3743
run: pip install uv && uv pip install . && uv pip install .[dev]
3844
- name: Run weight sync integration test

0 commit comments

Comments
 (0)