Qwen3-VL training with flashattn+vllm

### Question

Hi, I ran into a training issue when using rLLM with a VL model backend stack and wanted to check whether this is a known compatibility problem on the rLLM side.

`RuntimeError: This flash attention build does not support headdim not being a multiple of 32.`



### Context

[https://github.com/vllm-project/vllm/issues/26989](url) I have read about this one, but it doses not seem to work at rllm

### Relevant Code / Config

```python

```

### Environment

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen3-VL training with flashattn+vllm #464

Question

Context

Relevant Code / Config

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Qwen3-VL training with flashattn+vllm #464

Description

Question

Context

Relevant Code / Config

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions