[OpenVINO] Support Qwen3.5, Qwen3.5-MoE and Qwen3.6 by rkazants · Pull Request #1689 · huggingface/optimum-intel

rkazants · 2026-04-15T19:44:39Z

What does this PR do?

Re-created PR #1634

Fixes 181271, 181280, 182003

Installation instructions:

pip install git+https://github.com/rkazants/optimum-intel.git@support_qwen3_5
pip install --pre -U openvino openvino-tokenizers nncf --extra-index-url https://storage.openvinotoolkit.org/simple/wheels/nightly
pip install transformers==5.2.0
pip install requests torchvision opencv-python

Exporting cmd-line:

optimum-cli export openvino -m Qwen/Qwen3.5-0.8B Qwen3.5-0.8B

Inference script:

from transformers import AutoProcessor
from transformers.video_utils import load_video
from huggingface_hub import hf_hub_download
from optimum.intel.openvino import OVModelForVisualCausalLM

model_dir = "Qwen/Qwen3.5-0.8B"

processor = AutoProcessor.from_pretrained(model_dir)
model = OVModelForVisualCausalLM.from_pretrained(model_dir)

# Prepare video input
video_path = hf_hub_download(
                repo_id="raushan-testing-hf/videos-test",
                filename="sample_demo_1.mp4",
                repo_type="dataset",
            )
input_video, _ = load_video(video_path, num_frames=10, backend="opencv")

messages = [
    {"role": "user", "content": [
        {"type": "video"},
        {"type": "text", "text": "Why is this video funny?"},
    ]}
]
text = processor.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = processor(text=[text], videos=[input_video], return_tensors="pt")

# Run inference
output_ids = model.generate(**inputs, max_new_tokens=100)
output_text = processor.decode(output_ids[0], skip_special_tokens=True)

print(output_text)

Before submitting

[N/A] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
[] Did you write any new necessary tests?

… models

…qwen3_5

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

…qwen3_5

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

HuggingFaceDocBuilderDev · 2026-04-15T19:50:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

savvadesogle · 2026-04-17T05:36:22Z

Qwen3.6
Hooray!🎉

echarlaix added 30 commits January 19, 2026 10:39

Transformers v5

53d19b9

fix loading for llava_next_video

5205434

Remove deprecated transformers.onnx

e8feb0c

Merge branch 'main' into transformers-v5

55e4b3d

remove deprecated transformers.onnx from tests

bb54f64

remove huggingface_hub deprecated

71aa34e

relative to absolute import

0954015

update workflow to v5

1ba9789

remove redundant

f158656

update loading given transformers version

9345143

remove deprecated AutoModelForVision2Seq

b290ae3

update workflow

a4d1dc0

style

ac953ba

update setup

8001884

deprecated is_offline_mode

5f2a007

remove incompatible neural-compressor installation

ad477fe

remove documentation reference

42e98b8

add install transformers step

4ee3f51

Merge branch 'main' into transformers-v5

64c2022

transformers v5

8204264

install diffusers from source for v5

b319d19

remove deprecated CLIPFeatureExtractor

42300e4

openvino 2025.3.0

2a76102

add ov cache classes

f38703a

merge main in branch

46144d1

openvino nightly for modeling tests

2d3c734

openvino 2025.3 for modeling tests

b6dcefd

stop moving misplaced parameters from config to generation_config

ea24727

fix transformers version for doc building

07ff06b

fix transformers version for doc building

1270db0

echarlaix and others added 13 commits March 18, 2026 10:04

ix mamba expected int8

61d85b3

Fix _DEFAULT_IGNORED_SCOPE_CONFIGS for __make_16bit_traceable patched…

55c0d46

… models

add test to ensure dtype

2f38fd8

style

c925a79

Merge remote-tracking branch 'upstream/transformers-v5' into support_…

057ce12

…qwen3_5

Correct patching for vlm

934b32e

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

check openvino model expected dtype in test_export_dtype

bf1f377

Fix bf16 patching

e1f8c28

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

fix qwen3vl vision embeddings pos

5033df2

Merge remote-tracking branch 'upstream/transformers-v5' into support_…

ea94354

…qwen3_5

Support Qwen3.5-MoE

4602e00

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

Add position_ids input and its preparation for inference

cbe127e

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

Merge remote-tracking branch 'upstream/main' into support_qwen3_5

8eef963

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>