feat: add support for single-scale ViT models via adapter by ha405 · Pull Request #1265 · qubvel-org/segmentation_models.pytorch

ha405 · 2026-02-09T02:14:39Z

This PR adds support for timm Vision Transformer (ViT) models by implementing a ViTFeatureAdapter #1244 to generate the multi-scale features required by SMP decoders.

Key Changes:

ViTFeatureAdapter: Converts single-scale features (e.g., 1/16) into a hierarchical scale (1/4, 1/8, 1/16, 1/32) using learnable up/down-sampling.
Auto-Detection:
TimmUniversalEncoder
now automatically detects and adapts ViT-style models.
Testing: Added
tests/encoders/test_vit_adapter.py
with full coverage for ViT architectures.
All existing and new encoder tests passed.

feat: add support for single-scale ViT models via adapter

ae3e390

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add support for single-scale ViT models via adapter#1265

feat: add support for single-scale ViT models via adapter#1265
ha405 wants to merge 1 commit intoqubvel-org:mainfrom
ha405:feature/vit-adapter-support

ha405 commented Feb 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

ha405 commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ha405 commented Feb 9, 2026 •

edited

Loading