Skip to content

rope argument inconsistent #167

@yfcAzul

Description

@yfcAzul

In Qwen3Attention, the code appears to support RoPE scaling by forwarding a rope_scaling argument into get_rope:
self.rotary_emb = get_rope( self.head_dim, rotary_dim=self.head_dim, max_position=max_position, base=rope_theta, rope_scaling=rope_scaling, )

However, in layers/rotary_embedding.py, get_rope immediately enforces:

assert rope_scaling is None

It seems like the two pieces of code encode contradictory assumptions

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions