Skip to content

Commit 5dff8ec

Browse files
committed
Fix jagged_hstu_attn mapping and use quick autotuning in TPU benchmark
- Fix wrong kernel function name: jagged_hstu_attn -> _helion_jagged_attention_kernel - Add HELION_AUTOTUNE_EFFORT=quick to CI workflow — full effort times out for 5/11 kernels (welford, attention, geglu, grpo_loss, swiglu)
1 parent 02989ff commit 5dff8ec

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

.github/workflows/benchmark_tpu.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,7 @@ jobs:
1717
env:
1818
HELION_BACKEND: pallas
1919
HELION_AUTOTUNE_LOG_LEVEL: INFO
20+
HELION_AUTOTUNE_EFFORT: quick
2021

2122
runs-on: linux.google.tpuv7x.1
2223

benchmarks/run_tpu.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -111,7 +111,7 @@ def _softmax_shapes() -> list[tuple[str, tuple[Any, ...]]]:
111111
"bmm": ("bmm", "bmm", None, None),
112112
"geglu": ("geglu", "geglu", None, None),
113113
"grpo_loss": ("grpo_loss", "grpo_loss_forward", None, None),
114-
"jagged_hstu_attn": ("jagged_hstu_attn", "jagged_hstu_attn", None, None),
114+
"jagged_hstu_attn": ("jagged_hstu_attn", "_helion_jagged_attention_kernel", None, None),
115115
"low_mem_dropout": ("low_mem_dropout", "low_mem_dropout", None, None),
116116
"swiglu": ("swiglu", "swiglu_fwd", None, None),
117117
}

0 commit comments

Comments
 (0)