align RTL: tage window method, block resolveQ if update failed due to bank conflict. by jensen-yan · Pull Request #644 · OpenXiangShan/GEM5

jensen-yan · 2025-12-10T07:47:17Z

Align with RTL, use an 8-window approach to block resolveQ.
When there is a conflict in tage updates, resolveQ cannot dequeue and fails to update the entire BPU.
After accumulating 8 times, it blocks BPU prediction to allow tage to update successfully once.

Summary by CodeRabbit

New Features
- Configurable prediction-blocking during branch-predictor updates with a threshold to temporarily pause predictions.
- Two-phase update flow: probe for bank conflicts, defer or apply updates accordingly.
Improvements
- Added resolve success/failure notifications to drive update flow.
- New metrics for deferred updates and blocked predictions to aid observability.
- Deferral replaces immediate dropping for bank-conflict cases, improving robustness.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

otherwise main BTB may do resolve udpate twice if tage resolveUpdate failed Change-Id: I660cb52b7f70a4202ca7741d8c7ed564fa7cd5b8

coderabbitai · 2025-12-10T07:47:43Z

Walkthrough

Implements a probe-then-apply two‑phase resolve mechanism for predictor updates: components are probed for bank/conflict readiness, updates are applied only if all components can resolve, and repeated probe failures can temporarily block prediction. Changes touch fetch stage, decoupled BPU, BTBTAGE, and base predictor interfaces.

Changes

Cohort / File(s)	Summary
Fetch stage resolve handling `src/cpu/o3/fetch.cc`	`resolveUpdate()` call site changed to treat a boolean return: calls `notifyResolveSuccess()` on true and `notifyResolveFailure()` on false; only pops resolve queue on success.
Decoupled BPU + stats `src/cpu/pred/btb/decoupled_bpred.cc`, `src/cpu/pred/btb/decoupled_bpred.hh`, `src/cpu/pred/btb/decoupled_bpred_stats.cc`	`DecoupledBPUWithBTB::resolveUpdate` changed from `void`→`bool`. Added `notifyResolveSuccess()`, `notifyResolveFailure()`, `blockPredictionOnce()`, `blockPredictionPending`, `resolveDequeueFailCounter`, `resolveBlockThreshold` and `predictionBlockedForUpdate` stat; prediction can be blocked after thresholded failures.
BTBTAGE predictor `src/cpu/pred/btb/btb_tage.cc`, `src/cpu/pred/btb/btb_tage.hh`	Introduced `canResolveUpdate(const FetchStream&)` (bank‑conflict probe) and `doResolveUpdate(const FetchStream&)` (deferred apply). Refactored update flow into probe/apply phases and renamed stat `updateDroppedDueToConflict` → `updateDeferredDueToConflict`.
Base BTB interface `src/cpu/pred/btb/timed_base_pred.hh`	Added virtual `canResolveUpdate(const FetchStream&)` (default true) and `doResolveUpdate(const FetchStream&)` (default delegates to `update()`), enabling two‑phase resolve across predictors.
Predictor config `src/cpu/pred/BranchPredictor.py`	Added `resolveBlockThreshold` Param to `DecoupledBPUWithBTB` to configure consecutive resolve dequeue failures before blocking predictions.
Tests `src/cpu/pred/btb/test/btb_tage.test.cc`	Tests updated to use `canResolveUpdate()`/`doResolveUpdate()` two‑step flow and assert deferred vs applied behavior for bank conflicts and non‑conflicts.

Sequence Diagram

sequenceDiagram
    participant Fetch as Fetch Stage
    participant DBPU as DecoupledBPU
    participant TAGE as BTBTAGE
    participant Other as Other Predictors

    Fetch->>DBPU: resolveUpdate(stream)
    activate DBPU
    Note over DBPU: Phase 1 — Probe all components
    DBPU->>TAGE: canResolveUpdate(stream)
    TAGE-->>DBPU: true/false
    DBPU->>Other: canResolveUpdate(stream)
    Other-->>DBPU: true/false

    alt Any probe returned false
        DBPU->>DBPU: resolveDequeueFailCounter++
        alt Counter >= resolveBlockThreshold
            DBPU->>DBPU: blockPredictionOnce()
        end
        DBPU-->>Fetch: false
        Fetch->>DBPU: notifyResolveFailure()
        Fetch->>Fetch: keep stream in queue
    else All probes true
        Note over DBPU: Phase 2 — Apply updates
        DBPU->>TAGE: doResolveUpdate(stream)
        TAGE-->>DBPU: done
        DBPU->>Other: doResolveUpdate(stream)
        Other-->>DBPU: done
        DBPU-->>Fetch: true
        Fetch->>DBPU: notifyResolveSuccess()
        Fetch->>Fetch: pop stream from queue
    end
    deactivate DBPU

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Pay special attention to:
- decoupled_bpred.cc — two‑phase probe/apply logic, counter and block state transitions.
- btb_tage.cc — bank‑conflict detection semantics, deferred vs dropped updates, and interaction with snapshot/recompute paths.
- fetch.cc — correct handling of queue popping and notify calls to avoid losing or reordering resolve entries.

Possibly related PRs

Train potential entry correctly in resolve update #637 — overlaps resolve/update flow and resolved‑flag handling in decoupled BPU and fetch stage.
Train ITTAGE at resolve stage #630 — related changes to resolve‑stage update behavior (IT‑TAGE/ITTAGE training and conditional updates).
cpu-o3: move resolve queue merging from iew to fetch stage #635 — touches src/cpu/o3/fetch.cc resolve‑queue processing and resolve outcome handling.

Suggested labels

perf, align-kmhv3

Suggested reviewers

Yakkhini
CJ362ff

Poem

🐰 A probe, a pause before the leap,

I check each bank so updates keep,
If conflicts stack, I count and wait,
Then let predictions leap — or hesitate.

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 43.48% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly describes the main implementation change: introducing an 8-window mechanism (RTL alignment) to block resolveQ when TAGE bank conflict updates fail.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch tage-bank1-align

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

XiangShanRobot · 2025-12-10T07:51:59Z

[Generated by GEM5 Performance Robot]
commit: e685026
workflow: gem5 Ideal BTB Performance Test

Ideal BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	15.82	15.63	+1.23 🟢

[Generated by GEM5 Performance Robot]
commit: e685026
workflow: gem5 Ideal BTB Performance Test

Ideal BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	15.82	15.82	-0.01 🔴

Change-Id: I44f3e6433d110d7625f4fbdb29df0e95d5ab9181

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (3)

src/cpu/pred/btb/test/btb_tage.test.cc (3)

891-910: Same-bank conflict path correctly exercises canResolveUpdate; optionally tighten invariants

The new canResolveUpdate(stream) usage plus the conflict-counter and predBankValid checks look consistent with the intended “detect & defer” behavior. If you want this test to guard more strongly against regressions, you could additionally assert that no update-related stats (e.g., updateAllocSuccess) change in this path, to prove the update was fully skipped rather than partially applied.

913-928: No-conflict path two-phase flow looks good; consider checking post-update state if it matters

Using canResolveUpdate(stream) with ASSERT_TRUE before doResolveUpdate(stream) cleanly exercises the non-conflict path and confirms the conflict counter stays unchanged. If doResolveUpdate is also expected to clear or update any state such as predBankValid after a successful resolve, you might add an assertion here to encode that contract in the test.

931-948: Conflict-disabled path is correct; optional: isolate subcases with a fresh predictor

With enableBankConflict = false, the canResolveUpdate / doResolveUpdate pair correctly bypasses conflict accounting even for same-bank accesses. Since all three subcases share the same bankTage instance, consider (optionally) constructing a fresh BTBTAGE for each sub-block or explicitly resetting any relevant state if you ever tighten the implementation semantics; this would make the test more robust to future changes in internal bookkeeping.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e685026 and 3135da1.

📒 Files selected for processing (1)

src/cpu/pred/btb/test/btb_tage.test.cc (3 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Quick Build, Unit Tests & Smoke Test
GitHub Check: perf_test / XS-GEM5 - Run performance test (spec06-0.3c)

github-actions · 2025-12-10T10:50:46Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`1.6750`	-
This PR	`1.8606`	📈 `+0.1856` (`+11.08%`)

✅ Difftest smoke test passed!

XiangShanRobot · 2025-12-10T12:36:50Z

[Generated by GEM5 Performance Robot]
commit: 3135da1
workflow: gem5 Ideal BTB Performance Test

Ideal BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	15.83	15.82	+0.08 🟢

jensen-yan added 2 commits December 9, 2025 19:01

cpu-o3: test tage bank conflict to block resolveQ

e75f511

cpu-o3: probe if can resolveUpdate, then update

e685026

otherwise main BTB may do resolve udpate twice if tage resolveUpdate failed Change-Id: I660cb52b7f70a4202ca7741d8c7ed564fa7cd5b8

cpu-o3: update tage unit test

3135da1

Change-Id: I44f3e6433d110d7625f4fbdb29df0e95d5ab9181

coderabbitai bot reviewed Dec 10, 2025

View reviewed changes

jensen-yan requested review from CJ362ff and Yakkhini December 11, 2025 02:17

CJ362ff approved these changes Dec 11, 2025

View reviewed changes

jensen-yan merged commit 3e6a5f6 into xs-dev Dec 12, 2025
3 checks passed

jensen-yan deleted the tage-bank1-align branch December 12, 2025 02:21

This was referenced Dec 23, 2025

ITTAGE Bank Conflict Implementation #671

Closed

bpu: add stats for recomputed vs actual/original prediction diff #685

Merged

This was referenced Jan 6, 2026

Split microtage perf #694

Closed

Utage align with rtl perf #699

Closed

Utage usfulbit align #704

Closed

This was referenced Jan 14, 2026

Ahead utage history align #709

Closed

Ahead utage history perf #715

Closed

This was referenced Jan 21, 2026

cpu-o3: simplify fetch， only support decoupled BTB mode #721

Merged

Split microtage align #727

Closed

Ai give ahead perf #728

Closed

Split microtage1 align #734

Closed

Ahead microtage index perf #741

Closed

This was referenced Jan 30, 2026

Refs/heads/split utage perf #746

Closed

Split utage align #747

Closed

coderabbitai bot mentioned this pull request Feb 27, 2026

cpu: Align BTBTAGE update reread and bank conflicts #764

Closed

coderabbitai bot mentioned this pull request Mar 6, 2026

Utage check rtl align #773

Open

coderabbitai bot mentioned this pull request Mar 13, 2026

bpu,configs: Add optional BTBTAGE upper-bound mode for kmhv3 #770

Merged

This was referenced Mar 18, 2026

Btb tage rtl useful sticky align #801

Open

cpu,arch-riscv,cpu-o3,bpu: align control-PC semantics, fetch coverage, and owner migration #805

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

align RTL: tage window method, block resolveQ if update failed due to bank conflict.#644

align RTL: tage window method, block resolveQ if update failed due to bank conflict.#644
jensen-yan merged 3 commits intoxs-devfrom
tage-bank1-align

jensen-yan commented Dec 10, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 10, 2025 •

edited

Loading

Uh oh!

XiangShanRobot commented Dec 10, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

XiangShanRobot commented Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jensen-yan commented Dec 10, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

XiangShanRobot commented Dec 10, 2025

Ideal BTB Performance

Overall Score

Ideal BTB Performance

Overall Score

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 10, 2025

🚀 Coremark Smoke Test Results

Uh oh!

XiangShanRobot commented Dec 10, 2025

Ideal BTB Performance

Overall Score

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jensen-yan commented Dec 10, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 10, 2025 •

edited

Loading