cpu-o3: update tage index by jensen-yan · Pull Request #627 · OpenXiangShan/GEM5

jensen-yan · 2025-12-01T06:38:24Z

use startPC instead of alignedPC, to calculate index, tag, bank easier

Change-Id: Id76bf1658e4286aad2cd4580943ff724bfb7e1e7

Summary by CodeRabbit

Refactor
- Branch prediction internals now consistently use a fetch-start PC as the base for indexing, tagging, and bank handling.
Behavior Changes
- Bank-conflict simulation enabled by default, affecting out-of-the-box prediction behavior.
Tests
- Unit tests updated to use start-PC + branch-PC pairs for lookups and expectations.
Documentation
- Bank/indexing model updated to describe start-PC–centered bank and index computation and conflict handling.
Other
- Debug messages updated to the new PC terminology.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

use startPC instead of alignedPC, to calculate index, tag, bank easier Change-Id: Id76bf1658e4286aad2cd4580943ff724bfb7e1e7

coderabbitai · 2025-12-01T06:38:40Z

Walkthrough

Refactors BTBTAGE to use a fetch-block reference startPC instead of previously passed alignedPC across APIs and internals; signatures and call sites updated, index/tag/bank computations now derive alignment from startPC. Also flips BTBTAGE.enableBankConflict default to True and updates tests and documentation to the new semantics.

Changes

Cohort / File(s)	Summary
Header declarations `src/cpu/pred/btb/btb_tage.hh`	Public method signatures updated to use `startPC` instead of `alignedPC`: `lookupHelper`, `getBranchIndexInBlock`, `getBankId`, `generateSinglePrediction`, and `handleNewEntryAllocation` (expanded parameters and out-params).
Implementation updates `src/cpu/pred/btb/btb_tage.cc`	Implementations, internal computations, and logging changed to accept `startPC` and derive `alignedPC`/indices/tags/bank from it. Call sites inside the file updated to the new parameter semantics.
Python config `src/cpu/pred/BranchPredictor.py`	`BTBTAGE.enableBankConflict` default changed from `False` to `True`.
Tests `src/cpu/pred/btb/test/btb_tage.test.cc`	Test helper `findTableWithEntry` signature updated to `(BTBTAGE*, Addr startPC, Addr branchPC)` and call sites/tests adjusted to pass both `startPC` and `branchPC`; assertions and comments updated accordingly.
Documentation `docs/Gem5_Docs/frontend/TAGE_BANK_IMPLEMENTATION_PLAN.md`	Bank/index computation narrative rewritten to a `startPC`-centric model, describes bank-id extraction via instShiftAmt and bankId bits, and clarifies bank-conflict simulation and predBankValid lifecycle.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Verify alignment derivation from startPC in index/tag/bank computations (src/cpu/pred/btb/btb_tage.cc).
Check all updated public signatures compile and any external callers are updated (headers, tests).
Confirm tests reflect intended lookup semantics (startPC vs branchPC) and bank-conflict default behavior.
Review documentation changes for consistency with implementation.

Possibly related PRs

Tage1 align #615 — Makes overlapping BTBTAGE changes (switching PC handling to startPC, index/tag/bank logic and related tests); strong code-level relation.

Suggested reviewers

tastynoob
Yakkhini

Poem

🐰 I hopped through bytes where PCs would start,
Swapped aligned for start — a tiny art.
Banks now mind their bits, conflicts awake,
Tests and docs follow every little quake.
🌿

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 73.68% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'cpu-o3: update tage index' directly relates to the main change: replacing alignedPC with startPC in TAGE predictor index calculations across multiple methods and files.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch tage-indexNew2-align

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

XiangShanRobot · 2025-12-01T06:42:45Z

[Generated by GEM5 Performance Robot]
commit: 61036c7
workflow: gem5 Ideal BTB Performance Test

Ideal BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	15.03	20.20	-25.60 🔴

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (6)

src/cpu/pred/btb/btb_tage.hh (1)

167-195: StartPC-based helper signatures are consistent; consider documenting the implied contract.

The switch to const Addr &startPC for lookupHelper and the updated getBranchIndexInBlock(Addr pc, Addr startPC) signatures match the implementations and simplify call sites. Semantics remain clear as long as callers pass a startPC whose block contains all BTB entries being processed. A brief comment on that precondition next to getBranchIndexInBlock would help future maintainers avoid misuse.
src/cpu/pred/btb/btb_tage.cc (5)
195-294: Use of startPC in generateSinglePrediction is coherent with the index/tag math.

Switching to startPC in generateSinglePrediction for:

getBranchIndexInBlock(btb_entry.pc, startPC),

getTageIndex(startPC, …), and

getTageTag(startPC, …, position)

is semantically safe because both getTageIndex and getTageTag shift by at least floorLog2(blockSize), so low block-offset bits (where startPC and an explicitly aligned PC can differ) are discarded before masking/xor. Using getBaseTableIndex(startPC) likewise preserves the original indexing for the base table.

Minor nit: you recompute the branch position later as branch_idx = getBranchIndexInBlock(btb_entry.pc, startPC); – you could reuse the earlier position to avoid a second subtract/shift, but it’s purely cosmetic.

340-379: putPCHistory cleanly separates startPC for indexing from alignedPC for banking.

putPCHistory now:

Accepts Addr startPC (consistent with the base TimedBaseBTBPredictor interface).

Derives alignedPC = startPC & ~(blockSize - 1) only for bank ID computation and logging.

Passes startPC to lookupHelper, which is then used for TAGE index/tag computation.

Given that getTageIndex/getTageTag shift away the block-offset bits, using startPC instead of alignedPC there preserves behavior while simplifying call signatures. The comment “use 32byte(blockSize) aligned PC for prediction” is now conceptually true via the shifts rather than via the parameter itself; if you touch this again, you might slightly reword it to reflect that.

560-641: handleNewEntryAllocation’s startPC usage is consistent with prediction.

handleNewEntryAllocation now:

Computes position via getBranchIndexInBlock(entry.pc, startPC), and

Uses getTageIndex(startPC, ti, …) and getTageTag(startPC, ti, …, position)

so that new allocations are keyed by the same index and tag basis as the lookup path. This keeps provider selection and later lookups aligned.

If you want a tiny performance/clarity tweak, you could take const std::shared_ptr<TageMeta>& meta since the function doesn’t modify or retain the pointer, but the current signature is perfectly acceptable.

805-821: Tag computation from PC and history remains compatible with startPC.

getTageTag now forms pcBits as:
Addr pcBits = (pc >> floorLog2(blockSize)) & mask;
When callers pass startPC, this effectively uses bits above the block-offset, which is the same information content you’d get from an explicitly aligned PC. Combined with foldedHist, altFoldedHist, and position, this keeps tags stable under the new startPC-based API.

If in future you decide to use per-table PC shifts (e.g., from tablePcShifts), this would be the natural place to integrate them, but nothing here blocks that.

891-899: Deriving alignedPC inside getBranchIndexInBlock simplifies callers; assumptions unchanged.

getBranchIndexInBlock now takes (Addr pc, Addr startPC) and derives:
Addr alignedPC = startPC & ~(blockSize - 1);
Addr offset = (pc - alignedPC) >> 1;
assert(offset < maxBranchPositions);
This removes the need to compute an aligned PC at each call site while preserving the original assumption that all relevant branch PCs reside within the same block window anchored by startPC. Given that this was already required by the previous API, behavior is unchanged.

If you ever worry about this being called with a mismatched startPC, you could strengthen the debug check (e.g., additionally asserting pc >= alignedPC) or guard against out-of-range offsets in non-assert builds, but that’s not strictly necessary for this refactor.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 67534be and 61036c7.

📒 Files selected for processing (2)

src/cpu/pred/btb/btb_tage.cc (14 hunks)
src/cpu/pred/btb/btb_tage.hh (4 hunks)

🧰 Additional context used

🧬 Code graph analysis (2)

src/cpu/pred/btb/btb_tage.cc (5)

src/cpu/pred/btb/btb_tage.hh (19)

btb_entry (403-405)

startPC (168-169)

startPC (417-424)

pred (370-370)

history (130-130)

history (134-135)

history (139-142)

history (139-139)

history (144-148)

history (144-145)

history (161-161)

history (202-202)

history (382-382)

stream (155-155)

stream (408-408)

entry (88-90)

entry (152-152)

entry (411-414)

startAddr (122-124)

src/cpu/pred/btb/btb_mgsc.hh (6)

btb_entry (254-255)

stream (162-162)

stream (258-258)

entry (160-160)

entry (260-261)

startAddr (134-135)

src/cpu/pred/btb/stream_struct.hh (4)

startPC (380-382)

startPC (704-734)

startPC (704-709)

entry (489-511)

src/cpu/pred/btb/timed_base_pred.hh (6)

entry (75-75)

entry (75-75)

entry (80-80)

entry (80-80)

startAddr (59-61)

startAddr (59-61)

src/cpu/pred/btb/uras.hh (1)

startAddr (43-44)

src/cpu/pred/btb/btb_tage.hh (2)

src/cpu/pred/btb/stream_struct.hh (9)

startPC (380-382)

startPC (704-734)

startPC (704-709)

pc (738-758)

pc (738-741)

pc (770-780)

pc (770-771)

pc (785-793)

pc (785-785)

src/cpu/pred/btb/jump_ahead_predictor.hh (2)

startPC (155-164)

startPC (155-155)

🔇 Additional comments (4)

src/cpu/pred/btb/btb_tage.hh (1)

400-425: Unified startPC API for prediction/allocation helpers looks good.

Updating generateSinglePrediction and handleNewEntryAllocation to take startPC keeps the prediction and allocation paths aligned with each other and with the rest of the BTB predictor interfaces. The declarations match the definitions in the .cc file; no issues from an interface or correctness perspective.

src/cpu/pred/btb/btb_tage.cc (3)

296-330: lookupHelper’s startPC threading aligns with the new API.

lookupHelper now takes startPC and forwards it into generateSinglePrediction, which is exactly what you want after the signature change. The metadata population (meta->preds[...]) and stats updates are unchanged and remain consistent with the prediction/update paths.

429-555: Update path now uses the same startPC-derived indices as prediction.

In updatePredictorStateAndCheckAllocation, pulling Addr startPC = stream.getRealStartPC(); and then using:

getBaseTableIndex(startPC) and

getBranchIndexInBlock(entry.pc, startPC)

ensures the base-table index and branch-position used on update match those used during prediction. That keeps the fallback counter training consistent with how predictions were computed. No correctness issues spotted here.

648-789: Update() correctly reuses real start PC for recompute and allocation.

The update routine now consistently uses startAddr = stream.getRealStartPC() when:

Recomputing providers via generateSinglePrediction(btb_entry, startAddr, predMeta) when updateOnRead is true, and

Calling handleNewEntryAllocation(startAddr, …).

This keeps recomputation and allocation fully in sync with the prediction-time use of startPC without affecting the existing bank-conflict logic, which still relies on alignedPC for getBankId. Looks solid.

github-actions · 2025-12-01T06:54:30Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`1.8581`	-
This PR	`1.8581`	➡️ `0.0000` (`0.00%`)

✅ Difftest smoke test passed!

…/tag

Change-Id: I608a50da30297fbc90de925374de5b34c2bdbf9d

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (3)

src/cpu/pred/btb/btb_tage.cc (3)

37-51: Document/guard geometry assumptions for blockWidth, bankBaseShift, and indexShift.

The new members:

blockWidth(floorLog2(blockSize))

bankBaseShift(instShiftAmt)

indexShift(bankBaseShift + ceilLog2(numBanks))

implicitly assume:

blockSize is a power of two (used via floorLog2 and startPC & ~(blockSize - 1)),

baseTableSize and useAltOnNaSize are powers of two (used with - 1 masks),

numBanks > 0, and in practice often a power of two.

These match current defaults, but custom configs that violate them will get non‑obvious behavior (e.g., unreachable bank IDs or uneven base‑table coverage).

Consider adding debug‑time asserts (or at least comments) in the ctor to state these invariants explicitly, e.g. assert(isPowerOf2(blockSize)), assert(isPowerOf2(baseTableSize)), assert(isPowerOf2(useAltOnNaSize)), and assert(numBanks > 0).

Also applies to: 70-91

196-228: Ensure startPC / startAddr is exactly the same value on prediction and update paths.

The predictor is now keyed off startPC/startAddr rather than an explicit alignedPC:

Prediction:

putPCHistory(Addr startPC, ...) records lastPredBankId = getBankId(startPC) and calls lookupHelper(startPC, ...).

lookupHelper calls generateSinglePrediction(btb_entry, startPC, ...).

Update:

update(const FetchStream &stream) derives Addr startAddr = stream.getRealStartPC();

Uses getBankId(startAddr) for conflict detection.

When updateOnRead, recomputes with generateSinglePrediction(btb_entry, startAddr, predMeta).

State updates use getBaseTableIndex(startPC) / getBranchIndexInBlock(entry.pc, startPC) with startPC taken from the stream.

This is logically consistent only if the startPC passed into putPCHistory is guaranteed to equal stream.getRealStartPC() later for the same stream entry. If they can ever diverge (e.g., due to alignment adjustments, replay, or front‑end steering), you’ll get:

bank‑ID mismatches (lastPredBankId vs updateBank),

base‑table and TAGE table updates landing in different sets than the original lookup, degrading accuracy and possibly explaining some of the observed regression.

Suggestion:

Either:

enforce startPC == stream.getRealStartPC() via an assert in debug builds, or

store the exact startPC used for prediction in TageMeta (or the stream) and consume that in update() instead of recomputing from getRealStartPC().

At minimum, it’s worth double‑checking the call sites in the front‑end pipeline to confirm this invariant holds after the alignedPC→startPC refactor.

Also applies to: 271-276, 305-321, 347-376, 428-444, 566-587, 648-734

800-809: Double‑check PC shifting and bank/index geometry against the intended RTL mapping.

The helper set:

getTageTag: pcBits = (pc >> bankBaseShift) & mask;

getTageIndex: pcShift = enableBankConflict ? indexShift : bankBaseShift;

getUseAltIdx: shiftedPc = pc >> instShiftAmt;

getBaseTableIndex: (pc >> blockWidth) & (baseTableSize - 1);

getBranchIndexInBlock: alignedPC = startPC & ~(blockSize - 1); offset = (branchPC - alignedPC) >> instShiftAmt;

getBankId: (pc >> bankBaseShift) & ((1 << bankIdWidth) - 1);

introduces three different shift domains:

instruction granularity via instShiftAmt / bankBaseShift,

block granularity via blockWidth = floorLog2(blockSize),

bank interleaving via indexShift = bankBaseShift + ceilLog2(numBanks).

This is a reasonable factoring, but it subtly changes which PC bits feed tags/index vs. banks when compared to an alignedPC‑based scheme. Given the sizeable Ideal BTB regression reported on this PR, this is one of the likely hot spots:

With enableBankConflict == true, getTageIndex starts from indexShift, so TAGE indices ignore the bank‑select bits used by getBankId.

With enableBankConflict == false, indices start from bankBaseShift, sharing low bits with bank IDs.

getBranchIndexInBlock now derives alignment from blockSize, while maxBranchPositions still assumes a 64‑byte logical block in the comment, so the effective range depends on the relation between blockSize, instShiftAmt, and your actual fetch width.

I strongly recommend:

Cross‑checking these shifts and masks against the RTL/TAGE spec (especially the intended cfiPosition / block layout).

Verifying that for the default config (blockSize=32, instShiftAmt for your ISA, numBanks=4) the new derived indices and tags for a few sample PCs match what the hardware model expects.

Optionally adding comments near these helpers summarizing which PC bits they consume (e.g., “bankId uses bits [bankBaseShift .. bankBaseShift+bankIdWidth-1]”).

This will help ensure the refactor is behaviorally equivalent to the intended design and not inadvertently degrading prediction quality.

Also applies to: 826-837, 869-879, 881-895

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 61036c7 and 72470ef.

📒 Files selected for processing (3)

src/cpu/pred/BranchPredictor.py (1 hunks)
src/cpu/pred/btb/btb_tage.cc (17 hunks)
src/cpu/pred/btb/btb_tage.hh (5 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

src/cpu/pred/btb/btb_tage.hh

🧰 Additional context used

🧬 Code graph analysis (1)

src/cpu/pred/btb/btb_tage.cc (2)

src/cpu/pred/btb/btb_tage.hh (28)

btb_entry (403-405)

startPC (168-169)

startPC (417-424)

pred (370-370)

history (130-130)

history (134-135)

history (139-142)

history (139-139)

history (144-148)

history (144-145)

history (161-161)

history (201-201)

history (382-382)

stream (155-155)

stream (408-408)

entry (88-90)

entry (152-152)

entry (411-414)

pc (172-172)

pc (175-175)

pc (179-179)

pc (183-183)

pc (186-188)

pc (186-186)

pc (191-191)

pc (198-198)

pc (286-286)

branchPC (194-194)

src/cpu/pred/btb/stream_struct.hh (10)

startPC (380-382)

startPC (704-734)

startPC (704-709)

entry (489-511)

pc (738-758)

pc (738-741)

pc (770-780)

pc (770-771)

pc (785-793)

pc (785-785)

🪛 Ruff (0.14.6)

src/cpu/pred/BranchPredictor.py

1060-1060: Param may be undefined, or defined from star imports

(F405)

🔇 Additional comments (1)

src/cpu/pred/BranchPredictor.py (1)

1037-1062: Reconsider defaulting BTBTAGE.enableBankConflict to True (likely cause of performance regression).

Changing enableBankConflict from the previous default False to True is a semantic change affecting every config instantiating BTBTAGE, including MicroTAGE and DecoupledBPUWithBTB.tage.

With enableBankConflict now on by default:

getTageIndex uses pcShift = indexShift instead of bankBaseShift, altering which PC bits are used for indexing.

The update path drops updates when updateBank == lastPredBankId, reducing effective training frequency.

This behavior change would show up as performance regression in benchmarks expecting conflict-free indexing.

Given that this alters indexing semantics for all BTBTAGE-based configs:

Verify the regression impact by re-running the workflow with enableBankConflict forced to False to isolate how much of any performance change comes from the default flip vs. other index logic changes, and

Consider keeping the default at False and requiring configs that want realistic bank conflict simulation to opt in explicitly.

If the goal is to validate bank conflicts in specific studies, making this opt-in (or at least clearly documented) will avoid surprising users who expect the previous ideal-case behavior from the default BTBTAGE configuration.

XiangShanRobot · 2025-12-01T08:24:50Z

[Generated by GEM5 Performance Robot]
commit: 72470ef
workflow: gem5 Ideal BTB Performance Test

Ideal BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	14.87	15.05	-1.17 🔴

[Generated by GEM5 Performance Robot]
commit: 72470ef
workflow: gem5 Ideal BTB Performance Test

Ideal BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	14.87	15.03	-1.08 🔴

Change-Id: I41edcd55edfffd5225319f8d49dc6b985fbbdec9

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

docs/Gem5_Docs/frontend/TAGE_BANK_IMPLEMENTATION_PLAN.md (1)
59-59: Update stale API reference in code example.

The code snippet at line 59 shows getBankId(alignedPC), but the PR objective indicates the change replaces alignedPC with startPC throughout. This should be updated for consistency with the implemented changes.
- lastPredBankId = getBankId(alignedPC);
+ lastPredBankId = getBankId(startPC);

🧹 Nitpick comments (2)

src/cpu/pred/btb/test/btb_tage.test.cc (2)

251-271: Clarify and guard findTableWithEntry’s dependence on prediction meta

The new helper correctly uses (startPC, branchPC) and indexFoldedHist to mirror how the predictor indexed the last fetch block. However, it silently assumes:

tage->getPredictionMeta() is non-null and still corresponds to a prior putPCHistory for this startPC, and

meta->indexFoldedHist[t] is valid for all t < numPredictors.

Consider adding a brief comment or GTest ASSERT_NE(meta, nullptr) (and, if practical, asserts on indexFoldedHist[t]) to make these preconditions explicit and fail fast if the helper is ever called out of sequence in future tests.

887-919: BankConflict test: good bank mapping; consider centralizing bank computation

The added comment documenting bank IDs from (pc >> 1) & 0x3 with instShiftAmt == 1, and the use of 0x100 vs 0x104 in Test 2, correctly exercise a “different bank” case under the new startPC-based banking scheme.

To avoid future drift between comments, magic constants, and BTBTAGE’s actual bank function, consider introducing a small getBank(Addr) helper in the test (or reusing an existing API) and deriving the test PCs from that, instead of hard-coding specific values in comments and calls.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 72470ef and c3dceaa.

📒 Files selected for processing (2)

docs/Gem5_Docs/frontend/TAGE_BANK_IMPLEMENTATION_PLAN.md (2 hunks)
src/cpu/pred/btb/test/btb_tage.test.cc (5 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

src/cpu/pred/btb/test/btb_tage.test.cc (2)

src/cpu/pred/btb/btb_ubtb.hh (1)

meta (144-147)

src/cpu/pred/btb/btb_ittage.hh (2)

entry (66-66)

entry (107-107)

🪛 LanguageTool

docs/Gem5_Docs/frontend/TAGE_BANK_IMPLEMENTATION_PLAN.md

[uncategorized] ~25-~25: 您的意思是“位"于"”吗？
Context: ...的 ceilLog2(numBanks) 位作为 bank id，剩余更高位与 folded history 组合得到 index / tag。这样可以直接利...

(YU7_YU8)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Quick Build, Unit Tests & Smoke Test
GitHub Check: perf_test / XS-GEM5 - Run performance test (spec06-0.3c)

🔇 Additional comments (2)

src/cpu/pred/btb/test/btb_tage.test.cc (2)

300-306: BasicPrediction’s findTableWithEntry call matches new API semantics

Passing 0x1000 as both startPC and branchPC is consistent with the single-branch-per-block scenario under test and with the updated (startPC, branchPC) helper. No issues from a semantics perspective.

532-535: MultipleBranchSequence correctly distinguishes per-branch allocation

Using a shared startPC (0x1000) with different branchPC values (0x1000 vs 0x1004) when calling findTableWithEntry is a good fit for the new indexing model: you probe the same fetch-block index while discriminating entries by entry.pc. This aligns with the test’s intention that only the mispredicted second branch should allocate.

github-actions · 2025-12-02T02:47:31Z

🚀 Coremark Smoke Test Results

Branch	IPC	Change
Base (`xs-dev`)	`1.8581`	-
This PR	`1.7949`	📉 `-0.0632` (`-3.40%`)

✅ Difftest smoke test passed!

src/cpu/pred/btb/btb_tage.cc

cpu-o3: revert 4a767ea

61036c7

use startPC instead of alignedPC, to calculate index, tag, bank easier Change-Id: Id76bf1658e4286aad2cd4580943ff724bfb7e1e7

coderabbitai bot reviewed Dec 1, 2025

View reviewed changes

jensen-yan added 2 commits December 1, 2025 15:16

cpu-o3: tage use 1 bit shift instead of 5 bit shift to get bank/index…

a19dafe

…/tag

cpu-o3: test bank confict use new 1 bit shift

72470ef

Change-Id: I608a50da30297fbc90de925374de5b34c2bdbf9d

coderabbitai bot reviewed Dec 1, 2025

View reviewed changes

cpu-o3: tage bankConflict: fix unit test and update docs

c3dceaa

Change-Id: I41edcd55edfffd5225319f8d49dc6b985fbbdec9

coderabbitai bot reviewed Dec 2, 2025

View reviewed changes

Yakkhini reviewed Dec 2, 2025

View reviewed changes

src/cpu/pred/btb/btb_tage.cc Show resolved Hide resolved

Yakkhini approved these changes Dec 2, 2025

View reviewed changes

jensen-yan merged commit 75d8e8a into xs-dev Dec 2, 2025
3 checks passed

jensen-yan deleted the tage-indexNew2-align branch December 2, 2025 03:04

This was referenced Dec 24, 2025

Using mbtb basetable align #672

Merged

Tage tag align #675

Merged

This was referenced Jan 5, 2026

cpu-o3:Implement predwrongSource method #683

Merged

bpu: test tage not use alt table #690

Open

Split microtage perf #694

Closed

This was referenced Jan 12, 2026

Utage align with rtl perf #699

Closed

Utage usfulbit align #704

Closed

Ahead utage history align #709

Closed

This was referenced Jan 23, 2026

Ai give ahead perf #728

Closed

Split microtage1 align #734

Closed

Ahead microtage index perf #741

Closed

Refs/heads/split utage perf #746

Closed

This was referenced Jan 30, 2026

Split utage align #747

Closed

exit-slot TAGE #754

Open

This was referenced Mar 5, 2026

cpu: Align BTBTAGE update reread and bank conflicts #764

Closed

bpu,configs: Add optional BTBTAGE upper-bound mode for kmhv3 #770

Merged

Utage check rtl align #773

Open

coderabbitai bot mentioned this pull request Mar 23, 2026

Tage way tuning align #803

Open

coderabbitai bot mentioned this pull request Apr 8, 2026

cpu,arch-riscv,cpu-o3,bpu: align control-PC semantics, fetch coverage, and owner migration #805

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpu-o3: update tage index#627

cpu-o3: update tage index#627
jensen-yan merged 4 commits intoxs-devfrom
tage-indexNew2-align

jensen-yan commented Dec 1, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 1, 2025 •

edited

Loading

Uh oh!

XiangShanRobot commented Dec 1, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

github-actions bot commented Dec 1, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

XiangShanRobot commented Dec 1, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

github-actions bot commented Dec 2, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jensen-yan commented Dec 1, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

XiangShanRobot commented Dec 1, 2025

Ideal BTB Performance

Overall Score

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 1, 2025

🚀 Coremark Smoke Test Results

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

XiangShanRobot commented Dec 1, 2025

Ideal BTB Performance

Overall Score

Ideal BTB Performance

Overall Score

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 2, 2025

🚀 Coremark Smoke Test Results

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jensen-yan commented Dec 1, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 1, 2025 •

edited

Loading