Standardize opcode tracer behavior for debug_traceBlockByNumber and debug_traceTransaction by MysticRyuujin · Pull Request #762 · ethereum/execution-apis

MysticRyuujin · 2026-03-03T23:17:22Z

Note: PR is in draft to gather feedback, and so I can continue testing go-ethereum via rpctestgen

Summary

This PR adds OpenRPC method and schema definitions for:

debug_traceTransaction
debug_traceBlockByNumber
debug_traceBlockByHash

Scope is intentionally limited to the default opcode (struct) logger output, while still allowing named tracers through TraceConfig.tracer.

It is also based on the investigation done by ethPandaOps on trace-comparisons

Goal

The goal of this PR is to standardize the existing live debug_trace* opcode tracer contract as one canonical cross-client output:

preserve the widely used geth-shaped RPC surface
define clear field presence/absence rules
define clear encoding rules for stack, memory, storage, and return data
define exact semantics for when fields appear
add conformance tests that drive clients toward the same default opcode tracer response

Non-goals

This PR does not:

attempt literal EIP-3155 conformance
redesign the trace format from scratch
standardize named tracer result schemas

If the ecosystem wants a cleaner or more opinionated trace format than the current live RPC methods provide, that should be a separate effort.

Preserve vs normalize

Preserved from current live RPC practice

top-level opcode trace shape: gas, failed, returnValue, structLogs
op as a human-readable opcode name string
gas and gasCost as JSON integers
memory as chunked 32-byte words
named tracers remain supported but out of scope here

Standardized by this PR

stack values use canonical 0x-prefixed uint256 quantities
memory words use 0x-prefixed bytes32
storage keys and values use 0x-prefixed bytes32
error must be absent when no error occurred
returnData must be absent when disabled and valid hex when enabled
EOA-to-EOA transfers must return structLogs: []
block traces must return ordered { txHash, result } entries
storage snapshots must be emitted only at SLOAD and SSTORE

Relationship to `EIP-3155` / `EIP-7756`

This PR borrows the useful parts of the EIP work, but it does not adopt those specs literally.

The main intentional differences are:

keep geth-style top-level field names instead of output / gasUsed / pass
keep integer gas / gasCost
keep op as a string name
keep memory as chunked words instead of a single blob
leave stateRoot, time, fork, memSize, and named tracer output schemas out of scope

Key field-level differences

Area	`EIP-3155` / `EIP-7756`	This PR
Top-level summary fields	`output`, `gasUsed`, `pass`	`returnValue`, `gas`, `failed`
`op`	opcode byte, optionally with `opName`	opcode name string only
`gas` / `gasCost`	hex string or number, depending on EIP/version	JSON integers
`memory`	contiguous blob	chunked `bytes32[]`
Additional fields	may include `stateRoot`, `time`, `fork`, `memSize`	out of scope here

Storage semantics

This PR standardizes storage behavior as part of the canonical output:

storage is a cumulative snapshot
storage keys and values are 0x-prefixed bytes32
storage is emitted only at SLOAD and SSTORE
storage must be absent, not null or {}, at all other opcodes

Likely client changes

Based on the current tests, the existing PR investigation, and the ethPandaOps comparison work, the most likely alignment work is:

Client	Likely changes to align
`Geth`	Add `0x` prefix to memory chunks and storage keys/values in opcode traces
`Erigon`	Verify memory/storage encoding and block-trace `txHash` pairing match the spec
`Besu`	Align stack/memory/storage encoding and field presence behavior with the canonical trace shape
`Nethermind`	Omit empty `error` / `storage` fields where required and align storage timing/output
`Reth`	Align `returnData` gating and verify storage timing/encoding behavior

This table is a concrete starting point for client review, not a claim of fully validated implementation diffs across every client version.

Tests

This PR adds focused tests for:

debug_traceTransaction
- EOA transfer returns structLogs: []
- contract call trace shape is spec-compliant
- unknown transaction returns an error
debug_traceBlockByNumber
- ordered per-transaction { txHash, result } responses
- memory encoding
- storage encoding
- returnData gating behavior
- genesis trace error
- invalid block number error
debug_traceBlockByHash
- basic block with transactions
- genesis trace error
- invalid block hash error

Most large trace fixtures are SpecOnly, meaning they validate the standardized wire contract rather than replaying a recorded byte-for-byte dump from one client.

That is intentional: the purpose of this PR is to standardize one canonical response shape and semantics for the default opcode tracer, while avoiding overfitting the tests to incidental formatting outside the specified contract.

Recorded fixtures are still used where the expected behavior is small and unambiguous, such as:

unknown transaction errors
genesis trace errors
invalid block number errors

Coverage still incomplete

Current coverage still does not fully prove:

refund presence when non-zero
broader storage snapshot timing patterns beyond the focused cases in this PR
some revert/failure-path variants where clients have historically diverged

The proposal here is still to define one canonical target; this section simply identifies the areas where additional fixtures or client feedback may still be useful.

Current validation status

make build: pass
go test ./... in tools/: pass
make fill: fail (need go-ethereum to conform to the spec as written here)- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

src/debug/trace.yaml

chfast · 2026-03-12T14:03:10Z

src/schemas/opcode-tracer.yaml

+      minimum: 0
+    gasCost:
+      title: gas cost
+      description: The gas cost charged for executing this opcode.


This is not well specified. Does it mean the amount of gas to be burned by successful execution of the instruction?

I updated the description to reflect the intent, though I'm not 100% sure if this wording is correct? I think it is.

In geth, OnOpcode is called before the opcode executes, receiving the pre-computed cost as a parameter. gasCost is that value — the amount that will be deducted from the remaining gas assuming successful execution, including dynamic costs such as memory expansion and cold/warm storage access charges (EIP-2929). So yeah, I think the updated description is accurate.

I think the main issue is in CALL instructions. I think gasCost should exclude the amount of gas passed down to the callee and only report the amount being burned as the instruction cost. This way the sum of all gasCost entries should give you the final gas used amount. cc @MariusVanDerWijden.

Idea about sum being equal to the final used amount looks interesting, but want to highlight a few points:

For formula to hold CALL gasCost won't include any "execution" gas, as it will already be accounted for in "inner" opcodes. Only base call gas, memory expansion, value transfer, and account creation fees. This may be a bit counterintuitive.

From a brief check none of the top clients have such exact implementation, so this will require a change from all of them. Also most clients take full forwarded gas as gasCost (without deducting amount returned by the callee). May be wrong here so.

MysticRyuujin · 2026-03-13T22:02:22Z

This could be updated to support EIP-7708 if it ever actually makes it into a fork 😅

src/debug/trace.yaml

src/schemas/opcode-tracer.yaml

alexb5dh

No corcerns from Nethermind side.
Just not sure about reexec being in the standard, as doesn't seem like it's needed or widely used.

src/schemas/opcode-tracer.yaml

canepat · 2026-03-19T10:03:41Z

Just not sure about reexec being in the standard, as doesn't seem like it's needed or widely used.

+1 from Erigon

0000c-0c

run

all ok

mattsse

no concerns from reth

…nd debug_traceTransaction Made-with: Cursor

Made-with: Cursor

The reexec field is an implementation detail of geth's state reconstruction and does not belong in the standardized trace configuration.

Update the replace directive to use the fix-legacy-struct-log-json branch commit (c787778ed44a) from the MysticRyuujin fork, which fixes legacy structLog JSON encoding in the struct logger tracer.

macfarla

the execution timeout is non-trivial to implement in Besu. would prefer this to be a nice to have rather than a must-have. everything else is fine.

src/schemas/opcode-tracer.yaml

macfarla · 2026-03-27T00:38:35Z

src/schemas/opcode-tracer.yaml

+        each StructLog entry. Ignored when tracer is set.
+        Default: false.
+      type: boolean
+    limit:


medium effort to implement this in Besu since we don't have this currently

s1na · 2026-03-27T17:17:03Z

src/debug/trace.yaml

+        $ref: '#/components/schemas/TraceConfig'
+  errors:
+    - code: 4444
+      message: Pruned history unavailable


probably also an error for "block not found" and parent not found

Same comment as here

src/debug/trace.yaml

…pdate go-ethereum - Add debug_traceBlockByHash method spec (mirrors debug_traceBlockByNumber but takes a block hash; clients MUST error when block not found) - Add timeout error-on-timeout behavior to TraceConfig and method descriptions - Update OpcodeBlockTransactionTrace description to reference both block trace methods - Add DebugTraceBlockByHash test generator (trace-block-with-transactions, trace-genesis, trace-block-not-found) - Update tools/go.mod replace to MysticRyuujin fork commit 957e1ed413ed (includes gofmt fix for formatMemoryWord) - Regenerate tests/debug_traceBlockByHash/ via make fill Made-with: Cursor

Clients that do not support execution timeouts (e.g. streaming JSON-RPC implementations) MAY ignore the timeout field. When a timeout is supported and reached, the client MUST still return an error with no partial results. Made-with: Cursor

This is a breaking change in the opcode (structLog) tracer. Several fields will have a slight formatting difference to conform to the newly established spec at: ethereum/execution-apis#762. The differences include: - `memory`: words will have the 0x prefix. Also last word of memory will be padded to 32-bytes. - `storage`: keys and values will have the 0x prefix. --------- Co-authored-by: Sina M <1591639+s1na@users.noreply.github.com>

MysticRyuujin · 2026-04-06T14:16:29Z

go-ethereum PR was merged but there is no release, go.mod needs to be updated

daniellehrner · 2026-04-07T08:13:28Z

src/schemas/opcode-tracer.yaml

+      description: >-
+        The contents of EVM memory at this step, divided into 32-byte chunks.
+        Each element is a 0x-prefixed, zero-padded 32-byte hex word representing
+        a consecutive 32-byte memory slot starting at offset (index * 32).


When stack, storage and memory is enabled, we've seen block traces on mainnet which were bigger than 47GB and that was even without padding. I think we should try to reduce the trace size as much as possible and not do any padding

Isn't the padding in practice only for the last word, which may be partial? The overhead seems low to me.

daniellehrner · 2026-04-07T08:15:07Z

src/schemas/opcode-tracer.yaml

+        The contents of EVM memory at this step, divided into 32-byte chunks.
+        Each element is a 0x-prefixed, zero-padded 32-byte hex word representing
+        a consecutive 32-byte memory slot starting at offset (index * 32).
+        This field is absent (not null) when enableMemory is false.


Similar to storage we should only populate this for op codes that touch memory to reduce the trace size

Looking through the Besu codebase the following op codes touch memory:

MLOAD, KECCAK256, LOG0–LOG4, RETURN, REVERT, MSIZE, MSTORE, MSTORE8, CALLDATACOPY, CODECOPY, EXTCODECOPY, RETURNDATACOPY, CALL, CALLCODE, DELEGATECALL, STATICCALL, CREATE, CREATE2, MCOPY

We discussed a similar idea which didn't end up happening because streaming traces alleviated some of the issue.

It is the same in spirit as to what you're suggesting with the difference that we focus on opcodes that change the memory (and anytime you enter a new scope) and at that point output the full memory.

I just run tests on debug_traceBlockByNumber with memory tracing enabled, on mainnet, and saw a block that generated 643 GiB of json output :

0x17AEF7D 658854.28MB

this is just to show case that we need to optimize memory tracing, and to support @daniellehrner's idea.

Maybe it's not worth to standardize this part then.

I think it makes even more sense to standardize it to be sure that debug_trace* calls generate reasonable amount of data. So basically, we need to reduce the impact of memory tracing on the json output size.

I'd like to point out that: 1) it is disabled by default and won't bloat the trace, 2) there are few use-cases that require memory.

That said, I fully support optimizing the impact of memory, either in this or a future PR. I'd propose a more granular flag than enableMemory for it. An enum which can support a range of values like none, full, onChange. Because there are multiple ways to optimise it, you can even go as far as only outputting the diff compared to last change.

There is a way to do this in a backwards compatible way too. If we pick full as the default value, then when enableMemory is toggled it behaves the same way it does today.

daniellehrner reviewed Mar 12, 2026

View reviewed changes

src/debug/trace.yaml Show resolved Hide resolved

chfast reviewed Mar 12, 2026

View reviewed changes

alexb5dh reviewed Mar 18, 2026

View reviewed changes

src/debug/trace.yaml Show resolved Hide resolved

src/schemas/opcode-tracer.yaml Outdated Show resolved Hide resolved

canepat reviewed Mar 19, 2026

View reviewed changes

src/schemas/opcode-tracer.yaml Show resolved Hide resolved

alexb5dh approved these changes Mar 19, 2026

View reviewed changes

canepat reviewed Mar 19, 2026

View reviewed changes

src/schemas/opcode-tracer.yaml Show resolved Hide resolved

0000c-0c approved these changes Mar 26, 2026

View reviewed changes

mattsse approved these changes Mar 26, 2026

View reviewed changes

MysticRyuujin mentioned this pull request Mar 26, 2026

eth/tracers/logger: fix legacy structLog JSON encoding ethereum/go-ethereum#34093

Merged

MysticRyuujin added 6 commits March 26, 2026 10:40

debug, schemas: add opcode tracer spec for debug_traceBlockByNumber a…

965572a

…nd debug_traceTransaction Made-with: Cursor

wordlist: add SLOAD and SSTORE for spellcheck

acb03e5

Made-with: Cursor

update

41f8509

debug: reorder example fields, clarify gasCost description

4d2f5ca

debug: remove reexec from TraceConfig

120cd65

The reexec field is an implementation detail of geth's state reconstruction and does not belong in the standardized trace configuration.

tools: point go-ethereum replace to MysticRyuujin fork fix branch

4805dab

Update the replace directive to use the fix-legacy-struct-log-json branch commit (c787778ed44a) from the MysticRyuujin fork, which fixes legacy structLog JSON encoding in the struct logger tracer.

MysticRyuujin force-pushed the opcode-tracer-spec branch from b2f7439 to 4805dab Compare March 26, 2026 15:41

macfarla mentioned this pull request Mar 27, 2026

debug RPC endpoints standardization besu-eth/besu#10115

Open

macfarla approved these changes Mar 27, 2026

View reviewed changes

s1na reviewed Mar 27, 2026

View reviewed changes

src/debug/trace.yaml Show resolved Hide resolved

s1na reviewed Mar 27, 2026

View reviewed changes

src/debug/trace.yaml Show resolved Hide resolved

MysticRyuujin added 2 commits March 27, 2026 13:29

MysticRyuujin mentioned this pull request Apr 1, 2026

RPC Standards # 24 | April 6, 2026, 15:00 UTC ethereum/pm#2001

Closed

4 tasks

This was referenced Apr 3, 2026

feat(debug): add enableReturnData param and returnData field besu-eth/besu#10172

Open

Feat-11 : Add limit param to stop opcode trace capture after N steps besu-eth/besu#10173

Open

daniellehrner reviewed Apr 7, 2026

View reviewed changes

This was referenced Apr 7, 2026

core/state, eth/tracers: add access list to struct logger ethereum/go-ethereum#34656

Closed

feature request: add 'access list' into tracing api responses ethereum/go-ethereum#25278

Closed

lupin012 mentioned this pull request Apr 10, 2026

rpc: tracing logger memory encoding erigontech/erigon#20478

Draft

Conversation

MysticRyuujin commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Goal

Non-goals

Preserve vs normalize

Preserved from current live RPC practice

Standardized by this PR

Relationship to EIP-3155 / EIP-7756

Key field-level differences

Storage semantics

Likely client changes

Tests

Coverage still incomplete

Current validation status

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexb5dh Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MysticRyuujin commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexb5dh left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

canepat commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

0000c-0c left a comment

Choose a reason for hiding this comment

Uh oh!

mattsse left a comment

Choose a reason for hiding this comment

Uh oh!

macfarla left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

s1na Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

MysticRyuujin commented Apr 6, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

daniellehrner Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

s1na Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MysticRyuujin commented Mar 3, 2026 •

edited

Loading

Relationship to `EIP-3155` / `EIP-7756`

alexb5dh Mar 26, 2026 •

edited

Loading

MysticRyuujin commented Mar 13, 2026 •

edited

Loading

alexb5dh left a comment •

edited

Loading

canepat commented Mar 19, 2026 •

edited

Loading

s1na Mar 27, 2026 •

edited

Loading

daniellehrner Apr 7, 2026 •

edited

Loading

s1na Apr 7, 2026 •

edited

Loading

ahamlat Apr 8, 2026 •

edited

Loading