perf: replace protobuf metric reporting with pre-allocated flat long[] array by andygrove · Pull Request #3614 · apache/datafusion-comet

andygrove · 2026-02-28T19:18:00Z

Which issue does this PR close?

N/A

Rationale for this change

I have been profiling with async-profiler and saw a surprising amount of time spent in metrics handling.

What changes are included in this PR?

Replace protobuf-based metric updates with a flat long[] array approach for faster native-to-JVM metric transfer
Pre-allocate parallel arrays (metric names, SQLMetrics, node offsets) at plan creation via DFS walk of CometMetricNode tree
Native side fills a Vec<i64> and copies to JVM via single SetLongArrayRegion bulk JNI call instead of protobuf encode/decode
JVM reads linearly from the array — no protobuf, no tree walking, no map lookups, no string allocations per update cycle
Remove metric.proto and all protobuf metric infrastructure

How are these changes tested?

…] array Replace the protobuf-based metric update path with a flat array approach for significantly faster metric transfers from native to JVM. Previously, every metric update built a HashMap<String, i64> and NativeMetricNode protobuf tree, encoded it, decoded on JVM side, then walked the tree with Map.get() lookups per metric. Now, metrics are flattened into parallel arrays at plan creation time, native fills a Vec<i64> and copies to JVM via a single SetLongArrayRegion bulk JNI call, and JVM reads linearly — no protobuf, no tree walking, no map lookups, no string allocations per update cycle.

Metrics like conversionTime are accumulated on the JVM side and have no native counterpart. Initialize the values array to -1 (sentinel) and only update SQLMetrics where native actually wrote a value (>= 0), preventing native metric updates from zeroing out JVM-accumulated metrics.

kazuyukitanimura · 2026-04-06T16:36:29Z

Thanks @andygrove Would you mind resolve the conflicts?

andygrove · 2026-04-24T19:14:35Z

Closing in favor of a fresh implementation. The flat-array approach matched metrics by DFS position between two independent tree walks (Rust SparkPlan vs JVM CometMetricNode), which is brittle when the trees diverge (e.g., hash-join build/probe swap in planner.rs:1725). A new attempt will key slots by a stable identifier so positional mismatches can't cross-apply metric values.

andygrove added 4 commits February 28, 2026 12:13

style: apply cargo fmt formatting

eef6468

Merge branch 'main' into flat-array-metrics

9f41929

andygrove marked this pull request as ready for review March 2, 2026 15:11

andygrove added 2 commits March 3, 2026 09:34

Merge branch 'main' into flat-array-metrics

06da156

Merge remote-tracking branch 'apache/main' into flat-array-metrics

16dbf4c

andygrove added the performance label Apr 24, 2026

Merge remote-tracking branch 'apache/main' into flat-array-metrics

9d77069

andygrove closed this Apr 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: replace protobuf metric reporting with pre-allocated flat long[] array#3614

perf: replace protobuf metric reporting with pre-allocated flat long[] array#3614
andygrove wants to merge 7 commits intoapache:mainfrom
andygrove:flat-array-metrics

andygrove commented Feb 28, 2026 •

edited

Loading

Uh oh!

kazuyukitanimura commented Apr 6, 2026

Uh oh!

andygrove commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

andygrove commented Feb 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

kazuyukitanimura commented Apr 6, 2026

Uh oh!

andygrove commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

andygrove commented Feb 28, 2026 •

edited

Loading