Serialize dynamic filters on execution plan nodes (HashJoin, Aggregate, Sort) by jayshrivastava · Pull Request #2 · jayshrivastava/datafusion

jayshrivastava · 2026-02-20T19:52:17Z

Which issue does this PR close?

Informs: datafusion-contrib/datafusion-distributed#180
Follow up for: apache#20416

Rationale for this change

I'm interested in serializing a physical plan (post-physical optimizer) and executing it on a remote node. To do so, I need dynamic filters and references/pointers to dynamic filters to be preserved in the plan. Currently, nodes which produce filters such as HashJoinExec, AggregateExec, and SortExec, do not serialize their dynamic filters.

This change intends to update the above nodes to serialize dynamic filters and adds tests for the scenario above.

What changes are included in this PR?

Proto schema (datafusion.proto)

Added PhysicalExprNode dynamic_filter field to:

HashJoinExecNode (tag 11)
AggregateExecNode (tag 13)
SortExecNode (tag 5)

Plan node public API

Added with_dynamic_filter() and dynamic_filter() to HashJoinExec, AggregateExec, SortExec.

with_dynamic_filter() always

validates that the filter is valid for the plan node's schema
resets any internal state related to the dynamic filter

Serde

Using the new plan node public APIs above

Each node's try_from_* serialization function now reads dynamic_filter()
and serializes it via the proto converter
Each node's try_into_* deserialization function deserializes the field,
downcasts to DynamicFilterPhysicalExpr, and sets it on the node

Are these changes tested?

Added tests which create this plan and perform round-trip serialization on it (1 test each for HashJoinExec, AggregateExec, and SortExec).

    HashJoinExec ─── dynamic filter
         │               │  
         ▼               │
    FilterExec           | (optimizer pushes down this filter
         │               |
         ▼               ▼ 
    DataSourceExec  ─ dynamic filter

Added tests for with_dynamic_filter() and dynamic_filter() on dynamic_filter() to HashJoinExec, AggregateExec, SortExec.

jayshrivastava · 2026-02-23T21:48:10Z

Note for reviewers: I'm unsure if I should be using apply_expressions() or expressions() (see apache#20337) instead of with_dynamic_filter() and dynamic_filter()

LiaCastaneda · 2026-02-24T10:20:17Z

+    /// Returns the dynamic filter expression for this aggregate, if set.
+    pub fn dynamic_filter(&self) -> Option<&Arc<DynamicFilterPhysicalExpr>> {
+        self.dynamic_filter.as_ref().map(|df| &df.filter)
+    }


I think it would be cleaner to use apply_expressions (apache#20337), mainly because it's more generic and you can do basically anything with PhysicalExprs inside a plan, including detecting dynamic filters, and you wouldn't need to know beforehand which nodes are producers and consumers -- any custom logic can be done separately in the proto crate. It would also reduce overhead to people who wants to add a new ExecutionPlan that holds a DynamicFilterPhysicalExpr, they'd have to remember to also add the manual dynamic_filter() call here. Implementation for apply_expressions is part of ExecutionPlan and will not be optional, so users will not forget they have to do it in every node.

LiaCastaneda · 2026-02-24T10:46:45Z

+    pub fn with_dynamic_filter(
+        mut self,
+        filter: Arc<DynamicFilterPhysicalExpr>,
+    ) -> Result<Self> {


I see we do something similar for every producer/consumer, a more generic way to modify the expressions would probably implementing map_expressions like suggested here in ExecutionPlan to make it more generic?

Informs: datafusion-contrib/datafusion-distributed#180 Closes: apache#20418 Consider this scenario 1. You have a plan with a `HashJoinExec` and `DataSourceExec` 2. You run the physical optimizer and the `DataSourceExec` accepts `DynamicFilterPhysicalExpr` pushdown from the `HashJoinExec` 3. You serialize the plan, deserialize it, and execute it What should happen is that the dynamic filter should "work", meaning 1. When you deserialize the plan, both the `HashJoinExec` and `DataSourceExec` should have pointers to the same `DynamicFilterPhysicalExpr` 2. The `DynamicFilterPhysicalExpr` should be updated during execution by the `HashJoinExec` and the `DataSourceExec` should filter out rows This does not happen today for a few reasons, a couple of which this PR aims to address 1. `DynamicFilterPhysicalExpr` is not survive round-tripping. The internal exprs get inlined (ex. it may be serialized as `Literal`) 2. Even if `DynamicFilterPhysicalExpr` survives round-tripping, during pushdown, it's often the case that the `DynamicFilterPhysicalExpr` is rewritten. In this case, you have two `DynamicFilterPhysicalExpr` which are different `Arc`s but share the same `Inner` dynamic filter state. The current `DeduplicatingProtoConverter` does not handle this specific form of deduping. This PR aims to fix those problems by adding serde for `DynamicFilterPhysicalExpr` and deduping logic for the inner state of dynamic filters. It does not yet add a test for the `HashJoinExec` and `DataSourceExec` filter pushdown case, but this is relevant follow up work. I tried to keep the PR small for reviewers. Yes, via unit tests. `DynamicFilterPhysicalExpr` are now serialized by the default codec

Fixups for the cherry-picked commits from PRs apache#19437, apache#20037, apache#20416, and #2 to work with branch-52's partition-index APIs: - Update remap_children callers to use instance method signature - Adapt DynamicFilterUpdate::Global enum for new code paths - Add missing partitioned_exprs/runtime_partition fields to new constructors - Remove null_aware field (not on branch-52) - Replace FilterExecBuilder with FilterExec::try_new - Remove non-compiling tests that depend on upstream-only APIs - Fix duplicate imports in roundtrip test file Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Fixups for the cherry-picked commits from PRs apache#19437, apache#20037, apache#20416, and jayshrivastava#2 to work with branch-52's partition-index APIs: - Update remap_children callers to use instance method signature - Adapt DynamicFilterUpdate::Global enum for new code paths - Add missing partitioned_exprs/runtime_partition fields to new constructors - Remove null_aware field (not on branch-52) - Replace FilterExecBuilder with FilterExec::try_new - Remove non-compiling tests that depend on upstream-only APIs - Fix duplicate imports in roundtrip test file Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

apache#20960) Reproducer for apache#20937

…messages (apache#20387) ## Which issue does this PR close? - Closes apache#20386. ## Rationale for this change `memory_limit` (`RuntimeEnvBuilder::new().with_memory_limit()`) configuration uses `greedy` memory pool as `default`. However, if `memory_pool` (`RuntimeEnvBuilder::new().with_memory_pool()`) is set, it overrides by expected `memory_pool` config such as `fair`. Also, if both `memory_limit` and `memory_pool` configs are not set, `unbounded` memory pool will be used so it can be useful to expose `ultimately used/selected pool` as part of `ResourcesExhausted` error message for the end user awareness and the user may need to switch used memory pool (`greedy`, `fair`, `unbounded`), - Also, [this comparison table](lance-format/lance#3601 (comment)) is an example use-case for both `greedy` and `fair` memory pools runtime behaviors and this addition can help for this kind of comparison table by exposing used memory pool info as part of native logs. Please find following example use-cases by `datafusion-cli`: **Case1**: datafusion-cli result when `memory-limit` and `top-memory-consumers > 0` are set: ``` eren.avsarogullari@AWGNPWVK961 debug % ./datafusion-cli --memory-limit 10M --command 'select * from generate_series(1,500000) as t1(v1) order by v1;' --top-memory-consumers 3 DataFusion CLI v53.0.0 Error: Not enough memory to continue external sort. Consider increasing the memory limit config: 'datafusion.runtime.memory_limit', or decreasing the config: 'datafusion.execution.sort_spill_reservation_bytes'. caused by Resources exhausted: Additional allocation failed for ExternalSorter[0] with top memory consumers (across reservations) as: ExternalSorterMerge[0]#2(can spill: false) consumed 10.0 MB, peak 10.0 MB, DataFusion-Cli#0(can spill: false) consumed 0.0 B, peak 0.0 B, ExternalSorter[0]#1(can spill: true) consumed 0.0 B, peak 0.0 B. Error: Failed to allocate additional 128.0 KB for ExternalSorter[0] with 0.0 B already allocated for this reservation - 0.0 B remain available for the total memory pool: greedy(used: 10.0 MB, pool_size: 10.0 MB) ``` **Case2**: datafusion-cli result when `memory-limit` and `top-memory-consumers = 0` (disabling top memory consumers logging) are set: ``` eren.avsarogullari@AWGNPWVK961 debug % ./datafusion-cli --memory-limit 10M --command 'select * from generate_series(1,500000) as t1(v1) order by v1;' --top-memory-consumers 0 DataFusion CLI v53.0.0 Error: Not enough memory to continue external sort. Consider increasing the memory limit config: 'datafusion.runtime.memory_limit', or decreasing the config: 'datafusion.execution.sort_spill_reservation_bytes'. caused by Resources exhausted: Failed to allocate additional 128.0 KB for ExternalSorter[0] with 0.0 B already allocated for this reservation - 0.0 B remain available for the total memory pool: greedy(used: 10.0 MB, pool_size: 10.0 MB) ``` **Case3**: datafusion-cli result when only `memory-limit`, `memory-pool` and `top-memory-consumers > 0` are set: ``` eren.avsarogullari@AWGNPWVK961 debug % ./datafusion-cli --memory-limit 10M --mem-pool-type fair --top-memory-consumers 3 --command 'select * from generate_series(1,500000) as t1(v1) order by v1;' DataFusion CLI v53.0.0 Error: Not enough memory to continue external sort. Consider increasing the memory limit config: 'datafusion.runtime.memory_limit', or decreasing the config: 'datafusion.execution.sort_spill_reservation_bytes'. caused by Resources exhausted: Additional allocation failed for ExternalSorter[0] with top memory consumers (across reservations) as: ExternalSorterMerge[0]#2(can spill: false) consumed 10.0 MB, peak 10.0 MB, ExternalSorter[0]#1(can spill: true) consumed 0.0 B, peak 0.0 B, DataFusion-Cli#0(can spill: false) consumed 0.0 B, peak 0.0 B. Error: Failed to allocate additional 128.0 KB for ExternalSorter[0] with 0.0 B already allocated for this reservation - 0.0 B remain available for the total memory pool: fair(pool_size: 10.0 MB) ``` ## What changes are included in this PR? - Adding name property to MemoryPool instances, - Expose used MemoryPool info to Resources Exhausted error messages ## Are these changes tested? Yes and updating existing test cases. ## Are there any user-facing changes? Yes, being updated Resources Exhausted error messages.

github-actions Bot added physical-plan core proto physical-expr labels Feb 20, 2026

jayshrivastava changed the title ~~wip~~ Serialize dynamic filters on execution plan nodes (HashJoin, Aggregate, Sort) Feb 23, 2026

jayshrivastava force-pushed the js/serialize-dynamic-filters-in-execution-plans-2 branch from ff17e8a to 00b2a63 Compare February 23, 2026 19:44

github-actions Bot added physical-expr and removed physical-expr labels Feb 23, 2026

LiaCastaneda reviewed Feb 24, 2026

View reviewed changes

jayshrivastava force-pushed the js/dedupe-dynamic-filter-inner-state branch from c5d0e2f to fef4259 Compare February 26, 2026 18:48

jayshrivastava force-pushed the js/serialize-dynamic-filters-in-execution-plans-2 branch from 6cbe847 to 4889d13 Compare February 26, 2026 18:52

wip

ed4c611

jayshrivastava force-pushed the js/serialize-dynamic-filters-in-execution-plans-2 branch from 4889d13 to ed4c611 Compare February 26, 2026 18:53

jayshrivastava force-pushed the js/dedupe-dynamic-filter-inner-state branch from cb23b01 to 18b0289 Compare March 19, 2026 15:04

jayshrivastava pushed a commit that referenced this pull request Mar 19, 2026

test: add reproducer for Dictionary InList pushdown type mismatch (#2… (

3ece9ec

apache#20960) Reproducer for apache#20937

jayshrivastava force-pushed the js/dedupe-dynamic-filter-inner-state branch 3 times, most recently from d75e7f8 to e0ec773 Compare April 14, 2026 17:21

jayshrivastava force-pushed the js/dedupe-dynamic-filter-inner-state branch 2 times, most recently from b419d4c to dc683d3 Compare April 22, 2026 03:53

jayshrivastava mentioned this pull request Apr 23, 2026

[DISCUSSION] Future of Dynamic Filters Sync apache/datafusion#21207

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serialize dynamic filters on execution plan nodes (HashJoin, Aggregate, Sort)#2

Serialize dynamic filters on execution plan nodes (HashJoin, Aggregate, Sort)#2
jayshrivastava wants to merge 2 commits intojs/dedupe-dynamic-filter-inner-statefrom
js/serialize-dynamic-filters-in-execution-plans-2

jayshrivastava commented Feb 20, 2026 •

edited

Loading

Uh oh!

jayshrivastava commented Feb 23, 2026

Uh oh!

LiaCastaneda Feb 24, 2026

Uh oh!

LiaCastaneda Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jayshrivastava commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Proto schema (datafusion.proto)

Plan node public API

Serde

Are these changes tested?

Uh oh!

jayshrivastava commented Feb 23, 2026

Uh oh!

LiaCastaneda Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

LiaCastaneda Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jayshrivastava commented Feb 20, 2026 •

edited

Loading