What is the problem the feature request solves?
Datafusion optimizer wraps some operators which might produce multiple small batches with CoalesceBatchesExec node. In Comet, this technique is already applied to SMJ with filter.
Describe the potential solution
Shouldn't it be extended to other plans (Filter/HashJoin)? It might be especially beneficial for workloads where sort merge join -> hash join replacement is enforced.
Additional context
No response
What is the problem the feature request solves?
Datafusion optimizer wraps some operators which might produce multiple small batches with CoalesceBatchesExec node. In Comet, this technique is already applied to SMJ with filter.
Describe the potential solution
Shouldn't it be extended to other plans (Filter/HashJoin)? It might be especially beneficial for workloads where sort merge join -> hash join replacement is enforced.
Additional context
No response