perf(parquet/compress): set zstd pool encoder concurrency to 1 by dimakuz · Pull Request #717 · apache/arrow-go

dimakuz · 2026-03-16T19:52:39Z

The zstdEncoderPool is used exclusively by EncodeAll(), which is a single-shot synchronous call that uses exactly one inner block encoder. However, zstd.NewWriter defaults concurrent to runtime.GOMAXPROCS, pre-allocating that many inner block encoders — each with its own ~1 MiB history buffer (ensureHist). On a 10-core machine, each pooled Encoder allocates 10 inner encoders when only 1 is ever used by EncodeAll.

With WithEncoderConcurrency(1), each pooled encoder creates a single inner encoder, matching actual usage. The streaming Write/Close path is unaffected — it does not use the pool.

Benchmark results (Apple M4 Pro, arm64, 256 KiB semi-random data):

BenchmarkZstdPooledEncodeAll/Default-14        11000 B/op   5250 MB/s
BenchmarkZstdPooledEncodeAll/Concurrency1-14     810 B/op   5500 MB/s

14x less memory per operation, ~5% higher throughput from reduced GC pressure.

In a parquet write workload (1 GiB Arrow data, ZSTD level 3), this reduced ensureHist allocations from 22 GiB to 7 GiB and madvise kernel CPU from 4.6s to 2.3s (10% wall-time improvement).

Rationale for this change

High memory churn during parquet encoding

What changes are included in this PR?

Change to zstd encoder concurrency, a benchmark to reproduce results.

Are these changes tested?

Yes

Are there any user-facing changes?

No

The zstdEncoderPool is used exclusively by EncodeAll(), which is a single-shot synchronous call that uses exactly one inner block encoder. However, zstd.NewWriter defaults concurrent to runtime.GOMAXPROCS, pre-allocating that many inner block encoders — each with its own ~1 MiB history buffer (ensureHist). On a 10-core machine, each pooled Encoder allocates 10 inner encoders when only 1 is ever used by EncodeAll. With WithEncoderConcurrency(1), each pooled encoder creates a single inner encoder, matching actual usage. The streaming Write/Close path is unaffected — it does not use the pool. Benchmark results (Apple M4 Pro, arm64, 256 KiB semi-random data): BenchmarkZstdPooledEncodeAll/Default-14 11000 B/op 5250 MB/s BenchmarkZstdPooledEncodeAll/Concurrency1-14 810 B/op 5500 MB/s 14x less memory per operation, ~5% higher throughput from reduced GC pressure. In a parquet write workload (1 GiB Arrow data, ZSTD level 3), this reduced ensureHist allocations from 22 GiB to 7 GiB and madvise kernel CPU from 4.6s to 2.3s (10% wall-time improvement).

zeroshade · 2026-03-16T19:54:31Z

Should this be a configurable setting that we just default to 1?

zeroshade

just the one question

dimakuz · 2026-03-17T07:18:27Z

Thanks for the prompt review!
I think that in the current implementation of zstd codec and zstd libs there's no real reason to make it configurable. We encode as one-shot (EncodeAll) and internally it uses a single encoder, never benefitting from parallel encoders being available.
If you prefer to plumb it out in some way so its more controllable from outside I can try.

zeroshade · 2026-03-17T17:02:29Z

I think this is fine for now, we can look into making it more controllable in a follow-up. Thanks!

dimakuz requested a review from zeroshade as a code owner March 16, 2026 19:52

zeroshade approved these changes Mar 16, 2026

View reviewed changes

zeroshade merged commit 5a94422 into apache:main Mar 17, 2026
39 of 41 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(parquet/compress): set zstd pool encoder concurrency to 1#717

perf(parquet/compress): set zstd pool encoder concurrency to 1#717
zeroshade merged 1 commit intoapache:mainfrom
dimakuz:perf/zstd-pool-concurrency

dimakuz commented Mar 16, 2026

Uh oh!

zeroshade commented Mar 16, 2026

Uh oh!

zeroshade left a comment

Uh oh!

dimakuz commented Mar 17, 2026 •

edited

Loading

Uh oh!

zeroshade commented Mar 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dimakuz commented Mar 16, 2026

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

zeroshade commented Mar 16, 2026

Uh oh!

zeroshade left a comment

Choose a reason for hiding this comment

Uh oh!

dimakuz commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zeroshade commented Mar 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dimakuz commented Mar 17, 2026 •

edited

Loading