Skip to content

feat: add method for sync Parquet reader read bloom filter#8024

Merged
alamb merged 2 commits intoapache:mainfrom
mapleFU:GH-8023
Aug 1, 2025
Merged

feat: add method for sync Parquet reader read bloom filter#8024
alamb merged 2 commits intoapache:mainfrom
mapleFU:GH-8023

Conversation

@mapleFU
Copy link
Copy Markdown
Member

@mapleFU mapleFU commented Jul 30, 2025

Which issue does this PR close?

Rationale for this change

Add sync parquet read bloom filter.

What changes are included in this PR?

Add a sync get_row_group_column_bloom_filter

Are these changes tested?

By unittests

Are there any user-facing changes?

Api added

@github-actions github-actions bot added the parquet Changes to the parquet crate label Jul 30, 2025
@mapleFU
Copy link
Copy Markdown
Member Author

mapleFU commented Jul 31, 2025

@alamb would you mind take a look? (or just close this). I use this in a poc, might switch to async_reader when I goto production version

Copy link
Copy Markdown
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @mapleFU -- I think this code looks good to me and it provides parity with the async reader.

As I mentioned on #8023 (comment), I have a plan / idea of how to avoid having to implement features twice for two different I/O models, but it isn't quite done yet (and even so this API is still nice to have)

@alamb alamb merged commit a535d3b into apache:main Aug 1, 2025
16 checks passed
@mapleFU mapleFU deleted the GH-8023 branch August 2, 2025 07:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

parquet Changes to the parquet crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Parquet reader: add method for sync reader read bloom filter

2 participants