-
Notifications
You must be signed in to change notification settings - Fork 321
fix: make BloomFilter intermediate buffer Spark-compatible #4390
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
andygrove
wants to merge
24
commits into
apache:main
Choose a base branch
from
andygrove:feat/bloom-filter-intermediate-buffer-compat
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
24 commits
Select commit
Hold shift + click to select a range
f7fa33c
fix: allow safe mixed Spark/Comet partial/final aggregate execution
andygrove f2a8207
fix: address review feedback on mixed partial/final aggregate guard
andygrove 9826403
fix: skip partial aggregate tag when partial itself cannot be converted
andygrove 753a9a5
fix: narrow partial aggregate tag lookup and regenerate TPC-DS golden…
andygrove 6ae483d
fix: reject grouping on nested map types in hash aggregate conversion
andygrove 53405f6
fix: remove COUNT from mixed-safe aggregates to fix AQE/count-bug reg…
andygrove 9e2c25a
spotless
andygrove f53e3c1
test: ignore SPARK-33853 explain codegen subquery test under Comet
andygrove 3285485
Merge remote-tracking branch 'apache/main' into fix/safe-mixed-partia…
andygrove 671afa6
Merge remote-tracking branch 'apache/main' into fix/safe-mixed-partia…
andygrove 4322852
test: regenerate Spark 4.2 TPC-DS golden files after merge from main
andygrove 12018c3
Merge remote-tracking branch 'apache/main' into fix/safe-mixed-partia…
andygrove 43e0c0b
fix: address review feedback on safe mixed aggregate guard
andygrove 4bbfe74
fix: drop unused StructType import and regenerate TPC-DS golden files
andygrove 56e5da6
chore: revert .gitignore change
andygrove 08b3924
test: ignore SPARK-33853 explain codegen test on Spark 4.1.1
andygrove 64575f2
test: use descriptive reason for SPARK-33853 IgnoreComet tag
andygrove 8db42b0
fix: emit Spark-compatible BloomFilter intermediate buffer
andygrove 36cf0e8
feat: enable BloomFilter for mixed Spark/Comet partial/final aggregate
andygrove 9bab432
Merge remote-tracking branch 'apache/main' into feat/bloom-filter-int…
andygrove 2406272
refactor: move SparkBitArray test-only methods into the test module
andygrove 16299be
refactor: address review feedback on SparkBloomFilter::merge_filter
andygrove d51c9d6
Merge remote-tracking branch 'apache/main' into feat/bloom-filter-int…
andygrove 264510c
fix: cap bloom_filter_agg numItems/numBits and skip null inputs
andygrove File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The body here is identical to
CometHashAggregateExec.getSupportLevelatoperators.scala:1658-1670, including the conf names. That is fine for the test-knob purpose called out in the comment, butCOMET_ENABLE_PARTIAL_HASH_AGGREGATEandCOMET_ENABLE_FINAL_HASH_AGGREGATEnow gate bothHashAggregateExecandObjectHashAggregateExec. As a follow-up, consider renaming toCOMET_ENABLE_PARTIAL_AGGREGATE/COMET_ENABLE_FINAL_AGGREGATEso the conf names match the scope.