Skip to content

Conversation

@gene-bordegaray
Copy link

@gene-bordegaray gene-bordegaray commented Nov 18, 2025

Draft for broadcast join
Closes #223

Benches are volatile, doesn't seem like drastic improvements due to tpch data

Follow up could be some more benching

@gene-bordegaray gene-bordegaray force-pushed the gene.bordegaray/2025/11/add_broadcast_join_operator branch from 2b0f8ad to 1a31b0d Compare December 9, 2025 17:37
@gene-bordegaray gene-bordegaray marked this pull request as ready for review December 9, 2025 18:10
@gene-bordegaray gene-bordegaray marked this pull request as draft December 10, 2025 03:08
@gene-bordegaray gene-bordegaray force-pushed the gene.bordegaray/2025/11/add_broadcast_join_operator branch from b2292dd to 718808b Compare December 21, 2025 20:00
@gene-bordegaray gene-bordegaray marked this pull request as ready for review December 23, 2025 03:34
@gene-bordegaray
Copy link
Author

gene-bordegaray commented Dec 23, 2025

In CI, test tpcd queries 10, 35, and 69 seemed to fail but pass repetitively locally. Are these known as flaky tests in CI?

if !partition_group.contains(&i) {
style = "[style=invis]";
}
} else if let Some(partition_group) = &isolator_partition_group
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clippy wouldn't leave me alone 😄

if let Some(stage) = plan.input_stage() {
result.push(stage);
}
if let Some(plan) = plan.as_network_boundary()
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here

@gabotechs
Copy link
Collaborator

gabotechs commented Dec 23, 2025

In CI, test tpcd queries 10, 35, and 69 seemed to fail but pass repetitively. Are these known as flaky tests in CI?

They are flaky yes #267, although I the failures I see in this PR look different.

I think the issue is that sometimes the queries in those tests get distributed and sometimes not, as the number of files produced by the tpc-ds generator is random

@gabotechs
Copy link
Collaborator

gabotechs commented Dec 23, 2025

With the current setup, almost no tpc-ds query is actually getting distributed. I actually needed to rework those tests in #264. It looks like at random points they do get distributed though

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add Network Broadcast Join

2 participants