Hygor Knust
06/03/2023, 8:58 PMtable.exec.resource.default-parallelism=16
. It’s performing well generally, but there’s a join operation at the end of the job graph which seems to be causing a bottleneck.
Upon retrieving the job INFO
, I noticed that it’s using the GLOBAL
ship strategy and it seems to be forcing the operator to maintain a parallelism of 1.
{
"id": 96,
"type": "Join[93]",
"pact": "Operator",
"contents": "...",
"parallelism": 1,
"predecessors": [
{
"id": 88,
"ship_strategy": "GLOBAL",
"side": "second"
},
{
"id": 94,
"ship_strategy": "GLOBAL",
"side": "second"
}
]
}
I’ve tried searching through the documentation for more information on this, but I haven’t found anything yet.
I would like to know why is it doing this, and if there is a way to configure it.
Thank you!David Anderson
06/03/2023, 9:31 PMHygor Knust
06/03/2023, 9:33 PMON tbl1.id = tbl2.id_1 OR tbl1.id = tbl2.id_2
Raghunadh Nittala
06/04/2023, 6:39 AM/*+ BROADCAST(t1) */
Ari Huttunen
06/04/2023, 9:02 PMDavid Anderson
06/05/2023, 8:58 AMHygor Knust
06/05/2023, 10:43 AMleft
table and reprocessing it for each part of the union, but at least it is not bottlenecking the application
Thank you!Raghunadh Nittala
06/05/2023, 11:40 AMHygor Knust
06/05/2023, 11:43 AMRaghunadh Nittala
06/05/2023, 12:03 PM