Join the Kedro community

Updated 2 months ago

Optimizing spark code within kedro pipelines

Hi team, are there any best practices for optimizing spark code within Kedro pipelines? I have a large pipeline that executes at the last node due to lazy eval. I would like to look at execution plans, etc.

Any suggestions? I suppose this would apply to Polars/Ibis/other similar frameworks.

m
N
3 comments

You can analyze execution plan in Spark UI, once you run the job.

This is up to the execution engine - i.e. Polars / Spark is gonna to have completely different execution plan. Ibis is different in that catagory

So, agree with marrrcin, try Spark UI

Add a reply
Sign up and join the conversation on Slack