Join the Kedro community

Home
Members
Matthias Roels
M
Matthias Roels
Offline, last seen 3 weeks ago
Joined October 21, 2024

Regarding https://github.com/kedro-org/kedro/issues/4322, I am working on upgrading a big project from kedro 0.18.13 to the latest version. While doing so, I am also removing a custom ConfigLoader as I want to use OmegaConf. However, I do see some performance issues here too compared to the custom implementation we had. Did some debugging (using logging in my hooks) and found the following:

  • project has 1500 catalog entries with most of the filepath combining info from globals (bucket, prefix, data version,…)
  • With kedro 0.18, I was able to load the project in a notebook in around 25sec
  • In the new version, it takes 100sec
  • Most of the load times happens after my after context created hooks (potentially when creating the catalog?)

I would like to see what I can do to improve load times or, at least figure out for sure what’s causing it. Any help would be nice (I cannot give access to the full project, but I will provide any info I can provide)

9 comments
M
M
J

Question about configuration. How nested can you organise your config environment? Is it possible to have something like the following file structure?

conf/
  base/
    crm/
      prm/
        parameters.yaml
        catalog.yaml
      feat/
        …
And still allow OmegaConf to read all files?

1 comment
d