Join the Kedro community

Home
Members
Thiago José Moser Poletto
T
Thiago José Moser Poletto
Offline, last seen 2 days ago
Joined October 17, 2024

Guys, are there any built-in solution to handle large databases, so that the nodes run them partially, like, lets say, a 100k rows will be running in batches of 10k each. Instead of doing by hand with for loop or something like it...

7 comments
L
i
T
D

Guys, I would like to check with you if theres a simpler way to use a run_identifier on the path into the catalog:

I'm loading a base from BigQuery and spliting each row to run in another pipeline, where I load and save dynamically the inputs/outputs.

I would like to get a value from a column and use as run_identifier in the path on catalog:

filepath: ${root_folder}/${current_datetime}/${run_identifier}/data/model/{placeholder:name}.pt

is there a way known to do something like that? I open to suggestions...

51 comments
d
N
T

Guys I do have a problem I'm trying to use the kedro vertex ai plugin, but every time I try to do install, it also update kedro to 0.19.9, but the versions limit is "kedro>=0.18.1,<0.19.0". Any suggestions on how to workaround?

16 comments
T
J

Hey guys I would like to know if theres anyone that have tested the Kedro Vertex AI Plugin, on its latest version. I'm having some issues with async node runs, for some reason it is taking a lot longer than when run locally. It might be because I'm allocanting a GPU to parto of the process, but it shouldn't, in my perspective, so if anyone have any ideas or suggestions, I'll appreciate that...

12 comments
R
T