Hello!
I am having problems with kedro-mlflow. I am running a pipeline (pipeline-name) which terminates without giving any errors. The problem comes when I access the mlflow ui where two runs are shown, on one side the pipeline-name run and on the other side a run with a random name. In the pipeline-name run the model is logged but no parameters are shown, on the other hand in the run with the arbitrary name if the hyperparameters of the model are registered. Moreover, this run never ends even when the execution of the pipeline is finished.
Does anyone know what could be happening?
Thank you!
Hey @Martin Gonzalez, would be helpful if you could share some error logs/screenshots or a repo with which I could reproduce this?
@Ankita Katiyar It seems to be a problem with the parameters given to the node through the pipeline configuration file. When the node does not receive as input the parameters it does not generate two runs
Could you explain a bit more about the parameters? Theyâre not coming from the conf/base/parameters.yml
but somewhere else?
node( func=fit_xgboost, name='fit_xgboost', inputs=['X_train', 'y_train', 'params:xgboost_conf'], outputs='XGBRegressor', )No, the parameters are from
conf/base/parameters.yml
So if thereâs a params:something
in the pipeline inputs, itâs creating a spurious run but not otherwise?
Okay, Iâll try this out and get back to you. CCing @Yolan HonorĂ©-RougĂ© incase you have any insights
i have the same issue
kedro==0.19.11 kedro-mlflow==0.14.0 mlflow==2.20.1
kedro==0.19.10and it works fine
Wow very surprising, this type of error usually comes from mlflow or kedro-mlflow, not kedro.
The bug lies somewhere here : https://github.com/kedro-org/kedro/compare/0.19.10...0.19.11, but I don't see anything major so this is likely unexpected. It seems to affect the hooks workflow
Three related yet different bugs in the last 2 days on kedro-mlflow: https://github.com/Galileo-Galilei/kedro-mlflow/issues/624, https://github.com/Galileo-Galilei/kedro-mlflow/issues/623, https://github.com/Galileo-Galilei/kedro-mlflow/issues/622
@Ankita Katiyar I strongly suspect that the runner exectuor messes up pluggy https://github.com/kedro-org/kedro/compare/0.19.10...0.19.11#diff-6cfc8b43afcb8bfb74e8fe5f8ce1403285a191edc6abf63d763e000c572d8c5f
For those having the issue, can you confirm you use the standard runner, and not ThreadRunner
or ParallelRunner
?
The pipeline does not seem to be properly closed on error, and metrics create new runs.
Thanks for looking into this @Yolan HonorĂ©-RougĂ©, iâll try to figure out whatâs going on on Kedro side today!
@Martin Gonzalez Can you try with pip install kedro-mlflow==0.14.3
? It should be fixed