Sudip Bhandari
04/22/2025, 2:52 PMmykedroproject/
), as specified in my catalog.yml
. However, I've noticed that when I implement MLflow, artifacts and metrics are logged in a different location (under the mlruns
directory). This results in the same outputs being stored twice: once through Kedro and again via MLflow.
Do you have any advice on how to address this issue so that I store results only once? Ideally, I would like to have specific artifacts displayed in the MLflow UI, sourced directly from the mykedroproject/
folder.
Thanks in advance!!Jitendra Gundaniya
04/22/2025, 3:57 PMkedro-mlflow
plugin? You can follow the official guide here: Kedro-MLflow Plugin Guide.
This guide explains how to configure and use the plugin effectively. If you need advanced customisation, refer to the last chapter of the guide, which details how to use hooks.Yolan Honoré-Rougé
04/22/2025, 6:13 PMkedro run
. If you have kedro versioning enable you can turn it off, but if you are using dataset without versioning, this is the intended behaviour.Yolan Honoré-Rougé
04/22/2025, 6:14 PMSudip Bhandari
04/22/2025, 8:02 PMregressor.pickle
, exclusively in the mlruns
directory. However, I am facing challenges with the reverse process: I want MLflow to retrieve artifacts directly from my Kedro project directory, ensuring that my project structure remains intact (without having the same artifact be duplicated on mlruns
)
Specifically, I want to maintain my organized subfolders within mykedroproject
(e.g., raw
, features
, etc.) that adhere to the Kedro layer nomenclature. This arrangement makes debugging more straightforward, as I can avoid using run IDs and UUIDs assigned by MLflow.
And yes, I am currently versioning my Kedro run artifacts.
Any thoughts/advice on this?Laurens Vijnck
04/23/2025, 8:58 AM