Hey all we re currently running MLflow experiments using a K Kedro #questions

Hey all, we're currently running MLflow experiment...

Cory Maklin

08/28/2023, 4:57 PM

Hey all, we're currently running MLflow experiments using a Kedro pipeline. The pipeline produces intermediate datasets. I'd like to run multiple experiments concurrently while avoiding file collisions. What is the best approach for doing this in Kedro? Does someone know whether we can refer to

params

in the

catalog..yml

in order to make the paths dynamic?

Erwin

08/28/2023, 5:40 PM

you could use Jinja + namespaces

Emilio Gagliardi

08/28/2023, 6:09 PM

following to learn 🙂 not sure how jinja + namespaces work.

Cory Maklin

08/28/2023, 6:29 PM

Hey @Erwin do you have an example of using Jinja in the catalog by chance?

datajoely

08/28/2023, 6:43 PM

Are you using the

kedro-mlflow

plug-in?

datajoely

08/28/2023, 6:43 PM

and I think hooks are the best way to do this if you’re not

Lodewic van Twillert

08/29/2023, 7:27 AM

@datajoely small typo but surely you meant

kedro-mlflow

plugin😉

👍 1

Lodewic van Twillert

08/29/2023, 7:38 AM

Regarding referring to

params

to make paths dynamic, using namespaces: few days ago a similar question was asked and in the reply thread we might have a possible implementation for you https://kedro-org.slack.com/archives/C03RKP2LW64/p1692872422344789 Without any additional work, no you cannot refer to

params

in the

catalog.yml

at the moment. -- However, assuming you are indeed using the

kedro-mlflow

plugin and logging mlflow artifacts using the

<http://kedro_mlflow.io|kedro_mlflow.io>.artifacts.MlflowArtifactDataSet

- then I wouldn't think you need to refer to your params.yml to make the filepath dynamic? Since the dataset would be logged as an artifacts to the mlflow run with the params you need? 🤔 If you share some more about how you are setting up your concurrent runs, I suspect you could get away with just using

namespaces

by following the example in the docs here: https://docs.kedro.org/en/stable/data/data_catalog.html#example-3-generalise-datasets-using-namespaces-into-one-dataset-factory

datajoely

08/29/2023, 8:52 AM

yes kedro-mlflow facepalming

3 Views

Open in Slack

Previous Next