Is it possible to load a versioned pipeline run versioned wi Kedro #questions

Is it possible to load a "versioned" pipeline run ...

fmfreeze

06/07/2023, 10:29 AM

Is it possible to load a "versioned" pipeline run (versioned with experiment tracking)? I have a couple of

MemoryDataSet

flowing around a pipeline, and I want to inspect them for individual tracked pipeline runs after the run (e.g. load them again like with

session.run(to_outputs...)

but for a specific experiment run from the past.)

Nok Lam Chan

06/07/2023, 10:45 AM

Are you thinking some sort of time-travel feature?

Nok Lam Chan

06/07/2023, 10:48 AM

Memory data are ephermeral, if you want to inspect them after run you need to persist it to disk and saving them in the catalog.

fmfreeze

06/07/2023, 10:51 AM

Well, more of a way inspect parts of an already run pipeline captured by experiment tracking. That is already possible with versioned datasets, but not for MemoryDatasets, which - I think?! - needs some sort of re-running that tracked pipeline.

Nok Lam Chan

06/07/2023, 11:00 AM

You have to persist it instead of using MemoryDataSet, once you finish the run everything is gone it’s impossible to extract them.

fmfreeze

06/07/2023, 11:02 AM

Thank you @Nok Lam Chan for making that clear. One kind of related follow-up question to me is then: Is there a flag or another supported way to exclude a single

kedro run

from the experiment tracking?

Tynan

06/07/2023, 11:35 AM

Not at the moment, no. If you want to exclude a run you'd have to delete it from the session.db file

👍 1

Open in Slack

Previous Next