Hi team So I am running a notebook as part of a kedro pipeli Kedro #questions

Hi team, So I am running a notebook as part of a ...

Giovanna Cavali

05/09/2024, 1:50 PM

Hi team, So I am running a notebook as part of a kedro pipeline (using nbconvert) and the notebook loads kedro context and saves metrics to catalog. It works well but the metrics are not showing in kedro viz experiment tracking. I think it is because the timestamp is different when we compare the entries of the regular kedro nodes and the metrics saved in the notebook. Any ideas here on how to solve this?

Nok Lam Chan

05/09/2024, 1:52 PM

It need to be the same Kedro session in order to use Kedro's experiment tracking. Did you create a separate session to run the notebook?

Giovanna Cavali

05/09/2024, 1:55 PM

how can we make sure we have the same session?

Giovanna Cavali

05/09/2024, 1:55 PM

I am actually just using

%load_ext kedro.ipython

Nok Lam Chan

05/09/2024, 2:07 PM

Is it possible to pass in anything to nbconvert instead of re-creating a new session?

Giovanna Cavali

05/09/2024, 2:30 PM

Hum, not sure. If we use papermill, we can input parameters.

Giovanna Cavali

05/09/2024, 2:30 PM

Is there a way to load a specific session in the jupyter notebook?

Nok Lam Chan

05/09/2024, 2:37 PM

not that I am aware of

Nok Lam Chan

05/09/2024, 2:39 PM

You may be able to create the KedroSession manually and force the session_id to be the same

Nok Lam Chan

05/09/2024, 2:40 PM

But either way you need to pass in some information into the notebook

Giovanna Cavali

05/09/2024, 2:46 PM

yes, makes sense!!

Giovanna Cavali

05/09/2024, 2:50 PM

any documentation on how we can force the session_id? once we are loading the context in the notebook?

Giovanna Cavali

05/09/2024, 2:50 PM

and actually how we can extract the session_id from the current run?

Nok Lam Chan

05/09/2024, 2:58 PM

I don't think there is any documentation on this, it's is more like a hack than an official API.

Nok Lam Chan

05/09/2024, 2:58 PM

The session is protected by design and ensure that it's always unique.

Nok Lam Chan

05/09/2024, 3:00 PM

you can get the session_id from hooks, https://docs.kedro.org/en/stable/api/kedro.framework.hooks.specs.PipelineSpecs.html#kedro.framework.hooks.specs.PipelineSpecs

Giovanna Cavali

05/09/2024, 3:36 PM

Got it!! thank you for the help!!! let's see what we can do

Giovanna Cavali

07/02/2024, 5:11 PM

Hello team, Going back to this issue where I want to run a notebook (kedro ipython) with the same

session_id

as the rest of the pipeline, I was able to • extract

session_id

using

hooks

• pass

session_id

as a parameter to notebook using

papermill

• and then creating a Kedro Session with a specific

session_id

with the code bellow:

Copy code

# Creating Kedro Session, Context and Catalog
from kedro.framework.session import KedroSession
from kedro.framework.startup import bootstrap_project
from pathlib import Path
import logging, sys

bootstrap_project(Path(".."))
session = KedroSession(session_id=session_id)
context = session.load_context()
catalog = context._get_catalog(save_version=session_id)

I wanted to check if there's any risk on forcing a

session_id

, anything we should watch out for?

Merel

07/08/2024, 4:47 PM

@Giovanna Cavali what you're doing here is outside of any recommended use of Kedro or in fact outside of any publicly exposed API. The session should be created through

KedroSession.create()

which doesn't take the

session_id

argument. There's an open issue exactly for allowing this: https://github.com/kedro-org/kedro/issues/1731 There's no clear view on all the consequences of doing this, but the most important part is that the

save_version

and session_id are connected.

context._get_catalog(save_version=session_id)

here you're also using private API. Long story short: you can do all of the above, none of it is recommended and you have to take responsibility to make sure you're aware of the consequences of using private APIs (never do this in a production system if you can avoid it) 😅

12 Views

Open in Slack

Previous Next