Simon Wolf
03/24/2023, 9:27 AMTomás Rojas
03/24/2023, 1:33 PMkedro run
? I have a model training pipeline which I want to skip since I have a model already workingJulien Witty
03/24/2023, 4:00 PMTomás Rojas
03/26/2023, 9:45 PMMaxime Steinmetz
03/27/2023, 1:06 PMSanjeev
03/27/2023, 4:09 PMDotun O
03/27/2023, 4:58 PMAlfonso Licir
03/27/2023, 10:17 PMAlfonso Licir
03/27/2023, 10:17 PMAlfonso Licir
03/27/2023, 10:17 PMSergei Benkovich
03/28/2023, 8:29 AMAlexandre Ouellet
03/28/2023, 2:09 PMZiren Lin
03/28/2023, 3:44 PM#globals.yml
order_number: 'abc'
#catalog.yml
sql:
type: pandas.SQLQueryDataset
sql: "SELECT * FROM table WHERE column = ${order_number}"
Filip Panovski
03/29/2023, 8:46 AMValueError: The following datasets are used with transcoding, but were referenced without the separator: typed_invoices
Please specify a transcoding option or rename the datasets.
Details within thread.Christianne Rio Ortega
03/29/2023, 10:02 AMAna Man
03/29/2023, 10:31 AMSj
03/29/2023, 2:04 PMZoran
03/29/2023, 6:10 PMMiguel Angel Ortiz Marin
03/29/2023, 8:10 PMMiguel Angel Ortiz Marin
03/29/2023, 8:12 PMJuan Luis
03/30/2023, 8:40 AMNok Lam Chan
03/30/2023, 9:09 AMIñigo Hidalgo
03/30/2023, 9:58 AMAndreas Zeitler
03/30/2023, 11:47 AMnode(
modeloutput.predict,
inputs=["estimator", "modelinput_x_" + name],
outputs="modeloutput_" + name,
tags=["output", "output_" + name] + tags,
name="predict_" + name
)
The output is configured in the data catalog.
hook after_node_run_run:
if node.name == 'predict_dach_testsplit_test':
//This is the output of the node:
y_pred_prob_comb_test = catalog.load('modeloutput_dach_testsplit_test')
[..]
In the logs, it seems like kedro tries to load the data in the hook, before is was written by the catalog.
Is this possible and is it meant to act like that?
Could be fixed by using "outputs['modeloutput_dach_testsplit_test'] instead of catalog.load., but in my understanding it should not be necessary.
Thanks in advance!Priyanka Patil
03/30/2023, 12:08 PMNikola Shahpazov
03/30/2023, 12:28 PMAndrej Zachar
03/30/2023, 6:10 PMMassinissa Saïdi
03/31/2023, 12:29 PMDataSetError: <class 'sklearn.pipeline.Pipeline'> was not serialised due to: Can't pickle local object 'fit_best_model.<locals>.<lambda>'
I just return a partitioned pickle dataset like that return {'model_' + parameters['model']: pipeline}
and I define the dataset in catalog.yml like that
models_partionned:
type: PartitionedDataSet
path: data/06_models/${date}/${target}/
filename_suffix: ".pkl"
dataset:
type: pickle.PickleDataSet
Olivier Ho
03/31/2023, 1:14 PMPartitionedDataSet
return a dictionary of callable that enable lazy loading and IncrementalDataSet
which inherit for PartitionedDataSet
return a dictionary of the content?
• how does the IncrementalDataSet
work if you use it as an input of node? I do not see the call to the confirm
so I don't understand when is the checkpoint createdSebastian Cardona Lozano
04/01/2023, 2:31 PM