Hugo Evers
06/27/2023, 2:01 PMsample_size
to a config):
node(
func=train_test_split,
inputs={"df": "input", "sample_size": 50},
...
),
However, this doesn’t seem to work and I get an error refering to a separator error.. I noticed that in the modular pipeline, a similar syntax is allowed.
Is that on purpose?
What does work is:
node(
func=lambda df: train_test_split(df, sample_size=50),
inputs="input",
...
)
Nok Lam Chan
06/27/2023, 2:06 PMsample_size
does not exist. You need to have sample_size
in your parameters.yml
and use params:sample_size
instead.I noticed that in the modular pipeline, a similar syntax is allowed.Can you share the example?
Hugo Evers
06/27/2023, 2:07 PMinputs={
"df": "input",
"sample_size": "params:finetuner.sample_size",
},
sample_size
does not exist right? the issue is that 50 is not a valid inputNok Lam Chan
06/27/2023, 2:10 PMpipeline
is a string literal, which will be reference to a dataset
or parameter
, you cannot pass value directly there.Hugo Evers
06/27/2023, 2:10 PMNok Lam Chan
06/27/2023, 2:11 PMHugo Evers
06/27/2023, 2:14 PMNok Lam Chan
06/27/2023, 2:33 PMHugo Evers
06/27/2023, 2:33 PMNok Lam Chan
06/27/2023, 2:41 PMHugo Evers
06/27/2023, 2:42 PMNok Lam Chan
06/27/2023, 3:01 PM