Hugo Evers
06/27/2023, 2:01 PMsample_size to a config):
node(
func=train_test_split,
inputs={"df": "input", "sample_size": 50},
...
),
However, this doesn’t seem to work and I get an error refering to a separator error.. I noticed that in the modular pipeline, a similar syntax is allowed.
Is that on purpose?
What does work is:
node(
func=lambda df: train_test_split(df, sample_size=50),
inputs="input",
...
)Nok Lam Chan
06/27/2023, 2:06 PMsample_size does not exist. You need to have sample_size in your parameters.yml and use params:sample_size instead.Nok Lam Chan
06/27/2023, 2:06 PMI noticed that in the modular pipeline, a similar syntax is allowed.Can you share the example?
Hugo Evers
06/27/2023, 2:07 PMinputs={
"df": "input",
"sample_size": "params:finetuner.sample_size",
},Hugo Evers
06/27/2023, 2:08 PMHugo Evers
06/27/2023, 2:09 PMsample_size does not exist right? the issue is that 50 is not a valid inputHugo Evers
06/27/2023, 2:09 PMNok Lam Chan
06/27/2023, 2:10 PMpipeline is a string literal, which will be reference to a dataset or parameter, you cannot pass value directly there.Nok Lam Chan
06/27/2023, 2:10 PMHugo Evers
06/27/2023, 2:10 PMNok Lam Chan
06/27/2023, 2:11 PMHugo Evers
06/27/2023, 2:14 PMHugo Evers
06/27/2023, 2:16 PMNok Lam Chan
06/27/2023, 2:33 PMHugo Evers
06/27/2023, 2:33 PMNok Lam Chan
06/27/2023, 2:41 PMHugo Evers
06/27/2023, 2:42 PMNok Lam Chan
06/27/2023, 3:01 PM