Gauthier Pierard
01/16/2025, 11:52 AMHall
01/16/2025, 11:52 AMRashida Kanchwala
01/16/2025, 11:56 AMversioned: True
in the catalog entry. This will save your datasets with a timestamp-based version for each kedro run.
https://docs.kedro.org/en/stable/data/data_catalog.html#dataset-versioningGauthier Pierard
01/16/2025, 12:01 PMoutput_folder_<parameter>_<from_date>_<to_date>
.
where from_date
and to_date
are defined by a node and saved as memorydatasets in the catalog.
is it possible to define other catalog entries whose name depends on previous entries?Merel
01/16/2025, 1:08 PMGauthier Pierard
01/16/2025, 1:11 PMCSVDataset(
filepath="<s3://test_bucket/data/02_intermediate/company/motorbikes.csv>",
load_args=dict(sep=",", skiprows=5, skipfooter=1, na_values=["#NA", "NA"]),
credentials=dict(key="token", secret="key"),
)
and
# save the dataset to data/01_raw/test.csv/<version>/test.csv
catalog.save("test_dataset", data1)
correct?Merel
01/16/2025, 1:18 PMMerel
01/16/2025, 1:19 PMGauthier Pierard
01/16/2025, 1:25 PMRashida Kanchwala
01/16/2025, 1:35 PMfilepath: data/02_intermediate/pypi_kedro_demo_${now:}.csv
Here is an example code - https://github.com/kedro-org/kedro/issues/2355#issuecomment-2260512795Gauthier Pierard
01/16/2025, 1:39 PMdatasets.py
with which I am not familiar, thanks for the idea in any caseRashida Kanchwala
01/16/2025, 1:41 PMMerel
01/16/2025, 1:42 PM