Júlio Resende
02/28/2024, 3:11 PMJuan Luis
02/28/2024, 3:21 PMJuan Luis
02/28/2024, 3:21 PMJúlio Resende
02/28/2024, 3:24 PMJúlio Resende
02/28/2024, 3:26 PMJuan Luis
02/28/2024, 4:04 PMds:
type: pandas.DeltaTableDataset
save_args:
mode: overwrite
partition_filters: "${globals:partition_filters}"
and then add your partition_filters
in conf/base/globals.yml
Juan Luis
02/28/2024, 4:04 PMJúlio Resende
02/28/2024, 9:17 PMJuan Luis
02/28/2024, 10:44 PMNok Lam Chan
02/29/2024, 12:28 PMJúlio Resende
02/29/2024, 4:20 PMJúlio Resende
02/29/2024, 4:34 PMname 'column_name' present in the specified schema is not found in the columns or index
, but column_name was defined as nullable in the specified schema.
I also had some problems related with __index_level_0__ column when no schema was specified (see this issue).
Using pyarrow.Table.from_pandas(df)
as node return fixed all these problems. Perhaps this function could be embedded into pandas.DeltaTableDataset in the next release of kedro datasets?Juan Luis
02/29/2024, 4:35 PMJúlio Resende
02/29/2024, 4:45 PMJuan Luis
02/29/2024, 4:52 PMJúlio Resende
02/29/2024, 5:15 PM