Richard Purvis
02/27/2024, 7:36 PMpd.read_csv
with a chunksize
arg. I have seen the lazy save for a partitioned dataset article (link). However this requires a pre-defined dictionary with callable items, and if you are iterating through chunks you wouldn't be able to predefine keys.
CC @Yury Fedotovdatajoely
02/28/2024, 2:16 AMJuan Luis
02/28/2024, 6:45 AMJuan Luis
02/28/2024, 6:48 AMyield
them? something else?Richard Purvis
02/28/2024, 1:23 PMenumerate()
function?datajoely
02/28/2024, 1:24 PMJuan Luis
02/28/2024, 1:39 PMyield
and a custom dataset, please have a look https://docs.kedro.org/en/stable/nodes_and_pipelines/nodes.html#saving-data-with-generators if this isn't quite it, let's continue the conversation šRichard Purvis
02/29/2024, 5:06 PMJuan Luis
02/29/2024, 5:13 PM