Jonathan Dekermanjian
01/09/2025, 3:39 PMHall
01/09/2025, 3:39 PMdatajoely
01/09/2025, 3:40 PMdatajoely
01/09/2025, 3:41 PM--async
flag?datajoely
01/09/2025, 3:41 PMJonathan Dekermanjian
01/09/2025, 3:48 PMdatajoely
01/09/2025, 4:04 PMdatajoely
01/09/2025, 4:05 PMNok Lam Chan
01/09/2025, 4:06 PMkedro run
, once you start the pipeline, what Kedro sees are a bunch of nodes (no more concept of pipeline), and the execution order are determined by solving the dependencies of nodes.
If the data is a persisted type,(i.e. CSVDataset), the memory of the data is released immediately after the node. If it's a memory type, it will be released once the data has no more downstream dependencies.Jonathan Dekermanjian
01/09/2025, 4:11 PMdatajoely
01/09/2025, 4:15 PMJonathan Dekermanjian
01/09/2025, 4:15 PMdatajoely
01/09/2025, 4:15 PMJonathan Dekermanjian
01/09/2025, 4:18 PMJonathan Dekermanjian
01/09/2025, 4:19 PMdatajoely
01/09/2025, 4:20 PMdatajoely
01/09/2025, 4:20 PMJonathan Dekermanjian
01/09/2025, 4:22 PMJonathan Dekermanjian
01/09/2025, 4:24 PMdatajoely
01/09/2025, 4:32 PMNok Lam Chan
01/09/2025, 4:52 PM@Nok Lam Chan interesting, but then I don’t know how to explain why the memory utilization never goes down?does it comes down when you are using SequentialRunner (the default)? Is this problem only appears with Parallerunner?
Jonathan Dekermanjian
01/09/2025, 8:34 PMNok Lam Chan
01/09/2025, 9:15 PM