Mate Scharnitzky
02/01/2024, 7:28 AMthe only useful code is production code
. Then the article wakes you up with `unfortunately, deploying the same data pipelines in production often doesn't work as well as one would hope`Don't get me wrong, I had my own experience with the solution doesn't scale
, but there can be many reasons behind that, e.g., spark config is rabbit hole, costly operations, joins between large vs small tables...etc. I'm a bit unclear on the problem statement, what part of Kedro makes a production-ready data pipeline not scalable? Was it production-ready at the first place? If it's not scalable shouldn't we just start directly in SQL? Just to make it clear, I'm not challenging the value of Kedro - Ibis integration at all, I love this project and a big supporter of it. But I'd like to better understand the source of the scalability problem. Thank you!Jo Stichbury
02/01/2024, 9:47 AMdatajoely
02/01/2024, 10:15 AMMate Scharnitzky
02/01/2024, 10:29 AMNok Lam Chan
02/01/2024, 11:01 AMJuan Luis
02/01/2024, 11:14 AMJuan Luis
02/01/2024, 11:14 AMMate Scharnitzky
02/01/2024, 11:23 AMDeepyaman Datta
02/01/2024, 3:39 PMIan Whalen
02/01/2024, 4:30 PM