Jamal Sealiti
05/20/2025, 11:54 AMJuan Luis
05/20/2025, 11:57 AMJamal Sealiti
05/20/2025, 11:57 AMJamal Sealiti
05/20/2025, 11:59 AMJamal Sealiti
05/20/2025, 12:00 PMJuan Luis
05/20/2025, 12:07 PMJuan Luis
05/20/2025, 12:07 PMJamal Sealiti
05/20/2025, 12:22 PMJuan Luis
05/20/2025, 1:08 PMJamal Sealiti
05/20/2025, 1:12 PMDeepyaman Datta
05/20/2025, 3:04 PMThen there are plans in the near future to further develop Kedro into a version that is more streaming-oriented?I'd say, again, this depends on what it entails/what the current gaps are. IMO the biggest issue with Kedro for streaming isn't defining the logic (if you can write it in Python, you can shoehorn it into Kedro 😉 to an extent); the bigger issue is that
kedro run
(or, more generally, running at a point in time) doesn't make sense for streaming, and you need to "deploy" the streaming application. In that regard, there's been little work so far, partially because there also haven't been users looking to do true streaming work.
Realistically, what you would do would probably be quite similar to what https://github.com/getindata/dbt-flink-adapter does for dbt. Another powerful approach with Kedro could be to use Ibis (with either a Flink or Spark Streaming backend), if you don't specifically want Spark Streaming.Jamal Sealiti
05/21/2025, 9:30 AMDeepyaman Datta
05/21/2025, 1:21 PMThen there are plans in the near future to further develop Kedro into a version that is more streaming-oriented?In that case, I highly doubt this would happen in the near future, unless it's driven by the community; Kedro is a fairly unopinionated structurer of Python code and this works fairly well for batch workflows. Agree streaming requires more things to be built in to work, and this could be a plugin or something, but I don't know how much demand there has been for this to be prioritized from the core team at this point.