Hugo Evers
08/07/2024, 11:10 AMDeepyaman Datta
08/07/2024, 12:52 PMdolt_commit
in the dataset implementation on save (as you say, somewhat similar to MLflow dataset).
Let me reach out to a colleague on the Ibis team who talked to the Dolt team recently, to see if they have any thoughts...Hugo Evers
08/07/2024, 1:09 PMDeepyaman Datta
08/07/2024, 1:55 PMHugo Evers
08/07/2024, 2:05 PMDeepyaman Datta
08/07/2024, 2:36 PMI’ve worked extensively with kedro combined with sql, either custom datasets to use sqlalchemy, or very hacky solutions to work with sqlquerydataset/sqltabledataset, but im looking to clean up my act.Highly recommend using Ibis for this (via https://docs.kedro.org/projects/kedro-datasets/en/kedro-datasets-4.1.0/api/kedro_datasets.ibis.TableDataset.html). I am biased as I work on Ibis; that said, I do believe it's the best way to do this now in Kedro, especially given the Python dataframe model fits well. Happy to share some more context/answer questions, if you'd like.
i dont know whether there are action items rn, but basically i searched for dolt in this slack channel and found nothing, so if there is interest in collecting some best practices wrt dolt and kedro, we can keep this thread alive and maybe move the discussion somewhere more strucured if there is some momentum for it?Yeah, haven't seen any recent activity on this front. If you do write up/share your implementation along the way, will be happy to see it!
Nok Lam Chan
08/09/2024, 9:56 PM