Deepyaman Datta
05/09/2024, 11:47 PMDeepyaman Datta
05/10/2024, 12:06 AMpd.Series
as outputs of nodes by convention in many examples, so basically your node graph counts as column-level lineage?
I do like bits and pieces of positioning around Hamilton, but I'm definitely put off by false advertising. 😂Thierry Jean
05/10/2024, 2:12 PMkedro run
is largely promoted as the principal way to execute code, which limits it's portability.
## LLMs
- we have multiple examples on GitHub and ready-to-use dataflows for vector search, RAG, text summarization, etc.
- the key difference here is how the Hamilton Driver
facilitates in-memory operations vs. the Kedro Runner
classes. As far as I understand, you wouldn't run Kedro within a FastAPI application, but it's a common pattern for Hamilton (online feature engineering example).
## Column-level operations
- it's a the core of Hamilton since it's inception.
- We have many utilities to create column-level nodes from dataframe-level nodes allowing for very granular lineage, data validation, and schema checking.
- Our Driver
is able to resolve these column-level nodes into a dataframe seamlessly for users running Pandas, Polars (regular and lazy), Dask, PySpark, Vaex
- It's a beloved feature and why many companies adopt Hamilton for feature engineering and powering their ML pipelines.Thierry Jean
05/10/2024, 2:13 PMNok Lam Chan
05/10/2024, 2:18 PMThierry Jean
05/10/2024, 2:23 PMWilliam Caicedo
05/12/2024, 8:52 PMkedro-boot
you can pretty much run a Kedro session anywhere, including a FastAPI service. I’ve been running Kedro sessions inside a Slack bot handler for a while now, and invoking pipeline steps from external code by distributing the project as a pypi package. That’s very portable if you ask me.Martin S
05/18/2024, 3:25 AMThierry Jean
05/18/2024, 7:11 PMfrom_nodes
and to_nodes
provide a declarative API and match the overrides
and the requested nodes in Hamilton
@Deepyaman Datta Let me know if there's any false advertising!Nok Lam Chan
05/18/2024, 7:48 PMThierry Jean
05/18/2024, 11:01 PM