Luis Chaves Rodriguez
02/03/2025, 12:23 PMmytable:
type: pandas.SQLQueryDataset
credentials: postgres_dwh
filepath: sql/mytable.sql
basically, I'd like to pass parameters to the SQL queryHall
02/03/2025, 12:23 PMAnkita Katiyar
02/03/2025, 1:59 PMmytable_{table}:
type: pandas.SQLQueryDataset
credentials: <cred>
sql: SELECT * from {table}
And in your pipeline_registry.py
/ pipeline.py
, have a script that reads the queries from a file and generates the pipeline dynamically.
The way dataset factories works is that it reads the dataset name from the pipeline inputs/outputs and then fills in the placeholders in the catalog entry, so the dataset names might get crazy lookingLuis Chaves Rodriguez
02/03/2025, 2:00 PMAnkita Katiyar
02/03/2025, 3:14 PMNok Lam Chan
02/04/2025, 11:08 AMibis
would be a better fit for SQL parameterisation / multi-nodes SQL lazy evaluatedLuis Chaves Rodriguez
02/04/2025, 11:13 AMNok Lam Chan
02/04/2025, 11:15 AMNok Lam Chan
02/04/2025, 11:16 AM