Brandon Meek
06/17/2024, 1:59 PMdatajoely
06/17/2024, 2:02 PMJuan Luis
06/17/2024, 2:21 PMDeepyaman Datta
06/17/2024, 2:25 PMcars:
type: universal_catalog.UniversalCatalogDataset
source_name: cars
url: <http://localhost:5000/>
I'd rather not have to redefine each catalog entry as a UniversalCatalogDataset
(maybe there are cases where I have local overrides, but not for every dataset); is it possible just replace the catalog object with a remote catalog?Brandon Meek
06/17/2024, 2:27 PMJuan Luis
06/17/2024, 2:29 PMJuan Luis
06/17/2024, 2:39 PM# settings.py
from kedro_universal_catalog import UniversalCatalog
DATA_CATALOG_CLASS = UniversalCatalog
and most of the rest should be the same, right?Deepyaman Datta
06/17/2024, 2:43 PMdatajoely
06/17/2024, 2:46 PMBrandon Meek
06/17/2024, 3:16 PMUniversalCatalog
that pulls the entire catalog, and then if any of the values need to be overwritten you could use the OmegaConfigLoader merge strategiesBrandon Meek
06/17/2024, 7:55 PMsettings.py
but it's a subclass that returns a DataCatalog
that merges your project's DataCatalog
with the remote catalogDeepyaman Datta
06/18/2024, 7:14 AMJuan Luis
06/18/2024, 8:17 AMkedro-airflow
, but then tried to look for prior art in the MLOps world and I haven't seen really any "centralized config store". if anything, there are secrets managers, experiment tracking systems for ML, and the rest is roughly what @Yolan Honoré-Rougé called "external configuration" in his first "universal Kedro deployments" issue.
looking at Google's seminal paper "Hidden Technical Debt in Machine Learning Systems", "Configuration" there merely refers to ML systems: features, data, hyperparameters.
and yet, configuration in Kedro is much broader.Deepyaman Datta
06/18/2024, 8:38 AMdatajoely
06/18/2024, 8:44 AMdatajoely
06/18/2024, 10:09 AMJuan Luis
06/18/2024, 10:12 AMDeepyaman Datta
06/18/2024, 10:24 AMdatajoely
06/18/2024, 10:25 AM