Guillaume Tauzin
07/22/2025, 7:12 AMGuillaume Tauzin
07/22/2025, 7:54 AMconfig_resolver = catalog.config_resolver
config_resolver.resolve_pattern(dataset_name)
catalog.release(dataset_name)
df = catalog.load(dataset_name)datajoely
07/22/2025, 9:17 AMElena Khaustova
07/22/2025, 10:24 AMthe possibility of using dataset factories without a pipeline that define each single datatsetYou can use dataset factories without having a pipeline at all. If I understood the purpose correctly, you can just define factories in
catalog.yaml and then just do df = catalog.load(dataset_name) , the rest of the steps are not needed. The resolution happens under the hood.
This method allows you to see the resolved configuration without adding a dataset to the catalog:
ds_config = config_resolver.resolve_pattern(dataset_name)Guillaume Tauzin
07/22/2025, 11:17 AMload! I am not sure why I really thought you needed to resolve yourself specific dataset created from a factory before you could actually load it.
Thanks a lot @Elena Khaustova for explaining this :)