Elena Khaustova
10/11/2024, 11:57 AMPython 3.8
support.
• Implemented KedroDataCatalog
repeating DataCatalog
functionality with a few API enhancements.
• Moved development requirements from requirements.txt
to the dedicated section in pyproject.toml
for project template.
• Moved pattern resolution logic from DataCatalog
to a separate component - CatalogConfigResolver
.
• Made packaged Kedro projects return session.run()
output to be used when running it in the interactive environment.
• Enhanced OmegaConfigLoader
configuration validation to detect duplicate keys at all parameter levels, ensuring comprehensive nested key checking.
• Added two new experimental datasets - PyTorchDataset
and ProphetModelDataset
and one core dataset - HTMLDataset
.
• Replaced the GeoJSONDataset
with GenericDataset
to support parquet and feather file formats.
• Refactored the ManagedTableDataset
by separating the common table logic into a separate BaseTableDataset
.
Note: KedroDataCatalog
is an experimental feature and is under active development. Therefore, it is possible we’ll introduce breaking changes to this class, so be mindful of that if you decide to use it already. Let us know if you have any feedback about the KedroDataCatalog
or ideas for new features.
🪲 Bug fixes and other changes
• Fixed bug where using dataset factories breaks with ThreadRunner
.
• Fixed a bug where SharedMemoryDataset.exists
would not call the underlying MemoryDataset
.
• Fixed bug related to loading/saving models from/to remote storage using TensorFlowModelDataset
.
• Fixed deprecated load and save approaches of GBQTableDataset
and GBQQueryDataset
.
💥 Breaking changes to the API
• Kedro: Removed ShelveStore
to address a security vulnerability.
• kedro-datasets: Exposed load
and save
publicly for each dataset. This requires Kedro version 0.19.7 or higher.
Read the complete release notes for kedro and kedro-datasets on GitHub.
Thanks a lot to our community contributors - @Puneet Saini, @Ethan Knights, Manezki, @Miguel RodrÃguez, felixscherz, @Yu-Sheng Kevin Li, @Brandon Meek, @Yury Fedotov, @Nelson Zambrano, @Janick Spirig Spirig, @Galen Seilis, mariusz89016, @Matthias Harms and @Minura Punchihewa ✨
We welcome every community contribution, large or small. See what we're working on now and report bugs or suggest future features.
Until next time,
The Kedro Team 💛