Alex Shawn
10/24/2024, 3:08 AMkedro-airflow-k8s plugin version 0.8.2. I've followed the documentation provided, but I'm unable to get it to function as expected.
Here's the output I'm getting when I try to use the commands:
(kedro-17.0) $ pip show kedro
Name: kedro
Version: 0.16.6
Summary: Kedro helps you build production-ready data and analytics pipelines
Home-page: <https://github.com/quantumblacklabs/kedro>
Author: QuantumBlack Labs
Author-email:
License: Apache Software License (Apache 2.0)
Requires: anyconfig, cachetools, click, cookiecutter, fsspec, jmespath, jupyter-client, pip-tools, pluggy, python-json-logger, PyYAML, setuptools, toposort
Required-by: kedro-airflow-k8s, kedro-docker, kedro-telemetry
(kedro-17.0) $ kedro install
Usage: kedro [OPTIONS] COMMAND [ARGS]...
Try 'kedro -h' for help.
Error: No such command 'install'.
(kedro-17.0) $ kedro airflow-k8s
Usage: kedro [OPTIONS] COMMAND [ARGS]...
Try 'kedro -h' for help.
Error: No such command 'airflow-k8s'.
(kedro-17.0)$ pip show kedro-airflow-k8s
Name: kedro-airflow-k8s
Version: 0.8.2
Summary: Kedro plugin with Airflow on Kubernetes support
Home-page: <https://github.com/getindata/kedro-airflow-k8s/>
Author: Michal Zelechowski, Mariusz Strzelecki, Mateusz Pytel
Author-email: <mailto:mateusz@getindata.com|mateusz@getindata.com>
License: Apache Software License (Apache 2.0)
Requires: click, kedro, pip-tools, python-slugify, semver, tabulate
Required-by:
Another issue is when I include the configurations for X_test, y_train, y_test in the catalog.yml file as follows:
X_train:
type: pickle.PickleDataSet
filepath: data/05_model_input/X_train.pickle
layer: model_input
y_train:
type: pickle.PickleDataSet
filepath: data/05_model_input/y_train.pickle
layer: model_input
X_test:
type: pickle.PickleDataSet
filepath: data/05_model_input/X_test.pickle
layer: model_input
y_test:
type: pickle.PickleDataSet
filepath: data/05_model_input/y_test.pickle
layer: model_input
And then execute kedro run locally, I receive the following exception:
Class 'pickle.PickleDataset' not found, is this a typo?
However, I have confirmed that kedro-datasets is installed in my environment:
(kedro-17.0) $ pip show kedro-datasets
Name: kedro-datasets
Version: 5.0.0
Summary: Kedro-Datasets is where you can find all of Kedro's data connectors.
Home-page:
Author: Kedro
Author-email:
License: Apache Software License (Apache 2.0)
Requires: kedro, lazy-loader
Required-by:
Could someone please assist me in resolving this issue? I would greatly appreciate any guidance you can provide.marrrcin
10/24/2024, 7:29 AMkedro-airflow
Also a good point of reference for more advanced stuff:
https://getindata.com/blog/deploying-kedro-pipelines-gcp-composer-airflow-node-grouping-mlflow/Yolan Honoré-Rougé
10/24/2024, 8:18 AMpickle.PickleDataset instead of pickle.PickleDataSet (notice the correct version has a lowercase "s" at the beginning of "set")Merel
10/24/2024, 9:12 AM0.16.6 , is that correct? That predates kedro-datasets , so it won't be using the datasets from there. Any chance you can update to a more recent Kedro version?Alex Shawn
10/24/2024, 3:50 PMAlex Shawn
10/25/2024, 2:36 PMAlex Shawn
10/25/2024, 2:37 PMAlex Shawn
10/25/2024, 2:47 PMMerel
10/28/2024, 9:17 AMDmitry Sorokin
10/29/2024, 4:56 PMkedro-docker plugin: https://github.com/kedro-org/kedro-plugins/tree/main/kedro-docker
2. Generate an Airflow DAG using the kedro-airflow plugin: https://github.com/kedro-org/kedro-plugins/tree/main/kedro-airflow
3. Take the generated DAG and replace the KedroOperator with KubernetesPodOperator(). Make sure to link the operator to the Docker container from step 1, instead of using the packaged project used by the kedro-airflow plugin.
4. Upload the DAG and Container to K8