Alex Shawn
10/24/2024, 3:08 AMkedro-airflow-k8s
plugin version 0.8.2. I've followed the documentation provided, but I'm unable to get it to function as expected.
Here's the output I'm getting when I try to use the commands:
(kedro-17.0) $ pip show kedro
Name: kedro
Version: 0.16.6
Summary: Kedro helps you build production-ready data and analytics pipelines
Home-page: <https://github.com/quantumblacklabs/kedro>
Author: QuantumBlack Labs
Author-email:
License: Apache Software License (Apache 2.0)
Requires: anyconfig, cachetools, click, cookiecutter, fsspec, jmespath, jupyter-client, pip-tools, pluggy, python-json-logger, PyYAML, setuptools, toposort
Required-by: kedro-airflow-k8s, kedro-docker, kedro-telemetry
(kedro-17.0) $ kedro install
Usage: kedro [OPTIONS] COMMAND [ARGS]...
Try 'kedro -h' for help.
Error: No such command 'install'.
(kedro-17.0) $ kedro airflow-k8s
Usage: kedro [OPTIONS] COMMAND [ARGS]...
Try 'kedro -h' for help.
Error: No such command 'airflow-k8s'.
(kedro-17.0)$ pip show kedro-airflow-k8s
Name: kedro-airflow-k8s
Version: 0.8.2
Summary: Kedro plugin with Airflow on Kubernetes support
Home-page: <https://github.com/getindata/kedro-airflow-k8s/>
Author: Michal Zelechowski, Mariusz Strzelecki, Mateusz Pytel
Author-email: <mailto:mateusz@getindata.com|mateusz@getindata.com>
License: Apache Software License (Apache 2.0)
Requires: click, kedro, pip-tools, python-slugify, semver, tabulate
Required-by:
Another issue is when I include the configurations for X_test
, y_train
, y_test
in the catalog.yml
file as follows:
X_train:
type: pickle.PickleDataSet
filepath: data/05_model_input/X_train.pickle
layer: model_input
y_train:
type: pickle.PickleDataSet
filepath: data/05_model_input/y_train.pickle
layer: model_input
X_test:
type: pickle.PickleDataSet
filepath: data/05_model_input/X_test.pickle
layer: model_input
y_test:
type: pickle.PickleDataSet
filepath: data/05_model_input/y_test.pickle
layer: model_input
And then execute kedro run
locally, I receive the following exception:
Class 'pickle.PickleDataset' not found, is this a typo?
However, I have confirmed that kedro-datasets
is installed in my environment:
(kedro-17.0) $ pip show kedro-datasets
Name: kedro-datasets
Version: 5.0.0
Summary: Kedro-Datasets is where you can find all of Kedro's data connectors.
Home-page:
Author: Kedro
Author-email:
License: Apache Software License (Apache 2.0)
Requires: kedro, lazy-loader
Required-by:
Could someone please assist me in resolving this issue? I would greatly appreciate any guidance you can provide.marrrcin
10/24/2024, 7:29 AMkedro-airflow
Also a good point of reference for more advanced stuff:
https://getindata.com/blog/deploying-kedro-pipelines-gcp-composer-airflow-node-grouping-mlflow/Yolan Honoré-Rougé
10/24/2024, 8:18 AMpickle.PickleDataset
instead of pickle.PickleDataSet
(notice the correct version has a lowercase "s" at the beginning of "set")Merel
10/24/2024, 9:12 AM0.16.6
, is that correct? That predates kedro-datasets
, so it won't be using the datasets from there. Any chance you can update to a more recent Kedro version?Alex Shawn
10/24/2024, 3:50 PMAlex Shawn
10/25/2024, 2:36 PMAlex Shawn
10/25/2024, 2:37 PMAlex Shawn
10/25/2024, 2:47 PMMerel
10/28/2024, 9:17 AMDmitry Sorokin
10/29/2024, 4:56 PMkedro-docker
plugin: https://github.com/kedro-org/kedro-plugins/tree/main/kedro-docker
2. Generate an Airflow DAG using the kedro-airflow
plugin: https://github.com/kedro-org/kedro-plugins/tree/main/kedro-airflow
3. Take the generated DAG and replace the KedroOperator
with KubernetesPodOperator()
. Make sure to link the operator to the Docker container from step 1, instead of using the packaged project used by the kedro-airflow
plugin.
4. Upload the DAG and Container to K8