Joseph McLeish
05/07/2025, 1:00 PMkedro run
in the directory of the project (after having run uv pip install -r requirements.txt
), I get the following error:
PySparkRuntimeError: [JAVA_GATEWAY_EXITED] Java gateway process exited before sending its port number.
It seems like I need to install Java for this to work, but there's no mention of Java anywhere in the docs, so this doesn't feel like the right option. I'm on Windows running locally on VS Code and didn't encounter any issues with requirements installation). Is anyone able to help with this error? Thanks! 🙂Laura Couto
05/07/2025, 1:29 PMNote that PySpark requires Java 8 (except prior to 8u371), 11 or 17 with JAVA_HOME properly set. If using JDK 11, set -Dio.netty.tryReflectionSetAccessible=true for Arrow related features and refer to Downloading.
Hope this helps!Joseph McLeish
05/07/2025, 1:31 PMNok Lam Chan
05/08/2025, 10:53 AMNok Lam Chan
05/08/2025, 10:55 AMkedro new
This is the list of option, if you choose 6
or all
, then it will included pyspark
in hooks.py
which setup the spark connection. If this is not necessary you can skip the option.Joseph McLeish
05/08/2025, 10:57 AMall
as I wanted to test out the functionality as fully as possibleJoseph McLeish
05/08/2025, 10:59 AM7) Kedro-Viz: Kedro's native visualisation tool
like in the tutorial. I'd installed kedro-viz, so was surprised when I only had 1-6 as options during kedro project setup