Join Slack
Powered by
Has anyone tried to customize Spark Dataset to rea...
# questions
t
Trọng Đạt Bùi
06/12/2025, 6:41 AM
Has anyone tried to customize Spark Dataset to read multiple folders in HDFS?
a
Ankita Katiyar
06/12/2025, 3:11 PM
Maybe the
PartitionedDataset
is useful?
https://docs.kedro.org/en/stable/data/partitioned_and_incremental_datasets.html#partitioned-dataset-load
n
Nok Lam Chan
06/13/2025, 8:05 AM
https://sites.google.com/site/hellobenchen/home/wiki/big-data/spark/read-data-files-from-multiple-sub-folders
does spark natively support reading subfolders already?
2
Views
Open in Slack
Previous
Next