Eric Bell
02/12/2024, 6:53 AMkedro.io.core.DatasetError: Failed while saving data to data set ParquetDataset(filepath=D:/XXXXX/bmad-data-analysis/data/02_intermediate/preprocessed_companies.pq, load_args={}, protocol=file, save_args={}).
I/O operation on closed file.
I uninstalled fastparquet and installed pyarrow and is works now.Nok Lam Chan
02/12/2024, 4:56 PMNok Lam Chan
02/12/2024, 4:59 PMpip
,kedro
,kedor-datasets
version)Nok Lam Chan
02/12/2024, 6:12 PMpyarrow
already.
I can confirm this is an issue, I manually delete pyarrow
and install fastparquet
and arrive at the same error.
For now I suggest to use pyarrow
, since this should be increasing the standard of the community, but at the same time we will to fix the bug.datajoely
02/12/2024, 6:20 PMEric Bell
02/13/2024, 3:39 AMNok Lam Chan
02/13/2024, 12:13 PMfastparquet
?Nok Lam Chan
02/13/2024, 12:14 PMEric Bell
02/13/2024, 11:50 PM>>> import pandas
<stdin>:1: DeprecationWarning:
Pyarrow will become a required dependency of pandas in the next major release of pandas (pandas 3.0),
(to allow more performant data types, such as the Arrow string type, and better interoperability with other libraries)
but was not found to be installed on your system.
If this would cause problems for you,
please provide us feedback at <https://github.com/pandas-dev/pandas/issues/54466>
Wow now I'm really going insane ... I'm certain that when I first saw this message, it said "pyarrow or fastparquet" ... now it only says "pyarrow"datajoely
02/14/2024, 11:41 AM