Hello everyone. I am new to kedro and I want to ex...
# questions
a
Hello everyone. I am new to kedro and I want to execute a pipeline (that works well in sequential running) in parallel using —runner=‘ParallelRunner’. I am facing the following issue: every time I try it, the first node is well executed and saves its output as a SharedMemoryDataset but just after I see that the other nodes « have not run » and I have a RecursionError raising « maximum depth exceeded » coming from the line 284 in set_project_logging in kedro\framework\project\_init_.py « if package_name not in self.data[‘loggers’]. Does anyone know where does it come from ?
r
hi can you share what version of kedro are you on? also are you creating any custom datasets ?
a
Thanks for your answer. I am using 0.19.12 version. Yes I am, because I am loading excel files from s3 and it didnt work so following the advice given in an issue in the kedro github I created a NoFsspecProblemExcelDataset(ExcelDataset).
Do you want me to show it to you?
r
We have an existing ticket for this issue raised by another user: https://github.com/kedro-org/kedro/issues/4753. Let me know if it aligns with the problem you're facing. We're planning to address it in the upcoming 1.0 release. In the meantime, you could try using Kedro 0.19.10 as a workaround.
a
Ok thank you very much. I will try this
I tried it and I still got the recusion error. What I did was implementing the _describe() method (which I didn’t change previously) try pretty printing or do repr() instead. However like you said it worked while tryîg with other versions (0.19.10 didnt work but 0.19.09 did). I maybe did something wrong ?
r
Thanks yes, apologies, i think 0.19.10 was the version we introduce this change. So it will work in the versions prior to it
👍 1