How to Dynamically Read Multiple Sheets from an Excel File Kedro #questions

How to Dynamically Read Multiple Sheets from an Ex...

Gabriel Aguiar

04/30/2024, 12:19 PM

How to Dynamically Read Multiple Sheets from an Excel File in Kedro? Hi everyone! I'm currently working with Kedro and I need to load multiple sheets from an Excel file. The challenge is that I do not know the sheet names in advance, so I need to dynamically identify and load all available sheets. Could someone please guide me on how to achieve this in Kedro? What would be the best practice for integrating this with the Kedro data catalog? Thanks in advance for your help! Using right now: "constraints_{name_us}": type: pandas.ExcelDataset filepath: data/10_optimization_inputs/constraints_{name_us}.xlsx load_args: engine: openpyxl decimal: "." sheet_name: None (Sheet_name = None don't work)

datajoely

04/30/2024, 12:19 PM

what did you get? an error or a single dataframe?

Gabriel Aguiar

04/30/2024, 12:20 PM

A error, the sheet with name None its not in my excel file

datajoely

04/30/2024, 12:34 PM

so I’ve just tested with a project I happened to have open and it works as expected

datajoely

04/30/2024, 12:35 PM

does your YAML have the right whitespace?

Copy code

"constraints_{name_us}":
    type: pandas.ExcelDataset
    filepath: data/10_optimization_inputs/constraints_{name_us}.xlsx
    load_args:
       engine: openpyxl
       decimal: "."
       sheet_name: None

👍 1

Gabriel Aguiar

04/30/2024, 12:45 PM

Worksheet named None not found I am current using kedro 0.19.3

datajoely

04/30/2024, 12:50 PM

So is there any way you are passing

"None"

not

None

? so is there any reason why the

Gabriel Aguiar

04/30/2024, 12:51 PM

I tried the both 😕

datajoely

04/30/2024, 12:51 PM

Can you try

null

🥳 1

Gabriel Aguiar

04/30/2024, 12:51 PM

Yes, i will try 🙂

Gabriel Aguiar

04/30/2024, 1:10 PM

Work with null 😄

Gabriel Aguiar

04/30/2024, 1:10 PM

Thank you @datajoely ❤️

datajoely

04/30/2024, 1:51 PM

🙏

datajoely

04/30/2024, 1:51 PM

nasty issue that!

Nok Lam Chan

04/30/2024, 2:52 PM

Or just leave the option? I think the default should be None anyway?

datajoely

04/30/2024, 2:53 PM

yeah I didn’t mention that technically this is valid:

Copy code

"constraints_{name_us}":
    type: pandas.ExcelDataset
    filepath: data/10_optimization_inputs/constraints_{name_us}.xlsx
    load_args:
       engine: openpyxl
       sheet_name: 
       decimal: "."

but it does feel like it breaks the

explicit is better than implicit

python mantra

Nok Lam Chan

04/30/2024, 2:54 PM

https://pandas.pydata.org/docs/reference/api/pandas.read_excel.html Alright I am wrong the default is 0 which is the first sheet

datajoely

04/30/2024, 2:54 PM

oh you’re talking about the pandas arg

datajoely

04/30/2024, 2:54 PM

yeah technically an empty key is null in yaml

👍🏼 1

👍 1

10 Views

Open in Slack

Previous Next