Thomas d'Hooghe
05/21/2024, 4:25 PMabfs://
protocol. I am providing it as followed:
raw_dataset:
type: pandas.CSVDataset
filepath: "<abfs://container/file.csv>"
credentials: azure_credentials
with azure credentials being:
azure_credentials:
account_name: "name_datastore" (gotten from Data-container page in Azure AI Machine Learning Studio)
account_key: "eyJ...." (gotten from `az account get-access-token` in a Compute instance in Azure AI Machine learning studioe)
I am getting the following error:
File "/anaconda/envs/py311/lib/python3.11/site-packages/azure/storage/blob/_shared/authentication.py", line 152, in on_request
self._add_authorization_header(request, string_to_sign)
File "/anaconda/envs/py311/lib/python3.11/site-packages/azure/storage/blob/_shared/authentication.py", line 135, in _add_authorization_header
raise _wrap_exception(ex, AzureSigningError) from ex
azure.storage.blob._shared.authentication.AzureSigningError: Invalid base64-encoded string: number of data characters (2049) cannot be 1 more than a multiple of 4
Am I doing the credentials wrong?Nok Lam Chan
05/21/2024, 4:39 PMNok Lam Chan
05/21/2024, 4:41 PMimport pandas as pd
storage_options={'account_name': ACCOUNT_NAME, 'account_key': ACCOUNT_KEY}
ddf = pd.read_csv('abfs://{CONTAINER}/{FOLDER}/*.csv', storage_options=storage_options)
I took this example and modify from adlfs README, can you try that and see if this works? This would help to diagnose whether the issue is coming from fsspec or kedroThomas d'Hooghe
05/22/2024, 7:32 AMNok Lam Chan
05/22/2024, 9:27 AMazure_credentials:
account_name: "name_datastore" (gotten from Data-container page in Azure AI Machine Learning Studio)
account_key: "eyJ...." (gotten from `az account get-access-token` in a Compute instance in Azure AI Machine learning studioe)
You show this snippets above so I thought you would have these account name and key already?Thomas d'Hooghe
05/22/2024, 11:08 AMNok Lam Chan
05/22/2024, 11:29 AM