Juan Luis
06/27/2025, 1:59 PMJuan Luis
06/27/2025, 1:59 PMThe primary existing Python specification used for object storage is fsspec, which defines a filesystem-like interface based around Python file-like objects.
However this presents an impedance mismatch: object storage is not a filesystem and does not have the same semantics as filesystems. This leads to surprising behavior, poor performance, and integration complexity.
Juan Luis
06/27/2025, 2:00 PMNok Lam Chan
06/27/2025, 2:37 PMfsspec
has always been a nice abstraction for the somewhat consistent interface, but sub-par performance. For example if I've use botocore
directly few years ago, and I can easily get a few x setup for s3.Nok Lam Chan
06/27/2025, 2:38 PMfsspec
, while polars
has its own thing in Rust and Spark has its own thing as well.Juan Luis
06/27/2025, 3:41 PM