I’d like to start reaching out to different communities and giving them a high level overview of the next steps for Python deltalake (new word for delta-rs) as it pertains their roadmaps. I’m pretty tight with the Polars and Dask communities and can start there. Here’s the messaging I’m thinking of at a high level:
“We’ve built a connector that makes it easy to read Delta tables into Dask DataFrames. This allows users to query data faster than regular Parquet data lakes because of transaction log level metadata skipping and avoiding inefficient file listing operations. We’re currently integrating the Arrow Database Connectivity (ADBC) library into delta-rs, which will allow us to build Dask writers and support for delete, merge, and update transactions more easily. We will let you know when the ADBC work is done so we can start working on the Dask writers/DML support.”
I’ll plan on sending Ritchie from Polars a similar message. Does this sound like a good plan?