Eero Lihavainen
07/19/2023, 1:38 PMdeltalake
library in AWS Lambda & S3. It works nicely, but I noticed that query latency varies quite a lot depending on how many files there are in the commit log. If I read the delta-rs
code correctly, the update_incremental
function reads and processes all files in sequence, is that correct? And if so, would it be possible to make concurrent requests to the storage instead and buffer the results? https://github.com/delta-io/delta-rs/blob/a7389d581aceeec7187ccdfd1cbb80e7ac0747f9/rust/src/delta.rs#L472Will Jones
07/19/2023, 2:42 PMEero Lihavainen
07/19/2023, 2:44 PMRobert
07/19/2023, 3:43 PMEero Lihavainen
07/19/2023, 3:47 PMRobert
07/19/2023, 3:55 PMEero Lihavainen
07/20/2023, 12:12 PMrtyler
07/20/2023, 1:20 PMEero Lihavainen
07/20/2023, 1:39 PMrtyler
07/20/2023, 1:43 PMEero Lihavainen
07/20/2023, 1:45 PM