Itai Yaffe
07/27/2023, 3:01 PMdelta.randomizeFilePrefixes
and delta.randomPrefixLength
).
However, this is a specific solution that's limited to AWS S3 (and does not apply to ADLS, for example, or object storages that provide S3-like API).
I've chatted with @Gilad Asulin and @Scott Haines, and It'll be great to get the community's insights w.r.t adding support for spreading a Delta table across multiple buckets/storage accounts (pros/cons, implications, etc.).
One option, for example, is to have the _delta_log
reside in the "main" bucket of the table, and the log entries will point to the fully qualified paths of the data files, which could reside in other buckets (in theory, as @Robert mentioned in this thread, the log and data files could reside in separate buckets ... the protocol allows for fully qualified paths in add actions
).
WDYT?Dominique Brezinski
07/27/2023, 7:45 PMItai Yaffe
07/31/2023, 9:18 AMDominique Brezinski
07/31/2023, 2:07 PMItai Yaffe
07/31/2023, 2:12 PMDominique Brezinski
07/31/2023, 2:25 PMItai Yaffe
07/31/2023, 2:34 PMDominique Brezinski
07/31/2023, 2:39 PM