https://delta.io logo
o

orsagiv

02/01/2023, 4:13 PM
Hey when running a daily Optimize job on a Delta table without adding a where clause (meaning on the entire table) i would think it will optimize only the non optimized files, but it seems like it go over all the partitions again even if they weren’t touched at all from the last run is that correct?
m

Martin

02/01/2023, 4:45 PM
Hi I had the same expectation, but I fear you are right. I made a feature request last year: https://github.com/delta-io/delta/issues/1260
g

Gerhard Brueckl

02/01/2023, 5:58 PM
On Databricks it will only optimize files/partitions that are not optimized yet. not sure about the OSS implementation
3 Views