Dinesh N

05/17/2023, 5:05 AM
#deltalake-questions I am pretty new to delta tables, tried Googling on differences with vacuum and optimise Could find any good info, can any one share me good documentation that explains technically please

Jordan Cuevas

05/19/2023, 1:05 PM
Optimize will either combine many small parquet files into a single file to make future reads more efficient (if you specify compaction), or order the data within various parquet files by some column(s) so that future reads can more easily know which file(s) it needs to load. Vacuum simply deletes old parquet files; old as defined by your specified retention period. Any file that is no longer being used and hasn't been used by an version in the last X hours/days would be deleted.