https://delta.io logo
a

Abolfazl karimian

07/02/2023, 8:57 AM
Hey everyone. I'm ambiguous about 2 configuration in delta lake usage. what is the difference between delta.deletedFileRetentionDuration and Retain. for example when we want to apply a retention job on your lake, we have 2 way i guess(I'm not sure they are the same!): First:
delta_table = DeltaTable.forPath(session , path)
delta_table.logRetentionDuration = "interval 10 days"
delta_table.deletes( col("date") < "specified_day" )
session.sql("vacuum delta.path RETAIN 48 HOURS")
session.sql("vacuum delta.path RETAIN 48 HOURS")
Second:
delta_table = DeltaTable.forPath(session , path)
delta_table.logRetentionDuration = "interval 10 days"
delta.deletedFileRetentionDuration = "interval 2 days"
delta_table.deletes( col("date") < "specified_day" )
session.sql("vacuum delta.path")
session.sql("vacuum delta.path")
Are these 2 ways different?!