Hello, I am using batch processing to load previous day data to write to Delta table which is partitioned by Folder Date and using Overwrite mode. Now my requirement is to run this Job every hour, what would be the best way to do it ?
Note: I should have re-run option for a particular date
02/10/2023, 6:30 PM
You can try Airflow to schedule your job
02/11/2023, 10:19 PM
Scheduling is not a problem I can do the same thing with Databricks workflow jobs but it’s about writing to the delta table with overwrite mode