guru moorthy
07/03/2023, 10:12 PMchris fish
07/03/2023, 10:48 PMsort
or orderBy
to ensure your data has a specific ordered layout. but this wont have the effect of bucketing on the downstream operations that read that data (faster joins/aggregations). delta supports min/max based file skipping, which can speed up filters for sorted or co-located data. Zorder uses clustering algorithms to produce files that benefit more from this file skippingSherlock Beard
07/04/2023, 3:13 AMJacek
07/04/2023, 8:55 AM