Z Ordering is cool, but I don’t think it’s always necessary. A lot of times sorting the data is all you need.
Sorting the data is good for tables that are always filtered by a certain column for example. Hierarchical sorting is also good in a lot of instances. I am assuming that sorting would be easier to implement than Z Ordering (also easier to explain and use perhaps).
What do folks think about a
delta_table.optimize.sort(["col1", "col2"])
method that hierarchically sorts the data? We can also add a
delta_table.optimize.z_order
method of course.