Ahmad Dorri
05/27/2023, 4:39 PMNick Karpov
05/27/2023, 6:22 PMAggregate pushdown into Delta scan for SELECT COUNT(*). Aggregation queries such as SELECT COUNT(*) on Delta tables are satisfied using file-level row counts in Delta table metadata rather than counting rows in the underlying data files. This significantly reduces the query time as the query just needs to read the table metadata and could make full table count queries faster by 10-100x.
Matthew Powers
05/27/2023, 10:50 PMAhmad Dorri
05/28/2023, 10:26 AMdataframe.count()
operation and noticed that it provided fast resultsJosephK (exDatabricks)
05/29/2023, 8:40 PM