Roy Green
01/22/2023, 11:47 AMLennart Skogmo
01/22/2023, 4:25 PMRoy Green
01/22/2023, 6:32 PMJosephK (exDatabricks)
01/22/2023, 6:36 PMRoy Green
01/22/2023, 6:39 PMJosephK (exDatabricks)
01/22/2023, 6:41 PMRoy Green
01/22/2023, 6:41 PMOmkar
01/23/2023, 12:12 PMRoy Green
01/23/2023, 12:14 PMOmkar
01/23/2023, 12:34 PMMin Idle
and Max Capacity
in your serverless pool config as required. Although please be careful to not set Min Idle
too high as it may cost you more and the workers may stay idle during off-peak hours.
More info here on best practices: https://docs.databricks.com/clusters/cluster-config-best-practices.html#autoscaling
This talk might also be useful: https://www.databricks.com/dataaisummit/session/scaling-your-workloads-databricks-serverless
Hoping this will be helpful to you! 😁Roy Green
01/23/2023, 12:50 PMOmkar
01/23/2023, 1:54 PMEXPLAIN
for your query? It may tell you which parts of the query are taking time.
Explain syntax here: https://docs.databricks.com/sql/language-manual/sql-ref-syntax-qry-explain.htmlRoy Green
01/23/2023, 1:54 PMJosephK (exDatabricks)
01/23/2023, 1:55 PMRoy Green
01/23/2023, 1:55 PMJosephK (exDatabricks)
01/23/2023, 1:58 PMRoy Green
01/23/2023, 1:58 PMJosephK (exDatabricks)
01/23/2023, 1:59 PMRoy Green
01/23/2023, 1:59 PMJosephK (exDatabricks)
01/23/2023, 2:00 PMRoy Green
01/23/2023, 2:00 PMJosephK (exDatabricks)
01/23/2023, 2:02 PMRoy Green
01/23/2023, 2:02 PMOmkar
01/23/2023, 2:09 PMRoy Green
01/23/2023, 2:10 PMOmkar
01/23/2023, 2:11 PMRoy Green
01/23/2023, 2:11 PMOmkar
01/23/2023, 2:15 PMbook_name
then you schema will be something like:
table BookCounter
book_name string (your string column)
book_count integer (it's overall count)
Periodically update this Counter table from your source table.Roy Green
01/23/2023, 2:15 PM