I have a common question, but didn't find a simple solution online. How do I remove duplicated rows from delta table? I'm using azure databricks and reading an external delta table stored on azure storage account
j
JosephK (exDatabricks)
05/23/2023, 5:30 PM
Most languages have a distinct method
l
liab
05/23/2023, 5:33 PM
okay, are you saying create a view that's as select distinct?
j
JosephK (exDatabricks)
05/23/2023, 5:33 PM
Views don't delete anything
just do a read, distinct, write operation to remove rows