https://delta.io logo
l

liab

05/23/2023, 5:27 PM
I have a common question, but didn't find a simple solution online. How do I remove duplicated rows from delta table? I'm using azure databricks and reading an external delta table stored on azure storage account
j

JosephK (exDatabricks)

05/23/2023, 5:30 PM
Most languages have a distinct method
l

liab

05/23/2023, 5:33 PM
okay, are you saying create a view that's as select distinct?
j

JosephK (exDatabricks)

05/23/2023, 5:33 PM
Views don't delete anything
just do a read, distinct, write operation to remove rows
2 Views