https://delta.io logo
n

Naveen Kumar Vadlamudi

06/18/2023, 4:19 PM
Hello Team, I am here with another question, Soo Deltalake is created to perform, incremental updates over cloud object stores that ensures ACID properties. As per the literature it mentions that, Delta lake took reference from hudi. In 2021 lakehouse architecture got published, but my question here is 1) Lakehouse combines the best worlds of both Datalake and Warehouse then whats the need of integrating with Snowflake (A Traditional Datawarehouse) When Photon Query engine, can do what the Snowflake does. Therefore plugging BI tools to visualize data directly from lakehouse is a viable option ? (As per the literature) This raises my above question again ! Is this for data governance , data auditing, data lineage purposes ? if so we can plugin data governance tools like Openmetadata to visualize and track everything directly from lakehouse ! Or else is this related to lack of support offered to lakehouse as of now ? Let me know your thoughts on this 🙂
j

Jordan Fox

06/18/2023, 7:18 PM
Seem all over the place with the questions. Photon is a proprietary query engine from Databricks. It's essentially a rewrite of spark in c utilizing vectors. Snowflake is a separate company that primarily has their own proprietary backend storage, but also support external datalake storage (external table). Snowflake primarily supports Iceberg. They would want to also support deltalake so more people can continue using their platform.
j

JosephK (exDatabricks)

06/18/2023, 7:51 PM
inspired from apache hudi is actually the opposite of the truth and that hudi is more inspired by delta. If you look at release times and roadmaps this would be born out.
n

Naveen Kumar Vadlamudi

06/18/2023, 7:53 PM
Sorry for the misconception, but the first release of incremental based updates on Hadoop was recorded in 2017 by hudi founder so I thought Delta lake did inspired from hudi.
Also in citation of Delta lake the authors mentioned apache hudi and apache iceberg as reference
j

JosephK (exDatabricks)

06/18/2023, 7:55 PM
reference and inspire are different words
👍 1
n

Naveen Kumar Vadlamudi

06/18/2023, 8:00 PM
I rephrased my statement
After researching a bit, I understood that Deltalake proposed the ACID properties over cloud object stores, and later released into the community. From there, other open table formats took it as reference for their projects and implemented it. So in the context of enforcement of ACID properties Deltalake has laid the foundation. But the only point i need to confirm is did delta design choices were in anyway inspired from hudi, iceberg projects.