Welcome to the world of Lakehouse

Photo by Alexander Isreb on Pexels.com The rise of big data has led to the emergence of new data architectures that can handle the volume, variety, and velocity of data generated by modern organizations. Two of the most popular data architectures are traditional data warehouses and lake house architectures. In this post, we will compare... Continue Reading →

Spark ETL Chapter 10 with Lakehouse (Delta Lake vs Apache Iceberg vs Apache HUDI)

Previous blog/Context: In an earlier blog, we discussed Spark ETL with Lakehouse (with Apache Iceberg). Please find below blog post for more details. https://developershome.blog/2023/03/21/spark-etl-chapter-9-with-lakehouse-apache-iceberg/ Introduction: Today, In this below, we will discuss below points Spark ETL with famous Lakehouse formats. (Delta Lake, Apache Iceberg, and Apache HUDI) Offerings from all these lake house data formats.... Continue Reading →

Spark ETL Chapter 9 with Lakehouse | Apache Iceberg

Previous blog/Context: In an earlier blog, we discussed Spark ETL with Lakehouse (with HUDI). Please find below blog post for more details. https://developershome.blog/2023/03/22/spark-etl-chapter-8-with-lakehouse-apache-hudi/ Introduction: In this blog, we will discuss Spark ETL with Apache iceberg. We will first understand what Apache iceberg is and why use Apache iceberg for creating Lake house. We will source... Continue Reading →

Create a website or blog at WordPress.com

Up ↑