Previous blog/Context: In an earlier blog, we discussed Spark ETL with Lakehouse (with Delta Lake). Please find below blog post for more details. https://developershome.blog/2023/03/19/spark-etl-chapter-7-with-lakehouse-delta-lake/embed/#?secret=Z8M19UjerD#?secret=yljQcLJrZC Introduction: In this blog, we will discuss Spark ETL with Apache HUDI. We will first understand what Apache HUDI is and why Apache HUDI is used for creating Lake house. We... Continue Reading →
Spark ETL Chapter 4 with Cloud data lakes (AWS S3 bucket)
Previous blog/Context: In an earlier blog, we discussed Spark ETL with Cloud data lakes (Azure blob and Azure Data Lake services). Please find below blog post for more details. https://developershome.blog/2023/03/08/spark-etl-chapter-3-with-cloud-data-lakes-azure-blob-azure-adls/ Introduction: In this blog, we will discuss Spark ETL with Cloud data lakes and we will be doing Spark ETL with AWS S3 bucket. We... Continue Reading →