Previous blog/Context: In an earlier blog, we discussed Spark ETL with API. Please find below blog post for more details https://developershome.blog/2023/03/18/spark-etl-chapter-6-with-apis/ Introduction: In this blog, we will discuss Spark ETL with lake house. We will first understand what a lake house is and why we need a lakehouse and what are the formats for storing... Continue Reading →
Spark ETL Chapter 6 with APIs
Previous blog/Context: In an earlier blog, we discussed Spark ETL with HIVE. Please find below blog post for more details https://developershome.blog/2023/03/15/spark-etl-chapter-5-with-hive-tables/ Introduction: In this blog, we will do Spark ETL with APIs. We will source data from API and load data into one of the below destinations. We will learn how to call APIs from... Continue Reading →
Spark ETL Chapter 5 with Hive tables
Previous blog/Context: In an earlier blog, we discussed Spark ETL with Cloud data lakes (AWS S3 bucket). Please find below blog post for more details. https://developershome.blog/2023/03/12/spark-etl-chapter-4-with-cloud-data-lakes-aws-s3-bucket/ Introduction: In this blog, we will discuss HIVE tables/views and we will do ETL with Hive tables. We will learn about how to create global and temporary hive tables... Continue Reading →