Previous blog/Context: In an earlier blog, we discussed Spark ETL with HIVE. Please find below blog post for more details https://developershome.blog/2023/03/15/spark-etl-chapter-5-with-hive-tables/ Introduction: In this blog, we will do Spark ETL with APIs. We will source data from API and load data into one of the below destinations. We will learn how to call APIs from... Continue Reading →
Spark ETL Chapter 5 with Hive tables
Previous blog/Context: In an earlier blog, we discussed Spark ETL with Cloud data lakes (AWS S3 bucket). Please find below blog post for more details. https://developershome.blog/2023/03/12/spark-etl-chapter-4-with-cloud-data-lakes-aws-s3-bucket/ Introduction: In this blog, we will discuss HIVE tables/views and we will do ETL with Hive tables. We will learn about how to create global and temporary hive tables... Continue Reading →
Spark ETL Chapter 4 with Cloud data lakes (AWS S3 bucket)
Previous blog/Context: In an earlier blog, we discussed Spark ETL with Cloud data lakes (Azure blob and Azure Data Lake services). Please find below blog post for more details. https://developershome.blog/2023/03/08/spark-etl-chapter-3-with-cloud-data-lakes-azure-blob-azure-adls/ Introduction: In this blog, we will discuss Spark ETL with Cloud data lakes and we will be doing Spark ETL with AWS S3 bucket. We... Continue Reading →