Previous blog/Context: In an earlier blog, we discussed Spark ETL with Cloud data lakes (AWS S3 bucket). Please find below blog post for more details. https://developershome.blog/2023/03/12/spark-etl-chapter-4-with-cloud-data-lakes-aws-s3-bucket/ Introduction: In this blog, we will discuss HIVE tables/views and we will do ETL with Hive tables. We will learn about how to create global and temporary hive tables... Continue Reading →
Spark ETL Chapter 4 with Cloud data lakes (AWS S3 bucket)
Previous blog/Context: In an earlier blog, we discussed Spark ETL with Cloud data lakes (Azure blob and Azure Data Lake services). Please find below blog post for more details. https://developershome.blog/2023/03/08/spark-etl-chapter-3-with-cloud-data-lakes-azure-blob-azure-adls/ Introduction: In this blog, we will discuss Spark ETL with Cloud data lakes and we will be doing Spark ETL with AWS S3 bucket. We... Continue Reading →
Spark ETL Chapter 3 with Cloud data lakes (Azure Blob | Azure ADLS)
Previous blog/Context: In an earlier blog, we discussed Spark ETL with NoSQL Databases (MongoDB Database). Please find below blog post for more details. https://developershome.blog/2023/03/07/spark-etl-chapter-2-with-nosql-database-mongodb-cassandra/ Introduction: In this blog, we will discuss Spark ETL with Cloud data lakes and we will be doing Spark ETL with Azure Blob storage. We will use public blob storage and... Continue Reading →