Previous blog/Context: In an earlier blog, we discussed Spark ETL with Lakehouse (with Delta Lake). Please find below blog post for more details. https://developershome.blog/2023/03/19/spark-etl-chapter-7-with-lakehouse-delta-lake/embed/#?secret=Z8M19UjerD#?secret=yljQcLJrZC Introduction: In this blog, we will discuss Spark ETL with Apache HUDI. We will first understand what Apache HUDI is and why Apache HUDI is used for creating Lake house. We... Continue Reading →
Spark ETL Chapter 2 with NoSQL Database (MongoDB | Cassandra)
Previous blog/Context: In an earlier blog, we discussed Spark ETL with SQL Databases (MySQL and PostgreSQL Database). Please find below blog post for more details. https://developershome.blog/2023/03/06/spark-etl-with-sql-databases-mysql-postgresql/ Introduction: In this blog, we will discuss Spark ETL with NoSQL database, and we are considering MongoDB and we will do all the Spark ETL with MongoDB database. All... Continue Reading →