Previous blog/Context: In an earlier blog, we discussed Spark ETL with SQL Databases (MySQL and PostgreSQL Database). Please find below blog post for more details. https://developershome.blog/2023/03/06/spark-etl-with-sql-databases-mysql-postgresql/ Introduction: In this blog, we will discuss Spark ETL with NoSQL database, and we are considering MongoDB and we will do all the Spark ETL with MongoDB database. All... Continue Reading →
Spark ETL Chapter 1 with SQL Databases (MySQL | PostgreSQL)
Previous blog/Context: In an earlier blog, we discussed Spark ETL with files (CSV, JSON, Text, Paraquet and ORC). Please find below blog post for more details. https://developershome.blog/2023/03/02/spark-etl-chapter-0-with-files-csv-json-parquet-orc/ Introduction: In this blog post, we will discuss Spark ETL with SQL Database. We will be considering MySQL and PostgreSQL for Spark ETL. All other SQL Databases like... Continue Reading →
Spark ETL Chapter 0 with files (CSV | JSON | Parquet | ORC)
Previous blog/Context: Please see the previous blog, where we have designed a plan for Spark ETL pipelines. In the coming days we will be doing spark ETL using all of the data sources mentioned. Please find the blog for more details. https://developershome.blog/2023/03/01/spark-etl-elt-data-connections/ Introduction: In this blog, we will be discussing Spark ETL with files. We... Continue Reading →