Spark ETL Chapter 1 with SQL Databases (MySQL | PostgreSQL)

Previous blog/Context: In an earlier blog, we discussed Spark ETL with files (CSV, JSON, Text, Paraquet and ORC). Please find below blog post for more details. https://developershome.blog/2023/03/02/spark-etl-chapter-0-with-files-csv-json-parquet-orc/ Introduction: In this blog post, we will discuss Spark ETL with SQL Database. We will be considering MySQL and PostgreSQL for Spark ETL. All other SQL Databases like... Continue Reading →

Spark ETL | ELT | Data Connections

Introduction: In this blog we will discuss Spark ETL (Extract, transform and load) or ELT (Extract, load and transform). In Spark to connect different data sources we need to install libraries, we will discuss how to install all the required libraries and how to connect with different data sources and extract, transform and load data.... Continue Reading →

Create a website or blog at WordPress.com

Up ↑