Spark Chapter 12 Spark Streaming with Apache Kafka

Previous blog/Context: In an earlier blog, we discussed Spark ETL with Lakehouse (All the famous lake house formats). Please find below blog post for more details. https://developershome.blog/2023/04/05/spark-etl-chapter-11-with-lakehouse-delta-table-optimization/ Introduction: Today, we will discuss the points below. What is Apache Kafka? Basic concepts of Apache Kafka (Publisher and Subscriber) Publish and subscribe messages from the command line... Continue Reading →

Welcome to the world of Lakehouse

Photo by Alexander Isreb on Pexels.com The rise of big data has led to the emergence of new data architectures that can handle the volume, variety, and velocity of data generated by modern organizations. Two of the most popular data architectures are traditional data warehouses and lake house architectures. In this post, we will compare... Continue Reading →

Delta Lake with Python (delta-rs)

Delta tables Read, Write, History check, and vacuum using Python "By the end of this article, you will learn how to access delta table using Python and how to do CRUD operations on delta table using Python" In the earlier blog, we discussed delta lake and learned how to implement a lake house using Delta... Continue Reading →

Create a website or blog at WordPress.com

Up ↑