Data Engineering using Kafka and Spark Structured Streaming

Get a Free Week of Skillshare

$9.99 Coupon code for Data Engineering using Kafka and Spark Structured Streaming Udemy Course. This is an exclusive discount coupon from the course instructor, it will be active for few days. Check ENROLL NOW button to get a maximum discount. We manually verified coupon code on February 17th, 2024 .

What you’ll learn

  • Setting up self support lab with Hadoop (HDFS and YARN), Hive, Spark, and Kafka
  • Overview of Kafka to build streaming pipelines
  • Data Ingestion to Kafka topics using Kafka Connect using File Source
  • Data Ingestion to HDFS using Kafka Connect using HDFS 3 Connector Plugin
  • Overview of Spark Structured Streaming to process data as part of Streaming Pipelines
  • Incremental Data Processing using Spark Structured Streaming using File Source and File Target
  • Integration of Kafka and Spark Structured Streaming – Reading Data from Kafka Topics

Here is a brief outline of the course. You can choose either Cloud9 or GCP to provision a server to set up the environment.

  • Setting up Environment using AWS Cloud9 or GCP
  • Setup Single Node Hadoop Cluster
  • Setup Hive and Spark on top of Single Node Hadoop Cluster
  • Setup Single Node Kafka Cluster on top of Single Node Hadoop Cluster
  • Getting Started with Kafka
  • Data Ingestion using Kafka Connect – Web server log files as a source to Kafka Topic
  • Data Ingestion using Kafka Connect – Kafka Topic to HDFS a sink
  • Overview of Spark Structured Streaming
  • Kafka and Spark Structured Streaming Integration
  • Incremental Loads using Spark Structured Streaming

Who this course is for:

  • Experienced ETL Developers who want to learn Kafka and Spark to build streaming pipelines
  • Experienced PL/SQL Developers who want to learn Kafka and Spark to build streaming pipelines
  • Beginner or Experienced Data Engineers who want to learn Kafka and Spark to build streaming pipelines

Recommended Courses 

  1. Apache Kafka Series – Kafka Cluster Setup & Administration
  2. Kafka fundamentals for java developers
  3. Apache Kafka Series – Kafka Monitoring & Operations
Deal Score-1
Disclosure: This post may contain affiliate links and we may get small commission if you make a purchase. Read more about Affiliate disclosure here.

Gain access to over 11,000+ courses for just $16.58 [₹850] per month

Choose between monthly or annual billing cycles, with the freedom to cancel at any time.

The future belongs to learners. Udemy online courses as low as $13.99

New customer offer! Top courses from $14.99 when you first visit Udemy

Gain the skills you need to reach your next career milestone for as little as $11.99

Course Coupon Club
Logo
Follow us on Telegram Join us on FB