Apache Spark Structured Streaming
Master Real-time Stream Processing using PySpark and Databricks
This course will help you understand Real-time Stream processing using Apache Spark and Databricks Cloud and apply that knowledge to build real-time stream processing solutions. This course is example-driven and follows a working session-like approach. We will take a live coding approach and explain all the needed concepts.
Real-time Stream Processing Concepts, Spark Structured Streaming APIs and Architecture, Working with Streaming Sources and Sinks
Kafka for Data Engineers, Working With Kafka Source and Integrating Spark with Kafka
State-less and State-full Streaming Transformations, Windowing Aggregates using Spark Stream, Watermarking and State Cleanup
Working with Azure Databricks and Streaming on Databricks Platform
What do you need to know before you start this course
Spark Fundamentals and exposure to Spark Dataframe APIs
Programming knowledge using Python programming language
76 lectures - 18 hours video
About the Course
FREE PREVIEWCourse Prerequisite
FREE PREVIEWSource Code and Other Resources
Spark Development Environments
FREE PREVIEWSetup your Databricks Community Cloud Environment
FREE PREVIEWWorking in Databricks Workspace
Batch processing to stream processing
Your Spark application - Applying Best Practice
Your first streaming application - Implementing Stream
Stream Processing Model in Spark
Create Another Streaming Application
Stream Triggers
Incremental Batch Processing
Streaming Sources and Sinks
Creating Chain of Streams
Review Rating
An Introduction to Kafka
Creating Kafka Cluster in Cloud
Kafka Core Concepts
Producing Data to Kafka Topic
Consuming Data from Kafka Topic
Working with Kafka Topic Data
How to Implement Idempotence
Working with Kafka Sink
Streaming Aggregates and State Store
Incremental Aggregates and Update Mode
Spark Streaming Output Modes
Statefull Vs Stateless Aggregation
Implementing Stateless Streaming Aggregation
Timebound Stateful Tumbling Window Aggregation
Watermarking and State Store Cleanup
Sliding Window Aggregates
Review Rating
Introduction to Databricks
Creating Azure Free Account
Azure Portal Overview
Creating Azure Databricks Service
Introduction to Azure Databricks Workspace
Azure Databricks Architecture
Creating Azure Databricks Cluster
Introduction to Databricks Notebooks
Notebooks Magic Commands
Databricks Notebooks Utilities
Introduction to Databricks Unity Catalog
Introduction to Databricks Workflow Jobs
Introduction to Databricks Rest API
Introduction to Databricks CLI
We provide standard 3-year access to the course material from the date of purchase. However, our promotional offers may reduce the access duration for a discounted price. Please check access validity terms and conditions for your promotional offers.
Yes. You can ask for a refund within 7 days of your purchase or before completing 15% of the course material, whichever is earlier. We provide a refund after deducting 6% of payment processing charges.
We have a Q&A forum where you can ask questions, and our team will answer your queries.
Get in touch with your course coordinator to learn more about the course, our instructor-led programs, discount offers, group discounts, corporate training and additional payment methods.
Want to speak to your course coordinator? We are just a WhatsApp message or a phone call away.
Drop us an email with all your queries and questions and we will get back to you over the email.
Schedule a call with course coordinator for bundles, discounts and live sessions
Master Spark programming in Python (PySpark) from beginner to advanced. Hands-on learning and Capstone project.
Master Azure Databricks Cloud platform capabilities and Lakehouse architecture. Micro-projects and Capstone project.
Curated learning path for mastering big data engineering using Spark and Azure Databricks. Hands-on and Capstone projects.