Mastering PySpark Programming
Master Data Engineering skills with PySpark | Beginner to Pro
Mastering PySpark is a comprehensive course that will help you become proficient in PySpark Programming, Spark SQL, Dataframe APIs, Spark Architecture, Performance Tuning and Join Optimization, Advanced Concepts such as AQE, DPP, Memory Management and Unit Testing
PySpark Programming - Data Engineering and Data Processing using PySpark and Spark SQL
Spark Architecture - Understanding Spark internals, Performance optimization, Memory Management
Advanced Concepts - Data Sources and Sinks, Adaptive Query Execution, Dynamic Partition Pruning, Unit Testing
What do you need to know before you start this course
Programming Knowledge Using Python Programming Language and SQL Fundamentals
A Recent 64-bit Windows/Mac Machine with 8 GB RAM & Internet Connection
130 lectures - 25 hours video - Capstone Project
Ch1: Downloadable Documents
Ch1: Downloadable Materials
Big Data - Overview and Objectives
FREE PREVIEWWhat is Big Data and How it Started
FREE PREVIEWWhat is Big Data and How it Started - Notes
Hadoop Architecture, History, and Evolution
Hadoop Architecture, History, and Evolution - Notes
What is Data Lake and How it works
What is Data Lake and How it works - Notes
Introducing Apache Spark and Databricks Cloud
FREE PREVIEWIntroducing Apache Spark and Databricks Cloud - Notes
Spark Development Environment - Overview and Objectives
FREE PREVIEWSpark Development Environments
FREE PREVIEWSpark Development Environments - Notes
Setup your Databricks Community Cloud Environment
Setup your Databricks Community Cloud Environment - Notes
Introduction to Databricks Workspace
FREE PREVIEWIntroduction to Databricks Workspace - Notes
Create Your First Spark Application in Databricks Cloud
FREE PREVIEWCreate Your First Spark Application in Databricks Cloud - Notes
Setup Your Local Development IDE
Setup your Local Development IDE - Notes
Mac Users - Setup your Local Development IDE
Create Your First Spark Application using IDE
FREE PREVIEWCreate Your First Spark Application using IDE - Notes
Ch1: Assignment
Ch1: Test Your Knowledge [Requires 60% passing grade - Retake allowed]
Review Rating
Ch2: Downloadable Documents
Ch2: Downloadable Materials
Introduction to Spark Data Frames and Tables
Introduction to Spark Data Frames and Tables - Notes
Creating Spark Dataframe
Creating Spark Dataframe - Notes
Creating Spark Tables
Creating Spark Tables - Notes
Common Problem With Databricks Community
Common Problem With Databricks Community - Notes
Data Caching Essentials
Data Caching Essentials - Notes
Working With Spark SQL
Working With Spark SQL - Notes
Dataframe Transformations and Actions
Dataframe Transformations and Actions - Notes
Applying Transformations
Applying Transformations - Notes
Querying Spark Dataframe
Querying Spark Dataframe - Notes
More Dataframe Transformations
More Dataframe Transformations - Notes
Micro Project - Problem Statement
Micro Project Problem Statement - Notes
Ch2: Assignments
Ch2: Test Your Knowledge [Requires 60% passing grade - Retake allowed]
Review Rating
Ch3: Downloadable Documents
Ch3: Downloadable Materials
Ch2: Assignment Solutions
Creating Spark Session
Creating Spark Session - Notes
Programmatically Creating Data Frames
Programmatically Creating Data Frames - Notes
Working with Spark Data Frame Schema
Working with Spark Data Frame Schema - Notes
Writing Spark Data Frames
Writing Spark Data Frames - Notes
Managed Vs External Tables
Managed Vs External Tables - Notes
Introducing Spark Web UI
Introducing Spark Web UI - Notes
Ch3: Assignment
Ch3: Test Your Knowledge [Requires 60% passing grade - Retake allowed]
Ch4: Downloadable Documents
Ch4: Downloadable Materials
Ch3 Assignment Solutions
Working with Spark Data Frame Rows
Working with Spark Data Frame Rows - Notes
Working with Spark Data Frame Columns
Working with Spark Data Frame Columns - Notes
Working with Select and SelectExpr
Working with Select and SelectExpr - Notes
Working with Literal Values
Working with Literal Values - Notes
Adding, Renaming and Removing Columns
Adding, Renaming and Removing Columns - Notes
Filtering Records and Finding Unique Rows
Filtering Records and Finding Unique Rows - Notes
Ch4: Assignment
Ch4: Assignment Resource for DOWNLOAD
Ch4: Test Your Knowledge [Requires 60% passing grade - Retake allowed]
Ch5: Downloadable Documents
Ch5: Downloadable Materials
Ch4 Assignment Solutions
Ch4 Assignment Solution Notebooks for DOWNLOAD
Sampling and Splitting
Sampling and Splitting - Notes
Sorting and Limiting
Sorting and Limiting - Notes
Nulls in Apache Spark
Nulls in Apache Spark - Notes
Working with Nulls
Working with Nulls - Notes
Spark Data Types
Spark Data Types - Notes
Casting and Aliases
Casting and Aliases - Notes
Ch5: Assignment
Ch5: Test Your Knowledge [Requires 60% passing grade - Retake allowed]
This course covers everything you need to know about PySpark. The depth and clarity are remarkable, making complex topics easy to understand. Highly recommend it for anyone serious about mastering PySpark!
This course covers everything you need to know about PySpark. The depth and clarity are remarkable, making complex topics easy to understand. Highly recommend it for anyone serious about mastering PySpark!
Read LessExcellent course to become the master on PySpark at enterprise level
Excellent course to become the master on PySpark at enterprise level
Read LessOne of the excellent course in the market to learn Pyspark.
One of the excellent course in the market to learn Pyspark.
Read Lessvery clear
very clear
Read LessTill Now my experience is awesome
Till Now my experience is awesome
Read LessExcellent
Excellent
Read LessVery good course
Very good course
Read LessWe provide standard 3-year access to the course material from the date of purchase. However, our promotional offers may reduce the access duration for a discounted price. Please check access validity terms and conditions for the promotional offers.
Yes. You can ask for a refund within 7 days of your purchase or before completing 15% of the course material, whichever is earlier. We provide a refund after deducting 6% of payment processing charges.
We have a Q&A forum where you can ask questions, and our team will answer your queries.
Get in touch with your course coordinator to learn more about the course, our instructor-led programs, discount offers, group discounts, corporate training and additional payment methods.
Want to speak to your course coordinator? We are just a WhatsApp message or a phone call away.
Drop us an email with all your queries and questions and we will get back to you over the email.
Schedule a call with course coordinator for bundles, discounts and live sessions
Learn Python programming language. Hands-on learning with Capstone project. Just enough Python for Spark developers.
Master Apache Spark Structured Streaming and incremental data processing. Scenario based learning and Capstone project.
Curated learning path for mastering big data engineering using Spark and Azure Databricks. Hands-on and Capstone projects.