What you'll learn

Mastering PySpark is a comprehensive course that will help you become proficient in PySpark Programming, Spark SQL, Dataframe APIs, Spark Architecture, Performance Tuning and Join Optimization, Advanced Concepts such as AQE, DPP, Memory Management and Unit Testing

  • PySpark Programming - Data Engineering and Data Processing using PySpark and Spark SQL

  • Spark Architecture - Understanding Spark internals, Performance optimization, Memory Management

  • Advanced Concepts - Data Sources and Sinks, Adaptive Query Execution, Dynamic Partition Pruning, Unit Testing

Course Prerequisite

What do you need to know before you start this course

  • Programming Knowledge Using Python Programming Language and SQL Fundamentals

  • A Recent 64-bit Windows/Mac Machine with 8 GB RAM & Internet Connection

Course Content

130 lectures - 25 hours video - Capstone Project

    1. Ch1: Downloadable Documents

    2. Ch1: Downloadable Materials

    3. Big Data - Overview and Objectives

      FREE PREVIEW
    4. What is Big Data and How it Started

      FREE PREVIEW
    5. What is Big Data and How it Started - Notes

    6. Hadoop Architecture, History, and Evolution

    7. Hadoop Architecture, History, and Evolution - Notes

    8. What is Data Lake and How it works

    9. What is Data Lake and How it works - Notes

    10. Introducing Apache Spark and Databricks Cloud

      FREE PREVIEW
    11. Introducing Apache Spark and Databricks Cloud - Notes

    12. Spark Development Environment - Overview and Objectives

      FREE PREVIEW
    13. Spark Development Environments

      FREE PREVIEW
    14. Spark Development Environments - Notes

    15. Setup your Databricks Community Cloud Environment

    16. Setup your Databricks Community Cloud Environment - Notes

    17. Introduction to Databricks Workspace

      FREE PREVIEW
    18. Introduction to Databricks Workspace - Notes

    19. Create Your First Spark Application in Databricks Cloud

      FREE PREVIEW
    20. Create Your First Spark Application in Databricks Cloud - Notes

    21. Setup Your Local Development IDE

    22. Setup your Local Development IDE - Notes

    23. Mac Users - Setup your Local Development IDE

    24. Create Your First Spark Application using IDE

      FREE PREVIEW
    25. Create Your First Spark Application using IDE - Notes

    26. Ch1: Assignment

    27. Ch1: Test Your Knowledge [Requires 60% passing grade - Retake allowed]

    28. Review Rating

    1. Ch2: Downloadable Documents

    2. Ch2: Downloadable Materials

    3. Introduction to Spark Data Frames and Tables

    4. Introduction to Spark Data Frames and Tables - Notes

    5. Creating Spark Dataframe

    6. Creating Spark Dataframe - Notes

    7. Creating Spark Tables

    8. Creating Spark Tables - Notes

    9. Common Problem With Databricks Community

    10. Common Problem With Databricks Community - Notes

    11. Data Caching Essentials

    12. Data Caching Essentials - Notes

    13. Working With Spark SQL

    14. Working With Spark SQL - Notes

    15. Dataframe Transformations and Actions

    16. Dataframe Transformations and Actions - Notes

    17. Applying Transformations

    18. Applying Transformations - Notes

    19. Querying Spark Dataframe

    20. Querying Spark Dataframe - Notes

    21. More Dataframe Transformations

    22. More Dataframe Transformations - Notes

    23. Micro Project - Problem Statement

    24. Micro Project Problem Statement - Notes

    25. Ch2: Assignments

    26. Ch2: Test Your Knowledge [Requires 60% passing grade - Retake allowed]

    27. Review Rating

    1. Ch3: Downloadable Documents

    2. Ch3: Downloadable Materials

    3. Ch2: Assignment Solutions

    4. Creating Spark Session

    5. Creating Spark Session - Notes

    6. Programmatically Creating Data Frames

    7. Programmatically Creating Data Frames - Notes

    8. Working with Spark Data Frame Schema

    9. Working with Spark Data Frame Schema - Notes

    10. Writing Spark Data Frames

    11. Writing Spark Data Frames - Notes

    12. Managed Vs External Tables

    13. Managed Vs External Tables - Notes

    14. Introducing Spark Web UI

    15. Introducing Spark Web UI - Notes

    16. Ch3: Assignment

    17. Ch3: Test Your Knowledge [Requires 60% passing grade - Retake allowed]

    1. Ch4: Downloadable Documents

    2. Ch4: Downloadable Materials

    3. Ch3 Assignment Solutions

    4. Working with Spark Data Frame Rows

    5. Working with Spark Data Frame Rows - Notes

    6. Working with Spark Data Frame Columns

    7. Working with Spark Data Frame Columns - Notes

    8. Working with Select and SelectExpr

    9. Working with Select and SelectExpr - Notes

    10. Working with Literal Values

    11. Working with Literal Values - Notes

    12. Adding, Renaming and Removing Columns

    13. Adding, Renaming and Removing Columns - Notes

    14. Filtering Records and Finding Unique Rows

    15. Filtering Records and Finding Unique Rows - Notes

    16. Ch4: Assignment

    17. Ch4: Assignment Resource for DOWNLOAD

    18. Ch4: Test Your Knowledge [Requires 60% passing grade - Retake allowed]

    1. Ch5: Downloadable Documents

    2. Ch5: Downloadable Materials

    3. Ch4 Assignment Solutions

    4. Ch4 Assignment Solution Notebooks for DOWNLOAD

    5. Sampling and Splitting

    6. Sampling and Splitting - Notes

    7. Sorting and Limiting

    8. Sorting and Limiting - Notes

    9. Nulls in Apache Spark

    10. Nulls in Apache Spark - Notes

    11. Working with Nulls

    12. Working with Nulls - Notes

    13. Spark Data Types

    14. Spark Data Types - Notes

    15. Casting and Aliases

    16. Casting and Aliases - Notes

    17. Ch5: Assignment

    18. Ch5: Test Your Knowledge [Requires 60% passing grade - Retake allowed]

Course Features

  • Practice Quiz & Assignment
  • PDF & Source Code
  • Total Support

Course Reviews

5 star rating

In-Depth and Comprehensive!

Siva Das

This course covers everything you need to know about PySpark. The depth and clarity are remarkable, making complex topics easy to understand. Highly recommend it for anyone serious about mastering PySpark!

This course covers everything you need to know about PySpark. The depth and clarity are remarkable, making complex topics easy to understand. Highly recommend it for anyone serious about mastering PySpark!

Read Less
5 star rating

Excellent course to become the master on PySpark at enterprise level

Amarnath Reddy Danthuluru

Excellent course to become the master on PySpark at enterprise level

Excellent course to become the master on PySpark at enterprise level

Read Less
5 star rating

Mastering PySpark Programming

Sudarshana Pai

One of the excellent course in the market to learn Pyspark.

One of the excellent course in the market to learn Pyspark.

Read Less
5 star rating

pyspark

sravya nag

very clear

very clear

Read Less
5 star rating

Excellent Course

Shivam Gupta

Till Now my experience is awesome

Till Now my experience is awesome

Read Less
5 star rating

Excellent,its very good

Praveen Mekala

Excellent

Excellent

Read Less
5 star rating

Pyspark Transformation

Nava Divya

Very good course

Very good course

Read Less

Features & Support

  • Total Support

    We provide support throughout your learning and answer every question. You may also avail one-to-one and online technical support calls for blocker issues.

  • Certification Ready

    The course covers the curriculum for Databricks Certified Associate Developer for Apache Spark 3.0 - Python certification exam.

  • Capstone Project

    The course includes many hands-on assignments/solutions and a major real-life capstone project with the complete solution, source code, and explanation.

Course FAQ

  • How long can I access the course material?

    We provide standard 3-year access to the course material from the date of purchase. However, our promotional offers may reduce the access duration for a discounted price. Please check access validity terms and conditions for the promotional offers.

  • Do you have a refund policy?

    Yes. You can ask for a refund within 7 days of your purchase or before completing 15% of the course material, whichever is earlier. We provide a refund after deducting 6% of payment processing charges.

  • How do you provide support?

    We have a Q&A forum where you can ask questions, and our team will answer your queries.

Enroll into self-paced course

Discover your potential, start today and learn at your own schedule

Schedule a free call

Get in touch with your course coordinator to learn more about the course, our instructor-led programs, discount offers, group discounts, corporate training and additional payment methods.

  • WhatsApp

    WhatsApp: +91-93534 65988

    Want to speak to your course coordinator? We are just a WhatsApp message or a phone call away.

  • Email

    Email: [email protected]

    Drop us an email with all your queries and questions and we will get back to you over the email.