Free
Course

Introduction to Big Data with PySpark

See how big data is used across different industries and learn how to work with big data using PySpark!

4.44 out of 5 stars
4,431 learners enrolled
  • Skill level

    Beginner
  • Time to complete

    4 hours
  • Certificate of completion

    Included with paid plans
  • Prerequisites

    None

About this course

This course is an introduction to the underlying concepts behind big data with a practical and hands-on approach with PySpark. Big data is everywhere, and touches data science, data engineering, and machine learning. It is becoming central to marketing, strategy, and research. This course covers the applications and implications of big data on finance, social media, health, and medicine. PySpark makes it easy to start analyzing big data, making the potential of big data accessible to anyone who knows Python.

Syllabus

2 lessons • 2 projects • 3 quizzes

The platform

Hands-on learning

Animated GIF of an AI provided error explanation within Codecademy's learning environment
Mobile-friendly version of a lesson and code editor for the course 'Introduction to HTML' running in Codecademy's learning environment
An AI-generated hint within the instructions of a Codecademy project
Animated GIF of a mouse cursor hovering over the Python term "comment" displaying a Docs tooltip within a Codecademy lesson
Animated GIF of Jupyter notebook integrated within a course titled 'Merging Datasets' running in Codecademy's learning environment
Meet the creator of the course
Andrea Hassler
Andrea has a Master's in Applied Statistics from NYU and a Bachelor's in Psychology from SUNY New Paltz. She has worked with students individually as a tutor for many years. She has also contributed to research projects in the health sciences as a statistical consultant and research assistant.
Andrea has a Master's in Applied Statistics from NYU and a Bachelor's in Psychology from SUNY New Paltz. She has worked with students individually as a tutor for many years. She has also contributed to research projects in the health sciences as a statistical consultant and research assistant.

Introduction to Big Data with PySpark course ratings and reviews

4.44 out of 5 stars
140 ratings
  1. 5 stars
  2. 4 stars
  3. 3 stars
  4. 2 stars
  5. 1 star
  • The progress I have made since starting to use codecademy is immense! I can study for short periods or long periods at my own convenience - mostly late in the evenings.
    Chris
    Codecademy Learner @ USA
  • I felt like I learned months in a week. I love how Codecademy uses learning by practice and gives great challenges to help the learner to understand a new concept and subject.
    Rodrigo
    Codecademy Learner @ UK
  • Brilliant learning experience. Very interactive. Literally a game changer if you're learning on your own.
    John-Andrew
    Codecademy Learner @ USA

Our learners work at

  • Google Logo
  • Meta Logo
  • Apple Logo
  • EA Logo
  • Amazon Logo
  • IBM Logo
  • Microsoft Logo
  • Reddit Logo
  • Spotify Logo
  • Uber Logo
  • YouTube Logo
  • Instagram Logo

Join over 50 million learners and start Introduction to Big Data with PySpark today!

Looking for something else?

Browse more topics

View full catalog

Unlock additional features with a paid plan

  • Practice Projects

    Guided projects that help you solidify the skills and concepts you're learning.
  • Assessments

    Auto-graded quizzes and immediate feedback help you reinforce your skills as you learn.
  • Certificate of Completion

    Earn a document to prove you've completed a course or path that you can share with your network.