Apache Spark and Data Stream Processing: A Crash Course

Build your first stream processing application using Spark and Structured Streaming.

What will I learn?

  • Learn to pinpoint when to use or not to use Spark.
  • Gain an in-depth understanding of basic distributed data processing concepts.
  • Learn to leverage Spark’s core capabilities for your data projects.
  • Learn how to launch Spark clusters on AWS.
  • Learn how to think about and solve data-intensive problems.
  • Build an original solution for a real-world, practical stream processing problem.
  • Walk away with working code that you can use in your own projects.

Course Curriculum

MODULE 1

Spark: Suitability, Concepts, and Capabilities

Gain an understanding of what Spark can and cannot do, and what use cases it is best suited for.

MODULE 2

Setting up a Spark Environment

Learn how to work with Spark locally and on AWS.

MODULE 3

Breaking in Your New Spark Environment

Learn the difference between transformations, actions, batch operations and stream processing.

MODULE 4

Capstone Project

With your instructor's guidance, use what you've learned in this class to build a custom project.

Live, Online, Instructor-Led

Learn face-to-face in live online sessions with your instructor and peers from anywhere in the world.

Hands On

All our trainings involve in-class, hands on practice that is relevant to your team's goals. At the end of the training, your team will be ready to hit the ground running.

Elite Instructors

Learn from an elite team of industry experts who have taught at universities such as Harvard, and have trained teams at companies such as ANZ Bank.

Help When You Need It

Forget about the frustration of getting stuck while watching online videos. Our instructors are here to help in-between sessions, so you can have a smooth learning experience.

Frequently Asked Questions

Have a question?

Contact us any time, we’d love to hear from you!