Apache Spark and Data Stream Processing: A Crash Course

Build your first stream processing application using Spark and Structured Streaming.

What will I learn?

  • Learn to pinpoint when to use or not to use Spark.
  • Gain an in-depth understanding of basic distributed data processing concepts.
  • Learn to leverage Spark’s core capabilities for your data projects.
  • Learn how to launch Spark clusters on AWS.
  • Learn how to think about and solve data-intensive problems.
  • Build an original solution for a real-world, practical stream processing problem.
  • Walk away with working code that you can use in your own projects.

Curriculum

MODULE 1

Spark: Suitability, Concepts, and Capabilities

Gain an understanding of what Spark can and cannot do, and what use cases it is best suited for.

MODULE 2

Setting up a Spark Environment

Learn how to work with Spark locally and on AWS.

MODULE 3

Breaking in Your New Spark Environment

Learn the difference between transformations, actions, batch operations and stream processing.

MODULE 4

Capstone Project

With your instructor's guidance, use what you've learned in this class to build a custom project.

Live, Online, Instructor-Led

Learn face-to-face in live online sessions with your instructor and peers from anywhere in the world.

Hands On

All our trainings involve in-class, hands on practice that is relevant to your team's goals. At the end of the training, your team will be ready to hit the ground running.

Elite Instructors

Learn from an elite team of industry experts who have taught at universities such as Harvard, and have trained teams at companies such as ANZ Bank.

Help When You Need It

Forget about the frustration of getting stuck while watching online videos. Our instructors are here to help in-between sessions, so you can have a smooth learning experience.

Frequently Asked Questions

1. Who is this course for? 1. Who is this course for?

This course is aimed at developers who are already familiar with data-intensive systems and who want to get started with Spark and Structured Streaming and build their first stream processing application.

2. Is this a MOOC (Massive Open Online Course)? 2. Is this a MOOC (Massive Open Online Course)?

Absolutely not. Unlike other internet courses, this course is completely live. Your instructor lectures in real time (either on-site, or live on-line) and can answer questions and provide real-time, tailored feedback so you can reach your goals as fast as possible. At Edlitera, we believe nothing beats the experience of learning live from an elite instructor with extensive industry experience. 

3. What is the format of this course? 3. What is the format of this course?

The course combines lecturing, hands-on exercises to be solved both individually and in pairs, as well as one or two larger projects that require participants to use their newly acquired knowledge to solve real-world problems.

Exercises and projects are chosen to be relevant to the type of problems that your team encounters daily. Our goal is to deliver practical, immediately applicable knowledge, and to empower your team to hit the ground running after each training. Our instructors can teach this course on site at your offices or live online.

4. Can you tweak the content of the course for my team? 4. Can you tweak the content of the course for my team?

Of course! While we do offer a standard version of this course, we can also work with you to tailor the content such that it fits your team's priorities perfectly. 

5. What if I have other questions? 5. What if I have other questions?

If you have other questions, or want to find out more about this course or other courses we offer, please ping us - we love questions! You can email us at contact@edlitera.com, or get in touch via the chat box below.

Have a question?

Contact us any time, we’d love to hear from you!