Edlitera works best in modern browsers that support HTML5 video. Click to learn more.

Apache Spark and Data Stream Processing: A Crash Course

Build your first stream processing application using Spark and Structured Streaming.

Apache Spark and Data Stream Processing: A Crash Course
GET IN TOUCH

What will I learn?

Walk away with working code that you can use in your own projects.

Build an original solution for a real-world, practical stream processing problem.

Learn how to think about and solve data-intensive problems.

Learn how to launch Spark clusters on AWS.

Learn to leverage Spark’s core capabilities for your data projects.

Gain an in-depth understanding of basic distributed data processing concepts.

Learn to pinpoint when to use or not to use Spark.

Course curriculum

Module 1
Spark: Suitability, Concepts, and Capabilities

Gain an understanding of what Spark can and cannot do, and what use cases it is best suited for.

Module 2
Setting up a Spark Environment

Learn how to work with Spark locally and on AWS.

Module 3
Breaking in Your New Spark Environment

Learn the difference between transformations, actions, batch operations and stream processing.

Module 4
Capstone Project

With your instructor's guidance, use what you've learned in this class to build a custom project.

Elite instructors

Learn from an elite team of industry experts who have taught at universities such as Harvard, and have trained teams at companies such as Qualcomm.

Tailored content

We customize training content to match your team's goals. During training, your team works on projects that are relevant to your business.

Hands on

All our trainings involve in-class, hands on practice that is relevant to your team's goals. At the end of the training, your team will be ready to hit the ground running.

On site and live online

Choose whether you prefer in-person training, or you need the online participation of a geographically distributed team.

REQUIREMENTS:

Familiarity with Python, and some experience working with data-intensive systems, like databases or data processing frameworks like Hadoop.
Exposure to AWS is a plus.

Frequently Asked Questions

1. Who is this course for?

This course is aimed at developers who are already familiar with data-intensive systems and who want to get started with Spark and Structured Streaming and build their first stream processing application.

2. Is this a MOOC (Massive Open Online Course)?

Absolutely not. Unlike other internet courses, this course is completely live. Your instructor lectures in real time (either on-site, or live on-line) and can answer questions and provide real-time, tailored feedback so you can reach your goals as fast as possible. At Edlitera, we believe nothing beats the experience of learning live from an elite instructor with extensive industry experience. 

3. What is the format of this course?

The course combines lecturing, hands-on exercises to be solved both individually and in pairs, as well as one or two larger projects that require participants to use their newly acquired knowledge to solve real-world problems.

Exercises and projects are chosen to be relevant to the type of problems that your team encounters daily. Our goal is to deliver practical, immediately applicable knowledge, and to empower your team to hit the ground running after each training. Our instructors can teach this course on site at your offices or live online.

4. Can you tweak the content of the course for my team?

Of course! While we do offer a standard version of this course, we can also work with you to tailor the content such that it fits your team's priorities perfectly. 

5. What if I have other questions?

If you have other questions, or want to find out more about this course or other courses we offer, please ping us - we love questions! You can email us at [email protected], or get in touch via the chat box below.