CG数据库 >> Spark Streaming with Scala

h264, yuv420p, 1280x720 |ENGLISH, aac, 44100 Hz, 2 channels | 7h 12mn | 2.74 GB

What you'll learn

Process data as it arrives

Handle data of any scale in real time

Use Structured Streaming and SQL on streaming data

Use low-level, high-control APIs with DStreams

Integrate Spark with Kafka, JDBC, Cassandra

Master advanced concepts including window functions and watermarking

Requirements

Scala and functional programming essentials, at the level of the beginners course (implicits are a plus)

Spark fundamentals, including Spark SQL, DataFrames and RDDs

Good general knowledge of parallel/distributed systems (nodes, threads, etc)

Description

Updated for Spark 3.0

In this course, we will learn how to stream big data with Apache Spark 3. You'll write 1500+ lines of Spark code yourself, with guidance, and you will become a rockstar.

This course is for Spark & Scala programmers who now need to work with streaming data, or who need to process data in real time.

Why Spark in Scala:

it's blazing fast for big data

its demand has exploded

it's a highly marketable skill

it's well maintained, with dozens of high-quality extensions

it's a foundation for a data scientist

I like to get to the point and get things done. This course

deconstructs all concepts into the critical pieces you need

selects the most important ideas and separates them into what's simple but critical and what's powerful

sequences ideas in a way that "clicks" and makes sense throughout the process of learning

applies everything in live code

The end benefits are still much greater:

a completely new mental model around data streaming

significantly more marketable resume

more enjoyable work - Spark is fun!

This course is for established programmers with experience with both Scala and Spark at least at the level of the Rock the JVM essential courses for Scala and Spark. I already assume a solid understanding of general programming fundamentals.

This course is NOT for you if

you've never written Scala or Spark code before

you don't have some essential parallel programming background (e.g. what's a process/a thread)

The course is comprehensive, but you'll always see me get straight to the point. So make sure you have a good level of focus and commitment to become a badass programmer.

I believe both theory and practice are important. That's why you'll get lectures with code examples, real life code demos and assignments, plus additional resources, instructions, exercises and solutions. At the end of the course, you'll have written thousands of lines of Spark.

I've seen that my students are most successful - and my best students work at Google-class companies - when they're guided, but not being told what to do. I have exercises waiting for you, where I offer my (opinionated) guidance but otherwise freedom to experiment and improve upon your code.

Definitely not least, my students are most successful when they have fun along the way!

So join me in this course and let's rock the JVM!

Who this course is for:

Scala & Spark programmers who need to set up data pipelines in real time

Data scientists who need to work with streaming data


Spark Streaming with Scala的图片1
Spark Streaming with Scala的图片2

发布日期: 2020-07-02