CG数据库 >> Mastering Big Data Analytics with PySpark

MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz, 2 Ch

June 26, 2020 | ISBN: 9781838640583 | English

Duration: 41 Lessons (8h 7m) | Size: 1.54 GB

Effectively apply Advanced Analytics to large datasets using the power of PySpark

Learn

Gain a solid knowledge of vital Data Analytics concepts via practical use cases

Create elegant data visualizations using Jupyter

Run, process, and analyze large chunks of datasets using PySpark

Utilize Spark SQL to easily load big data into DataFrames

Create fast and scalable Machine Learning applications using MLlib with Spark

Perform exploratory Data Analysis in a scalable way

Achieve scalable, high-throughput and fault-tolerant processing of data streams using Spark Streaming

About

PySpark helps you perform data analysis at-scale; it enables you to build more scalable analyses and pipelines. This course starts by introducing you to PySpark's potential for performing effective analyses of large datasets. You'll learn how to interact with Spark from Python and connect Jupyter to Spark to provide rich data visualizations. After that, you'll delve into various Spark components and its architecture.

You'll learn to work with Apache Spark and perform ML tasks more smoothly than before. Gathering and querying data using Spark SQL, to overcome challenges involved in reading it. You'll use the DataFrame API to operate with Spark MLlib and learn about the Pipeline API. Finally, we provide tips and tricks for deploying your code and performance tuning.

By the end of this course, you will not only be able to perform efficient data analytics but will have also learned to use PySpark to easily analyze large datasets at-scale in your organization.

All related code files are placed on a GitHub repository at:

Features

Solve your big data problems by building powerful Machine Learning models with Spark and implementing them using Python

Get up-and-running with Spark's essential libraries and tools (such as PySpark, Spark Streaming, Spark SQL, and Spark MLlib) and learn to apply them in practical, real-world big data applications

Leverage Spark 2.x—one of the most popular big data technologies—to discover how powerful Spark Machine Learning is how easily you can apply it!


Mastering Big Data Analytics with PySpark的图片1
Mastering Big Data Analytics with PySpark的图片2

发布日期: 2020-06-27