MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz, 2 Ch
Difficulty: Intermediate | Genre: eLearning | Language: English | Duration: 8 Lectures (49m) | Size: 757 MB
What you'll learn:
Explain the relationship between Dataproc, key components of the Hadoop ecosystem, and related GCP services
Create, customize, monitor, and scale Dataproc clusters
Run data processing jobs on Dataproc
Apply access control to Dataproc
Requirements
Hadoop or Spark experience (recommended)
Google Cloud Platform account (sign up for free trial atif you don’t have an account)
Description
Google Cloud Dataproc is a managed service for running Apache Hadoop and Spark jobs. It can be used for big data processing and machine learning.
But you could run these data processing frameworks on Compute Engine instances, so what does Dataproc do for you? Dataproc actually uses Compute Engine instances under the hood, but it takes care of the management details for you. It’s a layer on top that makes it easy to spin up and down clusters as you need them.
Who this course is for
Data professionals
People studying for the Google Professional Data Engineer exam
发布日期: 2020-03-27