25.9.10
This website uses cookies to ensure you get the best experience on our website. Learn more

GCP Data Engineer Pro: Dataset Processing

Skillsoft issued completion badges are earned based on viewing the percentage required or receiving a passing score when assessment is required. In the intricate world of data management, envisioning the flow of information as an oil pipeline offers a vivid analogy. Just as oil must be carefully extracted, transported, and refined, data too requires meticulous processes to ensure its value is maximized. In this course, learn about big data processing, including Dataproc cluster options, creating a Dataproc cluster and running a Spark job, and Dataprep flows, profiling, transforming, and sampling of data. Next, discover how to build and deploy robust Dataflow pipelines and options to fine-tune Dataflow pipeline performance. Finally, explore the setup and management of Data Fusion instances and pipelines, create a Data Fusion pipeline, and examine the pricing models for Dataproc, Dataprep, and Dataflow. This course is one of a collection that prepares learners for the Google Professional Data Engineer exam.