Step-by-step guides for Data Proc
Written by
Updated at April 14, 2024
Data Proc clusters
- Information about existing clusters
- Creating clusters
- Connecting to a cluster
- Updating a cluster
- Migrating a lightweight cluster to a different availability zone
- Deleting clusters
Subclusters Data Proc
Apache and other third-party services
Delta Lake
- Setting up Delta Lake in single-cluster mode
- Setting up Delta Lake in multi-cluster mode
- Tips for setting up and using Delta Lake
Jobs
- Managing jobs
- Running jobs
- Managing Spark jobs
- Managing PySpark jobs
- Managing Hive jobs
- Managing MapReduce jobs
Hive Metastore clusters
- Creating a Hive Metastore cluster
- Connecting Data Proc to Metastore
- Deleting a Hive Metastore cluster
Logs and monitoring
- Working with logs
- Monitoring the state of Data Proc clusters and hosts
- Monitoring the state of Spark applications
- Diagnostics and troubleshooting of Spark application performance issues