Working with logs

Written by

Updated at April 14, 2024

Viewing log entries
Disabling sending logs
Storing logs

Data Proc cluster logs are collected and displayed by Yandex Cloud Logging.

To monitor the events on the cluster and its individual hosts, specify, in its settings, a relevant log group. You can do this when creating or updating the cluster. If no log group has been selected for the cluster, a default log group in the cluster directory will be used to send and store logs.

For more information, see Logs.

Viewing log entries

Management console

CLI

Go to the folder page and select Data Proc.
Click the cluster name.
Under Configuration, click the name of the cluster log group. The Cloud Logging page will open.
Click the row of the log group. This will open the cluster logs.
(Optional) Specify the output settings:
- Message filter:
  - Getting the job start output Data Proc:
```
job_id="<job_ID>"
```
  - Getting the stdout output for all YARN application containers:
```
application_id="<YARN_application_ID>" AND yarn_log_type="stdout"
```
  - Getting YARN container's stderr output:
```
container_id="<YARN_container_ID>" AND yarn_log_type="stderr"
```
  - Getting the YARN Resource Manager service logs from the cluster's master host:
```
hostname="<master_host_FQDN>" AND log_type="hadoop-yarn-resourcemanager"
```
- Message logging levels: From TRACE to FATAL.
- Number of messages per page.
- Message interval (one of the standard intervals or an ad-hoc one).

If you do not have the Yandex Cloud command line interface yet, install and initialize it.

View a description of the CLI command to get logs:

yc logging read --help

Examples:

To get logs of the Data Proc cluster's HDFS NameNode service, run this command:

yc logging read \
   --group-id=<log_group_ID> \
   --resource-ids=<cluster_ID> \
   --filter=log_type=hadoop-hdfs-namenode

To get logs for the last two hours from all Data Proc clusters assigned to a specific log group, run the command:
```
yc logging read \
   --group-id=<log_group_ID> \
   --resource-types=dataproc.cluster \
   --since=2h
```
To get your cluster's system log for a specific period, run this command:
```
yc logging read \
   --group-id <log_group_ID> \
   --resource-ids=<cluster_ID> \
   --filter 'syslog' \
   --since 'YYYY-MM-DDThh:mm:ssZ' \
   --until 'YYYY-MM-DDThh:mm:ssZ'
```
Set the logging period in the --since and --until parameters in YYYY-MM-DDThh:mm:ssZ format, e.g., 2020-08-10T12:00:00Z. The time zone must be specified in UTC format.

To get a log for metrics sent from a specific host to Yandex Monitoring, run this command:

yc logging read \
   --group-id <log_group_ID> \
   --resource-ids=<cluster_ID> \
   --filter 'telegraf and hostname="<host_FQDN>"' \
   --since 'YYYY-MM-DDThh:mm:ssZ' \
   --until 'YYYY-MM-DDThh:mm:ssZ'

To get the host FQDN:

Go to the folder page and select Data Proc.
Click the cluster name.
Go to the Hosts tab.
Copy the host FQDN.

Disabling sending logs

Management console

CLI

When creating or updating the cluster, add the dataproc:disable_cloud_logging property set to true.

If you do not have the Yandex Cloud command line interface yet, install and initialize it.

The folder specified in the CLI profile is used by default. You can specify a different folder using the --folder-name or --folder-id parameter.

When creating or updating the cluster pass the dataproc:disable_cloud_logging=true value in the --property parameter or pass an empty string ("") instead of the log group ID in the --log-group-id parameter:

yc dataproc cluster create <cluster name> \
   ... \
   --log-group-id=""

yc dataproc cluster update <cluster_name_or_ID> \
   --property dataproc:disable_cloud_logging=true

Storing logs

You pay for receiving and storing your logs based on the Cloud Logging pricing policy. The default log retention period is three days. To update the retention period, edit the log group settings.

For more information about logs, see the Cloud Logging documentation.

Working with logs

Viewing log entriesViewing log entries

Disabling sending logsDisabling sending logs

Storing logsStoring logs

Was the article helpful?

Viewing log entries

Disabling sending logs

Storing logs