Checking a job's logs

Find out how to get your job's logs while the job is running or after it is finished

Last updated 14th July, 2021

Objective

This guide will help you to check your job's logs while your job is running or after your job is finished.

To read an introduction about Data Processing service you can visit Data Processing Overview.

Requirements

  • Access to OVHcloud Manager
  • A job that has been already submitted to Data Processing

Get your running job's logs

When you launch a job with Data Processing, you may want to read your job's logs as it is running. There are three ways to get live logs:

  • Using the OVHcloud Manager
  • Using the OVHcloud API
  • Using the Data Processing CLI

With the OVHcloud Manager

To see your logs in the manager, you need to follow these steps:

  • Login to the OVHcloud Manager and select Public Cloud.
  • Select the project in which you submitted your job.
  • Select Data Processing from the left panel.
  • Click on the name of the job that you want to check the logs for, to open the job dashboard. (The job should be in RUNNING status.) Data Processing Engine
  • Click on the Logs tab in your job dashboard page.
  • If your job is still running, you will see its live logs in this tab. logs real time

Those logs will appear only if your job is running. Once your job has ended you will get a link to your Object Storage where your logs files are stored.

With the Data Processing CLI

If you are using the ovh-spark-submit CLI (see How to launch jobs through the CLI), then you do not have to take action.

The logs will appear in the standard output while your job is running.

Once the job has ended, the URL to the Object Storage container containing the job's logs will be displayed. This could be used to list your job's logs files through the OpenStack API. Thus, you will be able to upload them.

Please refer to the section below to know how.

If you use the OVHcloud Manager or the CLI, you may never see the last entries of your logs before the job stops. It is because the job has finished before the UI was updated. But don't worry, all the logs are uploaded to your Object Storage at the job's end.

With the OVHcloud API

An other way to read your job's logs is to use the OVH cloud API by calling the endpoint to GET a job's logs (see How to use the OVHcloud API)

To get the logs you have to use the GET on the /cloud/project/{serviceName}/dataProcessing/jobs/{jobId}/logs endpoint (where the service name is your Public Cloud project ID). This endpoint can take a query parameter which is from. This parameter allows you to specify the date from which you want to retrieve the logs. Its default value is 1970-01-01T00:00:00 UTC.

Whether you chose to set this start date or not, you will retrieve all the logs that came after in the limit of 10 000 characters.

Once the job has ended, the URL to the Object Storage container containing the job's logs will be displayed. This could be used to list your job's logs files through the OpenStack API. Thus, you will be able to upload them.

Please refer to the section below to know how.

When streaming logs you are limited to 10 000 characters at a time. Meaning you could experience missing logs in streaming mode (in the manager, in the CLI). All logs will be uploaded to your Object Storage at job end.

Download your logs files from Object Storage

Once your job is finished, its logs are uploaded to your Object Storage. While you can only retrieve the Spark driver node's logs when the job is running, you will have the logs from all the node (driver and executors) stored.

For each node, you will have at least 2 logs files:

  • one {jobId}/{nodeName}/odp.logs file which contains the logs generated by your job environment creation.
  • one or more {jobId}/{nodeName}/spark.log.yyyy-MM-ddThhhmmmss.sssssssss gathering all the logs generated by your Data Processing Job.

Some {jobId}/{nodeName}/spark.log.yyyy-MM-ddThhhmmmss.sssssssss files can be uploaded in Object Storage while your job is still running. This is due to the logs rotation that is configured to upload the files that reach the maximum logs files size of 100 MiB.

There is three ways to download your logs from your Object Storage:

  • with the OVHcloud Manager
  • with the OpenStack CLI
  • with the OpenStack API

With the OVHcloud Manager

To see your logs in the manager, you need to follow these steps:

  • Log in to the OVHcloud Manager and select Public Cloud.
  • Select the project in which you submitted your job.

From here, you can either go to your Object Storage section of your Public Cloud project and select the odp-logs container. Then filter the list of object with your job ID to get its logs. logs in object storage

Or you can go through your job dashboard instead, to do so:

  • Select Data Processing from the left panel.
  • Click on the name of the job that you want to download the logs for, to open the job dashboard. (The job should finished.)
  • Click on the Logs tab in your job dashboard page.
  • If your job is already finished, click on Download logs to download the output logs of your job from your Object Storage account.

logs finished

This button will lead you to the Object Storage container pre-filtered with the wanted job ID.

With the OpenStack CLI

You can access to your Object Storage by using the OpenStack CLI or the Swift CLI (Swift being the name of the OpenStack Object Storage).

Please follow the OpenStack documentation on installing the CLI and using the CLI.

In order to authenticate with the CLI you will have to set environment variables using an OpenStack RC file. You can find your RC file by following these steps:

  • Log in to the OVHcloud Manager and select Public Cloud.
  • Select the project in which you submitted your job.
  • Select Users & Roles from the left panel.
  • Click on the ... option button of your user and select Download OpenStack's RC file. get OpenStack's RC file

With the OpenStack API

In order to use the OpenStack API, you will need an OpenStack token. You can generate one in the OVHcloud manager by following these steps:

  • Log in to the OVHcloud Manager and select Public Cloud.
  • Select the project in which you submitted your job in the sidebar.
  • Select Users & Roles from the left panel.
  • Click on the ... option button of your user and select Generate an OpenStack token.
  • Enter your user password and retrieve your token.

With this token you should be able to list and to download the logs files of your jobs using the OpenStack API.

Go further

To learn more about using Data Processing and how to submit a job and process your data, we invite you to look at the Data Processing documentations page.

You can send your questions, suggestions or feedbacks in our community of users on https://community.ovh.com/en/ or on our Discord in the channel #dataprocessing-spark


Czy ten przewodnik był pomocny?

Zachęcamy do przesyłania sugestii, które pomogą nam ulepszyć naszą dokumentację.

Obrazy, zawartość, struktura - podziel się swoim pomysłem, my dołożymy wszelkich starań, aby wprowadzić ulepszenia.

Zgłoszenie przesłane za pomocą tego formularza nie zostanie obsłużone. Skorzystaj z formularza "Utwórz zgłoszenie" .

Dziękujemy. Twoja opinia jest dla nas bardzo cenna.


Inne przewodniki, które mogą Cię zainteresować...

OVHcloud Community

Dostęp do OVHcloud Community Przesyłaj pytania, zdobywaj informacje, publikuj treści i kontaktuj się z innymi użytkownikami OVHcloud Community.

Porozmawiaj ze społecznością OVHcloud

Zgodnie z Dyrektywą 2006/112/WE po zmianach, od dnia 1 stycznia 2015 r., ceny brutto mogą różnić się w zależności od kraju zameldowania klienta
(ceny brutto wyświetlane domyślnie zawierają stawkę podatku VAT na terenie Polski).