How to activate the Data Processing service for your cloud project

Find out how to activate the Data Processing service for your cloud project and what are its implications

Last updated 06th March, 2020

Objective

This guide explains how you can activate the OVHcloud Data Processing service on your cloud project. It will also explain what activating the service implies for you and your cloud project.

In this guide, we are assuming that you're using the OVHcloud Manager to use the Data Processing platform.

To read an introduction about Data Processing service you can visit Data Processing Overview.

Requirements

How to activate the Data Processing service

By default, the Data Processing service is not activated on any cloud project. You will need to go on the OVHcloud Manager and select the cloud project you would like to enable the service for first. Once done, if you access the Data Processing menu from the OVHcloud Manager, you will be greeted by a welcome page.

Data Processing Contract

To activate the service on the selected cloud project, click on the Start my first job button.

Activating the OVHcloud Data Processing service: What does it mean for you?

When activating the Data Processing service on one of your cloud projects, you are authorizing the OVHcloud Data Processing platform to have READ and WRITE access to the Object Storage associated with said cloud project. This is required to perform a restricted list of actions:

  • READ your cloud project's Object Storage:
    • We need to list all of your cloud project's Object Storage containers (in oder to make them selectable in the "Job Submit" form).
    • We need to read all files in the selected Object Storage container (in order to retrieve the application you want the Data Processing platform to execute).
  • READ your cloud project quotas.
  • WRITE to your cloud project's Object Storage:
    • We need to create a new container called "odp-logs".
    • We need to write your Data Processing job's output logs to this new container.

As the OVHcloud Data Processing service will continue to evolve and improve over time, the list of required permissions is subject to change.

To manage these permissions, the Data Processing service makes use of the Openstack Keystone mechanism called “Trust”. (you can learn more about it here https://wiki.openstack.org/wiki/Keystone/Trusts). Openstack Keystone Trusts are immutable: They can't be modified once they have been created. At this moment, the Data Processing platform can only create an Openstack Keystone Trust once for a given cloud project and can't delete it. This means that, for now, the only way to remove the Openstack Keystone Trust created by the Data Processing service is to delete the cloud project where it was applied.

We therefore recommend you to use a dedicated cloud project to minimize impacts if you decided to stop using the Data Processing service in the future. To learn more about cloud projects, please visit these pages:

Go further

To learn more about using Data Processing and how to submit a job and process your data, we invite you to look at Data Processing documentations page.

You can send your questions, suggestions or feedbacks in our community of users on https://community.ovh.com/en/ or on our Discord in the channel #dataprocessing-spark


Did you find this guide useful?

Please feel free to give any suggestions in order to improve this documentation.

Whether your feedback is about images, content, or structure, please share it, so that we can improve it together.

Your support requests will not be processed via this form. To do this, please use the "Create a ticket" form.

Thank you. Your feedback has been received.


These guides might also interest you...

OVHcloud Community

Access your community space. Ask questions, search for information, post content, and interact with other OVHcloud Community members.

Discuss with the OVHcloud community