Last updated 06th March, 2020
This guide explains how you can activate the OVHcloud Data Processing service on your cloud project. It will also explain what activating the service implies for you and your cloud project.
In this guide, we are assuming that you're using the OVHcloud Manager to use the Data Processing platform.
To read an introduction about Data Processing service you can visit Data Processing Overview.
- Access to OVHcloud Manager
- An OVHcloud account
- A cloud project in your OVHcloud account (see How to create a cloud project for details.)
How to activate the Data Processing service
By default, the Data Processing service is not activated on any cloud project. You will need to go on the OVHcloud Manager and select the cloud project you would like to enable the service for first. Once done, if you access the Data Processing menu from the OVHcloud Manager, you will be greeted by a welcome page.
To activate the service on the selected cloud project, you will have to read and agree to the terms and conditions from our Beta program by checking the related checkbox. The
submit my first processing job button will then become clickable, activating the service for your cloud project.
Activating the OVHcloud Data Processing service: What does it mean for you?
When activating the Data Processing service on one of your cloud projects, you are authorizing the OVHcloud Data Processing platform to have READ and WRITE access to the Object Storage associated with said cloud project. This is required to perform a restricted list of actions:
- READ your cloud project's Object Storage:
- We need to list all of your cloud project's Object Storage containers (in oder to make them selectable in the "Job Submit" form).
- We need to read all files in the selected Object Storage container (in order to retrieve the application you want the Data Processing platform to execute).
- READ your cloud project quotas.
- WRITE to your cloud project's Object Storage:
- We need to create a new container called "odp-logs".
- We need to write your Data Processing job's output logs to this new container.
As the OVHcloud Data Processing service will continue to evolve and improve over time, the list of required permissions is subject to change.
To manage these permissions, the Data Processing service makes use of the Openstack Keystone mechanism called “Trust”. (you can learn more about it here https://wiki.openstack.org/wiki/Keystone/Trusts). Openstack Keystone Trusts are immutable: They can't be modified once they have been created. At this moment, the Data Processing platform can only create an Openstack Keystone Trust once for a given cloud project and can't delete it. This means that, for now, the only way to remove the Openstack Keystone Trust created by the Data Processing service is to delete the cloud project where it was applied.
We therefore recommend you to use a dedicated cloud project to minimize impacts if you decided to stop using the Data Processing service in the future. To learn more about cloud projects, please visit these pages:
To learn more about using Data Processing and how to create cluster and process your data, we invite you to look at Data Processing documentations page.
These guides might also interest you...
How to submit a Python job on the Data Processing platform using the OVHcloud manager
How to fill the job submit form in the Data Processing page from the OVHcloud Manager