Data - Concept and best practices
Learn the concept behind AI Tools data
Learn the concept behind AI Tools data
Last updated 11th April 2022
OVHcloud Object Storage should be used to persist any data needed by AI Training jobs or AI notebooks.
There are two ways to manage your data:
ovhai
CLI. Next, find out how to manage your data with the CLI.AI Training jobs and AI Notebooks can read and write data from and to the OVHcloud Object Storage. Here is what is happening under the hood.
INITIALIZING
phase.RUNNING
phase lasts.FINALIZING
phase.STARTING
phase.RUNNING
phase lasts.STOPPING
phase.Users can give 3 different access rights on Filesystem volumes attached on jobs and notebooks:
RO
): you are only able to read data in your job or notebook. It is forbidden to write, modify or delete data in this container.When to attach a volume in read-only RO
?
The purpose of read-only permissions is to ensure that you do not modify your data by mistake. This permission is recommended for your input data: images, videos, sounds, text or csv files, etc.
Volumes which are in read-only mode are not synchronized with the Object Storage during the FINALIZING
or STOPPING
phase because there is no point in synchronizing data which has not changed during the life of a job or notebook.
If you have directly added data from the Object Storage to the container connected in RO
, you can request an intermediate synchronization. Otherwise, your data will be synchronized at the next start of your job or notebook.
RW
): you have full access to your data in your job or notebook. You can therefore write, modify or add data in this container.When to attach a volume in read-write RW
?
The purpose of the read-write permission is to be able to modify the contents of the object container. This permission is recommended if your container contains code (e.g. a Python file), a notebook or if you want to save a model or connection weights after your training.
Volumes which are in read-write do not allow you to delete data from your Object Storage. If you delete them from a job or notebook, they will still be present in your object container, even after synchronisation.
If you need to delete data from your object container, you can mount your volume in RWD
.
RWD
): you have full access to your data in your job or notebook. You can therefore write, modify, add or delete data in this container.When to attach a volume in read-write-delete RWD
?
The purpose of the read-write-delete permission is to be able to modify or delete the object container contents. You can not only make changes to the different files in your container (Python code, requirements file, ...) but also, and especially, delete data from it. The RWD will allow you to permanently delete data you no longer need from your object container. This permission can be used in particular if you want to delete old codes or connection weights which can be heavy files.
After the synchronisation phase, your data deleted from your notebook or job will be permanently deleted in the Object Storage.
For volume in RW
or RWD
: as long as your job or notebook is in the FINALIZING
or STOPPING
state, this means that the upload is still in progress. Once the state changes to INTERRUPTED
or STOPPED
, it means all the data was uploaded to your Object Storage.
By default filesystem volumes are created and deleted on the fly for each job needing it.
Users can enable a cache
feature that allow jobs and notebooks to re-use available volumes instead of creating a new one each time. That feature serves several purposes:
Unused volume data is regularly deleted. Users should not rely on this cache
feature for long time persistence.
You can check the OVHcloud documentation on how to create a data container.
You can check how to use your data in an AI Notebook.
You can check how to launch an AI Training job with attached volumes.
Please send us your questions, feedback and suggestions to improve the service:
Prima di inviare la valutazione, proponici dei suggerimenti per migliorare la documentazione.
Immagini, contenuti, struttura... Spiegaci perché, così possiamo migliorarla insieme!
Le richieste di assistenza non sono gestite con questo form. Se ti serve supporto, utilizza il form "Crea un ticket" .
Grazie per averci inviato il tuo feedback.
Accedi al tuo spazio nella Community Fai domande, cerca informazioni, pubblica contenuti e interagisci con gli altri membri della Community OVHcloud
Discuss with the OVHcloud community