Create Your Datasets
  • 21 Apr 2025
  • Dark
    Light
  • PDF

Create Your Datasets

  • Dark
    Light
  • PDF

Article summary

Create Datasets in Dataloop

The Dataloop storage is the internal dataset storage of Dataloop platform. Internal file storage allows you to store digital files, such as images, videos, audios, text files, and other data for annotation process.

  1. Log in to the Dataloop platform.
  2. Select Data from the left-side panel.
  3. In the Datasets tab, click Create Dataset, or click on the down-arrow and select Create Dataset from the list. The Data Management Resource Creation right-side panel is displayed.
  4. Dataset Name: Enter a Name for the dataset.
  5. Recipe (Optional): Select a recipe from the list.
  6. Provider: Ensure that, by default, Dataloop is selected. If not, select the Dataloop option from the list.
  7. Click Create Dataset. The new dataset will be created.

Create Datasets from Your Cloud Storage

Cloud storage services are online platforms that allow organization to store and manage their data. Dataloop supports the following cloud storage services:

  • Amazon Web Services (AWS) S3: Amazon S3 (Simple Storage Service) is a highly scalable, object storage service offered by AWS.
  • Microsoft Azure Blob Storage: Microsoft Azure provides Blob Storage for storing and managing unstructured data. It integrates well with other Azure services.
  • Google Cloud Storage: Google Cloud Storage is part of the Google Cloud Platform and offers object storage, archival storage, and data transfer services. It's often used alongside other GCP services.
Prerequisites

To create a Dataset based on external cloud storage, the process requires:

  1. Create a Storage-Driver to connect to the cloud-storage resource. For more information, see the Storage Driver Overview​.
  2. Create an integration. For more information, see the Integration Overview​.
  1. Log in to the Dataloop platform.
  2. Select Data from the left-side panel.
  3. Select the Datasets tab, if it is not selected by default.
  4. Click Create Dataset, or click on the down-arrow and select Create Dataset from the list. The Data Management Resource Creation right-side panel is displayed.
  5. Dataset Name: Enter a Name for the dataset.
  6. Recipe (Optional): Select a recipe from the list.
  7. Provider: Select one of the following external provider from the list, by default Dataloop is selected:
    1. AWS
    2. GCP
    3. Azure
  8. Storage Driver: Select a Storage Driver from the list. If not available, create a new Storage Driver​.
  9. Click Create Dataset.