Create Your Datasets
- 21 Apr 2025
- Print
- DarkLight
- PDF
Create Your Datasets
- Updated On 21 Apr 2025
- Print
- DarkLight
- PDF
Article summary
Did you find this summary helpful?
Thank you for your feedback
Create Datasets in Dataloop
The Dataloop storage is the internal dataset storage of Dataloop platform. Internal file storage allows you to store digital files, such as images, videos, audios, text files, and other data for annotation process.
- Log in to the Dataloop platform.
- Select Data from the left-side panel.
- In the Datasets tab, click , or click on the down-arrow and select Create Dataset from the list. The Data Management Resource Creation right-side panel is displayed.
- Dataset Name: Enter a Name for the dataset.
- Recipe (Optional): Select a recipe from the list.
- Provider: Ensure that, by default, Dataloop is selected. If not, select the Dataloop option from the list.
- Click . The new dataset will be created.
Create Datasets from Your Cloud Storage
Cloud storage services are online platforms that allow organization to store and manage their data. Dataloop supports the following cloud storage services:
- Amazon Web Services (AWS) S3: Amazon S3 (Simple Storage Service) is a highly scalable, object storage service offered by AWS.
- Microsoft Azure Blob Storage: Microsoft Azure provides Blob Storage for storing and managing unstructured data. It integrates well with other Azure services.
- Google Cloud Storage: Google Cloud Storage is part of the Google Cloud Platform and offers object storage, archival storage, and data transfer services. It's often used alongside other GCP services.
Prerequisites
To create a Dataset based on external cloud storage, the process requires:
- Create a Storage-Driver to connect to the cloud-storage resource. For more information, see the Storage Driver Overview.
- Create an integration. For more information, see the Integration Overview.

- Log in to the Dataloop platform.
- Select Data from the left-side panel.
- Select the Datasets tab, if it is not selected by default.
- Click , or click on the down-arrow and select Create Dataset from the list. The Data Management Resource Creation right-side panel is displayed.
- Dataset Name: Enter a Name for the dataset.
- Recipe (Optional): Select a recipe from the list.
- Provider: Select one of the following external provider from the list, by default Dataloop is selected:
- AWS
- GCP
- Azure
- Storage Driver: Select a Storage Driver from the list. If not available, create a new Storage Driver.
- Click .