1.3.8

Create Pipelines

26 Jul 2025

Print
Dark
Light
PDF

Create Pipelines

Updated On 26 Jul 2025

Print
Dark
Light
PDF

Article summary

Did you find this summary helpful?

Thank you for your feedback

Dataloop enables you to create pipelines either from scratch or using a template. You can choose from your organization's templates, Dataloop’s marketplace templates, or build one entirely on your own, providing both flexibility and efficiency in developing and deploying data processing and machine learning workflows.

This article guides you through creating pipelines on the Dataloop platform.

Using Templates

Create Pipeline Using Templates: The Marketplace is a comprehensive repository that contains a wide variety of pre-defined pipeline templates. These templates are designed to cover common use cases and scenarios across different industries and applications, providing a quick and efficient way to get started with pipeline creation. Here’s how it works:

Open the Pipelines page from the left-side menu.
Click Create Pipeline and select the Use a Template from the list. It opens the Select Pipeline Template popup window.
Select a template from the list.
Click Create Pipeline. The selected pipeline template will be displayed, and you can configure the available nodes as needed.

Create Your Pipeline

Creating pipelines from scratch is an approach suited for users with specific requirements that cannot be fully met by existing templates, or for those who prefer to have complete control over every aspect of their pipeline design. Here’s how it works:

Creating a pipeline in the Dataloop platform and activating it involves many steps, such as:

Create a Pipeline.
Place the nodes.
Verify the Starting Node
Configure Pipeline
Start the Pipeline
Trigger the Pipeline

1. Create a Pipeline

Open the Pipelines page from the left-side menu.
Click Create Pipeline and select the Start from Scratch from the list. It opens the Set Pipeline Name popup window.
Enter a name for the new pipeline.
Click Create Pipeline. The Pipeline window is displayed, and you can configure the available nodes as needed.

2. Place the Nodes

To compose a pipeline, drag and drop nodes onto the canvas and connect them by dragging the output port of one node to the input port of the next node.
Clicking on a node output port and releasing it will create an instant connection with the closest input port available.

Canvas Navigation

Left-click and hold on to any node to be able to drag it around the canvas.
Right-click and hold on to the canvas to be able to drag the entire canvas.

3. Verify the Pipeline Starting Node

The starting icon will appear next to the first node you place on the canvas. This icon can be dragged and placed on any node to mark it as the starting point of the pipeline.

When triggering data into a pipeline (for example, from the dataset-browser), the data enters the pipeline at the node set with the starting point.

4. Configure the Nodes

Configure the node inputs such as Fixed Value or Variables, Set Functions, Create Labeling Tasks, and Set Triggering Data in nodes, etc.

Refer to the following node category list and configure the nodes according to the type:
Learn more about the Node Inputs
Learn more about the Triggering Data

5. Activate the Pipeline

To activate your pipeline, click Publish in the pipeline page or the play button from the project's Pipeline page.

If you are unable to click Publish, or that the installation process has failed, it might be due to configuration issues of your pipeline nodes or errors in the pipeline composition:

To monitor node configuration issues, hover over the warning/errors icons on the nodes to see what issues need to be resolved. Resolve the issue, and the warning/error icon should disappear.

To monitor installation errors, click on the Error tab in the pipeline’s information panel on the right and check the error messages.

6. Trigger the Pipeline

Once you start the pipeline, you can trigger the pipeline by invoking data to pipeline by using Automatic, Manual, and via SDK invocation.

Learn more

Create Custom Applications

Dataloop allows you to create custom node applications using your external docker images.

1. Create a Pipeline

Open the Pipelines page from the left-side menu.
Click Create Pipeline and select the Start from Scratch from the list. It opens the Set Pipeline Name popup window.
Enter a name for the new pipeline.
Click Create Pipeline. The Pipeline window is displayed, and you can configure the available nodes as needed.

2. Place the Nodes

Select the starting nodes from the Node Library and drag them to the Canvas.
Add a Code node in the canvas and make the required updates.
Complete the connections.

Learn more about the pipeline nodes.

3. Customize the Code Node Using Your Docker Image

Prerequisites

To integrate your customized private container registry, you must create container registry integrations and secrets.

Learn how to Add Integrations
Learn how to Add Secrets
Learn how to Integrate Container Registries

Select the Code node and click Actions.
Select Edit Service Settings from the list.
Add Secrets and Integrations if your docker image is private. Ignore, if public.
Click Edit Configuration.
In the Docker Image field, enter your docker image URL.
Complete the remaining steps, if required.

4. Start the Pipeline

To activate your pipeline, click Publish in the pipeline editor screen or the play button from the project's Pipeline page.

Create Model Applications

Dataloop allows the installations for AI/ML models by allowing them to be hosted and executed on:

Dataloop's Managed Compute (internal infrastructure): The Models run on the Dataloop's Compute.
External Compute Providers (e.g., OpenAI, Azure, GCP, IBM, NVIDIA) via API Service Integration: The Models run on external provider's compute, and which requires secret credentials to complete the installation.

1. Create a Pipeline

Open the Pipelines page from the left-side menu.
Click Create Pipeline and select the Start from Scratch from the list. It opens the Set Pipeline Name popup window.
Enter a name for the new pipeline.
Click Create Pipeline. The Pipeline window is displayed, and you can configure the available nodes as needed.

2. Place the Nodes

Dataloop allows you to install Models and Applications running on Dataloop Compute and External Compute Providers (e.g., OpenAI, Azure, GCP, IBM, NVIDIA) via API Service Integration.

Select the required nodes (except model application) from the Node Library and drag them to the Canvas.
Click on the + icon next to the Node Library.
Select a model / application from the Models / Applications tab and click Add Node.
Select the model variations / application function from the list:
1. If the model / application is running on an external compute, click Proceed.
  1. Select an API Key, Secret or an Integration, as required. If not available,
    1. If there is no secret, click Add New Secret and follow the steps.
    2. To set the integration later, click Set Up Later.
    3. Click Install Model / Install Application
2. Else, click Install Model / Install Application pop-up window is displayed, click Install Model / Install Application. Click on the View Model to view it under the Model Management → Versions tab.
Go back to the Pipeline and view the newly added node under the Models' category in the left-side node library.
Drag it to the canvas and make the required configurations.
Complete the connections.

Learn more about the pipeline nodes.

3. Activate the Pipeline

To activate your pipeline, click Publish or the play button from the project's Pipeline page.

Create Pipeline Using SDK

Learn more about creating pipelines via SDK.

What's Next

Manage Pipelines

Table Of Contents

Using Templates
Create Your Pipeline
Create Custom Applications
Create Model Applications
Create Pipeline Using SDK