1.3.8

Model Nodes

27 Jul 2025

Print
Dark
Light
PDF

Model Nodes

Updated On 27 Jul 2025

Print
Dark
Light
PDF

Article summary

Did you find this summary helpful?

Thank you for your feedback

Overview

Pipeline automation involves creating an end-to-end automated workflow that includes data ingestion, preprocessing, model training, evaluation, and prediction. The Model Nodes are essential components in this automated pipeline. Here’s a detailed look at how each Model Node functions within pipeline automation.

Mandatory Secrets & Integrations

A warning notification will be displayed to prompt the user to set an Integration and Secret if they have not been configured yet.

Predict Model Node

The Predict Model node in Dataloop Pipelines automates annotation by using pre-trained or custom models (e.g., SAM, YOLO, ResNet). It identifies samples needing human review, speeds up annotation, and optimizes resources. Additionally, it enhances model performance through continuous learning and prepares data for honeypot or qualification tasks. Unlike applications, it manages computing settings automatically.

Set Integration or Secret

A notification with a link to set an integration or a secret will appear if not available in the node.

When you click on a Predict Model node, its details, such as Configuration, Executions, Logs, Instances, and available actions are shown on the right-side panel.

Config

Executions

Logs

Instances

Node Name: By default, Predict Model is displayed as name. You can change it accordingly.
Model: Select a fixed model or variable for prediction. You can select a specific trained, pre-trained, or deployed fixed model version from the list or use a variable that can be updated during the pipeline execution.
- Set Fixed Model: It allows you to set the selected model as a fixed model for prediction
- Set Variable: It allows you to set a Pipeline Variable (follow from the Step 3).
Node Input: Input channels are set to be of type: item by default. Click Set Parameter to set input parameter for the Predict Model node. For more information, see the Node Inputs article.
- To use the predict node, connect it to another node with the following inputs:
  - Item & annotation[]: Each predict node execution will initiate one execution of this node.
  - Item & annotation: Each predict node execution will initiate N executions of this node, according to the number of items in the annotation list. Each execution will run with an item-annotation pair.
  - Item/Annotation: Each predict node execution will initiate one execution of this node.
  - Annotation[]: Each predict node execution will initiate one execution of this node.
Node Output: Output channels are set to be of type: Annotation by default.
Trigger (Optional): An Event/Cron trigger can be set on this node, enabling you to initiate the pipeline run from this specific point.

When a node is selected, the Executions tab provides details about the execution of each item processed through that node.

Choose a specific execution from the list to access its detailed information.
Dataloop stores executions for a maximum of 90 days.

Use the filter to display executions based on its status.
Click Rerun Executions to rerun the selected executions. Before you confirm the rerun, you have two actions for the cycles:
- Resume Cycle: Resume the cycles of the selected executions starting from the current node onwards.
- Restart Cycle: Restart the cycles of the selected executions starting from the beginning.
View Execution details such as Information, Input, and Output.

The Logs tab provides a granular view of what happens behind the scenes when a node executes.

It showcases all log entries, including function and service level logs, application installation processes (from initialization to completion), generated during executions of the corresponding node within the selected time-frame.
Dataloop stores logs for a maximum of 14 days.
Click on the log to view log details in the bottom.

Utilize the following options to navigate through the logs:

Search: Enter free text to search through the logs.
Log Level Filter: Filter logs by selecting a level (critical, error, warning, info, debug) to find logs at that level and above.
Show Service Logs: Show/hide service-level logs, aiding in the debugging and resolution of service-level errors and issues.

The Instance tab displays all available instances (replicas of servicesor machines) of the node service, along with their current status—such as 🟡 Initializing, 🟢 Running, or 🔴 Failed.

It also indicates the number of restarts for each instance, which is useful for identifying services that are failing to activate and continuously restarting.
Additionally, a monitoring icon is available for each instance. Clicking this icon redirects you to the Instances section of the Service page, where you can monitor detailed health metrics such as CPU and memory usage for each instance.

Predict node allows you to perform the following actions:

Open the Service Page

Open the service page of a pipeline node where you can see Monitoring, Executions, Logs, Triggers, and Instances information of the service. It will be available once you set a model.

Open the Pipelines page and click on the pipeline from the list.
Select the node from the pipeline.
Click on the Action from the right-side panel.

Click Open Service Page from the list. The Monitoring tab of the service page is displayed.

Open the Model Page

It allows you to open the model that has been referenced. It will be unavailable if there is no model reference.

Open the Pipelines page and click on the pipeline from the list.
Select the node from the pipeline.
Click on the Action from the right-side panel.

Click Open Model from the list. The Model's Details page is displayed.

Open GitHub Code

It allows you to open the GitHub Code reference for the selected model. It will be unavailable if there is no fixed model selected.

Open the Pipelines page and click on the pipeline from the list.
Select the node from the pipeline.
Click on the Action from the right-side panel.

Click Open GitHub Code from the list. The GitHub page of the selected model is displayed.

Edit Service Settings

It allows you to make changes to the service settings like Integrations, Service Configuration, and Execution Configuration based on the model you have selected. This option is not available for Train and Validation.

Edit Service Configuration while running a pipeline

You can now edit the service configuration of a node even while running a pipeline, providing greater flexibility.

Open the Pipelines page and click on the pipeline from the list.
Select the node from the pipeline.
Click on the Action from the right-side panel.

Click Edit Service Settings from the list. The Edit Service Setting popup window of the service is displayed.

Integrations

The secret values are displayed, if the secret is defined in the dpk/dpkConfig. Click on the Edit Secret to search and select a secret from the list.

To create a new secret, click Create New -> Create Secret.
To create a new Integration, click Create New -> Create Integration.
Click Save Changes, if no additional modifications are needed.

Service Configuration

The service configuration section allows making changes to the following parameters, to change, click on the Edit Configuration.

Machine Types: The machine type on which the service will run (refers to CPU, GPU, RAM).
1. Change the machine type from the list.
2. Click on the Machine Settings to enable Preemptible Machine, the reclaimable compute instances.
Docker Image: A container image encapsulating essential dependencies and code for service execution. By default, the docker image based on the SDK version is displayed. You can modify the URL as required.
Concurrency: Number of concurrent executions per instance. By default, the value 10 is displayed.
Scaling Type: Choose the scaling type based on your workload.
1. Autoscale: Recommended by default. Autoscaling dynamically adjusts the number of service instances based on the incoming execution load, suitable for changing workloads. When autoscaler reduces instances, it waits for execution timeout before termination.
  1. Min Instances: By default, 0. If you set the min instances to 0, the service will not run until it has something in the queue, it will be suspended.
  2. Max Instances: By Default, 2.
  3. Queue Size: By default, 10.
2. Fixed size: Use fixed scaling when you need a predetermined number of service instances, suitable for consistent workloads. Compute hours counts also when there is no workload. By default, 1 instance is selected.
Click Save Changes, if no additional modifications are needed.

Execution Configuration

The execution configuration section allows making changes to the following parameters, to change, click on the Edit Configuration.

SDK Version: The SDK version is displayed. Click to select a new version from the list.
Execution Timeout: The maximum execution duration before service restarts (in seconds). By default, 3600 is displayed.
Drain Time: Delays service restart to complete ongoing executions (in seconds). By default, 600 is displayed.
On Reset Action: Action taken when a service is reset or restarted (e.g. timeout). Select one of the following when you reset the execution.
1. Fail Execution
2. Rerun Execution
Max Attempts: Configures to automatically retry failed executions up to x attempts. Maximum number of execution attempts allowed. By default, 3 is displayed.
Bot: A dummy user used for running the service ('developer' role). A default Bot is displayed. Select a new one from the list, if needed.
Rerun executions as process: Defines if each execution will start a new thread or a process. Enable to make the execution as a process.
Click Save Changes.

Edit Access Credentials

You can edit access credentials for models to easily select API keys for seamless application integration.

Open the Pipelines page and click on the pipeline from the list.
Select a model node from the pipeline.
Click on the Action from the right-side panel.

Click Edit Access Credentials from the list. The Set Credentials popup window is displayed.

Select the API Key from the list.
Click Confirm.

Copy the Node ID

Copy node ID of a pipeline node

Open the Pipelines page and click on the pipeline from the list.
Select the node from the pipeline.
Click on the Action from the right-side panel.

Click Copy Node ID from the list.

View Execution Details

It allows you to view the Executions page of the Application Service.

Open the Pipelines page and click on the pipeline from the list.
Select the node from the pipeline.
Click on the Action from the right-side panel.

Click View Executions from the list. The Executions tab of the CloudOps Service page is displayed.

View Analytics Details

It allows you to view the Monitoring tab of the Application Service.

Open the Pipelines page and click on the pipeline from the list.
Select the node from the pipeline.
Click on the Action from the right-side panel.

Click View Analytics from the list. The Monitoring tab of the Application Service page is displayed.

View Log Details

It allows you to view the Logs page of the Application Service from a pipeline node.

Open the Pipelines page and click on the pipeline from the list.
Select the node from the pipeline.
Click on the Action from the right-side panel.

Click View Logs from the list. The Logs tab of the CloudOps page is displayed.

Generate Model Node

The Generate Model node integrates a generative AI model into your pipeline. It processes a prompt item to generate a response, returning the result as an annotation. You can customize settings such as system prompt, max tokens, temperature, and more directly from the node configuration.

When you click on the Generate node, its details, such as Configuration, Executions, Logs, Instances, and available Actions are shown on the right-side panel.