- 11 Sep 2024
- Print
- DarkLight
- PDF
Platform Release Notes
- Updated On 11 Sep 2024
- Print
- DarkLight
- PDF
August 15th, 2024
New Models Available in the Dataloop Marketplace
- Generative Models: Easily install generative models from top providers like Meta, Mistral, OpenAI, and more.
- Embedding Models: Install embedding models from the Dataloop Marketplace to extract features from your data, enabling applications like semantic search, data cleanup, and clustering analysis.
Easily integrate these models into your pipeline automations to create powerful workflows like Chatbot, Retrieval-Augmented Generation (RAG), RLHF, Content Summarization, and more.
Quickly Access and Install pre-built Datasets within Dataloop Marketplace
Whether you're new to Dataloop or exploring its diverse solutions, the 'Datasets' tab on the Dataloop Marketplace has everything you need. Browse and install datasets from top providers like Meta and Hugging Face directly into your project with just a few clicks.
Learn more
Enhanced Data Management Resource Creation Interface
We've enhanced the process of creating datasets and connecting them to your external storage.
Integrating your external storage with Dataloop is now simpler and more efficient.
You can now manage and create everything in one place:
- Cloud Integrations: Seamlessly connect with external storage providers (AWS, GCP, Azure).
- Storage Drivers: Easily configure and manage storage drivers.
- Datasets: Simplified dataset creation.
Labeling Tasks: Status Filtering
Tasks can now be easily filtered by their status, enabling quick viewing of tasks that are in progress or completed with just a few clicks.
Learn more
Annotation Studio: Status Log
The new Status Logs view in the annotation studio lets you to track and monitor your team's actions on data items. You can now easily follow task status changes, see who applied each status ('Completed,' 'Discard,' or custom status), and when it was done.
Task Analytics: Annotation Time Metric
A new metric, “Task Browser annotation time,” is now available on the Task Analytics page. It offers insights into labeling time in the Task Browser view (such as bulk classification), allowing you to monitor team efficiency and productivity.
You can easily find it under: Task Analytics > Performance Tab > User Stats.
CloudOps: Service Initialization Logs
Real-time progress updates are now provided during service initialization, reflecting the environment installation status in the service logs. This offers greater transparency and a smoother setup experience.
June 19th, 2024
Semantic Search with CLIP Model
Utilize the CLIP model to query your files using semantic search for both images and documents.
More embedding models can be found and installed on the Dataloop marketplace.
Learn more
Data Cleanup Tab
Use embeddings for similarity searches to clean your training datasets from overly similar items, whether they are documents, images, or other unstructured data. For images, also filter out those that are too dark, too bright, or blurred to maintain high data quality.
Learn More
Dataset Node Updates
Trigger pipeline over existing data
It is now possible to execute pipelines on existing data within a selected dataset when the pipeline is activated. This enhancement complements the ability to trigger pipelines on new data, offering increased flexibility.
Dataset Node Variables
Introducing the ability to set variables, enabling dynamic dataset selection within your pipeline using Pipeline Variable.
Learn More
Labeling Task and Workflow Templates
You can now create labeling tasks and use workflow templates to accelerate your labeling process. For instance, use the microtasks workflow to manage a series of small labeling tasks, enabling labelers to focus on specific tasks according to their expertise.
Learn more
Hiding Annotations Functionality Change
We've updated the keyboard shortcuts for hiding annotations to enhance performance and streamline the user experience. Now, use "J" to hide all annotations and "H" to hide/unhide selected annotations.
Learn More
Attributes Search
Users can now search for attribute values to streamline the annotation process, making it quicker and easier.
Learn More
Secret Management
We have improved the interface for secrets management. Continue to securely store and manage your sensitive information, such as API keys and database credentials, within Dataloop, ensuring your AI building process remains protected and efficient.
Learn More
May 1st, 2024
Video Performance Enhancements
Major performance changes were made to improve the user experience and allow for a smooth annotation flow, read more about it.
Labeling Tasks: Layout Deprecation
As recently announced, Labeling Task now features a unified page for managing tasks, assignments, and issues, replacing the old multipage view.
Please note that we no longer support the old layout at this time. To explore the new layout, on the navigation menu, click Labeling, read more about it.
Simplified File Export and Download
We've streamlined our data export and download processes for enhanced ease of use.
April 15th, 2024
Lidar Annotation Studio
We're excited to announce significant upgrades to our Lidar Annotation Studio, designed to boost performance and enrich your annotation capabilities.
This update includes:
- Timeline Feature: Effortlessly traverse your project's point cloud data (PCD) timeline. This feature not only allows you to visualize the start and end times of annotations but also provides a clearer insight into the scope and details of the annotations.
- Instance Segmentation: This annotation tool is designed for intricate segmentation of point clouds, allowing for a deeper level of detail and precision in your annotations.
- Polyline and Spline: This update introduces precise marking of linear elements, coupled with the ability to fine-tune their curvature.
Read more about LiDAR.
Task 2.0 in now the default task interface
We are excited to announce that the Task 2.0 interface is now the default tool for managing your tasks, designed to enhance your workflow and boost efficiency. The option to revert to the classic interface is available until May 1st. Simply click on Switch to Classic Labeling Tasks to make the switch.
We encourage you to explore the new interface and share your feedback!
Read more about Labeling Tasks 2.0.
Data Management
- New Clustering Tab: We're excited to introduce the Clustering Tab, a new feature designed to simplify the analysis of large data sets by using feature vector projections to organize similar items. This tool not only streamlines your data, but also highlights anomalies by identifying data points that stand out from the rest. Embrace enhanced insight and efficiency as you explore the hidden patterns and relationships within your data.
- New Insights Tab: The Insights Tab provides deep visibility into your annotations, offering features like an annotation location heat map, a histogram of annotation labels, and detailed attributes per label, among others. Accessing comprehensive information about your data with just a quick glance has never been simpler.
To experience the new insights and clustering tabs:
Go to the Marketplace > Applications > Install The Clustering App / Insights App > Go to one of your datasets> the new tabs now available, check them out!
Read more about Clustering tab and Insights tab.
Dataloop Marketplace
Filter by Trainability
In models tab, users can easily filter trainable model, to quickly get started with the model fine-tuning process.
Read more about Dataloop Marketplace.
Model Management
Possible to filter models in the model versions tab by model status, application, dataset, and creator for easy finding of models.
- Enhanced Model Filtering: Users can now seamlessly filter models within the Model Versions tab based on criteria such as model status, application, dataset, and creator, facilitating quick and efficient model identification.
- Use Data Tags to Simplify Model Training & Evaluation: Initiate model training and evaluation effortlessly by selecting data subsets tagged as Train, Test, or Validation from your dataset. This enhancement simplifies the process of initiating model training or evaluation, streamlining the workflow for users.
Read more about Model Management and Model Evaluation.
February 28th, 2024
Marketplace - Your ML applications marketplace
Marketplace revolutionizes the way developers of all skill levels and expertise build AI, providing them with models, applications, pipelines, and out-of-the-box end-to-end AI and GenAI workflows.
Pipelines
Expand your pipeline possibilities with our broad selection of new node types — the essential building blocks for ML projects. Install them via Marketplace and craft your pipelines effortlessly, using either built-in templates or starting from scratch with intuitive drag-and-drop features.
Models
Access and deploy models from popular & leading providers like Google, Microsoft, Nvidia, and more, enriching your project with cutting-edge AI capabilities.
Applications
Integrate FaaS applications with public and applications in your model development, streamlining workflows and simplifying project advancements.
To learn more about it, refer to the Marketplace article.
Evolving from Application Hub to CloudOps
The switch from the Applications page to CloudOps marks a significant upgrade in managing application services.
To learn more about it, refer to the CloudOps article.
Dataset Browser - Filtering by Task
The Tasks filter in the Dataset Browser introduces granular data sorting by Task Name and Task ID for advanced dataset management.
To learn more about it, refer to the Schema-Based Search article.
Pipelines
The new nodes panel makes it straightforward to discover and configure the wide variety of nodes available, supported by an organized layout and enhanced categorization.
In addition, new node types have been added to allow you to customize your pipeline exactly as you wish.
To learn more about it, refer to the Pipelines Nodes article.
Model Management - Simplified AI Deployment
- Foundation Models Installation: Foundation models are now easily installable on the Model Management page with just one click!
- Workflow Enhancements: Improved cloning, training, evaluating, and deploying models, especially with a more user-friendly cloning experience.
- Pre-trained Model Integration: New Pre-trained status enhances AI model utility across workflows, enabling better predictions, further training, easy deployment, and performance evaluation.
- Model Creation Update: Introduced a nuanced status flow for model creation and cloning, accurately reflecting each stage of a model's life cycle.
To learn more about it, refer to the Model Management article.
January 8th, 2024
Dataset Browser 2.1
The updated dataset browser features an enhanced interface and introduces new impactful functionalities. With Dataset Browser 2.1, you can elevate your browsing experience.
Smart Search with Dataloop
Explore the upgraded data filtering experience with Smart Search on the Dataloop data browser. Uncover an intelligent search feature that simplifies obtaining the precise data results you're looking for. Now, you can filter by different operators such as annotation labels, item media types, and more.
Click here to discover more about the enhanced functionalities of Smart Search.
Enhanced User Interface
Now, you can effortlessly access all the item information you need in one glance. Easily discover item metadata, basic information, and Function Executions. Find more about it in the Dataset Browser documentation.
Audio Annotation Studio - Transcription Finder
Introducing the Transcription Finder feature in the Audio Annotation Studio, designed to help users efficiently "find and replace" specific text within transcriptions. This tool supports annotators in ensuring data quality with ease.
Find more about it in the Audio Studio documentation.
Task Management 2.0 Enhancements
Continuing our commitment to an improved user experience, we've enhanced the Task Management interface. Now, effortlessly review issues and annotations marked "For Review." We've also streamlined access to the Task Browser and Task Analytics for a smoother user journey.
Find more about it on the Labeling Tasks documentation.
Rerun Pipeline Cycles
Addressing pipeline errors is now more efficient. Users can rerun cycles at the execution or cycle level, removing item status from tasks and facilitating reinsertion into assignments.
Find more about it on the Running and Monitoring pipelines documentation.
December 19th, 2023
Service Page
New tabs were added to the service page to streamline the process and make it easier for users to access useful features.
Now, users can analyze their service executions, logs, triggers, and instances conveniently through distinct tabs.
Find more about it in the Service Analytics documentation.
Data Governance
Exciting news! Our latest update introduces enhanced data governance, offering a centralized hub with dedicated tabs for Integrations and Secrets, providing a streamlined and provider-focused view for efficient data asset management and protection.
Try out the new Data Governance page by clicking Switch to V2.
Read more about it on the Data Governance documentation page.
Annotator Issue Creation in QA Task
Annotators can now create issues directly within the QA task, providing them with the ability to address concerns as needed.
For more information, see the Roles of Users in the QA Process article.
Pipeline Tasks
New UI for task node
The redesigned task node interface ensures a streamlined and organized view of task details, offering users instant access to all relevant information.
Edit tasks from the pipeline page when the pipeline is running
Introducing a new level of flexibility! Users can now edit pipeline tasks, even while the pipeline is running ensuring a dynamic workflow where any changes made seamlessly reflect in the ongoing pipeline process, guaranteeing an uninterrupted experience.
To learn more about editing a task, see the Annotation and QA tasks article.
Edit workload of pipeline Distribution tasks - New feature
You can now edit the workload item within the distribution of a pipeline task, providing more granular control over specific task details within the broader distribution process.
To learn more about editing the Workload Item, see the Edit the Workload Item article.
November 20th, 2023
My Projects
The My Projects user interface (UI) has been enhanced to include the ability to display additional information on project cards, the option to sort projects by name and creation date, and the introduction of a page refresh functionality.
For more information, see the My Projects article.
Task Management UI
- The Task Management page has undergone significant enhancements, with the introduction of tabs for Tasks, Assignments, and Issues, offering a more provider-centric perspective.
- Additionally, a detailed view of Tasks and Assignments has been incorporated into the new UI, featuring information such as Actions, General Information, Status, Metadata, and more in a right-side panel. This empowers users to access a wealth of information on a single page.
- Furthermore, the Search and Filter functionalities have been improved to provide more precise and effective search results.
For more information, see the Lebeling Tasks 2.0 article.
Pipeline Page and Nodes
- The Pipeline user interface (UI) has undergone a series of enhancements, offering a fresh perspective on the pipeline table list, buttons, menus, pipeline counters, popup messages, and search fields.
- The detailed view of Pipeline Nodes, including FaaS Node and Code Node, on the right-side panel has been enriched with new fields, information text, view and edit mode information, and node validation errors.
- The Action Menu feature for nodes on the right-side panel empowers users to perform various actions, such as opening datasets, accessing FaaS services, and viewing analytics, etc. according to the nodes.
- The Node Service Status feature allows users to monitor the current status of their nodes, including states like Active, Initializing, Inactive, or Failed.
For more information, see the Pipeline Overview and the Pipeline Nodes articles.
October 15th, 2023
New Features
VS Code Application
Introducing the new built-in VS Code for enhanced FaaS development. With its integrated code editor, you can easily view, edit, and debug code in the same environment where FaaS applications are built and deployed, instead of locally.
- For more information, see the Service Debugging article.
Pipeline: Evaluate Model Node
The new "Evaluate Model" node seamlessly integrates with your pipeline, allowing you to easily create model evaluations, fully automating the training process in conjunction with our "Train Model" and "Predict" nodes.
Assess model performance, compare predictions to ground truth, and access results effortlessly.
- For more information, see the Evaluate Model article.
Enhancements
New Data Management UI
- The Data Management page has been enhanced to show resources by tabs and enable a more provider-focused view.
- Users can now easily copy and paste the SDK script for the relevant resource that has been added to the Data Management page.
- The recipe feature has been moved out of the Data Management page.
For more information, see the Storage Drivers and Dataset topics.
September 20th, 2023
New Features
Logs of User Logins
User login logs are records that track and document when users access the Dataloop platform. It records the user login email, organization role, and timestamp. For more information, see the Logins topic.
Apple Quicktime Videos (MOV format) are Now Automatically Converted
Dataloop converts MOV video files to WebM format for frame-accurate annotations. For more information, see the Supported File Formats topic.
Redistribute items with Metadata (key & value) Numbers
Use numbers as metadata (key and value) to filter items while redistributing tasks. For more information, see the To redistribute a task topic.
Rerun Pipeline Cycles
Users can rerun pipeline cycles by selecting executions for rerun starting points or entire cycles for rerun to avoid regenerating the cycles again in the case of failure. Rerunning the cycle in the pipeline removes the item status from a task, so it will be reinserted into an assignment. For more information, see the Supported File Formats topic.
Enhancements
New Data Management UI
- The Data Management page has been enhanced to show resources by tabs and enable a more provider-focused view.
- Users can now easily copy and paste the SDK script for the relevant resource that has been added to the Data Management page.
- The recipe feature has been moved out of the Data Management page.
For more information, see the Storage Drivers and Dataset topics.
The Term "Contributors" is Now "Users"
The terminology Contributors is now replaced with the term Users, and the menu will indicate Team. There is no functionality change, only a change of terminology. For example,
- Menu term changed to Team
- The Contributors' title page becomes Project Team.
- Buttons and search boxes are now Users.
- Project contributors are now Project Users, etc.
For more information, see the Project Team topic.
August 9th, 2023
New Widget in Project Overview: Task Management
A task management widget is now available on the Project Overview page, where you can:
- View a list of available tasks.
- Create a task.
- Create a workflow via Pipeline.
For more information, see Project Dashboard.
Data Management
New Storage Integration Type: GCP Cross Project
From now on, Dataloop allows users with a GCP project to integrate their GCS bucket with the Dataloop platform and create datasets.
The GCP Cross Project integration stands as the best choice for GCP due to its significant advantages:
- Least Privilege Principle: By leveraging GCP Cross Project integration, fine-tuned IAM roles and permissions can be assigned, providing stringent access and reducing risk of third-party compromises.
- Rotational Capability: This integration allows the straightforward rotation of IAM roles, ensuring better security by refreshing third-party service credentials periodically.
- Better Audibility: GCP Cross Project integration aids in generating detailed action logs, facilitating more effective monitoring and detection of any unusual activity.
- Separation of Responsibilities: This feature enables precise control over GCP projects and task delegation to third parties without the need to expose credentials, thereby mitigating risk of misuse.
For more information, see the Cross Project Integration.
Integration Validation for all Supported Cloud Integrations
- All supported integration types will now be validated for actual resource creation on the customer's side.
- This does not impact existing storage integrations. Only applicable to new integrations.
- Supported on:
- GCP - Cross Project, Private Key
- Azure - Client Secret
- AWS - Cross Account, STS
For more information, see the Integration Validation for all Supported Cloud Integrations.
Supports Duplicate Naming Enforcement for New Integrations & Secrets
- Newly created integrations and secrets will now be enforced to be unique by name
- This does not impact existing integrations and secrets. Only applicable to new integrations and secrets.
For more information, see the Supports Duplicate Naming Enforcement for New Integrations & Secrets.
Data Management Deprecations:
Support for Elastic Container Registry (AWS)
Support for ECR in Dataloop platform Integrations has deprecated and no longer available. For more information, see Deprecate AWS ECR.
Legacy API Calls and Introducing Replacements
For more information about deprecated API calls and the replacements, see Deprecate annotations JSON and ZIP endpoints.
July 19th, 2023
Widget Management
You can now manage the widgets on the Project Overview page, enable or disable, and drag & drop widgets to rearrange. For more information, see Manage Widgets.
Annotation Studio
Video Keyframe Shortcuts
Keyframe setting keyboard shortcuts are now available for selected annotations, such as the annotation end-keyframe,
Annotation start-keyframe, and toggling occlusion on the current frame. For more information, see the Keyboard shortcuts.
Bulk attribute update for annotations from different labels
You can bulk select and update the attribute for annotations from different labels. For more information, see the Annotation Controls.
Remove item status before you edit
All users must require to remove the item status to edit the item. You have an Enable/disable force all users to remove status before editing annotations option in the Project settings to enable or disable this feature. For more information, see the Status Overlay On Completed Items.
Inactive user management
Inactive users are grayed out and are not available for redistribution, but are available for reassignment. You can view an inactive message next to the user who is no longer in the project. For more information, see the Inactive users.
Quality Tasks Scoring
Quality task scoring is available now for items, annotations, contributors, Avg. IOU, and Avg. Labels Consensus. When you activate a Consensus, Qualification, or Honeypot task while creating a task, a score function is enabled. All contributors, items, and annotations are scored via the function. The new Scores tab on the Analytics page gives you more information. For more information, see the Score Analytics.
Scores tab - New
The new Scores tab on the Analytics page is added to provide new information, such as:
- Number of contributors, items, and annotators.
- Average percentage of IoU and Label Consensus.
- Items consensus
- Annotations consensus
- Labels confusion matrix
- Contributors' confusion matrix
July 10th, 2023
Annotation Task - New UI
- A new UI has been introduced for the following Annotation task management features:
- Create or edit a task.
- Create a QA task for the labeling task.
- Add items to an existing task.
For more information, see Annotation and QA tasks.
Annotation Studio Filters
The filter in the annotation studio has been enhanced to include the following sections and updates:
- Annotation type: It displays annotations based on the annotation tool types, such as box, point, polygon, note, etc.
- Annotation status: It displays annotations based on the annotation status, such as no status, issue, for review, etc.
- Annotation properties: It displays annotations based on the annotation properties, such as object ID, description, etc.
- By default, all the filter are unchecked. When you select a filter, it displays only the annotations according to the selection you made.
For more information, see Annotations Filter.
Redistribute tasks by Metadata or DQL query
You can now redistribute items to the assignees based on the specific context of their workload. The items that do not meet the criteria will not be listed for redistribution and will remain with the current assignee. For more information, see Redistribute a Task or an Assignment of a Task.
Notification Center
Notification Channel - New
A new Notification Channel feature has been introduced in the Organization Overview page of the Dataloop platform to expand the notification capabilities by allowing users to connect additional channels, such as email, to receive notifications based on their notification subscription. For more information, see Notification Channels.
Notification Subscription
The notification subscription section in the User Settings dialog has been updated to include the option to unsubscribe from Task/Assignment (workflow) updates. For more information, see Notification Subscriptions.
June 7th, 2023
Filter Out Annotations in the annotation-studio
New recipe settings to filter annotations in the annotation-studio:
Task scope and Recipe scope.
Task scope only shows annotations created in the current task, while recipe scope only shows annotations that match the current recipe. These settings can help annotators and reviewers focus on their work and hide unnecessary annotations.
Qualification Tasks
Introducing qualification tasks in the annotation-studio to assess annotator skills. Create test tasks with hidden ground-truth answers and compare annotations to calculate scores for performance evaluation.
Learn more qualification tasks here.
Honeypot Tasks
Introducing honeypot tasks for ongoing monitoring of annotator performance. Embed known ground-truth items into tasks with raw data to estimate annotation accuracy and ensure quality standards. Effectively monitor and maintain the desired level of annotation accuracy and quality by strategically placing honeypot items within tasks.
Learn more honeypot tasks here.
May 24th, 2023
Workflow Notifications
The platform will send users notifications regarding workflow events. Managers are notified when tasks are created, updated and completed. Annotators are notified when they have new work assignments, new issues were created, additional items added to their assignments and more. For more information, read here.
Status Overlay
Setting a status (e.g. Completed) on items in annotation or QA tasks flags the item as done, and often triggers further automation steps in Pipelines and FaaS. It is therefore important to ensure no changes to items/annotations are done after setting the status, else those changes won't be in effect for the automation steps.
This situation is now resolved enforcing annotators to remove the status from the item, before they can make changes to annotations, even as part of fixing issues opened during a QA task.
For more information, read here.
Editing Integrations
While we encourage using cross-account integrations for better security, customers using standard IAM integrations can now update the key/password values when credentials expire.
Pipeline Variables
Pipeline Variables bring enhanced flexibility and control to pipelines. With Pipeline Variables, it is possible to easily set and manage dynamic parameters as node input, access them simultaneously across multiple nodes, and update their values during runtime. The variables can be managed easily from the pipeline interface. SDK support will be added on the next SDK release.
Pipeline Widget
A new widget is now available in the project overview page, providing information about running pipelines, pending pipeline cycles and errors.
Pipeline “Action” Filters - Supported by FaaS
Now available for FaaS nodes as well! Enhance your pipeline flows by setting actions on function outputs when creating your FaaS. Use the actions as filters in the Pipeline to selectively process the returned outputs of a FaaS node. Action filters empower you to create smarter pipeline flows with increased flexibility and control.
May 1st, 2023
Auto Segmentation With SAM Model
The Auto-segmentation feature has been updated to use Meta's latest Segment Anything Model, delivering amazingly accurate segmentation right from the first shot.
AWS Policy S3 Folder Limited Access
Storage integrations for AWS now support policies restricting access to specific folders.
Azure Datalake Gen2
Storage integration for Azure Datalake Gen2 is now available from the platforms' user-interface. When selecting the storage integration type of Azure, it is not possible to further refine and select between Azure Blob and Azure DataLake Gen2.
Ignoring WebM Format Warnings For Linked Items
Linked items do not reside on the platform, and therefore cannot be converted to WebM format (which would require storing the WebM file on the platform).
Projects using linked items can now set a project-wide configuration from the Project settings to hide and WebM format related messages.
Pipeline: Support Consensus Tasks in Pipeline
Consensus Tasks for QA workflow is now supported in pipeline as well - reviewers can work on items containing work from multiple contributors and flag issues on annotations or create note annotations to trigger correction work, and increase the quality of the annotations.
To read more about consensus tasks, click here.
Model Management: Define Train & Validation Subsets on Model
Train & validation subset were updated to be defined on the model itself. When setting a train/validation subset (as DQL filter), it will be added to the model metadata under
metadata.system.subsets.train
metadata.system.subsets.validation
April 4th, 2023
New Dataset Technology
A technological breakthrough enables Dataloop to offer Dataset supporting sub-second queries for hot storage of 30M items. Query by item attributes, item and annotations metadata, and user metadata that represents the customer’s project context, all at sub-second queries at industry-leading capacity.
The new technology now applies to all new Datasets. Migration of existing ones is available on-demand.
Multi-Labeling Service Vendors
Working with multiple labelling service providers is now inherently supported in the Dataloop platform. Labeling companies are added to a project, confined to view only their scope of the project and unaware of other companies. Managers from such labeling companies can easily import the workforce from their organization, individually or in bulk as groups.
Combined with the 'Pulling' work allocation method in annotation tasks, the workforce in tasks can dynamically be scaled according to budget and deadlines.
Employ multi-labeling service providers to achieve 24/7 work continuity, cross-company quality validation, or domain expertise.
New Project Dashboard/Overview
Completely redesigned the project overview/dashboard, and it is now based on widgets, the first 2 covering what the old one did, Contributors and Datasets. The new widgets provide more information and more functionality. Additional widgets will be added in future releases.
Consensus tasks - Reassignments
A new feature now allows reassigning Consensus task assignments from one contributor to another, ensuring work continuity.
For more information and to learn how to reassign a consensus assignment, read here.
Video - delete keyframe from right-bottom keyframes list
When annotating video files, a new feature makes it possible to delete key-frames. This is very useful when needs to undo an annotation change, or when working with uploaded model annotations, containing many key-frames.
Consensus Support In QA Workflows
Consensus work is supported in the QA workflow - reviewers can work on items containing work from multiple contributors and flag issues on annotations or create note annotations to trigger correction work, and increase the quality of the annotations.
When annotators correct their work and apply 'Complete' status to the item, their annotations are removed from the master item, and the new, corrected ones are copied over to represent the corrected higher-quality work. Reviewers can then review their work and set the 'Approve' status on the item.
Pipeline & FaaS
Service Shutdown - Docker Image
Our FaaS service restart loop is expanded to include monitoring for Docker image errors. Once an ImagePullBackOff
error is detected, preventing your service from scaling up, Dataloop system will shut the service down to save on computational costs. Also, you are notified of this action through the notification center.
Merge Flows in a pipeline cycle
Use the explicit Merge option which is now supported to bring flows within the pipeline back together into a single flow, as long they are all running in a single pipeline cycle.
Pipeline ML Nodes:
New Pipeline nodes are now available for you, supporting our Model Management tool directly from the pipeline.
Model Predict node:
Integrate the Model Predict node into your pipeline to invoke the predict()
function of your model in your pipeline automation flow. The node gets an item as input and completes running once the prediction is completed. For more information, read here.
Model Train node:
The Model Train node can integrate into your pipeline to execute your model’s train()
function during the pipeline run, allowing for seamless model training within your flow. For more information, read here.
Model-Related Trigger Events:
To streamline your work with Model Management tool, Dataloop introduces new trigger events that can use to automatically invoke your FaaS and Pipeline based on the model-related actions. The newly supported events are model_create
, model_update
, and model_delete
.
A system event is fired with every create, update, or delete action of your models.
March 20th, 2023
Organization Members & Groups Redesigned
The Dataloop user interface is redesigned to manage organization members and groups. Now it is easier to invite members into your organization, assign roles, and view the organization project in which they are members. The interface for creating groups is enhanced to allow the reusing of groups and subgroups while showing the actual number of users in the group (handling duplicates).
New storage driver - Azure Datalake Gen2
A new cloud storage driver is now available to support Azure Datalake Gen2.
To learn more, read here.
Cross-Account integration for AWS
Create cross-account integration for AWS, yielding higher security.
Eliminating the need for users to create and maintain IAM for use with the Dataloop platform, you can now assume the role of a Dataloop ARN.
For more information and detailed setup instructions
, read here.
Improved Data Sampling
The new data-sampling mechanism now samples data in a better-distributed way. When you create annotation and QA tasks, the data is evenly sampled from all task contributors (all assignments) while sampling data from the annotation task.