PDF Studio
  • 19 Feb 2025
  • Dark
    Light
  • PDF

PDF Studio

  • Dark
    Light
  • PDF

Article summary

Overview

The PDF Annotation Studio allows annotators to interact with PDF documents and perform text classification or labeling by highlighting specific text segments and associating them with predefined labels.


Key Features

  • Annotators can highlight specific portions of text directly within the PDF interface.
  • Once highlighted, these text segments can be assigned labels from a predefined set, such as "Name," "Date," "Location," or any custom label relevant to the project.
  • Intuitive text selection tools make it easy to highlight text, whether it's a single word, a phrase, or an entire paragraph.
  • Additional tools for undo/redo, zoom, and navigation ensure a smooth annotation experience, even with large documents.

Label Picker

A Label Picker is a feature within the PDF annotation studio that allows you to select labels to highlight texts in the PDF. You can perform the following activities on the Label Picker section. The available labels are determined in the Recipe.

  • Scroll and click a label to activate it.
  • Use the search bar to easily find labels.
  • Resize the label list to better fit your number of labels by clicking and dragging the separator line.
  • Use the Shortcut keys to navigate between labels.

Annotation Tools

Annotation tools are designed to facilitate the process of data annotation. Data annotation involves adding metadata, labels, or tags to raw data, making it understandable for machines.

Text Tool

The Text Highlighting Tool in the Dataloop platform is a core feature designed to enable annotators to interact directly with text-based content, such as documents, PDFs, or other textual datasets.

It facilitates the extraction, classification, and organization of textual information by allowing users to highlight and label specific text segments.


Annotations Tab

The Annotations tab on the right-panel allows you to control and manage annotations involves utilizing the annotations list and attribute controls, particularly when attributes are configured in the Recipe.

Learn more about the annotations and actions available.

Item Tab

The Item tab displays information according to the type of the selected item.

Learn more about the item and actions available.


Item Info & Controls (Top-Panel)

Item Info & Controls are available depends on the type of annotation studio. For the detailed information, refer to the following articles.

Keyboard shortcuts

General Shortcuts

ActionKeyboard Shortcuts
SaveS
UndoCtrl + Z
RedoCtrl + Y
Search LabelShift + L
Navigate in label pickerUp or Down Arrows
Select a label in the label PickerEnter
Previous ItemLeft Arrow
Next ItemRight Arrow
Add Item DescriptionT
Mark Item as DoneShift + F
Mark Item as DiscardedShift + G
Hide/Show Selected AnnotationsH
Hide/Show All AnnotationsJ
Go to annotation listShift + ;
Navigate in annotation listUp and Down arrows
Select/deselect an annotationSpace

Item's View Controls (Bottom-Panel)

The controls on the bottom-side panel display based on the annotation contexts and work controls.

Workflow Context

Assignment controls, including moving between items, displaying the item gallery, and the status buttons (Complete / Discard). It displays only while working on an annotation or QA task.

  1. Browse between the assignment items using the Left and Right arrows
  2. Open the Thumbnails' gallery viewer, and click a thumbnail to open that item
  3. Save button - Clicking the button when it is enabled triggers saving changes to the Dataloop platform, before the auto-saving feature takes care of that.
  4. Status buttons - Complete and Discard.

Work in the PDF Studio

Before You Begin - Prerequisites

Access the PDF studio

  1. In the Data Browser, double-click on the PDF item or right-click -> Open With.
  2. Select PDF Studio from the list. The selected item will be opened in the GIS Studio.

Annotate using the text tool

  1. Double-click on the PDF file. The PDF studio is displayed.
  2. Select a label from the label picker.
  3. Identify the text or sentence to be annotated.
  4. Click and drag, or double-click on the word to annotate. The annotation will be listed in the annotation list.

Export PDF annotations

  1. In the PDF Annotations Studio, select the Item tab from the right-side panel.
  2. Click Export icon.
  3. Select the Export Annotation option from the list. A JSON file will be downloaded.

Export PDF Item Structure

  1. In the Data Browser, select the item.
  2. Right-click -> File Actions -> Export JSON. A JSON file will be downloaded in a .zip file.

Change Labels of an Annotation

  1. On the right-side panel, select the annotation from the annotations list.
  2. Click on either the Change label & attributes icon from the top (above the Search Annotations field) or the Edit label icon (pencil icon) from the right-side of the annotation. A pop-up window is displayed.
  3. Make sure you are on the labels tab, and select the desired new label to switch to.
  4. Click Save Changes.

Set Attributes for an Annotation

  1. On the Annotation list, click on the annotation name. A pop-up window is displayed.
  2. Select the Attributes tab.
  3. Set the attributes and click Save Changes.

Also, you can use the annotation list on the right-side panel, select the annotation, and follow the above steps to set the attributes.


Delete annotations

  1. Select the relevant annotation(s) from the annotation list.
  2. Click Delete and confirm it.