PDF Studio
  • 19 Dec 2024
  • Dark
    Light
  • PDF

PDF Studio

  • Dark
    Light
  • PDF

Article summary

Overview

The PDF Annotation Studio allows annotators to interact with PDF documents and perform text classification or labeling by highlighting specific text segments and associating them with predefined labels.


Key Features

  • Annotators can highlight specific portions of text directly within the PDF interface.
  • Once highlighted, these text segments can be assigned labels from a predefined set, such as "Name," "Date," "Location," or any custom label relevant to the project.
  • Intuitive text selection tools make it easy to highlight text, whether it's a single word, a phrase, or an entire paragraph.
  • Additional tools for undo/redo, zoom, and navigation ensure a smooth annotation experience, even with large documents.

Main Sections of the PDF Studio:


Section 1: Label Picker and Annotation Tools

This section allows you to use labels and annotation tools to perform the annotation process. The available labels and tools are determined in the Recipe.

Label Picker

A Label Picker is a feature within the PDF annotation studio that allows you to select labels to highlight texts in the PDF.

You can perform the following activities on the Label Picker section:

  • Scroll and click a label to activate it.
  • Use the search bar to easily find labels.
  • Resize the label list to better fit your number of labels by clicking and dragging the separator line at the bottom.

Annotation Tools

Below the Label Picker on the left-side panel, the Annotation Tools section provides various tools for creating and editing annotations. These tools include:

Text Tool

The Text Highlighting Tool in the Dataloop platform is a core feature designed to enable annotators to interact directly with text-based content, such as documents, PDFs, or other textual datasets.

It facilitates the extraction, classification, and organization of textual information by allowing users to highlight and label specific text segments.


Section 2: PDF Studio Canvas

The PDF Studio Canvas allows you to label the texts or sentences by using the text tool.


Work in the PDF Studio

Before You Begin - Prerequisites

Supported Data Types:

The PDF Studio supports PDF files. Refer to the Supported Image formats section to view the supported formats in the PDF Studio.

How to Upload Data?

Once the PDF file is ready, refer to the Upload Items article to upload the file into the Dataloop platform.

Prepare the Recipe

Ensure you prepare the recipe as required to start the PDF annotation process.

For more actions

Refer to the Basic and general explanations and actions article to learn more about the available actions.

How to Open the PDF Studio?

  1. In the Data Browser, double-click on the PDF item or right-click -> Open With.
  2. Select PDF Studio from the list. The selected item will be opened in the GIS Studio.

How to Annotate in the PDF Studio Using the Text Tool?

  1. Double-click on the PDF file. The PDF studio is displayed.
  2. Select a label from the label picker.
  3. Identify the text or sentence to be annotated.
  4. Click and drag, or double-click on the word to annotate. The annotation will be listed in the annotation list.

How to View PDF Annotations in the JSON Output Format?

  1. In the PDF Annotations Studio, select the Item tab from the right-side panel.
  2. Click Export icon.
  3. Select the Export Annotation option from the list. A JSON file will be downloaded.

How to View PDF Item Structure in the JSON Output Format?

  1. In the Data Browser, select the item.
  2. Right-click -> File Actions -> Export JSON. A JSON file will be downloaded in a .zip file.

How to Change Labels for an Annotation in the PDF Studio?

  1. On the right-side panel, select the annotation from the annotations list.
  2. Click on either the Change label & attributes icon from the top (above the Search Annotations field) or the Edit label icon (pencil icon) from the right-side of the annotation. A pop-up window is displayed.
  3. Make sure you are on the labels tab, and select the desired new label to switch to.
  4. Click Save Changes.

How to Set Attributes for an Annotation in the PDF Studio?

  1. On the Annotation list, click on the annotation name. A pop-up window is displayed.
  2. Select the Attributes tab.
  3. Set the attributes and click Save Changes.

Also, you can use the annotation list on the right-side panel, select the annotation, and follow the above steps to set the attributes.


How to Delete Annotations in the PDF Studio?

  1. Select the relevant annotation(s) from the annotation list.
  2. Click Delete and confirm it.

PDF Studio Keyboard Shortcuts

General Shortcuts

ActionKeyboard Shortcuts
SaveS
UndoCtrl + Z
RedoCtrl + Y
Search LabelShift + L
Navigate in label pickerUp or Down Arrows
Select a label in the label PickerEnter
Previous ItemLeft Arrow
Next ItemRight Arrow
Add Item DescriptionT
Mark Item as DoneShift + F
Mark Item as DiscardedShift + G
Hide/Show Selected AnnotationsH
Hide/Show All AnnotationsJ
Go to annotation listShift + ;
Navigate in annotation listUp and Down arrows
Select/deselect an annotationSpace


What's Next