- 19 Dec 2024
- Print
- DarkLight
- PDF
PDF Studio
- Updated On 19 Dec 2024
- Print
- DarkLight
- PDF
Overview
The PDF Annotation Studio allows annotators to interact with PDF documents and perform text classification or labeling by highlighting specific text segments and associating them with predefined labels.
Key Features
- Annotators can highlight specific portions of text directly within the PDF interface.
- Once highlighted, these text segments can be assigned labels from a predefined set, such as "Name," "Date," "Location," or any custom label relevant to the project.
- Intuitive text selection tools make it easy to highlight text, whether it's a single word, a phrase, or an entire paragraph.
- Additional tools for undo/redo, zoom, and navigation ensure a smooth annotation experience, even with large documents.
Main Sections of the PDF Studio:
- Section 1: Label Picker and Annotation Tools
- Section 2: PDF Studio Canvas (Center part)
- Section 3: Annotation Lists, Attributes, and Item Details (right-side panel)
- Section 4: Item Info & Controls (top-side panel)
Section 1: Label Picker and Annotation Tools
This section allows you to use labels and annotation tools to perform the annotation process. The available labels and tools are determined in the Recipe.
Label Picker
A Label Picker is a feature within the PDF annotation studio that allows you to select labels to highlight texts in the PDF.
You can perform the following activities on the Label Picker section:
- Scroll and click a label to activate it.
- Use the search bar to easily find labels.
- Resize the label list to better fit your number of labels by clicking and dragging the separator line at the bottom.
Annotation Tools
Below the Label Picker on the left-side panel, the Annotation Tools section provides various tools for creating and editing annotations. These tools include:
Text Tool
The Text Highlighting Tool in the Dataloop platform is a core feature designed to enable annotators to interact directly with text-based content, such as documents, PDFs, or other textual datasets.
It facilitates the extraction, classification, and organization of textual information by allowing users to highlight and label specific text segments.
Section 2: PDF Studio Canvas
The PDF Studio Canvas allows you to label the texts or sentences by using the text tool.
Work in the PDF Studio
Before You Begin - Prerequisites
Supported Data Types:
The PDF Studio supports PDF files. Refer to the Supported Image formats section to view the supported formats in the PDF Studio.
How to Upload Data?
Once the PDF file is ready, refer to the Upload Items article to upload the file into the Dataloop platform.
Prepare the Recipe
Ensure you prepare the recipe as required to start the PDF annotation process.
For more actions
Refer to the Basic and general explanations and actions article to learn more about the available actions.
How to Open the PDF Studio?
- In the Data Browser, double-click on the PDF item or right-click -> Open With.
- Select PDF Studio from the list. The selected item will be opened in the GIS Studio.
How to Annotate in the PDF Studio Using the Text Tool?
- Double-click on the PDF file. The PDF studio is displayed.
- Select a label from the label picker.
- Identify the text or sentence to be annotated.
- Click and drag, or double-click on the word to annotate. The annotation will be listed in the annotation list.
How to View PDF Annotations in the JSON Output Format?
- In the PDF Annotations Studio, select the Item tab from the right-side panel.
- Click Export icon.
- Select the Export Annotation option from the list. A JSON file will be downloaded.
How to View PDF Item Structure in the JSON Output Format?
- In the Data Browser, select the item.
- Right-click -> File Actions -> Export JSON. A JSON file will be downloaded in a .zip file.
How to Change Labels for an Annotation in the PDF Studio?
- On the right-side panel, select the annotation from the annotations list.
- Click on either the Change label & attributes icon from the top (above the Search Annotations field) or the Edit label icon (pencil icon) from the right-side of the annotation. A pop-up window is displayed.
- Make sure you are on the labels tab, and select the desired new label to switch to.
- Click Save Changes.
How to Set Attributes for an Annotation in the PDF Studio?
- On the Annotation list, click on the annotation name. A pop-up window is displayed.
- Select the Attributes tab.
- Set the attributes and click Save Changes.
Also, you can use the annotation list on the right-side panel, select the annotation, and follow the above steps to set the attributes.
How to Delete Annotations in the PDF Studio?
- Select the relevant annotation(s) from the annotation list.
- Click Delete and confirm it.
PDF Studio Keyboard Shortcuts
General Shortcuts
Action | Keyboard Shortcuts |
---|---|
Save | S |
Undo | Ctrl + Z |
Redo | Ctrl + Y |
Search Label | Shift + L |
Navigate in label picker | Up or Down Arrows |
Select a label in the label Picker | Enter |
Previous Item | Left Arrow |
Next Item | Right Arrow |
Add Item Description | T |
Mark Item as Done | Shift + F |
Mark Item as Discarded | Shift + G |
Hide/Show Selected Annotations | H |
Hide/Show All Annotations | J |
Go to annotation list | Shift + ; |
Navigate in annotation list | Up and Down arrows |
Select/deselect an annotation | Space |