Documents (beta)

Guide for labeling document (PDF) data.

Overview

When you attach a document dataset to a project, the Labelbox Editor interface will automatically adjust for document labeling.

For more information on the import format, see our docs on Document import format.

16001600

Supported annotation types

Below are the annotation types that you may include in your ontology for labeling document data. Classification-type annotations can be applied at the global level and/or nested within a bounding box annotation.

Annotation type

Import

Export

Bounding box

See reference

See reference

Entity (NER)

See reference

See reference

Annotation Relationships

Coming Soon

See reference

Radio classification

See reference

See reference

Checklist classification

See reference

See reference

Free-form text classification

See reference

See reference

Dropdown classification

Deprecated

Deprecated

Custom Text Layer

A unique aspect of our document editor is being able to view your text layer. You can toggle the text layer on - and it will appear any time you want to highlight an entity.

19201920

Exporting Raw Text

To export raw data alongside the entity labels in a PDF project, you can toggle on the "Save and Export Raw Text" option at the time of project creation. This option automatically pops up for our Document data type. Click the toggle to turn ‘save and export’ data on. With this on, the named entities will be exported with your text file.

19201920

Navigate the document

Use your mouse scroll wheel or trackpad to move forward and backward through the pages of the document. To jump to a specific page, highlight the current page number in the top navigation bar, type your desired page number, and press Enter.

12621262

To zoom in, press Z and click on the section of the page you want to zoom in on.
To zoom out, press Opt + Z and click on the page or press Shift + Z to return the page to its original zoom level.

Bounding box

To create a bounding box, use your cursor to create the shape around a character, word(s), or section in the document. To reposition the bounding box on the document, simply click + hold then use your mouse or trackpad to reposition the annotation on the document. You can also resize the bounding box by clicking on the corners and dragging them to their new position.

Shortcut: In the Tools panel, you will see a numerical hotkey next to the name of the annotation. Use the specified number hotkey (e.g., 1, 2, 3) in the Tools panel to activate the bounding box tool.

To create another instance of the bounding box, press the number hotkey again to activate the tool, then create another bounding box. Once all instances have been created, press e to submit your label.

12621262

Entity

To create an entity, click the desired starting character and drag to select a sequence of characters in the text. Characters are not restricted to a single class; entity annotations may overlap completely or partially. Entities may also span multiple pages. To edit an entity's class, right-click the entity and select Change class.

Shortcut: In the Tools panel, you will see a numerical hotkey next to the name of the annotation. Use the specified number hotkey (e.g., 1, 2, 3) in the Tools panel to activate the entity tool.

To create another entity, press the number hotkey again to activate the tool, then create another entity. Once all entities have been created, press e to submit your label.

12071207

Token Selection

We also support tokenization, so you can create and highlight entities at both word-level or character-level - this is determined by the data contained in your JSON upload. Clicking on a specific word will highlight the entire word. This is helpful when labeling text, as it can be easy to accidentally miss certain characters or words when highlighting text.

19201920

Relationships

To create a relationship between annotations, select a Relationship tool and hover over the annotation where you want the relationship to start to reveal the annotation's anchor points. Click an anchor point to create the starting point of the relationship, then bring your mouse over to the annotation you want to relate it to, hovering over it to reveal its anchor points. Finally, click one of the anchor points to complete the relationship.

Right-click a relationship to change its direction, make it bi-directional, or delete it from the asset.

13881388

Relationships for annotations across pages

If you want to create an annotation relationship for annotations that exist on different pages, you will need to follow a slightly different workflow:

  1. Select the annotation relationship tool
  2. Go to the annotation where you want to start the relationship, right click it, and click "select relationship start"
  3. Scroll to your destination annotation tool, right click it, and click "select relationship end"

After you have selected both the starting and end point of the relationship, you relationship will be established.

13881388

Radio classification

Create a radio classification by activating the classification question and inputting the answer value. In the below example, press 8, k, and esc to complete the radio classification.

Once all classifications have been completed, press e to submit your label.

12621262

Checklist classification

Create a checklist classification by activating the classification question and inputting the answer value(s). In the below example, pressing 7 and pressing Down + Enter on the answer values completes the checklist classification.

Once all classifications have been completed, press e to submit your label.

12621262

Free text classification

Create a free text classification by activating the classification question and inputting the answer value. In the below example, pressing 6, typing the answer value, and pressing Enter completes the free text classification.

Once all classifications have been completed, press e to submit your label.

12621262

Document Specific Hotkeys

Fucntion

Hotkey

Description

Show Text Layer

'Shift' + T

Show or hide the text layer.


Did this page help you?