Documents (beta)

Guide for labeling document (PDF) data.

Overview

When you attach a document dataset to a project, the Labelbox Editor interface will automatically adjust for document labeling.

For more information on the import format, see our docs on Document import format.

Supported annotation types

Below are the annotation types that you may include in your ontology for labeling document data. Classification-type annotations can be applied at the global level and/or nested within a bounding box annotation.

Annotation type

Import

Export

Bounding box

Coming soon

See reference

Entity (NER)

Coming soon

See reference

Radio classification

See reference

See reference

Checklist classification

See reference

See reference

Free-form text classification

See reference

See reference

Dropdown classification

Deprecated

Deprecated

Navigate the document

Use your mouse scroll wheel or trackpad to move forward and backward through the pages of the document. To jump to a specific page, highlight the current page number in the top navigation bar, type your desired page number, and press Enter.

To zoom in, press Z and click on the section of the page you want to zoom in on.
To zoom out, press Opt + Z and click on the page or press Shift + Z to return the page to its original zoom level.

Bounding box

To create a bounding box, use your cursor to create the shape around a character, word(s), or section in the document. To reposition the bounding box on the document, simply click + hold then use your mouse or trackpad to reposition the annotation on the document. You can also resize the bounding box by clicking on the corners and dragging them to their new position.

Shortcut: In the Tools panel, you will see a numerical hotkey next to the name of the annotation. Use the specified number hotkey (e.g., 1, 2, 3) in the Tools panel to activate the bounding box tool.

To create another instance of the bounding box, press the number hotkey again to activate the tool, then create another bounding box. Once all instances have been created, press e to submit your label.

Entity

This tool is currently in closed beta. To participate in the beta, sign up here.

To create an entity, click the desired starting character and drag to select a sequence of characters in the text. Characters are not restricted to a single class; entity annotations may overlap completely or partially. Entities may also span multiple pages. To edit an entity's class, right-click the entity and select Change class.

Shortcut: In the Tools panel, you will see a numerical hotkey next to the name of the annotation. Use the specified number hotkey (e.g., 1, 2, 3) in the Tools panel to activate the entity tool.

To create another entity, press the number hotkey again to activate the tool, then create another entity. Once all entities have been created, press e to submit your label.

Radio classification

Create a radio classification by activating the classification question and inputting the answer value. In the below example, press 8, k, and esc to complete the radio classification.

Once all classifications have been completed, press e to submit your label.

Checklist classification

Create a checklist classification by activating the classification question and inputting the answer value(s). In the below example, pressing 7 and pressing Down + Enter on the answer values completes the checklist classification.

Once all classifications have been completed, press e to submit your label.

Free text classification

Create a free text classification by activating the classification question and inputting the answer value. In the below example, pressing 6, typing the answer value, and pressing Enter completes the free text classification.

Once all classifications have been completed, press e to submit your label.


Did this page help you?