Import annotations as pre-labels

Learn how to use model-assisted labeling (MAL) to import annotations to a project as pre-labels.

Check out these end-to-end developer guides to learn how to import annotations.

Developer Guides:


Overview

The model-assisted labeling (MAL) workflow allows you to import computer-generated predictions (or simply annotations created outside of Labelbox) as pre-labels on an asset.

The imported annotations will be pre-populated in the labeling editor. However, in order to convert the pre-labels to real annotations, a human labeler will still need to open the data row in the editor and submit it. This functionality is designed to speed up human labeling.

Annotations imported as pre-labels help speed up human labeling. Annotations imported via MAL are pre-populated in the labeling editor

Annotations imported as pre-labels help speed up human labeling. Annotations imported via MAL are pre-populated in the labeling editor

How to import annotations as pre-labels

Generate Python SDK import annotation code

Based on your project's ontology, you can generate code snippets to import your annotations to the Labelbox platform. These code snippets serve as boilerplate code for your workflow and can help speed up the below steps. You can find these snippets by navigating to the automation tab of your project and selecting the "Step 1: Create annotation payload" and "Step 2: Import annotations payload" dropdown menus.

Step 1: Import data rows

To import annotations as pre-labels, you'll need to have a set of data rows to attach the annotations to. If you do not already have a set of data rows, you'll need to import data rows in a dataset in Catalog.

To learn how to import data rows via the Python SDK (Step 1), see this tutorial.

To learn more about creating data rows, see Create a dataset in Catalog.

Step 2: Create/select an ontology

When you import a set of annotations, you'll need to specify the ontology (also called taxonomy) that corresponds to the set of annotations. If the project ontology already exists in Labelbox, you may select the ontology that fits your annotations. If the ontology does not exist in Labelbox yet, you'll need to create an ontology.

To learn how to create an ontology via the Python SDK (Step 2), see this tutorial.

To learn more about creating ontologies, see Create/modify ontologies.

Step 3: Create a labeling project

Before you can import your annotations, you'll need to make sure you have a project to connect these annotations to. You cannot simply import annotations without specifying which project they'll be associated with. Oftentimes, you will already have a project set up with the correct ontology for your set of annotations. However, If you do not already have a project, you will need to create a project and attach the ontology that fits your annotations.

To learn how to set up a labeling project via the Python SDK (step 3), see this tutorial.

To learn more about creating a project, see Create a project.

Step 4: Send a batch of data rows to the project

Now that you have your project and ontology configured, you'll need to send a subset of data Rows (i.e., a batch) to the project's labeling queue. You will be attaching your annotations to this batch of data rows.

To learn how to create a batch via the Python SDK (Step 4), see this tutorial.

To learn more about batch mode, see our docs on Batches.

Step 5: Create the annotations payload

After you have successfully configured your project with the correct ontology and selected a batch of data rows to attach the annotations to, you are ready to prepare the annotation payload. To do this, you will need to use our Python SDK. Each imported annotation will need to reference an annotation class within the ontology (see step 2 above) and a specific data row ID. Labelbox supports two formats for the annotations payload: NDJSON and Python Annotation types.

Use the table below to find an annotation payload sample for your asset type. The "-" symbol indicates that importing as pre-labels is not supported for that annotation/asset type combination. To learn how to create an annotations payload (Step 5), see this tutorial.

ImageVideoTextDocumentsGeospatialAudioConversational textDICOM
Bounding boxPayloadPayloadN/APayloadPayloadN/AN/A-
PolygonPayload-N/AN/APayloadN/AN/A-
PointPayloadPayloadN/AN/APayloadN/AN/A-
PolylinePayloadPayloadN/AN/APayloadN/AN/APayload
Segmentation maskPayloadPayloadN/AN/A-N/AN/APayload
Text entityN/AN/APayloadPayloadN/AN/APayloadN/A
Classification - RadioPayloadPayloadPayloadPayloadPayloadPayloadPayload-
Classification - ChecklistPayloadPayloadPayloadPayloadPayloadPayloadPayload-
Classification - Free-form textPayload-PayloadPayloadPayloadPayloadPayload-
RelationshipPayload-PayloadPayload-N/APayload-

Step 6: Import the annotation payload to project

After you have constructed your annotation payload, you are ready to import the annotations to the labeling project. To do this you will need to use our Python SDK. See the table at the top of this page to find the end-to-end tutorial that most closely fits your use case.

To learn how to import an annotation payload via the SDK (Step 6), see this tutorial.

Best practices

  • Make sure the annotations are in the proper format. Use the charts above to determine whether an annotation type is supported for the data type you are labeling.
  • Before you begin a new import job, make sure there are no existing MAL annotations on the data row. Duplicate import jobs may result in unexpected behavior using this SDK call Project.bulk_import_requests().
  • To override an imported annotation on a data row, import the annotation again using the same uuid.