Queue data for labeling

Add or remove data from a labeling project

You can queue data to a Project using either Batch or Dataset based queues. Batch based queues provide more features and flexibility than the Dataset based queues and is the recommended way to orchestrate labeling in Labelbox.

The queue mode of your project is configured at project setup. If you want to change the queue mode after project creation, you can do so from Project > Settings > Queue

Settings to update queue mode.Settings to update queue mode.

Settings to update queue mode.

Batch based queues

📘

Batch based queues do not support benchmark and consensus currently.

Batch based queues allow you to individually select Data Rows to queue up for labeling through Catalog or the SDK for labeling. Each Batch can have a maximum of 25,000 data rows and each Data Row can only be submitted in one Batch per Project.

Batch based queues enable to you to do the following

  • Leverage the Catalog filters to find the most relevant data to label next. This includes using filters and similarity functions. For more details, please refer to the article Data curation.
  • Easily and quickly queue higher priority data for labeling and move it to the front of the labeling queue.
  • Randomly sample data from a large dataset into a project for labeling.

For more details on how to create batches, please refer to the article Batches.

Adding a batch to a project for labeling.Adding a batch to a project for labeling.

Adding a batch to a project for labeling.

View batches from within a project

The Data rows activity tab is where you can manage all of the batches for your project. Click this button (shown below) to view a menu of options for managing Data Rows.

View batch history

Click Batch history to view a changelog of added and removed Data Rows within this project. You'll also be able to see which batch the Data Rows belong to.

Manage batches

Click Manage batches to view all of the existing batches that have been added to your project. When you click the menu option on each batch, you'll see an option to rename or archive the batch. You'll also be able to remove the remaining unlabeled Data Rows from the labeling queue.

Dataset based queues

Dataset based queues are useful when you intend to label all the data rows in a data set.

Attach / Detach

❗️

Dataset limits per project

Please note that you can add at most 1500 datasets to a single project.

You can attach up to 1500 dataset to a labeling project. When you attach a dataset to a project, Labelbox will add all of the data rows in that dataset to the labeling queue.

If you have a large dataset (over 100k data rows), we recommend dividing the large dataset into smaller datasets. This will keep the queue size small and maintain high queue performance, especially if you are working with a large labeling team.

Attach a dataset during project creation.Attach a dataset during project creation.

Attach a dataset during project creation.

When you detach a dataset from a project, Labelbox will remove all its data rows from the project. However, you'll still be able to view the labels in the project. If the dataset is detached during a live labeling session, some of its data rows may remain reserved in the queue (depending upon the number of active labelers).

Go to project settings to attach a dataset to or detach a dataset from an existing project.Go to project settings to attach a dataset to or detach a dataset from an existing project.

Go to project settings to attach a dataset to or detach a dataset from an existing project.

Appending to a Dataset

Once a dataset is attached to one or more projects, you can continuously append new Data Rows to the label queue for those projects by adding them to the attached dataset. You can add more data rows via the app or you can use the Python SDK (recommended).


What’s Next
Did this page help you?