Processing issues (errors and warnings)

Learn about the warnings and errors you may encounter when importing a dataset to Labelbox.

Labelbox allows you to see the progress and status of data row processing. Once your data is ingested, Labelbox performs the following tasks on your data rows:

  1. Generate off-the-shelf embeddings for similarity search and the projector view
  2. Extract media attributes (e.g., width and height of an image, frame count of a video)
  3. Convert your data to a standard format (e.g., geospatial TIF data)

Processing issues help you identify problems with your ingested data rows early on. Data rows with processing issues will be caught during ingestion, rather than later in downstream workflows. For example, a data row with a processing issue, will not appear in the Catalog. Hence it cannot be sent to labeling from the user interface.

Labelbox offers the following states and error messages to help you troubleshoot your data rows with processing issues.

📘

Large files

Large files such as TIF, Video, and Audio can take a few seconds to process.

Processing states: processing, success, failure

StateDescription
Processing A data row ingested is initially in the processing state. The data row will remain in the processing state until it is fully processed.
SuccessThe processing of a data row will either succeed or fail. A success state indicates that the data row processing was successful.

If the data row processing was successful, the data row shows up in your Catalog dataset. The data row can now be used to achieve any workflow in Labelbox.
Failure The processing of a data row will either succeed or fail. A failure status indicates that the data row processing has failed.

If the processing of the data row fails, the data row will have a processing issue. It could be an error or a warning.

Processing issues help you identify problems with your ingested data early on. Data rows with issues will be caught during ingestion, rather than later in downstream workflows. For example, a data row with a processing issue, will not appear in the Catalog. Hence, it cannot be sent to labeling from the user interface.

View processing data rows

In Catalog, you can see which datasets with processing data rows. In the left panel of the Catalog tab, you can see which datasets have a processing icon.

This dataset has some data rows that are being processed

This dataset has some data rows that are being processed

When you click on the dataset, you'll see the number of data rows being processed. In the top right corner, there is a processing icon that indicates the count of data rows being processed. Click on this processing icon to get the most up-to-date count (you may need to refresh the webpage).

4794 data rows are being processed in this dataset

4794 data rows are being processed in this dataset

View failed data rows

Data rows that failed to process (errors and/or warnings) will show up in the processing issues view.

To view the failed data rows, go to Catalog and click on the issue icon to access the processing issues view.

Click this icon to access the processing issues view

Click this icon to access the processing issues view

The processing issues view will display all data rows with a processing issue (error or warning) in the dataset. It will also display the exhaustive list of errors and warnings found in the dataset.

The processing issues view shows all data rows with processing errors and warnings

The processing issues view shows all data rows with processing errors and warnings

You can filter to keep only data rows with a specific error or warning. There are two ways to do this in the Processing Issues view:

  1. Look to the right panel and click on the error or warning.
  2. Populate an error or warning filter at the top of the processing issues view.
Ways to filter data rows on specific errors or warnings

Ways to filter data rows on specific errors or warnings

You can reprocess data rows from the processing issues view.

You can reprocess data rows from the processing issues view.

Fix processing issues (re-process data rows)

If some of your data rows have processing issues (errors and/or warnings), you can take corrective action to fix them.

For example, if there is a permission-related issue with your cloud bucket, some of the data rows you import may have processing issues. You'll need to fix the issue in your cloud bucket and then ask Labelbox to re-process these Data Rows with processing issues.

To re-process your data rows, navigate to the processing issues view. From there, you can see all data rows with errors and warnings. Filter on specific errors and warnings, select the data rows you want to re-process, click on [X] selected, and click Re-process.

You can find here an exhaustive list of errors and warnings, with suggestions for corrective actions to fix them.

Errors

NameDescriptionSeverityUser remediation
InternalErrorAn internal error prevented the data row from being processed.ErrorReprocessing may or may not fix the issue. If the issue persists after reprocessing, contact support.
FetchFailedThe data row could not be fetched.ErrorCheck that the data row URL is correct and that the Labelbox backend has access to the data row URL.

If using Delegated Access, verify that it is set up correctly.

Then try reprocessing.
FetchTimeoutThe data row could not be fetched due to a timeout error.ErrorCheck that the data row URL is correct and that the server is responding to requests for the data row URL in a timely manner.

Then try reprocessing.
ForbiddenThe data row could not be fetched because access was forbidden.ErrorCheck that the data row URL is correct and that the Labelbox backend has access to the data row URL.

Then try reprocessing.
NotFoundThe data row could not be fetched because it was not found.ErrorCheck that the data row URL is correct and that the Labelbox backend has access to the data row URL.

Then try reprocessing.
TooManyRequestsThe data row could not be fetched due to a rate-limiting error.Error
InvalidCorsThe data row is being served with invalid CORS headers. This will result in a degraded labeling experience.Error for text data rowsCheck that the data row URL is being served with the correct CORS headers. See Configure CORS to learn more.

Then try reprocessing.
ConversionFailedThe data row failed to be converted from its original format to a new format required for labeling.ErrorThe geospatial file may not be compatible with Labelbox. Check the supported geospatial file types and requirements.

Try reprocessing. Contact support if the problem persists.
InvalidEspgThe data row has invalid ESPG metadata.ErrorReview Labelbox’s ESPG requirements (do we have any?), fix the ESPG metadata, and reupload.
TooManyBandsThe geospatial data row has too many bands.ErrorReduce the number of bands in the geospatial file and reupload.

View error count

A processing error is a serious issue that will undermine downstream workflows in Labelbox. A data row with a processing error will not show up in your Catalog dataset. As a consequence, you cannot send it e.g. to labeling from the user interface.

You can find processing errors in the Processing Issues view. You will need to fix the processing errors before you can use the data row in Labelbox.

Number of processing errors

Number of processing errors

Warnings

NameDescriptionSeverityUser remediation
InvalidCorsThe data row is being served with invalid CORS headers. This will result in a degraded labeling experience.Warning for non-text data rowsCheck that the data row URL is being served with correct CORS headers.

Then try reprocessing.

View warning count

A data row with a processing warning can be used in Labelbox, but the experience might be slightly downgraded for some features. A data row with a processing warning will still show up in your Catalog dataset.

You can find warnings in the Processing Issues view. We recommend that you fix the warnings in order to fully use the data row in Labelbox.

Number of processing warnings

Number of processing warnings

Processing issues and the Python SDK

Today, processing issues show up in the Labelbox user interface. As described above, data rows with processing issues do not show up in the Catalog. They cannot be used for downstream workflows: for example, you cannot send data rows with labeling issues to Annotate, since they do not show up in the Catalog.

The Labelbox Python SDK is more lenient than the Labelbox user interface:

  • if you export data rows from Catalog, using the SDK, all data rows are exported, including those with processing issues
  • if you send data rows to a labeling project, using the SDK, all data rows are sent to the labeling project, including those with processing issues.

We are working on making the Labelbox Python SDK less lenient and on par with the Labelbox user interface.