Improve model performance

Once you find a pattern of model errors, you can take action to improve your model. Here is an example of a data-centric iteration to improve model performance.

  1. Select Data Rows on which your model is struggling. See Find model errors (Error Analysis) to learn about useful techniques for surfacing and selecting these Data Rows that your model struggles to predict.
34563456
  1. Open the selected Data Rows in the Catalog
  • Click on 15 selected
  • Click on View in Catalog
34563456

The selected Data Rows will appear at the top of the Catalog.

34563456
  1. To identify similar Data Rows, create a Function. Follow these steps
  • Click on 15 selected
  • Click on View in Catalog
34563456
  • Click on Create function and name your function (named difficult cases in this example).
69126912
  1. The newly created Function will help you surface data that is similar to this pattern of model failures - among all of your Labelbox data. You want to surface this data, filter to keep only unlabeled data, label it in priority, and retrain your machine learning model on the improved dataset.
  • Go to the Catalog.
  • Select All datasets to explore all of your Labelbox data.
  • Filter on Functions to keep only Data Rows that look similar to the pattern of model failures. Labelbox automatically sorts Data Rows in decreasing order of Function similarity score.
  • Filter on Annotations to keep only unlabeled Data Rows.
  • Filter on Projects> not it to keep only Data Rows that are not already in your labeling project.
69126912
  1. To sample the top 100 of these Data Rows,
  • Click on Sample.
  • Select ordered.
  • Type 100 data rows.

Then, you can submit the batch to your labeling project.

  • Select the destination labeling project.
  • Click on Submit batch.
69126912

Once this data has been labeled in Labelbox, you can create a new Model Run, include these newly labeled Data Rows in your data splits, and retrain your machine learning model to improve its ability to detect basketball courts and ground track fields.

Congratulations, you have been through a data-centric iteration to improve your model!


Did this page help you?