Before you start
Before you can label data, you need to import it into Labelbox. Take a look at our guides for connecting your cloud data to Labelbox, then come back to this page.Connect to cloud storage
Key definitions
The following are fundamental concepts for organizing your data in Labelbox. Understanding these terms is the first step to building powerful ML models. All data in Labelbox is organized into Datasets, which are made up of individual Data Rows.| Term | Definition |
|---|---|
| Asset | The file you want to label (e.g., an image, a video, a text file). This is your core, raw data. |
| Data row | The container for a single asset and all its related information. This includes the asset itself, its metadata, attachments, and any annotations (labels) created for it. |
| Dataset | A collection of data rows, typically from a single source or domain. For example, you might create one dataset for “Medical Images from Device X” and another for “Customer Support Transcripts.” |
| Attachment | Supplementary information you can add to an asset to provide extra context for your labelers. For example, you could attach a PDF with instructions or a reference image. |
| Global key | An optional but highly recommended unique ID for each data row. Using global keys helps prevent duplicate data uploads and makes it easy to map data rows in Labelbox back to your own external databases or file systems. |
Create a new dataset
- Navigate to the Catalog tab.
- Click the + New button to open the “Create a new dataset” dialog.
- Give your dataset a clear, descriptive name (e.g.,
q1-2026-night-driving-images). - Optionally, add a description for more context.
- Choose an import method:
- Click Create dataset.
Best practices
- Naming: Dataset names can be up to 256 characters and include letters, numbers, spaces, and
_-.,()/. Use names that clearly explain the data’s source and purpose. - Organization: Group data from a single domain or source into its own dataset. This simplifies labeling workflow setup. You can use Metadata to further organize and filter data rows within a dataset.
Append data to an existing dataset
You can add more data to a dataset at any time.- Go to Catalog and select your dataset from the list on the left.
- Click the Append to dataset button.
- You will be prompted to choose an import method to add your new data rows.
Delete a dataset
- In Catalog, select the dataset you wish to delete.
- Click the three-dot menu and select Delete dataset.
- A confirmation dialog will appear. Type
deleteand click the Delete dataset button to confirm.