Documentation Index Fetch the complete documentation index at: https://docs.labelbox.com/llms.txt
Use this file to discover all available pages before exploring further.
Format: HTML Import methods:
IAM Delegated Access
Signed URLs (https URLs only)
Parameters
Import methods:
IAM Delegated Access
Signed URLs (https URLs only)
Parameter Required Description row_dataYes https path to an HTML file. For IAM Delegated Access, this URL must be in virtual-hosted-style format . For older regions, your S3 bucket may be in thehttps://<bucket-name>.s3.<region>.amazonaws.com/<key> format. If your object URLs are formatted this way, make sure they are in the virtual-hosted-style format before importing.global_keyNo Unique user-generated file name or ID for the file. Global keys are enforced to be unique in your org. Data rows will not be imported if its global keys are duplicated to existing data rows. media_typeNo "HTML" (optional media type to provide better validation and error messaging)metadata_fieldsNo See Metadata . attachmentsNo See Attachments and Asset overlays ,
Delegated Access URL
Standard URL
HTML example
[
{
"row_data" : "https://lb-test-data.s3.us-west-1.amazonaws.com/sample_html_1.html" ,
"global_key" : "https://lb-test-data.s3.us-west-1.amazonaws.com/sample_html_1.html" ,
"metadata_fields" : [{ "name" : "<metadata_field_name>" , "value" : "tag_string" }],
"attachments" : [{ "type" : "HTML" , "value" : "https://storage.googleapis.com/labelbox-sample-datasets/Docs/windy.html" }]
},
{
"row_data" : "https://lb-test-data.s3.us-west-1.amazonaws.com/sample_html_2.html" ,
"global_key" : "https://lb-test-data.s3.us-west-1.amazonaws.com/sample_html_2.html" ,
"metadata_fields" : [{ "name" : "<metadata_field_name>" , "value" : "tag_string" }],
"attachments" : [{ "type" : "TEXT_URL" , "value" : "https://storage.googleapis.com/labelbox-sample-datasets/Docs/text_attachment.txt" }]
}
]
Python example
bulk import example
local files
from labelbox import Client
from uuid import uuid4 ## to generate unique IDs
import datetime
client = Client( api_key = "<YOUR_API_KEY>" )
dataset = client.create_dataset( name = "Bulk import example - HTML" )
assets = [
{
"row_data" : "https://storage.googleapis.com/labelbox-datasets/html_sample_data/sample_html_1.html" ,
"global_key" : "https://storage.googleapis.com/labelbox-datasets/html_sample_data/sample_html_1.html" ,
"metadata_fields" : [{ "name" : "<metadata_field_name>" , "value" : "tag_string" }],
"attachments" : [{ "type" : "HTML" , "value" : "https://storage.googleapis.com/labelbox-sample-datasets/Docs/windy.html" }]
},
{
"row_data" : "https://storage.googleapis.com/labelbox-datasets/html_sample_data/sample_html_2.html" ,
"global_key" : "https://storage.googleapis.com/labelbox-datasets/html_sample_data/sample_html_2.html" ,
"metadata_fields" : [{ "name" : "<metadata_field_name>" , "value" : "tag_string" }],
"attachments" : [{ "type" : "TEXT_URL" , "value" : "https://storage.googleapis.com/labelbox-sample-datasets/Docs/text_attachment.txt" }]
}
]
task = dataset.create_data_rows(assets)
task.wait_till_done()
print (task.errors)