📘

Supported file formats and size

Formats: PDF
Labelbox recommends documents 100 pages or fewer, all import methods (upload, IAM DA, signed URLs)

❗️

Cloud-hosted assets

Note that cloud-hosted files must be provided as https URLs. http URLs are not supported.

When importing document data to Labelbox, your JSON file must include the following information for each document.

Parameter

Required

Description

externalId

Yes

User-generated file name or ID for the file. For the best experience, this ID should be unique.

pdfUrl

Yes

https path to a cloud-hosted PDF.

For IAM Delegated Access, this URL must be in virtual-hosted-style format. For older regions, your S3 bucket may be in https://<bucket-name>.s3-<region>.amazonaws.com/<key> format. If your object URLs are formatted this way, make sure they are in the virtual-hosted-style format before importing.

attachments

No

Learn more about attachments here.

[ 
    { 
        "externalId": "dogecoin-whitepaper", 
        "pdfUrl": "https://storage.googleapis.com/jakub-test/dogecoin-whitepaper.pdf"
    }, 
    { 
        "externalId": "bitcoin-whitepaper", 
        "pdfUrl": "https://storage.googleapis.com/jakub-test/bitcoin.pdf"
    }
]
[
    {
        "externalId": "invoice-7394",
        "pdfUrl": "https://lb-test-data.s3.us-west-1.amazonaws.com/invoice-7394.pdf"
    },
    {
        "externalId": "invoice-7395",
        "pdfUrl": "https://lb-test-data.s3.us-west-1.amazonaws.com/invoice-7395.jpg"
    }
]