Find text
Search for text and uncover assets in Catalog.
When working with text data such as text, conversational text, documents, or HTML, you can use the find text filter in Catalog to surface data rows that contain specific keywords.
How the find text filter works
When you use the find text filter, you are allowed to specify multiple keywords in the expression. Labelbox will then surface data rows that contain those words.
Character/word limits
To view the minimum & maximum characters/words for the find text filter, visit our limits page.
If multiple words are provided in the expression, Labelbox will surface data rows in this order:
- Data rows containing all words
- Data rows containing all words except one
- Data rows containing all words except two
... and so on until all results containing at least one of the provided words are returned.
Supported media types
The filter is supported for the following data types:
Media type | Limit |
---|---|
Text | First 64k characters |
Conversational text | First 64k characters |
HTML | First 64k characters |
Documents | First 64k characters |
The limits above indicate the last character in the media file that the filter will reach. For example, with the current limits, if the keyword you are searching for appears after the 64,000th character in the media file, the data row will not appear in the filter results.
Combine find text with other filters
You can combine the find text filter with other filters in Catalog. Some filters are best used for targeting unstructured data and others are best for targeting structured data.
Combine the find text with the following filters to target data rows by structured data:
Combine find text with the following filters to target data rows by unstructured data:
Automate data curation with slices
After you populate the Catalog filters, you can save these filters as a slice of data. When you save a filter as a slice, you will not need to populate the same filters over and over again. Also, slices are dynamic, so any new incoming data rows will automatically appear in the relevant slices.
Read through the following resources to learn how to take action on the filtered data.
Updated 3 months ago