Sampling methods

You can automatically sample data from your filter query results to create batch. You must configure these two parameters to use sampling:

  • Sampling method (random, ordered)
  • Number of Data Rows to include in the sampled batch (max of 25,000)

Random sampling

Random sampling is a very useful selection technique when you are working with large amounts of data. You can randomly select Data Rows from your Catalog query to create a batch for labeling or re-work.

Here is an example to help you understand random sampling.

A user constructs a query to find plums in Catalog and clicks Sample. Then the user adjusts the amount of desired Data Rows in the batch to 100 and clicked Resample.

📘

Note

The random selection is always executed on the results of the query.

Creating a random sample of 100 data rows.Creating a random sample of 100 data rows.

Creating a random sample of 100 data rows.

Ordered sampling

The ordered sampling selection technique will respect the sorted order of Data Rows that you see in the results. You can order the results by the Created At timestamp in ascending or descending order.

Here is an example to help you understand ordered sampling.

In the example below, a user constructs a query to find plums in Catalog and clicks Sample. The user then selects the Ordered parameter, adjusts the amount of desired Data Rows in the batch to 100, and clicks Resample.

Using ordered samples with created at sortUsing ordered samples with created at sort

Using ordered samples with created at sort


Did this page help you?