> ## Documentation Index
> Fetch the complete documentation index at: https://docs.labelbox.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Offline multimodal chat evaluation

> Learn how to create offline multimodal chat evaluation projects for ranking and classifying model outputs on conversation text data.

The **offline multimodal chat evaluation** editor allows you to evaluate generative models by importing existing conversations and adding annotations to model responses. The editor supports various data types, including text, images, videos, audio, and PDFs.

## Set up offline multimodal chat evaluation projects

The following steps walk you through how to set up an offline multimodal chat evaluation project on the Labelbox platform. To learn how to set up an offline multimodal chat evaluation project using the SDK, see [Multimodal chat evaluation projects](/reference/multimodal-chat-evaluation-project).

### Step 1: create a project

1. On the [Annotate projects page](https://app.labelbox.com/projects), click the **+ New project** button.

2. Select **Multimodal chat**, and then select **Offline multimodal chat**.

3. Provide a **name** and an optional **description** for your project.

### Step 2: add data

1. Click the **Add data** button to select a [conversation v2 JSON](/reference/import-multimodal-chat-data#sample-conversation-v2-json) dataset or create a new dataset. Alternatively, you can [import data using the SDK](/reference/import-multimodal-chat-data).

### Step 3: Set up an ontology

[Create an ontology](/docs/labelbox-ontology#create-a-new-ontology) for evaluating model response, like the following example:

<Frame>
  <img src="https://mintcdn.com/labelbox-1db23ff4/sVOsXQe1Ox1EwD0L/images/docs/75c49ca-image.png?fit=max&auto=format&n=sVOsXQe1Ox1EwD0L&q=85&s=56414f02db70b9ce25042e213b4fddea" alt="" width="996" height="778" data-path="images/docs/75c49ca-image.png" />
</Frame>

The editor supports the following options:

| Feature                        | Description                                                                                                                                                                               | Export format                                                                          |
| ------------------------------ | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------- |
| **Message ranking**            | Rank multiple model-generated responses to determine their relative quality or relevance.                                                                                                 | [Payload](/reference/export-multimodal-chat-annotations#message-ranking)               |
| **Message selection**          | Select single or multiple responses that meet specific criteria.                                                                                                                          | [Payload](/reference/export-multimodal-chat-annotations#message-selection)             |
| **Message step reasoning**     | Break responses into steps and evaluate the accuracy of each step by selecting from **correct**, **neutral**, and **incorrect**. Add your rewrite with justification for incorrect steps. | [Payload](/reference/export-multimodal-chat-annotations#message-step-reasoning)        |
| **Classification - Radio**     | Select one option from a predefined set.                                                                                                                                                  | [Payload](/reference/export-multimodal-chat-annotations#classification-radio)          |
| **Classification - Checklist** | Choose multiple options from a list.                                                                                                                                                      | [Payload](/reference/export-multimodal-chat-annotations#classification-checklist)      |
| **Classification - Free text** | Add free text annotations.                                                                                                                                                                | [Payload](/reference/export-multimodal-chat-annotations#classification-free-form-text) |

Classification tasks can apply globally to the entire conversation or individually to a message. They can also nest subclassification tasks.

<Info>
  ### Experimental feature

  Message step reasoning is an [experimental](/docs/product-release-phases#experimental) feature. Currently, you can't import step reasoning labels using the SDK.
</Info>

### Step 4: Complete annotation tasks

Click the **Start labeling** button to add annotations to evaluate the responses. Complete all tasks in your [workflow](/docs/workflows).
