A/B testing — side-by-side model/prompt comparison in chat

## Is your feature request related to a problem? Please describe.

I need to compare responses from two models (or two prompts) on the same question, but Chainlit has no way to show them side-by-side.

## Describe the solution you'd like

A/B testing mode: one user message → two responses displayed in split view, with two variants running in parallel on the same conversation. The user picks the better one (A, B, or tie, with optional comment). Preference stored in the data layer.

A variant can be:
- Two different Chat Profiles (model A vs model B, or prompt A vs B)
- Same Chat Profile with different Chat Settings (e.g., temperature 0.3 vs 0.9)
- Two independent runs of the same config (to measure variance)

## Describe alternatives you've considered

- Chat Profiles + manual switching between two separate conversations — no side-by-side, no structured preference capture.
- External A/B tooling (LangSmith, PromptFoo…) — works offline but doesn't capture real in-app user preferences.

## Prerequisite

This feature depends on the ability to run two Chat Profiles in parallel on the same conversation. As a first step, hot Chat Profile swapping (changing profile without creating a new chat) is needed — tracked in #2899.


## Example :
<img width="1999" height="792" alt="Image" src="https://github.com/user-attachments/assets/879ce653-fda3-4098-8025-88274ade6265" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A/B testing — side-by-side model/prompt comparison in chat #2895

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Prerequisite

Example :

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

A/B testing — side-by-side model/prompt comparison in chat #2895

Description

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Prerequisite

Example :

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions