Workbench

Run the same prompt across multiple models and temperatures. Results are cached automatically — re-runs are instant.

Experiment Setup

Results

gpt-4o-mini

Standby

Ready to test this model

claude-3.5-haiku

Standby

Ready to test this model