Workbench

Run the same prompt across multiple models and temperatures. Results are cached automatically — re-runs are instant.

Experiment Setup
gpt-4o-miniclaude-3.5-haiku

Results

gpt-4o-mini

Standby

Ready to test this model

claude-3.5-haiku

Standby

Ready to test this model