Task designer

Write the prompt like a production ticket. Add scored tests. Tool-call rows map to Email environment tools; LLM judge and Node script fields are for export and grading in your own pipeline.

Task

Tests

No tests yet. Add at least one, or keep drafting the task first.