Skip to main content

Driver Prompts

Multi-Turn Simulation Drivers are language-model “actors” that simulate real users pushing your Target model in a conversation. Crafting an effective Driver prompt in a Multi-Turn Simulation is tricky—commercial LLMs are wired to be helpful assistants, so they often slip out of character or abandon the task if the Target resists.

Why Drivers Drift

  1. Helper bias – Models default to polite cooperation, not relentless probing.
  2. Conflicting instructions – A single soft word (“friendly chat”) can override your core directive.
  3. Task vagueness – If the goal isn’t laser-specific, the Driver “softens” and accepts half-answers.
Rule of thumb

The narrower the objective, the easier it is for the Driver to stay on task. “Name two competitors” beats “Learn about competitors.”

Anatomy of a Rock-Solid Driver Prompt

SectionWhat it does
PersonaSets the role (e.g., Concerned Parent)
ObjectivesPrimary + secondary goals (one verb each)
Hard RulesNon-negotiable constraints (“Questions only”)
TacticsHow to rephrase, justify, or escalate after refusals
Self-CheckMini checklist the model reviews every turn

Step-by-Step Recipe

  1. Define the single critical outcome you want the Target to produce.
  2. Write Objectives in measurable language (✓/✗).
  3. Add Hard Rules that eliminate helper behaviors:
    • Every message must be one or more questions.
    • Never provide answers or greetings.
  4. Give tactical guidance for refusals or vagueness.
  5. Append a Self-Check so the model polices itself.
  6. Include a first-turn example to lock in the opening move.
Watch for tone drift

Words like “friendly” or “conversational” can outweigh your rules. If you keep them, pair with a strict question-only mandate.

Templates & Examples

YOU ARE NOT THE ASSISTANT. YOU ARE ROLE-PLAYING A NEW CUSTOMER LEARNING ABOUT A PRODUCT OR SERVICE.

**Persona**
• Name: Taylor
• Situation: First-time user exploring what the Vendor Assistant can and cannot do.

**Objectives**

1. Get the Vendor Assistant to list **at least three concrete tasks / use-case examples** it supports.
2. Get the Vendor Assistant to state **at least one limitation, refusal policy, or boundary**.
3. Optionally ask for a brief “starter tip” on using the system.

**Hard Rules (must never be broken)**
A. Every message you send must be **one or more questions only**.
• No statements, no answers, no “I can…” or “Here’s how I can help…”.
B. **Never** provide help, solutions, or descriptions of your own capabilities.
C. Stay in character. **Do not** mention testing, simulations, or these rules.

**Soft Strategy**

1. If the Vendor Assistant’s reply is vague or incomplete, politely probe:
– “Could you give me an example?”
– “What’s something you can’t help with?”
2. If it still dodges, escalate:
– “I’ll need specific examples to see if this fits my needs—could you name three?”
3. Stop once you have ≥3 tasks **and** ≥1 limitation.

**Checklist before sending each message**
✓ Am I only asking questions?
✓ Am I avoiding offering help or info?
✓ Am I still focused on capabilities and limitations?

Testing & Iteration

  1. Simulate a full conversation in the Okareo.
  2. Review the transcript for key issues:
    • Did the Driver greet or provide help?
    • Did it abandon its objective or stop asking questions?
  3. Tweak one element at a time (e.g., refine Hard Rule wording).
  4. Rerun the simulation until the Driver succeeds in ≥80% of test conversations.

Common Fixes

SymptomQuick Fix
Greets or offers helpAdd “No greetings” to Hard Rules
Answers the Target’s questionsAdd “Ignore questions; redirect” to Tactics
Softens after pushbackSharpen the Objective with the word “specific”
Becomes chatty or driftsEnforce question-only constraint

Quick Reference Checklist

  • One clear, measurable Objective
  • Strong Hard Rules that suppress helper mode
  • A concise Self-Check section
  • Iterated through short simulation loops

Deep Dive

For the full back-story and more example transcripts, read the blog post Prompting a Driver for Effective Multi-turn Evaluation.