Purpose
Conduct the 15-minute conversational screen with the candidate. Score motivation, deal-breakers, role-fit signals.
Base model
Claude Sonnet 4.6 (via @anthropic-ai/sdk, zero retention) with tool use for structured outputs. OpenAI Whisper for STT.
Inputs
Live conversation audio (transcribed by Whisper), role context, candidate's CV summary. No protected-class signals.
Outputs
Per-trait scores (motivation, communication, deal-breakers), written reasoning per score, pass/review/reject for the next stage.
Evaluation
Golden dataset of 1,400 labelled screen conversations from beta; target accuracy 0.88; current 0.90. Calibration checked weekly against human recruiter ratings on a holdout set.
Limitations
Less reliable on candidates with strong accents or low-fluency English. Mitigation: candidates can switch language at any point; the language switch is logged but not used as a score input.
Last updated
22 May 2026, v1.0.