Voice Agent Prompt Engineer (VAPI) + Agent Evals

Remote, USA Full-time
We are hiring an experienced Voice Agent Prompt Engineer to refine and productionize system prompts for a VAPI-based real-time voice agent, using a structured evaluation-driven approach. This role is not just prompt writing. You will be expected to use agent evaluation tools (e.g., Langfuse) to test, score, and iteratively improve agent behavior until it meets strict business and conversational quality standards. Scope of Work Part 1: Prompt Engineering (Voice Agent) You will refine three separate system prompts, each for a distinct voice call use case: 1.24-hour pre-event reminder Confirm attendance, reduce no-shows, handle reschedules gracefully, and remain elegant and non-salesy. 2. Post-event feedback collection (attended) Collect structured feedback, surface business insights, and drive survey completion naturally. 3. Post-event follow-up (did not attend) Understand reasons for no-shows, reduce friction for future attendance, and log objections and intent signals. Business-Intelligent Behavior (Required) Across all prompts and test cases, the agent must: • Prioritize relationship and retention • Handle objections strategically • Capture structured insights (reason codes, sentiment, likelihood to attend again) • Remain concise and voice-natural • Never over-explain, argue, loop, or sound salesy Agent Evals (Mandatory) You must use agent evaluation tooling (Langfuse or equivalent) to drive refinement. Evaluation Requirements • Create agent evals for each use case • Minimum 20 test cases per use case • Each test case must define: o user utterance(s) o expected agent behavior o explicit pass/fail criteria Evaluation Dimensions (Examples) • Intent adherence • Conciseness • Tone consistency • Objection handling quality • Repetition avoidance • Data capture accuracy • Escalation / DNC compliance • Interruption handling Prompt iterations must be based on eval results, not intuition. ________________________________________ Deliverables Part 1 • 3 production-ready system prompts (Markdown, copy-paste ready) • Each prompt includes: o intent boundaries o anti-repetition rules o escalation + “do not record me” handling o question discipline (max 1–2 per turn) o voice/TTS phrasing and punctuation guidance • Agent eval test suite o 60+ total test cases o clear scoring or pass/fail logic • Optional: output schema guidance (attendance, sentiment, follow-up, etc.) ________________________________________ Part 2: VAPI Performance Tuning (Bonus / Add-On) Review our existing VAPI setup and suggest improvements to: • reduce perceived latency • improve turn-taking • handle interruptions more naturally • avoid awkward silences ________________________________________ Tools & Stack • VAPI (voice orchestration) • Deepgram (STT) • ElevenLabs (TTS) • Langfuse (agent evals & tracing) • LLM provider will be shared after onboarding ________________________________________ Required Experience • Voice agents deployed in production • Prompt engineering specifically for spoken conversations • Hands-on experience with Langfuse or similar eval frameworks • Strong understanding of real-time voice latency and interruption behavior • Ability to design test suites and evaluation criteria, not just prompts _______________________________________ Apply tot his job
Apply Now

Similar Jobs

Entry-Level Proofreader (Remote - English Content)

Remote, USA Full-time

Proofreader - Remote English Editing Jobs

Remote, USA Full-time

Remote Proofreader Jobs – Flexible Hours – Beginners Welcome

Remote, USA Full-time

Proofreader Remote

Remote, USA Full-time

SR Property Accountant

Remote, USA Full-time

Remote Property Accountant

Remote, USA Full-time

AI Prompt Engineer & RAG expert - Contract to Hire

Remote, USA Full-time

Property Accountant | Arrived | Remote US

Remote, USA Full-time

[Remote] Transaction Analyst - High-Tech Client

Remote, USA Full-time

[Remote] Proofreader

Remote, USA Full-time

Protocol Software Engineer (Job Code 92767)

Remote, USA Full-time

Customer Acquisition Account Executive

Remote, USA Full-time

E-commerce Customer Service Assistant for Fast-Growing Sports Equipment Brand blithequark - Shopify / Amazon - Contract to Hire

Remote, USA Full-time

Experienced Remote Data Entry Specialist - Part Time Opportunity at blithequark

Remote, USA Full-time

Operating Room Per Diem (PRN) Registered Nurse – Immediate Start Opportunity with San Marcos Surgery Center, HCA Healthcare Network

Remote, USA Full-time

Remote Senior Accountant, Client Services Team

Remote, USA Full-time

Market Surveillance Analyst

Remote, USA Full-time

Senior Prompt Engineer, NYC

Remote, USA Full-time

Experienced Pre-Licensed Customer Service Representative for Dynamic Insurance Support – Remote Opportunity with Professional Growth and Development

Remote, USA Full-time

Case Manager Registered Nurse - Field (Monmouth County, New Jersey)

Remote, USA Full-time
Back to Home