AI Agent Evaluation Analyst for Autonomous Agents (No coding required)

Remote, USA Full-time

We’re hiring detail-oriented, analytical contributors to help test and improve autonomous AI agent evaluations. This is part-time, fully remote work with flexible hours, ideal for people who enjoy finding edge cases, questioning assumptions, and strengthening complex systems. What you’ll do • Review and refine agent evaluation tasks and scenarios for logic, completeness, and realism • Identify inconsistencies, ambiguities, and missing assumptions • Define gold-standard expected behaviors for agents • Annotate reasoning paths, cause-effect relationships, and plausible alternatives • Collaborate with QA, writers, and developers to suggest refinements and expand edge case coverage • Ensure autonomous agents are tested thoroughly and realistically What we’re looking for • Strong analytical thinking and excellent attention to detail • Fluent written English with clear documentation skills • Comfort reading structured formats such as JSON or YAML (no need to write code) • Ability to reason about complex systems and spot what could break or be misinterpreted Nice to have Prior exposure to QA/test-case thinking, logic puzzles, or evaluation frameworks Apply tot his job

Apply Now

Experienced Disney Remote Customer Service Representative - Part-Time, Flexible Schedule, Competitive Pay, and Magical Benefits

Remote, USA Full-time

PANCE/PANRE Tutor/Instructor

Remote, USA Full-time

Back to Home

AI Agent Evaluation Analyst for Autonomous Agents (No coding required)

Similar Jobs

Lead Agentic AI Developer

Senior Technical Writer, Business Analyst with Gen. AI skills

AI Automation Specialist - Remote US

AI Automation Developer for Ongoing Work (AI, n8n, Make.com, Voiceflow) - Contract to Hire

AI Automation Specialist/Remote View Position

AI Agent Developer to Build an Autonomous Instagram Marketing System (Strategy + Automation)

AI Automation Engineer – Build Internal Business Applications

AI Automation Engineer – Extend Existing Salesforce-Based AI Outreach System - N8n / Salesforce

AI Automation Engineer (n8n + Playwright) for Google Flow Video Generation

Lead Data Engineer + AI Client - Altimetrik Takeda Location: Remote Need minimum 3 years of experien

Data Entry Remote Work

[Hiring] Inbound Outbound Queue Associate @CVS Health

Remote Part Time Data Entry / Typing Work From Home Opportunity with Remote Staffing

Social Media Strategist Needed to Build a Private Instagram Page Alongside Our Business Account

Clinical Quality Analyst RN - Clinical Practice...

[Remote] Amazon Account Strategist - Amazon-specific, Seller & Vendor Central required!

Food And Beverage Data Entry Specialist

Experienced Part-Time Remote Live Chat Representative – Delivering Exceptional Customer Experiences at blithequark

Experienced Disney Remote Customer Service Representative - Part-Time, Flexible Schedule, Competitive Pay, and Magical Benefits

PANCE/PANRE Tutor/Instructor

AI Agent Evaluation Analyst for Autonomous Agents (No coding required)

Similar Jobs

Lead Agentic AI Developer

Senior Technical Writer, Business Analyst with Gen. AI skills

AI Automation Specialist - Remote US

AI Automation Developer for Ongoing Work (AI, n8n, Make.com, Voiceflow) - Contract to Hire

AI Automation Specialist​/Remote View Position

AI Agent Developer to Build an Autonomous Instagram Marketing System (Strategy + Automation)

AI Automation Engineer – Build Internal Business Applications

AI Automation Engineer – Extend Existing Salesforce-Based AI Outreach System - N8n / Salesforce

AI Automation Engineer (n8n + Playwright) for Google Flow Video Generation

Lead Data Engineer + AI Client - Altimetrik Takeda Location: Remote Need minimum 3 years of experien

Data Entry Remote Work

[Hiring] Inbound Outbound Queue Associate @CVS Health

Remote Part Time Data Entry / Typing Work From Home Opportunity with Remote Staffing

Social Media Strategist Needed to Build a Private Instagram Page Alongside Our Business Account

Clinical Quality Analyst RN - Clinical Practice...

[Remote] Amazon Account Strategist - Amazon-specific, Seller & Vendor Central required!

Food And Beverage Data Entry Specialist

**Experienced Part-Time Remote Live Chat Representative – Delivering Exceptional Customer Experiences at blithequark**

**Experienced Disney Remote Customer Service Representative - Part-Time, Flexible Schedule, Competitive Pay, and Magical Benefits**

PANCE/PANRE Tutor/Instructor

AI Automation Specialist/Remote View Position

Experienced Part-Time Remote Live Chat Representative – Delivering Exceptional Customer Experiences at blithequark

Experienced Disney Remote Customer Service Representative - Part-Time, Flexible Schedule, Competitive Pay, and Magical Benefits