Azure open AI Engineer

Remote, USA Full-time
Role: AI EVAL Engineer Location: Bellevue, WA (Remote) Duration: 6+ months AI EVAL Engineering Azure OpenAI; EVAL; Bench Marking Required Skills - Strong understanding of LLMs and generative AI concepts, including model behavior and output evaluation - Experience with AI evaluation and benchmarking methodologies, including baseline creation and model comparison - Hands-on expertise in Eval testing, creating structured test suites to measure accuracy, relevance, safety, and performance - Ability to define and apply evaluation metrics (precisionrecall, BLEUROUGE, F1, hallucination rate, latency, cost per output)Prompt engineering and prompt testing experience across zero-shot, few-shot, and system prompt scenarios - Python other programming languages, for automation, data analysis, batch evaluation execution, and API integration - Experience with evaluation tools/frameworks (OpenAI Evals, HuggingFace evals, Promptfoo, Ragas, DeepEval, LM Eval Harness) - Ability to create datasets, test cases, benchmarks, and ground truth references for consistent scoring - Test design and test automation experience, including reproducible evaluation pipelines - Knowledge of AI safety, bias, security testing, and hallucination analysis Nice-to-Have - RAG evaluation experience - Azure OpenAI - OpenAI - Anthropic - Google AI platforms - Performance benchmarking (speed, throughput, cost) - Domain knowledge Office apps enterprise systems networking Apply tot his job
Apply Now

Similar Jobs

Remote FP&A Manager - AI Trainer ($50-$60/hour)

Remote, USA Full-time

[Remote] AI Trainer -Remote English Content Editor

Remote, USA Full-time

AI Trainer, LLM

Remote, USA Full-time

Freelance Cybersecurity Analyst - AI Trainer

Remote, USA Full-time

Remote AI Writing Trainer

Remote, USA Full-time

[Remote] Manager, Localization - Americas

Remote, USA Full-time

Head of Engineering Trading Systems

Remote, USA Full-time

Overnight Staff Pharmacist - Englewood, CO, Amazon Pharmacy

Remote, USA Full-time

Senior Marketing Manager, Amazon Pharmacy

Remote, USA Full-time

Amazon Web Services (AWS) Engineer Pro Bono (Global/Remote): UniversalGiving® – Make a Global Impact

Remote, USA Full-time

Revit/BIM Specialist Needed – Convert 3D CAD Files or Solid Works Files into Revit (BIM) Models

Remote, USA Full-time

Principal Cloud Architect // 100% remote.

Remote, USA Full-time

Remote Real Estate Researcher

Remote, USA Full-time

Experienced Remote Data Entry Specialist – Virtual Administrative Support and Data Management Opportunity at arenaflex

Remote, USA Full-time

**Experienced Full Stack Insurance Sales Executive – Remote Work Opportunity with Unlimited Earnings Potential**

Remote, USA Full-time

**Experienced Data Entry Clerk – Remote Work Opportunity for Entry-Level Professionals at blithequark**

Remote, USA Full-time

Experienced Customer Care Specialist II – Delivering Exceptional Support in 401(k) Administration and Cross-Functional Environments at blithequark

Remote, USA Full-time

Experienced Remote Data Entry Clerk - Flexible Hours and Professional Growth Opportunities with Blithequark

Remote, USA Full-time

Lead to Cash PMO, Senior Analyst / Manager

Remote, USA Full-time

Experienced Entry-Level arenaflex Data Entry Specialist – Launch Your Career in E-commerce with No Prior Experience Required (Part-Time Opportunity)

Remote, USA Full-time
Back to Home