Generalist Evaluator Expert

Remote, USA Full-time
Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### **Job Details:** - **Design and Optimize Prompts**: Create detailed prompts with multiple constraints and instructions. - **Define and Document Evaluation Standards**: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - **Conduct Model Testing and Grading**: Run prompts through models and assess preliminary outputs against expectations. - **Support Benchmarking and Quality Assurance**: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### **Minimum Qualifications:** - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### **Preferred Qualifications:** - Experience in teaching or research. ### **Application & Onboarding Process:** - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### **More Details About This Role:** - This is a **remote and asynchronous** role — work on your own schedule. - Expect to contribute at least **20 hours per week**. - Expect a commitment of around 1 month. - You’ll be working in a structured project environment with clear goals and tools. * * * ### **About** [**Mercor**]( - Our team is based in San Francisco, CA - We [specialize]( in recruiting experts for top AI labs - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey
Apply Now

Similar Jobs

AI Product Engineer

Remote, USA Full-time

Key Account Manager

Remote, USA Full-time

Program Associate

Remote, USA Full-time

Remote Amazon Marketplace Content & Keyword Optimization Specialist – SEO‑Driven Product Listing Expert for High‑Volume E‑Commerce

Remote, USA Full-time

Remote Luxury Fashion Customer Experience Specialist – Amazon Shopbop Full‑Time Work‑From‑Home Role

Remote, USA Full-time

Remote Amazon Customer Experience Specialist – Work‑From‑Home Customer Care Center Representative (Full‑Time, Flexible Shifts)

Remote, USA Full-time

Remote Amazon Customer Service Representative – Entry‑Level Full‑Time Role with Comprehensive Training, Competitive Pay, Flexible Hours, and Clear Career Advancement Path

Remote, USA Full-time

Remote Amazon Virtual Customer Care Advisor – Full‑Time Work‑From‑Home Role Supporting Billing, Insurance, and Pharmacy Services (Arizona Residents)

Remote, USA Full-time

Remote Amazon Customer Service Representative – Fully Remote Flexible Schedule, Immediate Openings, Competitive Pay & Comprehensive Benefits

Remote, USA Full-time

Part-Time Remote Amazon Customer Experience Specialist – Flexible Home‑Based Chat Support Role (20‑30 hrs/week)

Remote, USA Full-time

**Experienced Customer Service Representative – Airport and Call Center Operations**

Remote, USA Full-time

Associate Content Designer | Home Client Services

Remote, USA Full-time

Experienced Children's Fitness Instructor and Pretend Dinosaur Enthusiast – Join Our Dynamic Team at The Little Gym of Sea Girt, NJ for a Fun-Filled Career in Early Childhood Development and Education

Remote, USA Full-time

Experienced German Speaking Customer Experience Agent – Livestream Shopping Platform Support Specialist

Remote, USA Full-time

[Remote] Client Services Associate

Remote, USA Full-time

Data Analyst, Otter - Los Angeles

Remote, USA Full-time

Clinical Nurse Navigator, Remote MO

Remote, USA Full-time

**Experienced Full Stack Software Engineer – Web & Cloud Application Development at blithequark**

Remote, USA Full-time

Entry Level Data Entry Specialist – Remote Part-Time Opportunity with Flexible Schedule and Comprehensive Benefits at blithequark

Remote, USA Full-time

Experienced E-commerce Customer Support Specialist – Full-Time Remote Chat Agent Role at blithequark, Earn $25-$35/hr

Remote, USA Full-time
Back to Home