Role Overview
Mercor is partnering with a leading AI research lab on Project Vesuvius an initiative designed to evaluate and enhance the ability of large language models (LLMs) to generate structured high-quality research plans for open-ended machine learning problems.
We are seeking Machine Learning Researchers and PhDs to serve as annotators who will assess and provide structured feedback on AI-generated research plans. The goal is to improve how LLMs function as brainstorming partners for machine learning research.
This is a remote short-term engagement with flexible hours and opportunities to contribute to frontier AI evaluation and research.
Key Responsibilities
Evaluate and compare AI-generated research plans for clarity feasibility and technical soundness.
Design and compile ML tasks based on real-world challenges and research competitions.
Draft detailed executable natural language plans for machine learning workflows.
Implement and validate research plans in Python within a Docker environment.
Assess outputs against structured rubrics provide usefulness scores and deliver concise objective feedback.
Ideal Qualifications
5 years of experience in applied machine learning or a PhD in machine learning or related fields.
Strong understanding of ML research methodologies experimental design and evaluation practices.
Excellent analytical and technical writing skills.
Experience with reproducibility or benchmarking in ML research preferred.
Detail-oriented and able to deliver high-quality structured feedback independently.
Engagement Details
Type: Independent contractor
Mode: Fully remote and asynchronous work from anywhere on your own schedule.
Commitment: Maximum 80 hours per week
Project Name: Vesuvius
This role is designed for researchers and engineers who value autonomy precision and meaningful contribution to frontier AI development.
Compensation & Contract Terms
Hourly Pay Rate: Up to $140/hour
Payment: Weekly via Stripe Connect
Contract Type: Independent contractor engagement
Structure: Remote milestone-based evaluation with flexible scheduling
Application Process
Submit your resume or CV highlighting relevant ML research or engineering experience.
Complete a short AI-based interview and a brief questionnaire about your experience with reproducibility and model benchmarking.
Selected candidates will receive detailed onboarding materials and access to the project environment.
About Mercor
Mercor is a global talent marketplace connecting exceptional professionals with leading AI labs and research organizations. Our mission is to empower experts to contribute directly to the most influential and technically advanced AI projects worldwide. Mercor is backed by investors including Benchmark General Catalyst Adam DAngelo Larry Summers and Jack Dorsey. Thousands of professionals across disciplines from research and engineering to law and design have joined Mercor to build the next generation of artificial intelligence systems.
...The Talent Acquisition Partner is responsible for recruiting management and leadership talent for the Collegiate Hospitality line of business supporting the Mid-Atlantic region. The Mid-Atlantic region covers Pennsylvania, Delaware, Maryland, and West Virginia....
Apply Description The Student Intern in Civil Engineering will help prepare and 3-D model creations and drawings associated with... ...Coordinates. Ability to utilize and import information from GIS databases. Basic to intermediate skills with AutoDesk products...
Office Manager Overview: System One is partnering with a client seeking a detail-oriented Office Manager/Bookkeeper to oversee daily administrative and financial duties. This role requires strong QuickBooks experience, proficiency in Microsoft Word and Excel, and the...
...shooting, lighting, audio and grip for KQED remote productions. This includes shooting on-camera interviews, b-roll and drone footage. This position requires proficiency in operating cinematic field cameras, audio gear, lighting and basic grip equipment. The on-call...
...**Job PostingParsons is looking for an amazingly talented **Construction** **Inspector** to join our team! In this role you will get to... ...such as OSHA 10 hour (minimum) / 30 hour training (preferred).**Security Clearance Requirement:**NoneThis position is part of our...