$90-$110 per hour
Hourly contract Remote H C S
9 hired this month
Application Not started 0 of 3 steps completed 0%
Resume | Domain Expert Interview | Work Authorization
All application steps are reused whenever another role requires the same step, so you never have to upload your resume or take the same interview twice
We are building a benchmark dataset to evaluate AI models on professional document understanding and instruction following within the Technology domain.
Tasks consist of complex, multi-step requests grounded in real-world workspace files (technical specs, architecture docs, API references, codebases), web search, and code execution — each paired with a clearly defined ground truth output and an objective evaluation rubric. You will be responsible for authoring tasks that test an AI's ability to reason over technical documentation, follow precise instructions, and produce accurate, well-structured outputs.
We expect a minimum commitment of 15–20 hours per week.
Ideal candidates have 3+ years of hands-on experience in one or more of the following sub-domains:
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Mercor partners with leading AI labs and enterprises to train frontier models using human expertise. You will work on projects that focus on training and enhancing AI systems. You will be paid competitively, collaborate with leading researchers, and help shape the next generation of AI systems in your area of expertise.
Share the referral link below, and earn up to $440 for each successful referral through this unique link. There's no limit on how many people you can refer. Restrictions may apply. Learn more
Tagged as: Data Science
Expert Professionals — STEM Research $70-$100 per hour W2 Contingent Role Remote Join a leading AI lab's cutting-edge research team...
ApplyPlease include your resume in your application. Genethra is seeking a Biological Data Platform & Knowledge Systems Engineer to...
ApplySTEM Computational Scientific Software & Evaluation Design – Structural & Mechanical Engineering $70-$100 per hour Hourly contract Remote About the...
ApplyPlease include your resume in your application email. Genethra is seeking a Systems Biology & Multi-Scale Modeling Scientist to...
ApplySTEM Computational Scientific Software & Evaluation Design – Electrical Engineering & RF/Circuit Design $70-$100 per hour Hourly contract Remote New...
ApplySTEM Computational Scientific Software & Evaluation Design – Astrophysics & Cosmology $70-$100 per hour Hourly contract Remote New opportunity Early...
ApplyPlease visit work.mercor.com.
Don't forget to mention that you found the position on jobRxiv!
