$70-$100 / hr
Hourly contract Remote
About the Project
We're building a large-scale benchmark to test how well advanced AI systems can solve hard scientific and engineering problems. As a task designer, you'll create challenging computational problems that check whether AI can use real scientific software to do research-level work — running simulations, interpreting results, designing experiments, and uncovering hidden information from data.
This isn't a typical data-labeling job. You'll design original, graduate-level problems based on real scientific workflows, test them against cutting-edge AI models, and fine-tune them until the difficulty is just right.
What You'll Do
You'll create problems that require skilled use of specialized scientific software. Some will ask the AI to compute exact answers from a fully defined setup — testing whether it can correctly carry out complex, multi-step workflows. Others will be harder: the AI must plan a series of queries or experiments to uncover information that isn't directly visible, which means thinking strategically about what to measure, how to read partial results, and how to narrow down the possibilities efficiently.
Each problem goes through a testing loop against state-of-the-art AI models, and you'll refine it until it hits the target difficulty.
Domains & Tools We're Hiring For
We're especially interested in experts with deep, hands-on experience in:
Astrophysics & Cosmology — working with astropy and related tools for cosmological calculations, angular power spectra, galaxy survey analysis, and observational data reduction pipelines.
Experience with other specialized software in this domain will also be considered.
What Makes a Strong Candidate
You have graduate-level expertise (MS or PhD preferred) in the domain above, with real hands-on experience using these tools — not just theoretical knowledge. You've written code using these libraries to solve actual research problems, and you understand where they break, what their edge cases are, and what makes a problem genuinely hard rather than just complicated.
Beyond domain expertise, the best candidates think like puzzle designers: building problems where the challenge comes from smart reasoning rather than raw computation, where several approaches seem plausible but only careful analysis reveals the right one, and where surface-level pattern matching won't get you to the answer.
Requirements
Nice to Have
Please note: This application includes a coding assessment as part of the evaluation process.
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Mercor partners with leading AI labs and enterprises to train frontier models using human expertise. You will work on projects that focus on training and enhancing AI systems. You will be paid competitively, collaborate with leading researchers, and help shape the next generation of AI systems in your area of expertise.
Share the referral link below, and earn up to $400 for each successful referral through this unique link. There's no limit on how many people you can refer. Restrictions may apply. Learn more Don't know who to refer? Find relevant LinkedIn connections here. One Interview, Real Results AI experts share how Mercor made hiring faster, fairer, and easier — with just one interview.
Tagged as: Chemistry, Computer Science, Data Science, Earth Science, Engineering, Environmental Science, Life Sciences, Mathematics, Physics
Senior/Research Scientist in Immunology We are seeking a highly motivated and experienced Senior/Research Scientist in Immunology to join our dynamic...
ApplyJob Title At Thermo Fisher Scientific, you'll discover meaningful work that makes a positive impact on a global scale. Join...
ApplyMartens Lab – Junior or Assistant Specialist Department of Biochemistry and Biophysics UC San Francisco The Junior or Assistant Specialist...
ApplyPrincipal Scientist, Discovery BioSciences – Integrative New Targets, Oncology Working with Us Challenging. Meaningful. Life-changing. Those aren't words that are...
ApplyProcess Scientist AbbVie Westport is the largest operating facility in the AbbVie global network and a recognised centre of excellence...
ApplySenior Scientist/Engineer, Manufacturing Science and Technology (MSAT) – Biologics & Cell Therapy The Senior Scientist/Engineer, Manufacturing Science and Technology (MSAT)...
ApplyPlease visit work.mercor.com.
Don't forget to mention that you found the position on jobRxiv!
