Mercor is looking for a Bilingual Simplified Chinese STEM Expert with experience in biology/physics/chemistry theory and problem-solving to develop and refine high-quality reasoning questions that evaluate AI models. Your knowledge in biology/physics/chemistry will ensure the accuracy, rigor, and instructional quality of these test items. You will create and evaluate Simplified Chinese/English prompts and responses, ensuring scientific precision while maintaining clear, natural Simplified Chinese phrasing and alignment with English where needed.
Design and Optimize STEM-Based Prompts (Bilingual): Create detailed prompts in Simplified Chinese and/or English with multiple constraints and scientific instructions.
Define and Document Evaluation Standards: Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor and—when in Simplified Chinese—linguistic clarity and appropriate academic tone.
Conduct Model Testing and Grading (Bilingual): Run prompts through models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity, comparing Simplified Chinese vs. English where needed.
Support Benchmarking and Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific rigor, maintaining consistency and reliability across Simplified Chinese-language benchmarks before integration into official benchmarks.
Native-level fluency in Simplified Chinese (written) with strong reading/writing ability in English
BS or BA in Biology/Physics/Chemistry
Familiarity with undergraduate-level biology/physics/chemistry topics
Strong writing and critical thinking skills
Ability to work independently and meet deadlines
Complete an AI-led interview (around 15 minutes)
Complete a 45-minute written assessment that will guide you through writing rubrics
If selected, you will be invited to work on the project
Expect to contribute at least 20 hours per week
Expect a commitment of around 1 month
You'll be working in a structured project environment with clear goals and tools
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Mercor partners with leading AI labs and enterprises to train frontier models using human expertise. You will work on projects that focus on training and enhancing AI systems. You will be paid competitively, collaborate with leading researchers, and help shape the next generation of AI systems in your area of expertise.
Tagged as: Chemistry, Computer Science, Data Science, Earth Science, Engineering, Environmental Science, Life Sciences, Mathematics, Physics
Scientist At GSK, we have bold ambitions for patients, aiming to positively impact the health of 2.5 billion people by...
ApplyScientist – Development Pilot Line This role is part of the Upstream Pilot Line, which operates within the Bioprocess Research...
ApplyLead Data Governance Analyst The Lead Data Governance Analyst is responsible for establishing, implementing, and maintaining data governance frameworks that...
ApplyDirector, Pharmacovigilance Scientist Challenging. Meaningful. Life-changing. Those aren't words that are usually associated with a job. But working at Bristol...
ApplyAssociate Scientist, In-Vivo CAR-T Therapeutics At Lilly, we unite caring with discovery to make life better for people around the...
ApplyWitztum Lab – Junior Specialist The Department of Radiation Oncology at UCSF, in the laboratory of Dr. Alon Witztum (Clinical...
ApplyPlease visit work.mercor.com.
Don't forget to mention that you found the position on jobRxiv!
