Scientific Software Expert - Fully Remote | Upto $100/hr

JapanRemotecontract

<h3>About the job</h3><p><strong>Mercor</strong> connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include <strong>Benchmark</strong>, <strong>General Catalyst</strong>, <strong>Peter Thiel</strong>, <strong>Adam D'Angelo</strong>, <strong>Larry Summers</strong>, and <strong>Jack Dorsey</strong>.</p><p><strong>Position:</strong> STEM Computational Scientific Software &amp; Evaluation Design<br><strong>Type:</strong><strong>Contract</strong><br><strong>Compensation:</strong><strong>$45–$100/hour</strong><br><strong>Location:</strong><strong>Remote</strong><br><strong>Commitment:</strong><strong>15–20 hours/week</strong></p><h3>Role Responsibilities</h3><ul><li>Design graduate-level computational problems using domain-specific scientific software libraries.</li><li>Evaluate AI models' ability to solve research-grade problems through strategic reasoning and problem-solving.</li><li>Develop and refine tasks through calibration loops with state-of-the-art AI models.</li><li>Collaborate asynchronously and work independently to meet deadlines and improve AI model performance.</li><li>Utilize <strong>Python</strong> for problem setups, oracle functions, and solution validators in a <strong>Linux/terminal</strong> environment.</li></ul><h3>Qualifications<p></p><p><strong>Must-Have</strong></p></h3><ul><li><strong>Graduate-level training in a relevant STEM domain (<strong>MS, PhD, or equivalent research experience</strong>).</strong></li><li><strong>Proficiency with at least one scientific software library, evidenced by research publications, open-source contributions, or professional work.</strong></li><li><strong>Strong <strong>Python</strong> programming skills.</strong></li><li><strong>Ability to work independently and iterate on problem designs based on calibration feedback.</strong></li><li><strong>Comfortable working in a <strong>Linux/terminal</strong> environment with remote compute sandboxes.</strong></li></ul><h3><strong>Preferred</strong></h3><ul><li><strong>Experience across multiple listed domains or tools.</strong></li><li><strong>Familiarity with benchmark or evaluation design.</strong></li><li><strong>Background in scientific pedagogy or exam/problem-set design.</strong></li><li><strong>Experience with computational reproducibility and containerized environments.</strong></li></ul><h3><strong>Application Process (Takes 20–30 mins to complete)</strong></h3><ul><li><strong>Upload resume</strong></li><li><strong>AI interview based on your resume</strong></li><li><strong>Submit form</strong></li></ul><h3><strong>Resources &amp; Support</strong></h3><ul><li><strong>For details about the interview process and platform information, please check: https://talent.docs.<a href="https://himalayas.app/companies/mercor">mercor</a>.com/welcome</strong></li><li><strong>For any help or support, reach out to: support@<a href="https://himalayas.app/companies/mercor">mercor</a>.com</strong></li></ul><p><strong><em>PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.</em></strong></p><p>Originally posted on <a href="https://himalayas.app">Himalayas</a></p>

Apply with uptayn.

Sign in free to open the apply link, get this role scored against your CV, and track your application.

uptayn
2026 · built quietly in Berlin.
uptayn = up + attain
Built for
  • Recent business grads
  • Engineers pivoting to ops
  • Consultants → startup
  • Second-job operators
Quiet by default
  • No tracking pixels
  • No LinkedIn login
  • No spam outreach
  • Just roles + your CV