Computational Bayesian Statistics Expert for AI Evaluation Design [Remote]
$70 - $100 per hourSaidGig
- Remote job
This role involves designing original, graduate-level computational problems that challenge AI systems to utilize real scientific software tools for solving complex research-grade problems. You will create tasks that test AI''s ability to query simulations, interpret outputs, design experimental strategies, and extract hidden information from data.
Unlike typical annotation roles, your focus will be on calibrating these problems against cutting-edge AI models, iterating on their design to ensure the difficulty aligns with target benchmarks.
Key Responsibilities- Design computational problems that require advanced use of domain-specific scientific software libraries.
- Create tasks that test solvers on multi-step scientific workflows, ensuring precise outputs from fully specified setups.
- Develop complex queries or experimental designs to uncover non-visible information, requiring strategic reasoning.
- Engage in a calibration loop, testing each task against state-of-the-art AI models and refining designs based on feedback.
We are particularly interested in candidates with extensive hands-on experience in:
- Computational Bayesian Statistics and Applied Mathematics, utilizing libraries such as PyMC, PyStan, PyJAGS, and CmdStanPy.
- Applied mathematics and numerical PDEs, including tools like FEniCS, FEniCSx, DOLFINx, scikit-fem, FiPy, Devito, and Dedalus.
- Computational topology, using GUDHI.
- Differential algebra, including DACEyPy.
Experience with MCMC, Bayesian modeling, finite element or finite difference methods, mesh-based numerical modeling, computational topology, or other specialized Python-based computational methods is also valuable. Candidates are not required to have experience with all listed packages, but proficiency in any one will be highly regarded.
What Makes a Strong CandidateThe ideal candidate will possess:
- Graduate-level expertise (MS or PhD preferred) in the relevant domain, with practical experience using specific software tools.
- Proven ability to write code that utilizes these libraries to address real research problems, understanding their limitations and complexities.
- A puzzle designer mindset, capable of constructing problems that emphasize reasoning strategy over mere computation.
- Graduate-level training in a relevant STEM domain (MS, PhD, or equivalent research experience).
- Demonstrated proficiency with at least one of the specified scientific software libraries, supported by research publications, open-source contributions, or professional experience.
- Strong Python programming skills for writing problem setups, oracle functions, and solution validators.
- Ability to work independently and iterate on problem designs based on calibration feedback.
- Comfort with Linux/terminal environments and remote compute sandboxes.
This is a remote, hourly position.
CompensationHourly compensation ranges from $70 to $100.
$73 per hour
...Mindrift Project-Based AI Opportunities Mindrift connects specialists... ...tech companies, focused on testing, evaluating, and improving AI systems. Participation... ...tasks, contributors may: Design original computational statistics problems that simulate real...SuggestedHourly payPermanent employmentTemporary workPart timeRemote work10 hours per week$70 - $100 per hour
...creative and technical talent with leading AI research labs. Headquartered in San Francisco... ...Summers , and Jack Dorsey . Position: STEM Computational Scientific Software & Evaluation Design - Computational Bayesian Statistics and Applied Mathematics Type: Contract...SuggestedContract workSummer workRemote work$70 - $100 per hour
...OverviewJoin a leading AI lab's cutting-edge... ...-level subject-matter experts (SMEs) to contribute to... ...supporting a frontier-model evaluation effort focused on... ...deep domain expertise to design, solve, and evaluate... ...Biology, Mathematics, Statistics, Computer Science, Electrical Engineering...SuggestedHourly payContract workRemote workWeekday work$130 per hour
...Mercor is hiring Emergency Medicine experts in Miami Gardens to design clinical scenarios and evaluate AI model outputs against evidence-based standards. This remote, 100% asynchronous role requires board-certified attending physicians or final-year residents. The position...SuggestedRemote workFlexible hours$130 per hour
...Mercor is seeking Emergency Medicine experts to design clinical scenarios and evaluate AI models focusing on acute and emergency care. Candidates should be board certified attending physicians, final-year residents, or board-eligible fellows. This role involves creating...SuggestedHourly payRemote workFlexible hours$150 per hour
...Modern MedEd is hiring Psychiatry experts to design clinical scenarios and evaluate AI-generated model outputs in healthcare. This role requires board certification and offers remote, flexible participation at a rate of $150–$350/hr based on experience. Responsibilities...Remote workFlexible hours- Alignerr is seeking a Biology Subject Matter Expert for AI training. You will design advanced biology questions, evaluate AI performance, and provide feedback to enhance scientific reasoning in AI systems. The position is fully remote and flexible, allowing you to set...Remote jobFlexible hours
- Alignerr is seeking Chemistry Masters and PhDs to design, solve, and evaluate complex chemistry problems that contribute to AI understanding. This role is fully remote and... ...to directly impact AI training while collaborating with leading experts. #J-18808-Ljbffr AlignerrRemote jobFlexible hours
$70 - $100 per hour
...connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our... ...Summers , and Jack Dorsey . Position: STEM Computational Scientific Software & Evaluation Design - Seismology & Geophysics Type: Contract Compensation...Contract workSummer workRemote work- ...Alignerr is seeking Chemistry Experts (Masters and PhDs) to design and evaluate challenging chemistry problems that advance AI comprehension. This fully remote role allows you... ...communication skills, and experience with computational tools. Enjoy a flexible work schedule...Remote workFlexible hours
- ...A leading AI data platform is seeking individuals with Computer Science expertise to work as self-employed AI Trainers. This role involves completing tasks related to training and evaluating AI models. Successful applicants will have a strong understanding of programming...Hourly paySelf employmentRemote workFlexible hours
- ...developing and implementing AI/ML solutions to solve... ...-functional teams to design, develop, and deploy AI... ...making.Conduct statistical analyses using ML techniques... ...industry experience in Computer Science, Software Engineering... ...techniques to evaluate performance to choose the...Remote work
$35 per hour
...Mindrift Computational Physics Problem Designer Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation... ...model's answer. As an expert author, you: Pick...Hourly payPermanent employmentTemporary workPart timeFreelanceRemote work10 hours per week$76 per hour
...connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation... ...involves unique tasks, contributors may: Design original computational physics problems that simulate real...Hourly payPermanent employmentTemporary workPart timeFreelanceRemote work10 hours per week$100 per hour
...technical talent with leading AI research labs.... ...Electrical Engineering Expert (Semiconductor / ASIC /... ...ASIC , or Physical Design workflows. Review semiconductor... .... Assist in evaluating and improving AI systems... ..., or high-performance compute systems. Application...Contract workSummer workRemote work$15 - $40 per hour
...As a Design Expert (Multidisciplinary), you will leverage your creativity and design expertise to enhance AI-driven design solutions. This role is integral to the internal evaluations team, where you will review, evaluate, and refine interview content that encompasses...Remote jobHourly payContract work- ...Engineer, Senior AI (IT Python and AI programming (Expert) III Summary CIO Architecture... ...Technology (IT) design, develop, maintain, support... ...leadership by evaluating and driving implementation... ...models. Utilizes big data computation and storage tools to create...Permanent employmentWork experience placementInterim roleRemote work
$60 - $100 per hour
...Engineering, Data Science, and Systems Design Experts, C++ (5+ YOE) Type: Hourly contract Compensation... ...Contract Work Role Responsibilities Evaluate LLM-generated responses to coding and... ...Requirements BS, MS, or PhD in Computer Science or a closely related field Real...Hourly payFull timeContract workPart timeRemote work$130 per hour
...Mercor is seeking board-certified Internal Medicine experts to design and evaluate clinical scenarios for AI healthcare models. This remote role involves designing realistic prompts, grading AI responses, and providing feedback. The position is remote and requires...Hourly payRemote work$130 per hour
...Mercor is seeking Internal Medicine experts to collaborate with an AI lab on high-quality healthcare reasoning data. You will design clinical scenarios and evaluate AI model outputs based on evidence-driven standards. This role is remote and offers competitive pay at...Remote work$130 per hour
...Mercor is seeking Internal Medicine experts to shape AI reasoning in healthcare. You will design clinical scenarios and evaluate model outputs.This remote role involves 20 hrs/week with performance-based onboarding and ongoing reviews. Compensation ranges from $130 to...Hourly payWork at officeRemote work$130 per hour
...Mercor is hiring Internal Medicine experts to design clinical scenarios and evaluate AI outputs for patient care. This remote, ongoing role requires board-certified attending physicians or final-year residents. Pay rates range from $130 to $300/hr based on seniority,...Remote work$130 per hour
...Mercor is seeking Emergency Medicine experts to collaborate with a leading AI lab in training models on healthcare reasoning data. The role involves designing clinical scenarios, evaluating model outputs, and providing feedback on AI responses. Ideal candidates include...Hourly payRemote workFlexible hours- ...Mercor is looking for Emergency Medicine experts to collaborate with a leading AI lab on healthcare reasoning data. The role involves designing clinical scenarios, evaluating AI responses, and providing feedback to enhance model accuracy. Ideal candidates include board...Remote workFlexible hours
- Biochemistry Subject Matter Expert (AI Training) About the Role... ...hours/week What You'll Do Design Advanced Problems - Create challenging... ...Audit AI Outputs - Evaluate AI-generated biochemical... ...review Familiarity with computational biochemistry tools such as PyMOL...Hourly payOngoing contractContract workFreelanceRemote workFlexible hours
$180 per hour
...Mercor is seeking Cardiology experts in California to design clinical scenarios and evaluate AI outputs in healthcare. Contributions will shape AI reasoning about cardiovascular care through a remote, asynchronous role. Board certified attending physicians or final...Hourly payRemote work$180 per hour
...Mercor is seeking Hematology / Oncology experts to help train AI models on healthcare reasoning data. You will design clinical scenarios, evaluate AI outputs against evidence-based standards, and contribute to AI reasoning about cancer and hematologic care. This role...Hourly payRemote workFlexible hours$180 per hour
...Mercor is seeking Hematology / Oncology experts to design clinical scenarios and evaluate AI model outputs in Manchester, New Hampshire. The role involves providing feedback on AI responses and contributing to improving model behavior. Ideal candidates include board-certified...Weekly payRemote work$130 per hour
...Mercor is seeking Emergency Medicine experts to train AI models with healthcare reasoning data. This role involves designing clinical scenarios and evaluating AI outputs. Candidates must be board certified physicians or final-year residents. The position is remote and...Remote work$130 per hour
...Mercor is seeking Internal Medicine experts for a remote role working with a leading AI lab. You'll design clinical scenarios and evaluate model outputs, influencing AI's reasoning in patient care. This position offers flexible hours, with pay rates between $130 and $...Hourly payRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Computational Bayesian Statistics Expert for AI Evaluation Design [Remote]. Be the first to apply!


