Computational Bayesian Statistics Expert for AI Evaluation Design [Remote]

$70 - $100 per hour

SaidGig

Remote job

This role involves designing original, graduate-level computational problems that challenge AI systems to utilize real scientific software tools for solving complex research-grade problems. You will create tasks that test AI''s ability to query simulations, interpret outputs, design experimental strategies, and extract hidden information from data.

Unlike typical annotation roles, your focus will be on calibrating these problems against cutting-edge AI models, iterating on their design to ensure the difficulty aligns with target benchmarks.

Key Responsibilities

Design computational problems that require advanced use of domain-specific scientific software libraries.
Create tasks that test solvers on multi-step scientific workflows, ensuring precise outputs from fully specified setups.
Develop complex queries or experimental designs to uncover non-visible information, requiring strategic reasoning.
Engage in a calibration loop, testing each task against state-of-the-art AI models and refining designs based on feedback.

Domains & Tools We''re Hiring For

We are particularly interested in candidates with extensive hands-on experience in:

Computational Bayesian Statistics and Applied Mathematics, utilizing libraries such as PyMC, PyStan, PyJAGS, and CmdStanPy.
Applied mathematics and numerical PDEs, including tools like FEniCS, FEniCSx, DOLFINx, scikit-fem, FiPy, Devito, and Dedalus.
Computational topology, using GUDHI.
Differential algebra, including DACEyPy.

Experience with MCMC, Bayesian modeling, finite element or finite difference methods, mesh-based numerical modeling, computational topology, or other specialized Python-based computational methods is also valuable. Candidates are not required to have experience with all listed packages, but proficiency in any one will be highly regarded.

What Makes a Strong Candidate

The ideal candidate will possess:

Graduate-level expertise (MS or PhD preferred) in the relevant domain, with practical experience using specific software tools.
Proven ability to write code that utilizes these libraries to address real research problems, understanding their limitations and complexities.
A puzzle designer mindset, capable of constructing problems that emphasize reasoning strategy over mere computation.

Requirements

Graduate-level training in a relevant STEM domain (MS, PhD, or equivalent research experience).
Demonstrated proficiency with at least one of the specified scientific software libraries, supported by research publications, open-source contributions, or professional experience.
Strong Python programming skills for writing problem setups, oracle functions, and solution validators.
Ability to work independently and iterate on problem designs based on calibration feedback.
Comfort with Linux/terminal environments and remote compute sandboxes.

Work Terms

This is a remote, hourly position.

Compensation

Hourly compensation ranges from $70 to $100.

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Computational Bayesian Statistics Expert for AI Evaluation Design [Remote] in Remote vacancy

Statistics Expert with Python Experience - AI Projects on Mindrift
$73 per hour
...Mindrift Project-Based AI Opportunities Mindrift connects specialists... ...tech companies, focused on testing, evaluating, and improving AI systems. Participation... ...tasks, contributors may: Design original computational statistics problems that simulate real...
Suggested
Hourly pay
Permanent employment
Temporary work
Part time
Remote work
10 hours per week
Mind Rift
United States
8 hours ago
Bayesian Statistics Expert - PhD
$70 - $100 per hour
...creative and technical talent with leading AI research labs. Headquartered in San Francisco... ...Summers , and Jack Dorsey . Position: STEM Computational Scientific Software & Evaluation Design - Computational Bayesian Statistics and Applied Mathematics Type: Contract...
Suggested
Contract work
Summer work
Remote work
Mercor
San Francisco, CA
1 day ago
Remote Expert Professionals - STEM Research - AI Trainer ($70-$100 per hour)
$70 - $100 per hour
...OverviewJoin a leading AI lab's cutting-edge... ...-level subject-matter experts (SMEs) to contribute to... ...supporting a frontier-model evaluation effort focused on... ...deep domain expertise to design, solve, and evaluate... ...Biology, Mathematics, Statistics, Computer Science, Electrical Engineering...
Suggested
Hourly pay
Contract work
Remote work
Weekday work
Mercor Inc
Hesperia, CA
4 days ago
Remote Emergency Medicine Expert AI Training & Evaluation
$130 per hour
...Mercor is hiring Emergency Medicine experts in Miami Gardens to design clinical scenarios and evaluate AI model outputs against evidence-based standards. This remote, 100% asynchronous role requires board-certified attending physicians or final-year residents. The position...
Suggested
Remote work
Flexible hours
Mercor Inc
Miami Gardens, FL
8 days ago
Remote Emergency Medicine Expert - AI Training & Evaluation
$130 per hour
...Mercor is seeking Emergency Medicine experts to design clinical scenarios and evaluate AI models focusing on acute and emergency care. Candidates should be board certified attending physicians, final-year residents, or board-eligible fellows. This role involves creating...
Suggested
Hourly pay
Remote work
Flexible hours
Mercor Inc
Santee, CA
6 days ago
Remote Psychiatry Expert for AI Model Evaluation
$150 per hour
...Modern MedEd is hiring Psychiatry experts to design clinical scenarios and evaluate AI-generated model outputs in healthcare. This role requires board certification and offers remote, flexible participation at a rate of $150–$350/hr based on experience. Responsibilities...
Remote work
Flexible hours
Modern MedEd
New York, NY
6 days ago
Remote Biology AI Expert — Content & Evaluation
Alignerr is seeking a Biology Subject Matter Expert for AI training. You will design advanced biology questions, evaluate AI performance, and provide feedback to enhance scientific reasoning in AI systems. The position is fully remote and flexible, allowing you to set...
Remote job
Flexible hours
Alignerr
Dallas, TX
1 day ago
Remote Chemistry Expert for AI Problem Design & Evaluation
Alignerr is seeking Chemistry Masters and PhDs to design, solve, and evaluate complex chemistry problems that contribute to AI understanding. This role is fully remote and... ...to directly impact AI training while collaborating with leading experts. #J-18808-Ljbffr Alignerr
Remote job
Flexible hours
Alignerr
Seattle, WA
5 days ago
Seismology Expert - Computational Problem Designer - AI Trainer
$70 - $100 per hour
...connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our... ...Summers , and Jack Dorsey . Position: STEM Computational Scientific Software & Evaluation Design - Seismology & Geophysics Type: Contract Compensation...
Contract work
Summer work
Remote work
Mercor
San Diego, CA
1 day ago
Remote Chemistry Expert for AI Training
...Alignerr is seeking Chemistry Experts (Masters and PhDs) to design and evaluate challenging chemistry problems that advance AI comprehension. This fully remote role allows you... ...communication skills, and experience with computational tools. Enjoy a flexible work schedule...
Remote work
Flexible hours
Alignerr
New Bremen, OH
3 days ago
Remote AI Domain Expert: Train & Evaluate Models
...A leading AI data platform is seeking individuals with Computer Science expertise to work as self-employed AI Trainers. This role involves completing tasks related to training and evaluating AI models. Successful applicants will have a strong understanding of programming...
Hourly pay
Self employment
Remote work
Flexible hours
Prolific - UK Job Board?
New York, NY
6 days ago
AI/ML Subject Matter Expert
...developing and implementing AI/ML solutions to solve... ...-functional teams to design, develop, and deploy AI... ...making.Conduct statistical analyses using ML techniques... ...industry experience in Computer Science, Software Engineering... ...techniques to evaluate performance to choose the...
Remote work
Halvik
Vienna, VA
3 days ago
Physics & Python Expert - Freelance AI Trainer
$35 per hour
...Mindrift Computational Physics Problem Designer Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation... ...model's answer. As an expert author, you: Pick...
Hourly pay
Permanent employment
Temporary work
Part time
Freelance
Remote work
10 hours per week
Mind Rift
United States
8 hours ago
Physics Expert with Python / Freelance AI Trainer
$76 per hour
...connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation... ...involves unique tasks, contributors may: Design original computational physics problems that simulate real...
Hourly pay
Permanent employment
Temporary work
Part time
Freelance
Remote work
10 hours per week
Mind Rift
United States
8 hours ago
Semiconductor Design Expert - AI Hardware
$100 per hour
...technical talent with leading AI research labs.... ...Electrical Engineering Expert (Semiconductor / ASIC /... ...ASIC , or Physical Design workflows. Review semiconductor... .... Assist in evaluating and improving AI systems... ..., or high-performance compute systems. Application...
Contract work
Summer work
Remote work
Mercor
New York, NY
15 days ago
Multidisciplinary Design Expert for AI Model Evaluation
$15 - $40 per hour
...As a Design Expert (Multidisciplinary), you will leverage your creativity and design expertise to enhance AI-driven design solutions. This role is integral to the internal evaluations team, where you will review, evaluate, and refine interview content that encompasses...
Remote job
Hourly pay
Contract work
SaidGig
Remote
15 days ago
Engineer, Senior AI (IT Python and AI programming (Expert) III
...Engineer, Senior AI (IT Python and AI programming (Expert) III Summary CIO Architecture... ...Technology (IT) design, develop, maintain, support... ...leadership by evaluating and driving implementation... ...models. Utilizes big data computation and storage tools to create...
Permanent employment
Work experience placement
Interim role
Remote work
PLANIT Group
United States
2 days ago
Software Engineering Expert (C++) - AI Training Projects | Remote
$60 - $100 per hour
...Engineering, Data Science, and Systems Design Experts, C++ (5+ YOE) Type: Hourly contract Compensation... ...Contract Work Role Responsibilities Evaluate LLM-generated responses to coding and... ...Requirements BS, MS, or PhD in Computer Science or a closely related field Real...
Hourly pay
Full time
Contract work
Part time
Remote work
Crossing Hurdles
New York, NY
6 days ago
Remote Internal Medicine Expert for AI Training
$130 per hour
...Mercor is seeking board-certified Internal Medicine experts to design and evaluate clinical scenarios for AI healthcare models. This remote role involves designing realistic prompts, grading AI responses, and providing feedback. The position is remote and requires...
Hourly pay
Remote work
Mercor Inc
Hampton, VA
5 days ago
Remote Internal Medicine Expert for AI Training
$130 per hour
...Mercor is seeking Internal Medicine experts to collaborate with an AI lab on high-quality healthcare reasoning data. You will design clinical scenarios and evaluate AI model outputs based on evidence-driven standards. This role is remote and offers competitive pay at...
Remote work
Mercor Inc
Denton, TX
6 days ago
Remote Internal Medicine Expert for AI Training
$130 per hour
...Mercor is seeking Internal Medicine experts to shape AI reasoning in healthcare. You will design clinical scenarios and evaluate model outputs.This remote role involves 20 hrs/week with performance-based onboarding and ongoing reviews. Compensation ranges from $130 to...
Hourly pay
Work at office
Remote work
Mercor Inc
Archdale, NC
6 days ago
Remote Internal Medicine Expert for AI Training
$130 per hour
...Mercor is hiring Internal Medicine experts to design clinical scenarios and evaluate AI outputs for patient care. This remote, ongoing role requires board-certified attending physicians or final-year residents. Pay rates range from $130 to $300/hr based on seniority,...
Remote work
Mercor Inc
Olathe, KS
6 days ago
Remote Emergency Medicine Expert - AI Training & Evaluation
$130 per hour
...Mercor is seeking Emergency Medicine experts to collaborate with a leading AI lab in training models on healthcare reasoning data. The role involves designing clinical scenarios, evaluating model outputs, and providing feedback on AI responses. Ideal candidates include...
Hourly pay
Remote work
Flexible hours
Mercor Inc
Concord, NC
6 days ago
Remote Emergency Medicine Expert AI Training & Evaluation
...Mercor is looking for Emergency Medicine experts to collaborate with a leading AI lab on healthcare reasoning data. The role involves designing clinical scenarios, evaluating AI responses, and providing feedback to enhance model accuracy. Ideal candidates include board...
Remote work
Flexible hours
Mercor Inc
Fishers, IN
6 days ago
Biochemistry Subject Matter Expert - AI Content Specialist
Biochemistry Subject Matter Expert (AI Training) About the Role... ...hours/week What You'll Do Design Advanced Problems - Create challenging... ...Audit AI Outputs - Evaluate AI-generated biochemical... ...review Familiarity with computational biochemistry tools such as PyMOL...
Hourly pay
Ongoing contract
Contract work
Freelance
Remote work
Flexible hours
Alignerr
New York, NY
1 day ago
Remote Cardiology Expert for AI Clinical Scenarios
$180 per hour
...Mercor is seeking Cardiology experts in California to design clinical scenarios and evaluate AI outputs in healthcare. Contributions will shape AI reasoning about cardiovascular care through a remote, asynchronous role. Board certified attending physicians or final...
Hourly pay
Remote work
Mercor Inc
San Marcos, CA
4 days ago
Remote Hematology & Oncology Expert for AI Training
$180 per hour
...Mercor is seeking Hematology / Oncology experts to help train AI models on healthcare reasoning data. You will design clinical scenarios, evaluate AI outputs against evidence-based standards, and contribute to AI reasoning about cancer and hematologic care. This role...
Hourly pay
Remote work
Flexible hours
Mercor Inc
Albuquerque, NM
5 days ago
Remote Hematology & Oncology Expert for AI Training
$180 per hour
...Mercor is seeking Hematology / Oncology experts to design clinical scenarios and evaluate AI model outputs in Manchester, New Hampshire. The role involves providing feedback on AI responses and contributing to improving model behavior. Ideal candidates include board-certified...
Weekly pay
Remote work
Mercor Inc
Manchester, NH
5 days ago
Remote Emergency Medicine Expert AI Training & Evaluation
$130 per hour
...Mercor is seeking Emergency Medicine experts to train AI models with healthcare reasoning data. This role involves designing clinical scenarios and evaluating AI outputs. Candidates must be board certified physicians or final-year residents. The position is remote and...
Remote work
Mercor Inc
Pittsburg, CA
3 days ago
Remote Internal Medicine Expert for AI Training
$130 per hour
...Mercor is seeking Internal Medicine experts for a remote role working with a leading AI lab. You'll design clinical scenarios and evaluate model outputs, influencing AI's reasoning in patient care. This position offers flexible hours, with pay rates between $130 and $...
Hourly pay
Remote work
Flexible hours
Mercor Inc
Oxnard, CA
6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Computational Bayesian Statistics Expert for AI Evaluation Design [Remote]. Be the first to apply!