Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Computational Bayesian Statistics Expert for AI Evaluation Design [Remote]

$70 - $100 per hour

SaidGig

Remote
  • Remote job

This role involves designing original, graduate-level computational problems that challenge AI systems to utilize real scientific software tools for solving complex research-grade problems. You will create tasks that test AI''s ability to query simulations, interpret outputs, design experimental strategies, and extract hidden information from data.

Unlike typical annotation roles, your focus will be on calibrating these problems against cutting-edge AI models, iterating on their design to ensure the difficulty aligns with target benchmarks.

Key Responsibilities
  • Design computational problems that require advanced use of domain-specific scientific software libraries.
  • Create tasks that test solvers on multi-step scientific workflows, ensuring precise outputs from fully specified setups.
  • Develop complex queries or experimental designs to uncover non-visible information, requiring strategic reasoning.
  • Engage in a calibration loop, testing each task against state-of-the-art AI models and refining designs based on feedback.
Domains & Tools We''re Hiring For

We are particularly interested in candidates with extensive hands-on experience in:

  • Computational Bayesian Statistics and Applied Mathematics, utilizing libraries such as PyMC, PyStan, PyJAGS, and CmdStanPy.
  • Applied mathematics and numerical PDEs, including tools like FEniCS, FEniCSx, DOLFINx, scikit-fem, FiPy, Devito, and Dedalus.
  • Computational topology, using GUDHI.
  • Differential algebra, including DACEyPy.

Experience with MCMC, Bayesian modeling, finite element or finite difference methods, mesh-based numerical modeling, computational topology, or other specialized Python-based computational methods is also valuable. Candidates are not required to have experience with all listed packages, but proficiency in any one will be highly regarded.

What Makes a Strong Candidate

The ideal candidate will possess:

  • Graduate-level expertise (MS or PhD preferred) in the relevant domain, with practical experience using specific software tools.
  • Proven ability to write code that utilizes these libraries to address real research problems, understanding their limitations and complexities.
  • A puzzle designer mindset, capable of constructing problems that emphasize reasoning strategy over mere computation.
Requirements
  • Graduate-level training in a relevant STEM domain (MS, PhD, or equivalent research experience).
  • Demonstrated proficiency with at least one of the specified scientific software libraries, supported by research publications, open-source contributions, or professional experience.
  • Strong Python programming skills for writing problem setups, oracle functions, and solution validators.
  • Ability to work independently and iterate on problem designs based on calibration feedback.
  • Comfort with Linux/terminal environments and remote compute sandboxes.
Work Terms

This is a remote, hourly position.

Compensation

Hourly compensation ranges from $70 to $100.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Computational Bayesian Statistics Expert for AI Evaluation Design [Remote] in Remote vacancy
  • $73 per hour

     ...Mindrift Project-Based AI Opportunities Mindrift connects specialists...  ...tech companies, focused on testing, evaluating, and improving AI systems. Participation...  ...tasks, contributors may: Design original computational statistics problems that simulate real... 
    Suggested
    Hourly pay
    Permanent employment
    Temporary work
    Part time
    Remote work
    10 hours per week

    Mind Rift

    United States
    8 hours ago
  • $70 - $100 per hour

     ...creative and technical talent with leading AI research labs. Headquartered in San Francisco...  ...Summers , and Jack Dorsey . Position: STEM Computational Scientific Software & Evaluation Design - Computational Bayesian Statistics and Applied Mathematics Type: Contract... 
    Suggested
    Contract work
    Summer work
    Remote work

    Mercor

    San Francisco, CA
    1 day ago
  • $70 - $100 per hour

     ...OverviewJoin a leading AI lab's cutting-edge...  ...-level subject-matter experts (SMEs) to contribute to...  ...supporting a frontier-model evaluation effort focused on...  ...deep domain expertise to design, solve, and evaluate...  ...Biology, Mathematics, Statistics, Computer Science, Electrical Engineering... 
    Suggested
    Hourly pay
    Contract work
    Remote work
    Weekday work

    Mercor Inc

    Hesperia, CA
    4 days ago
  • $130 per hour

     ...Mercor is hiring Emergency Medicine experts in Miami Gardens to design clinical scenarios and evaluate AI model outputs against evidence-based standards. This remote, 100% asynchronous role requires board-certified attending physicians or final-year residents. The position... 
    Suggested
    Remote work
    Flexible hours

    Mercor Inc

    Miami Gardens, FL
    8 days ago
  • $130 per hour

     ...Mercor is seeking Emergency Medicine experts to design clinical scenarios and evaluate AI models focusing on acute and emergency care. Candidates should be board certified attending physicians, final-year residents, or board-eligible fellows. This role involves creating... 
    Suggested
    Hourly pay
    Remote work
    Flexible hours

    Mercor Inc

    Santee, CA
    6 days ago
  • $150 per hour

     ...Modern MedEd is hiring Psychiatry experts to design clinical scenarios and evaluate AI-generated model outputs in healthcare. This role requires board certification and offers remote, flexible participation at a rate of $150–$350/hr based on experience. Responsibilities... 
    Remote work
    Flexible hours

    Modern MedEd

    New York, NY
    6 days ago
  • Alignerr is seeking a Biology Subject Matter Expert for AI training. You will design advanced biology questions, evaluate AI performance, and provide feedback to enhance scientific reasoning in AI systems. The position is fully remote and flexible, allowing you to set... 
    Remote job
    Flexible hours

    Alignerr

    Dallas, TX
    1 day ago
  • Alignerr is seeking Chemistry Masters and PhDs to design, solve, and evaluate complex chemistry problems that contribute to AI understanding. This role is fully remote and...  ...to directly impact AI training while collaborating with leading experts. #J-18808-Ljbffr Alignerr
    Remote job
    Flexible hours

    Alignerr

    Seattle, WA
    5 days ago
  • $70 - $100 per hour

     ...connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our...  ...Summers , and Jack Dorsey . Position: STEM Computational Scientific Software & Evaluation Design - Seismology & Geophysics Type: Contract Compensation... 
    Contract work
    Summer work
    Remote work

    Mercor

    San Diego, CA
    1 day ago
  •  ...Alignerr is seeking Chemistry Experts (Masters and PhDs) to design and evaluate challenging chemistry problems that advance AI comprehension. This fully remote role allows you...  ...communication skills, and experience with computational tools. Enjoy a flexible work schedule... 
    Remote work
    Flexible hours

    Alignerr

    New Bremen, OH
    3 days ago
  •  ...A leading AI data platform is seeking individuals with Computer Science expertise to work as self-employed AI Trainers. This role involves completing tasks related to training and evaluating AI models. Successful applicants will have a strong understanding of programming... 
    Hourly pay
    Self employment
    Remote work
    Flexible hours

    Prolific - UK Job Board?

    New York, NY
    6 days ago
  •  ...developing and implementing AI/ML solutions to solve...  ...-functional teams to design, develop, and deploy AI...  ...making.Conduct statistical analyses using ML techniques...  ...industry experience in Computer Science, Software Engineering...  ...techniques to evaluate performance to choose the... 
    Remote work

    Halvik

    Vienna, VA
    3 days ago
  • $35 per hour

     ...Mindrift Computational Physics Problem Designer Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation...  ...model's answer. As an expert author, you: Pick... 
    Hourly pay
    Permanent employment
    Temporary work
    Part time
    Freelance
    Remote work
    10 hours per week

    Mind Rift

    United States
    8 hours ago
  • $76 per hour

     ...connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation...  ...involves unique tasks, contributors may: Design original computational physics problems that simulate real... 
    Hourly pay
    Permanent employment
    Temporary work
    Part time
    Freelance
    Remote work
    10 hours per week

    Mind Rift

    United States
    8 hours ago
  • $100 per hour

     ...technical talent with leading AI research labs....  ...Electrical Engineering Expert (Semiconductor / ASIC /...  ...ASIC , or Physical Design workflows. Review semiconductor...  .... Assist in evaluating and improving AI systems...  ..., or high-performance compute systems. Application... 
    Contract work
    Summer work
    Remote work

    Mercor

    New York, NY
    15 days ago
  • $15 - $40 per hour

     ...As a Design Expert (Multidisciplinary), you will leverage your creativity and design expertise to enhance AI-driven design solutions. This role is integral to the internal evaluations team, where you will review, evaluate, and refine interview content that encompasses... 
    Remote job
    Hourly pay
    Contract work

    SaidGig

    Remote
    15 days ago
  •  ...Engineer, Senior AI (IT Python and AI programming (Expert) III Summary CIO Architecture...  ...Technology (IT) design, develop, maintain, support...  ...leadership by evaluating and driving implementation...  ...models. Utilizes big data computation and storage tools to create... 
    Permanent employment
    Work experience placement
    Interim role
    Remote work

    PLANIT Group

    United States
    2 days ago
  • $60 - $100 per hour

     ...Engineering, Data Science, and Systems Design Experts, C++ (5+ YOE) Type: Hourly contract Compensation...  ...Contract Work Role Responsibilities Evaluate LLM-generated responses to coding and...  ...Requirements BS, MS, or PhD in Computer Science or a closely related field Real... 
    Hourly pay
    Full time
    Contract work
    Part time
    Remote work

    Crossing Hurdles

    New York, NY
    6 days ago
  • $130 per hour

     ...Mercor is seeking board-certified Internal Medicine experts to design and evaluate clinical scenarios for AI healthcare models. This remote role involves designing realistic prompts, grading AI responses, and providing feedback. The position is remote and requires... 
    Hourly pay
    Remote work

    Mercor Inc

    Hampton, VA
    5 days ago
  • $130 per hour

     ...Mercor is seeking Internal Medicine experts to collaborate with an AI lab on high-quality healthcare reasoning data. You will design clinical scenarios and evaluate AI model outputs based on evidence-driven standards. This role is remote and offers competitive pay at... 
    Remote work

    Mercor Inc

    Denton, TX
    6 days ago
  • $130 per hour

     ...Mercor is seeking Internal Medicine experts to shape AI reasoning in healthcare. You will design clinical scenarios and evaluate model outputs.This remote role involves 20 hrs/week with performance-based onboarding and ongoing reviews. Compensation ranges from $130 to... 
    Hourly pay
    Work at office
    Remote work

    Mercor Inc

    Archdale, NC
    6 days ago
  • $130 per hour

     ...Mercor is hiring Internal Medicine experts to design clinical scenarios and evaluate AI outputs for patient care. This remote, ongoing role requires board-certified attending physicians or final-year residents. Pay rates range from $130 to $300/hr based on seniority,... 
    Remote work

    Mercor Inc

    Olathe, KS
    6 days ago
  • $130 per hour

     ...Mercor is seeking Emergency Medicine experts to collaborate with a leading AI lab in training models on healthcare reasoning data. The role involves designing clinical scenarios, evaluating model outputs, and providing feedback on AI responses. Ideal candidates include... 
    Hourly pay
    Remote work
    Flexible hours

    Mercor Inc

    Concord, NC
    6 days ago
  •  ...Mercor is looking for Emergency Medicine experts to collaborate with a leading AI lab on healthcare reasoning data. The role involves designing clinical scenarios, evaluating AI responses, and providing feedback to enhance model accuracy. Ideal candidates include board... 
    Remote work
    Flexible hours

    Mercor Inc

    Fishers, IN
    6 days ago
  • Biochemistry Subject Matter Expert (AI Training) About the Role...  ...hours/week What You'll Do Design Advanced Problems - Create challenging...  ...Audit AI Outputs - Evaluate AI-generated biochemical...  ...review Familiarity with computational biochemistry tools such as PyMOL... 
    Hourly pay
    Ongoing contract
    Contract work
    Freelance
    Remote work
    Flexible hours

    Alignerr

    New York, NY
    1 day ago
  • $180 per hour

     ...Mercor is seeking Cardiology experts in California to design clinical scenarios and evaluate AI outputs in healthcare. Contributions will shape AI reasoning about cardiovascular care through a remote, asynchronous role. Board certified attending physicians or final... 
    Hourly pay
    Remote work

    Mercor Inc

    San Marcos, CA
    4 days ago
  • $180 per hour

     ...Mercor is seeking Hematology / Oncology experts to help train AI models on healthcare reasoning data. You will design clinical scenarios, evaluate AI outputs against evidence-based standards, and contribute to AI reasoning about cancer and hematologic care. This role... 
    Hourly pay
    Remote work
    Flexible hours

    Mercor Inc

    Albuquerque, NM
    5 days ago
  • $180 per hour

     ...Mercor is seeking Hematology / Oncology experts to design clinical scenarios and evaluate AI model outputs in Manchester, New Hampshire. The role involves providing feedback on AI responses and contributing to improving model behavior. Ideal candidates include board-certified... 
    Weekly pay
    Remote work

    Mercor Inc

    Manchester, NH
    5 days ago
  • $130 per hour

     ...Mercor is seeking Emergency Medicine experts to train AI models with healthcare reasoning data. This role involves designing clinical scenarios and evaluating AI outputs. Candidates must be board certified physicians or final-year residents. The position is remote and... 
    Remote work

    Mercor Inc

    Pittsburg, CA
    3 days ago
  • $130 per hour

     ...Mercor is seeking Internal Medicine experts for a remote role working with a leading AI lab. You'll design clinical scenarios and evaluate model outputs, influencing AI's reasoning in patient care. This position offers flexible hours, with pay rates between $130 and $... 
    Hourly pay
    Remote work
    Flexible hours

    Mercor Inc

    Oxnard, CA
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Computational Bayesian Statistics Expert for AI Evaluation Design [Remote]. Be the first to apply!