Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer, Frontier Evals - Finance

$310k - $380k

OpenAI

About the team
The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI/ASI. This team builds ambitious evaluations to measure and steer our models, and creates self-improvement loops to steer our training, safety, and launch decisions. Some of the team's open-sourced evaluations include SWE-bench Verified , MLE-bench , PaperBench , and SWE-Lancer , and the team built and ran frontier evaluations for GPT4o , o1 , o3 , GPT 4.5 , ChatGPT Agent , and GPT5 . If you are interested in feeling firsthand the fast progress of our models, and steering them towards good, this is the team for you.

About you
We are seeking exceptional research engineers that can push the boundaries of our frontier models in the finance domain. We are looking for those who will help shape AI evaluations of financial reasoning and related capabilities, and will own individual threads within this endeavor end-to-end.

In this role, you'll:

  • Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas

  • Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it

  • Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities

We expect you to be:

  • Have prior background / domain expertise in finance, especially investment banking or private equity, and a passion for these problems

  • Detail-oriented and thorough

  • Team player / willing to do a variety of tasks to move the team forward

  • Passionate and knowledgeable about AGI/ASI measurement

  • Strong engineering and statistical analysis skills

  • Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end

It would be great if you also have:

  • An ability to work cross-functionally

  • Excellent communication skills

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement .

Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form . No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link .

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation

$310K – $380K • Offers Equity

The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.

  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts

  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)

  • 401(k) retirement plan with employer match

  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)

  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees

  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)

  • Mental health and wellness support

  • Employer-paid basic life and disability coverage

  • Annual learning and development stipend to fuel your professional growth

  • Daily meals in our offices, and meal delivery credits as eligible

  • Relocation support for eligible employees

  • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.

More details about our benefits are available to candidates during the hiring process.

This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.

Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Research Engineer, Frontier Evals - Finance in San Francisco, CA vacancy
  • $320k

     ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...beneficial AI systems. About The Team The Frontier Red Team (FRT) is a small, focused...  ...against advanced adversarial AI. Create evals and training environments to... 
    Suggested
    Relocation
    Visa sponsorship

    Anthropic

    San Francisco, CA
    2 days ago
  •  ...Our vast talent network trains frontier AI models in the same way...  ...team. You’ll work alongside researchers, operators, and AI companies...  ...About the Role As a Research Engineer at Mercor, you’ll work at the...  ...reasoning. You’ll design and run evals, build rubrics and scorers, and... 
    Suggested
    Work at office

    Mercor

    San Francisco, CA
    1 day ago
  •  ...Turing is the world’s leading research accelerator for frontier AI labs and a trusted...  ...GDP-value domains such as Finance, Sales, Retail, Developer...  ...Environments for Software Engineering / coding agents UI-Environments...  ..., RL environments, and evals for frontier AI labs in... 
    Suggested
    For contractors
    Flexible hours

    Cerebras

    San Francisco, CA
    5 days ago
  • $160k - $240k

    Research Engineer — Evals Location: San Francisco, CA (Hybrid) OR Remote (Americas, UTC-3 to UTC-10) Employment Type: Full time Department: Engineering Team Compensation: $160K - $240K • 0.01% - 0.10% Overview You'll build the evaluation systems that tell us whether Firecrawl... 
    Suggested
    Full time
    Temporary work
    Work at office
    Remote work

    AI Chopping Block, Inc.

    San Francisco, CA
    1 day ago
  •  ...problems where the edge cases matter most. We’re looking for a Research Engineer to help define how we measure and improve model quality. You’...  ..., and risk workflows Design and run offline and online evals that measure model performance on real customer tasks, not just... 
    Suggested

    Variance

    San Francisco, CA
    1 day ago
  • A leading technology company in San Francisco is seeking a Research Engineer for the Frontier Safety Loss of Control team. The role focuses on monitoring and controlling AI to mitigate risks associated with misaligned agents. Candidates should have a Bachelor’s degree... 

    Google Inc.

    San Francisco, CA
    4 days ago
  • $174k - $252k

    Research Engineer, Frontier Safety Loss of Control, DeepMind DeepMind - San Francisco, CA, USA Qualified applicants with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California... 
    Full time

    Google Inc.

    San Francisco, CA
    4 days ago
  • $300k

    Research Engineer, Agent Systems One of the most mission-driven organizations in AI is building the infrastructure that makes intelligent...  ...$300K - $600K+ Total Comp + Equity San Francisco · On-Site Frontier AI Lab · Confidential Highly Selective · 1 Engineer No Visa... 
    Visa sponsorship

    Aionia Group

    San Francisco, CA
    5 days ago
  •  ...in San Francisco is looking for a Member of Technical Staff focused on frontier AI model development. The role involves designing experiments, developing training methods, and iterating on research ideas. Candidates should be excited about AI's impact on science and technology... 

    Mirendil

    San Francisco, CA
    1 day ago
  •  ...superintelligence stack - from frontier agentic models to the infra that...  ...environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run...  ...contexts. As a Research Engineer in our Reasoning team, you'll... 
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...superintelligence stack - from frontier agentic models to the infra that...  ...environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run...  ...contexts. As a Research Engineer working on Distributed Training,... 
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    2 days ago
  •  ...AI Research Scientist We're building the first truly private, personal AI that learns your skills, judgment, and...  ...augmenting people bottom-up. Our team previously created evals used by Open AI, completed frontier AI research at MIT/Cambridge/Oxford, worked in Stuart... 
    Shift work

    Workshop Labs

    San Francisco, CA
    4 days ago
  •  ...a team of world-class scientists, ML researchers, and engineers to work together to move beyond the beaten...  .... We are actively exploring the next frontier of model architectures for AI x...  ...development. Engineer meaningful evals and metrics which enable rapid model iteration... 
    Work at office

    Achira

    San Francisco, CA
    3 days ago
  •  ...Team The Future of Computing Research team is an applied research team...  ...of computing. We work at the frontier of multimodal AI, helping...  ...work closely across research, engineering, design, product, and safety...  ...clean experiments, reliable evals, and decision‑useful metrics.... 
    Work at office
    Immediate start
    Relocation package

    OpenAI

    San Francisco, CA
    1 day ago
  • $160k - $300k

     .... We deliver the intelligence that gives finance professionals a definitive edge. Our AI uncovers...  ...for matrix and deep, multi-source research. We’ve built our own agentic frameworks powered...  ...most powerful and scalable LLM inference engine - a distributed, asynchronous DAG... 
    Contract work
    For contractors
    For subcontractor
    Work at office

    Hebbia

    San Francisco, CA
    3 days ago
  • Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning... 
    Full time

    Refresh AI

    San Francisco, CA
    4 days ago
  • A leading AI company based in San Francisco is seeking a research systems engineer to train large language models and explore reinforcement learning techniques. The ideal candidate will work at the intersection of research and systems design experiments at scale. Benefits... 

    Applied Compute Inc.

    San Francisco, CA
    1 day ago
  • $350k

     ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...research-heavy environment such as a frontier AI lab, quant research firm, or domain...  ...pattern matching * Scaling long-running evals and fickle training environments that... 
    Full time
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...Location Type On-site Department Engineering Our Mission Reflection’s...  ...states. Our team of AI researchers and company builders come from...  ...teams to accelerate the pace of frontier model development. They are...  ...is building model evals and safety from the ground up... 
    Full time
    Relocation package

    B Capital

    San Francisco, CA
    20 hours ago
  • $320k

    Anthropic is seeking a Research Engineer in San Francisco to design and build autonomous AI systems that counter adversarial threats. This role involves creating evaluation environments and developing defensive agents while collaborating with experts in the field. Candidates... 
    Relocation package

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...technology to create a safer world. Our team combines expertise from law enforcement, data science, engineering, and financial intelligence to combat terrorist financing, drug trafficking, human trafficking, and other serious crimes. Here, you\'ll join a mission-driven,... 
    Local area
    Remote work
    Work from home
    Worldwide
    Weekday work

    TRM Labs Inc.

    San Francisco, CA
    5 days ago
  • Job Title: AI Research Engineer About Xterra Xterra is a Khosla Ventures-backed company building...  ...datasets. This is a rare intersection of frontier AI and real-world scientific impact....  ...performance, building the harnesses that run evals at scale, and making sure our... 

    Xterraai

    San Francisco, CA
    4 days ago
  •  ...powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've...  ...offering three integrated solutions for frontier AI development: Enterprise Platform & Tools...  .... Role Overview As an Applied Research Engineer, you will be at the forefront of... 
    Flexible hours

    HRB

    San Francisco, CA
    3 days ago
  •  ...well known in the AI community for seminal research accomplishments at top AI labs, have run...  ...Chai Discovery is seeking an AI Engineer to play a crucial role in developing our...  ...What We Offer Highly engaging work at the frontier of AI-driven drug discovery that will fundamentally... 
    Full time

    Menlo Ventures

    San Francisco, CA
    3 days ago
  •  ...work, it has to be a trustworthy searcher. We're hiring a Research Engineer to advance the science and engineering that goes into making...  ...care about, rather than overfitting to any one regime. Design frontier‑discriminating evaluations that distinguish genuine... 
    Visa sponsorship

    Nerdleveltech

    San Francisco, CA
    1 day ago
  •  ...these models to real-world industry and economy use cases. As a Research Engineer on our Physical AI team, you will lead pre-training and post...  ...and your impact will be direct. If you want to work at the frontier of generative modeling and physical AI, this is the team.... 
    Work at office

    Hedra

    San Francisco, CA
    3 days ago
  • About Us Sieve is the only AI research lab exclusively focused on video data. We combine...  ...sources to develop datasets that push the frontier of video modeling. Video makes up 80% of...  ...About the Role As an applied research engineering intern at Sieve, you’ll help build high... 
    Internship

    Sieve

    San Francisco, CA
    5 days ago
  • $200k - $400k

     ...team. About the Team Read more about the research team\'s work here: The Research team...  ...world deployments. Our goal is to push the frontier of applied conversational AI: agents...  ...scale. About the Role As a Voice Research Engineer, you’ll lead the development of the models... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    4 days ago
  • $315k

     ...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build...  .... As a Research Engineer on this team, you\'ll ensure our frontier models train reliably, efficiently, and at scale. This is... 
    Full time
    Work at office
    Visa sponsorship
    Flexible hours
    Weekend work
    Afternoon shift

    Menlo Ventures

    San Francisco, CA
    1 day ago
  •  ...developers and non-developers alike. Our mission is to push the frontier of code generation and agentic reasoning, and deploy these...  ...designed for agentic coding. We operate across research, engineering, product, and infrastructure—owning the full lifecycle of experimentation... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Frontier Evals - Finance. Be the first to apply!