Research Engineer, Frontier Evals - Finance

$310k - $380k

OpenAI

About the team
The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI/ASI. This team builds ambitious evaluations to measure and steer our models, and creates self-improvement loops to steer our training, safety, and launch decisions. Some of the team's open-sourced evaluations include SWE-bench Verified , MLE-bench , PaperBench , and SWE-Lancer , and the team built and ran frontier evaluations for GPT4o , o1 , o3 , GPT 4.5 , ChatGPT Agent , and GPT5 . If you are interested in feeling firsthand the fast progress of our models, and steering them towards good, this is the team for you.

About you
We are seeking exceptional research engineers that can push the boundaries of our frontier models in the finance domain. We are looking for those who will help shape AI evaluations of financial reasoning and related capabilities, and will own individual threads within this endeavor end-to-end.

In this role, you'll:

Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas
Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it
Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities

We expect you to be:

Have prior background / domain expertise in finance, especially investment banking or private equity, and a passion for these problems
Detail-oriented and thorough
Team player / willing to do a variety of tasks to move the team forward
Passionate and knowledgeable about AGI/ASI measurement
Strong engineering and statistical analysis skills
Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end

It would be great if you also have:

An ability to work cross-functionally
Excellent communication skills

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement .

Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form . No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link .

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation

$310K – $380K • Offers Equity

The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.

Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
401(k) retirement plan with employer match
Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
Mental health and wellness support
Employer-paid basic life and disability coverage
Annual learning and development stipend to fuel your professional growth
Daily meals in our offices, and meal delivery credits as eligible
Relocation support for eligible employees
Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.

More details about our benefits are available to candidates during the hiring process.

This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.

Apply

Vacancy posted more than 2 months ago

Similar jobs that could be interesting for youBased on the Research Engineer, Frontier Evals - Finance in San Francisco, CA vacancy

Research Engineer, Frontier Evals & Environments
$205k - $380k
...Agent Post-Training team creates the frontier agents OpenAI ships to the world.... ...products people use.About the RoleAs a researcher working on Frontier Evals & Environments, you will help build... ....You will work with researchers, engineers, product teams, infrastructure...
Suggested
OpenAI
San Francisco, CA
23 hours ago
Research Engineer
...is building infrastructure to create RL training data and evals for frontier AI agents, as well as a marketplace to sell these to frontier... ...top VCs and were YC W25. About the role We're looking for research engineers to help build out QA for training data created by...
Suggested
Full time
Work at office
Remote work
Relocation
Visa sponsorship
HUD
San Francisco, CA
4 days ago
Frontier AI Research Engineer - Post-Training RL/Reasoning
$350k
Mirendil is seeking research engineers in San Francisco to build the post-training stack for frontier reasoning models. You will engage in designing experiments and iterating on reinforcement learning methodologies, focusing on scalable infrastructure for large-scale AI...
Suggested
Mirendil
San Francisco, CA
3 days ago
Research Engineer/Scientist - Human Alignment, Consumer Devices
...Team The Future of Computing Research team is an applied research team... ...of computing. We work at the frontier of multimodal AI, helping... ...work closely across research, engineering, design, product, and safety... ...clean experiments, reliable evals, and decision‑useful metrics....
Suggested
Work at office
Immediate start
Relocation package
Slope
San Francisco, CA
3 days ago
Founding Research Engineer
$160k - $250k
Join to apply for the Founding Research Engineer role at Adam Join to apply for the Founding Research... ...at Adam. At Adam, we’re tackling a frontier problem: training AI models to... ...representations of CAD features Design evals to measure geometric accuracy in 3D space...
Suggested
Full time
Adam
San Francisco, CA
2 days ago
AI Research Engineer
...technology to create a safer world. Our team combines expertise from law enforcement, data science, engineering, and financial intelligence to combat terrorist financing, drug trafficking, human trafficking, and other serious crimes. Here, you\'ll join a mission-driven,...
Local area
Remote work
Work from home
Worldwide
Weekday work
TRM Labs Inc.
San Francisco, CA
3 days ago
Research Engineer
$200k - $350k
...training), second-time technical founders, engineers that made 100+ games for Voodoo,... ...engaging games & 3D environments. Our current research spans: Distributed multi-agent... ...join a mission-driven team working at the frontier of AI world generation. We're HQ'd in SF...
Visa sponsorship
Relocation package
ROAM
San Francisco, CA
4 days ago
Research Engineer
...well known in the AI community for seminal research accomplishments at top AI labs, have run... ...Chai Discovery is seeking an AI Engineer to play a crucial role in developing our... ...What We Offer Highly engaging work at the frontier of AI-driven drug discovery that will fundamentally...
Full time
Menlo Ventures
San Francisco, CA
2 days ago
Research Engineer
...reinforcement learning (RL) training data and evaluation for frontier AI agents. Their platform is used by advanced AI teams... ...and craftsmanship. The Opportunity Our partner is hiring a Research Engineer to help scale the quality assurance (QA) systems behind training...
Remote work
talentpluto
San Francisco, CA
23 hours ago
Research Program Manager - Model Evals and Safety
...nation states. Our team of AI researchers and company builders come... ...teams to accelerate the pace of frontier model development. They are... ...is building model evals and safety from the ground up... ...Partner with research and engineering leads across pre-training, mid...
Relocation package
Reflection
San Francisco, CA
4 days ago
Research Engineer
...eliminating complexity and friction with seamless automation. As a Research Engineer at Capably, you’ll help define how intelligent systems... ...performance in production. You’ll stay close to the frontier of agent research while keeping a sharp focus on enterprise...
Capably
San Francisco, CA
2 days ago
Senior Research Engineer
$300k - $400k
...and grow as a team. About the Team The Research team develops the model and decision-making... ...deployments. Our goal is to push the frontier of applied conversational AI: agents that... ...scale. About the Role As a Senior Research Engineer, you’ll be responsible for building...
Work at office
Decagon
San Francisco, CA
3 days ago
Research Engineer
...these models to real-world industry and economy use cases. As a Research Engineer on our Physical AI team, you will lead pre-training and post... ...and your impact will be direct. If you want to work at the frontier of generative modeling and physical AI, this is the team....
Work at office
HEDRA INC
San Francisco, CA
3 days ago
Research Engineer, Codex
...developers and non-developers alike. Our mission is to push the frontier of code generation and agentic reasoning, and deploy these... ...designed for agentic coding. We operate across research, engineering, product, and infrastructure—owning the full lifecycle of experimentation...
Work at office
Relocation package
OpenAI
San Francisco, CA
3 days ago
Member of Technical Staff, Research Engineer
...systems-heavy, and close to the frontier. You will consume real-world trajectories or researcher hypotheses, materialize... ...empirical AI research, systems engineering, and model evaluation. You may... ...LLM agents, computer‑use agents, evals, post‑training, synthetic data,...
Plato.ai
San Francisco, CA
23 hours ago
Research Engineer
...About Chai Discovery Chai is a research lab working on AI to unlock biology. Our models... ...as language models are changing how engineers write code. Our vision is a design suite... ...engineering. This is a chance to push the frontier of AI drug design, working alongside a...
Chai Discovery, Inc
San Francisco, CA
4 days ago
Member of Technical Staff - ML Research Engineer, Multi-Modal - Audio
...Opportunity Our Audio team is building frontier speech-language models that... ...with infrastructure and research teams Support... ...model training (data pipelines, evals, serving infrastructure) Proficiency... ...in shared codebases with high engineering standards Nice‑to‑have: Direct...
Shift work
Liquid AI
San Francisco, CA
3 days ago
Research - engineering
At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting... ...from text, efficient vector representations of 3D models, evals to measure performance of generations in 3D space, or RL frameworks...
Work at office
Camfer
San Francisco, CA
23 hours ago
Deployed Research Engineer: Video Data Pipelines
Sieve, Inc. is hiring a Deployed Research Engineer in San Francisco to tackle specific dataset problems for frontier AI labs. This role requires working closely with customers and internal teams to translate ambiguous requirements into actionable systems. Key responsibilities...
Sieve, Inc.
San Francisco, CA
23 hours ago
Frontier AI Safety Research Engineer
A leading AI research organization is looking for a research engineer to evaluate AI safety risks and design methodologies for assessing them. You will refine evaluations and establish scalable systems. Ideal candidates are passionate about AI safety, possess a red-team...
OpenAI
San Francisco, CA
2 days ago
Frontier AI Safety Research Engineer: CBRN Risk Eval
$295k
OpenAI in San Francisco is seeking an exceptional Research Engineer to advance AI safety initiatives. This role involves identifying emerging risks associated with AI and building evaluation systems to assess these risks effectively. Candidates should possess a passion...
OpenAI
San Francisco, CA
2 days ago
Research Engineer, Codex
$295k
About the TeamThe Codex Research team creates the frontier agents OpenAI ships to the world. We are training... ...faster and more trustworthy, create evals that reveal where models fail, or... ...and make progress across research, engineering, data, evals, and product. You should...
OpenAI
San Francisco, CA
23 hours ago
Senior Research Engineer
...forgets, and learns over time. This isn't research bolted onto a product team: you'll take... ...of the literature; and ship them with Engineering at SOTA latency, reliability, and cost.... ...and real outcomes. Live at the research frontier: Turn paper ideas into working prototypes...
Full time
Work at office
Remote work
Mem0
San Francisco, CA
1 day ago
Research Engineer, Developer Experience, Tinker
$350k
...their unique needs and goals. We are scientists, engineers, and builders who’ve created some of the most... ...TINKER Tinker [ is our fine-tuning API that empowers researchers and developers to customize frontier AI to their needs — opening access to capabilities...
Full time
Local area
Visa sponsorship
Relocation package
Flexible hours
Thinking Machines Lab
San Francisco, CA
1 day ago
Senior GenAI Research Engineer - Optimization and Kernels
$166k - $225k
...AI models should be available to all. Job Description As a research engineer on the Scaling team, you will be responsible for keeping up... ...developments in deep learning and advancing the scientific frontier by creating new techniques that go beyond the state of the art...
Worldwide
Cacheflow
San Francisco, CA
23 hours ago
Research Engineer/Research Scientist - Personal AGI, North Stars
$295k
...of humanity to benefit from frontier intelligence in whatever way... ...towards. We partner closely with research and product teams across the... ...’ll work as a Research Engineer / Scientist on the North Stars... ...translate these insights into robust evals, training data, reward...
Work at office
Relocation package
OpenAI
San Francisco, CA
23 hours ago
AI Research Engineer
$100k - $300k
...critical breaches occur. To stay at the cutting edge, we blend frontier research with real-world execution. Alongside our core product work,... ...AI lab at the heart of Cogent's mission. As an AI Research Engineer, you will work at the frontier of automated reasoning and AI...
Cogent Security
San Francisco, CA
3 days ago
Research Engineer, Model Evaluations
...as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to... ...harden the distributed eval execution platform so hundreds of evals run reliably against checkpoints throughout production RL training...
Full time
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
1 day ago
Research Scientist/Engineer - Post-training, Inference, & Safety and Security
Research Scientist/Engineer - Post-training, Inference, & Safety and Security Virtue AI Full-time Unknown About Virtue AI Virtue AI sets the... ...enterprises on their most strategic AI initiatives Work on the frontier - Engage with bleeding‑edge AI/ML and deploy AI security...
Full time
SupportFinity™
San Francisco, CA
1 day ago
ML Research Engineer - Hardware Codesign
$185k
...systems to power the next generation of frontier models. By co-designing chips,... ...platform.About the RoleWe’re seeking a Research-Hardware Codesign Engineer to operate at the boundary between... ...kernels, derisking numerics via model evals, quantifying system architecture...
Relocation package
3 days per week
OpenAI
San Francisco, CA
23 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Frontier Evals - Finance. Be the first to apply!