Research Engineer, Frontier Evals - Finance
$310k - $380kOpenAI
About the team
The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI/ASI. This team builds ambitious evaluations to measure and steer our models, and creates self-improvement loops to steer our training, safety, and launch decisions. Some of the team's open-sourced evaluations include SWE-bench Verified , MLE-bench , PaperBench , and SWE-Lancer , and the team built and ran frontier evaluations for GPT4o , o1 , o3 , GPT 4.5 , ChatGPT Agent , and GPT5 . If you are interested in feeling firsthand the fast progress of our models, and steering them towards good, this is the team for you.
About you
We are seeking exceptional research engineers that can push the boundaries of our frontier models in the finance domain. We are looking for those who will help shape AI evaluations of financial reasoning and related capabilities, and will own individual threads within this endeavor end-to-end.
In this role, you'll:
Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas
Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it
Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities
We expect you to be:
Have prior background / domain expertise in finance, especially investment banking or private equity, and a passion for these problems
Detail-oriented and thorough
Team player / willing to do a variety of tasks to move the team forward
Passionate and knowledgeable about AGI/ASI measurement
Strong engineering and statistical analysis skills
Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end
It would be great if you also have:
An ability to work cross-functionally
Excellent communication skills
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement .
Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form . No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link .
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Compensation
$310K – $380K • Offers Equity
The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.
Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
401(k) retirement plan with employer match
Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
Mental health and wellness support
Employer-paid basic life and disability coverage
Annual learning and development stipend to fuel your professional growth
Daily meals in our offices, and meal delivery credits as eligible
Relocation support for eligible employees
Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.
More details about our benefits are available to candidates during the hiring process.
This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.
- ...Our vast talent network trains frontier AI models in the same way... ...team. You’ll work alongside researchers, operators, and AI companies... ...About the Role As a Research Engineer at Mercor, you’ll work at the... ...reasoning. You’ll design and run evals, build rubrics and scorers, and...SuggestedWork at office
- ...Turing is the world’s leading research accelerator for frontier AI labs and a trusted... ...GDP-value domains such as Finance, Sales, Retail, Developer... ...Environments for Software Engineering / coding agents UI-Environments... ..., RL environments, and evals for frontier AI labs in...SuggestedFor contractorsFlexible hours
- ...Agent Post-Training team creates the frontier agents OpenAI ships to the world.... ...products people use. About the Role As a researcher working on Frontier Evals & Environments, you will help build... .... You will work with researchers, engineers, product teams, infrastructure...Suggested
$160k - $240k
Research Engineer — Evals You’ll build the evaluation systems that tell us whether Firecrawl actually works. That sounds simple. It isn’t. Our core promise — convert any URL into clean, structured, LLM‑ready data reliably — is hard to measure rigorously across millions...SuggestedFull timeTemporary workRemote work- ...a stealth team of elite founders and AI researchers, with backgrounds spanning Stanford, OpenAI... ...actually get better? Without a strong evals function, the lab ships vibes. With one,... ...we measure is what we want Product engineers, by instrumenting real-user behavior on...SuggestedRelocation package
- ...problems where the edge cases matter most. We’re looking for a Research Engineer to help define how we measure and improve model quality. You’... ..., and risk workflows Design and run offline and online evals that measure model performance on real customer tasks, not just...
- A leading technology company in San Francisco is seeking a Research Engineer for the Frontier Safety Loss of Control team. The role focuses on monitoring and controlling AI to mitigate risks associated with misaligned agents. Candidates should have a Bachelor’s degree...
$174k - $252k
Research Engineer, Frontier Safety Loss of Control, DeepMind DeepMind - San Francisco, CA, USA Qualified applicants with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California...Full time$300k
Research Engineer, Agent Systems One of the most mission-driven organizations in AI is building the infrastructure that makes intelligent... ...$300K - $600K+ Total Comp + Equity San Francisco · On-Site Frontier AI Lab · Confidential Highly Selective · 1 Engineer No Visa...Visa sponsorship- ...in San Francisco is looking for a Member of Technical Staff focused on frontier AI model development. The role involves designing experiments, developing training methods, and iterating on research ideas. Candidates should be excited about AI's impact on science and technology...
$150k
About the role At Frontier AI & Robotics, we're not just advancing robotics - we're reimagining... ...is our unique combination of ambitious research vision and practical impact. We leverage... ...as a mentor, tech lead, or leading an engineering team. Expertise in Python, PyTorch, and...InternshipLocal area- ...superintelligence stack - from frontier agentic models to the infra that... ...environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run... ...contexts. As a Research Engineer in our Reasoning team, you'll...Remote workWorldwideVisa sponsorshipRelocation packageFlexible hours
$150k - $300k
...superintelligence stack - from frontier agentic models to the infra that... ...environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run... ...contexts. As a Research Engineer working on Distributed Training,...Remote workWorldwideVisa sponsorshipRelocation packageFlexible hours- ...Team The Future of Computing Research team is an applied research team... ...of computing. We work at the frontier of multimodal AI, helping... ...work closely across research, engineering, design, product, and safety... ...clean experiments, reliable evals, and decision‑useful metrics....Work at officeImmediate startRelocation package
- Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning...Full time
- ...a team of world-class scientists, ML researchers, and engineers to work together to move beyond the beaten... .... We are actively exploring the next frontier of model architectures for AI x... ...development. Engineer meaningful evals and metrics which enable rapid model iteration...Work at office
$220k - $280k
...About the role In your role as Senior Research Engineer, you'll be at the heart of building the... ...shaping one of the most exciting product frontiers in AI. About the team Our Video... ...planning and tool orchestration to memory, evals, and shipping. You’ll partner closely with...Work at officeLocal areaFlexible hours$160k - $300k
.... We deliver the intelligence that gives finance professionals a definitive edge. Our AI uncovers... ...for matrix and deep, multi-source research. We’ve built our own agentic frameworks powered... ...most powerful and scalable LLM inference engine - a distributed, asynchronous DAG...Contract workFor contractorsFor subcontractorWork at office- ...AI Research Scientist We're building the first truly private, personal AI that learns your skills, judgment, and... ...augmenting people bottom-up. Our team previously created evals used by Open AI, completed frontier AI research at MIT/Cambridge/Oxford, worked in Stuart...Shift work
- ...Research Systems Engineer As a research systems engineer, you'll train frontier-scale models and develop the methods that make continual learning work inside enterprise... ...within customer environments to build custom evals, train models, and deploy agents that get better...Visa sponsorshipRelocation package
- A leading AI company based in San Francisco is seeking a research systems engineer to train large language models and explore reinforcement learning techniques. The ideal candidate will work at the intersection of research and systems design experiments at scale. Benefits...
- ...systems-heavy, and close to the frontier. You will consume real-world trajectories or researcher hypotheses, materialize... ...empirical AI research, systems engineering, and model evaluation. You may... ...LLM agents, computer‑use agents, evals, post‑training, synthetic data,...
- ...technology to create a safer world. Our team combines expertise from law enforcement, data science, engineering, and financial intelligence to combat terrorist financing, drug trafficking, human trafficking, and other serious crimes. Here, you\'ll join a mission-driven,...Local areaRemote workWork from homeWorldwideWeekday work
- ...Platform Research Engineer As a platform research engineer, you'll build the core AI systems... ...customer environments to build custom evals, train models, and deploy agents that get... ...coworkers. Our research team pushes the frontier of post-training and reinforcement...Visa sponsorshipRelocation package
- ...eliminating complexity and friction with seamless automation. As a Research Engineer at Capably, you’ll help define how intelligent systems... ...performance in production. You’ll stay close to the frontier of agent research while keeping a sharp focus on enterprise...
- ...developers and non-developers alike. Our mission is to push the frontier of code generation and agentic reasoning, and deploy these... ...designed for agentic coding. We operate across research, engineering, product, and infrastructure—owning the full lifecycle of experimentation...Work at officeRelocation package
- ...and others. The Role: We are looking for Research Engineers to build AI systems that use agent... ...directly with real-world agent data, apply frontier methods in production, and see your... ...support teams deploying agents across finance, legal, operations, and other high-stakes...Immediate start
$197.3k - $313.7k
## Applied Research EngineerApplyremote type: Office Tech-Flexiblelocations: California - San Francisco:... ...AI is looking for talented software and platform engineers to embed in our AI team to bridge the gap between frontier AI technology and customer-facing reality. Do you...Work at office- At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting... ...from text, efficient vector representations of 3D models, evals to measure performance of generations in 3D space, or RL frameworks...Work at office
- ...has already moved. The window to claim frontier applied AI for revenue is closing in the... ...scale, every day. We see exactly where research meets production and where the data is dirty... ...systems alongside elite and competitive engineering minds. Translate findings into...Relocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Frontier Evals - Finance. Be the first to apply!
- research software engineer San Francisco, CA
- research assistant engineering San Francisco, CA
- deep learning research engineer San Francisco, CA
- senior research engineer San Francisco, CA
- research programmer San Francisco, CA
- ai research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research engineer San Francisco, CA
- microsoft research San Francisco, CA
- oncology research nurse San Francisco, CA

