Research Engineer, Frontier Evals - Finance
$310k - $380kOpenAI
About the team
The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI/ASI. This team builds ambitious evaluations to measure and steer our models, and creates self-improvement loops to steer our training, safety, and launch decisions. Some of the team's open-sourced evaluations include SWE-bench Verified , MLE-bench , PaperBench , and SWE-Lancer , and the team built and ran frontier evaluations for GPT4o , o1 , o3 , GPT 4.5 , ChatGPT Agent , and GPT5 . If you are interested in feeling firsthand the fast progress of our models, and steering them towards good, this is the team for you.
About you
We are seeking exceptional research engineers that can push the boundaries of our frontier models in the finance domain. We are looking for those who will help shape AI evaluations of financial reasoning and related capabilities, and will own individual threads within this endeavor end-to-end.
In this role, you'll:
Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas
Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it
Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities
We expect you to be:
Have prior background / domain expertise in finance, especially investment banking or private equity, and a passion for these problems
Detail-oriented and thorough
Team player / willing to do a variety of tasks to move the team forward
Passionate and knowledgeable about AGI/ASI measurement
Strong engineering and statistical analysis skills
Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end
It would be great if you also have:
An ability to work cross-functionally
Excellent communication skills
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement .
Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form . No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link .
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Compensation
$310K – $380K • Offers Equity
The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.
Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
401(k) retirement plan with employer match
Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
Mental health and wellness support
Employer-paid basic life and disability coverage
Annual learning and development stipend to fuel your professional growth
Daily meals in our offices, and meal delivery credits as eligible
Relocation support for eligible employees
Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.
More details about our benefits are available to candidates during the hiring process.
This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.
$320k
...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...beneficial AI systems. About The Team The Frontier Red Team (FRT) is a small, focused... ...against advanced adversarial AI. Create evals and training environments to...SuggestedRelocationVisa sponsorship- ...Our vast talent network trains frontier AI models in the same way... ...team. You’ll work alongside researchers, operators, and AI companies... ...About the Role As a Research Engineer at Mercor, you’ll work at the... ...reasoning. You’ll design and run evals, build rubrics and scorers, and...SuggestedWork at office
- ...Turing is the world’s leading research accelerator for frontier AI labs and a trusted... ...GDP-value domains such as Finance, Sales, Retail, Developer... ...Environments for Software Engineering / coding agents UI-Environments... ..., RL environments, and evals for frontier AI labs in...SuggestedFor contractorsFlexible hours
$160k - $240k
Research Engineer — Evals Location: San Francisco, CA (Hybrid) OR Remote (Americas, UTC-3 to UTC-10) Employment Type: Full time Department: Engineering Team Compensation: $160K - $240K • 0.01% - 0.10% Overview You'll build the evaluation systems that tell us whether Firecrawl...SuggestedFull timeTemporary workWork at officeRemote work- ...problems where the edge cases matter most. We’re looking for a Research Engineer to help define how we measure and improve model quality. You’... ..., and risk workflows Design and run offline and online evals that measure model performance on real customer tasks, not just...Suggested
- A leading technology company in San Francisco is seeking a Research Engineer for the Frontier Safety Loss of Control team. The role focuses on monitoring and controlling AI to mitigate risks associated with misaligned agents. Candidates should have a Bachelor’s degree...
$174k - $252k
Research Engineer, Frontier Safety Loss of Control, DeepMind DeepMind - San Francisco, CA, USA Qualified applicants with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California...Full time$300k
Research Engineer, Agent Systems One of the most mission-driven organizations in AI is building the infrastructure that makes intelligent... ...$300K - $600K+ Total Comp + Equity San Francisco · On-Site Frontier AI Lab · Confidential Highly Selective · 1 Engineer No Visa...Visa sponsorship- ...in San Francisco is looking for a Member of Technical Staff focused on frontier AI model development. The role involves designing experiments, developing training methods, and iterating on research ideas. Candidates should be excited about AI's impact on science and technology...
- ...superintelligence stack - from frontier agentic models to the infra that... ...environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run... ...contexts. As a Research Engineer in our Reasoning team, you'll...Remote workWorldwideVisa sponsorshipRelocation packageFlexible hours
$150k - $300k
...superintelligence stack - from frontier agentic models to the infra that... ...environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run... ...contexts. As a Research Engineer working on Distributed Training,...Remote workWorldwideVisa sponsorshipRelocation packageFlexible hours- ...AI Research Scientist We're building the first truly private, personal AI that learns your skills, judgment, and... ...augmenting people bottom-up. Our team previously created evals used by Open AI, completed frontier AI research at MIT/Cambridge/Oxford, worked in Stuart...Shift work
- ...a team of world-class scientists, ML researchers, and engineers to work together to move beyond the beaten... .... We are actively exploring the next frontier of model architectures for AI x... ...development. Engineer meaningful evals and metrics which enable rapid model iteration...Work at office
- ...Team The Future of Computing Research team is an applied research team... ...of computing. We work at the frontier of multimodal AI, helping... ...work closely across research, engineering, design, product, and safety... ...clean experiments, reliable evals, and decision‑useful metrics....Work at officeImmediate startRelocation package
$160k - $300k
.... We deliver the intelligence that gives finance professionals a definitive edge. Our AI uncovers... ...for matrix and deep, multi-source research. We’ve built our own agentic frameworks powered... ...most powerful and scalable LLM inference engine - a distributed, asynchronous DAG...Contract workFor contractorsFor subcontractorWork at office- Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning...Full time
- A leading AI company based in San Francisco is seeking a research systems engineer to train large language models and explore reinforcement learning techniques. The ideal candidate will work at the intersection of research and systems design experiments at scale. Benefits...
$350k
...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...research-heavy environment such as a frontier AI lab, quant research firm, or domain... ...pattern matching * Scaling long-running evals and fickle training environments that...Full timeWork at officeVisa sponsorshipFlexible hours- ...Location Type On-site Department Engineering Our Mission Reflection’s... ...states. Our team of AI researchers and company builders come from... ...teams to accelerate the pace of frontier model development. They are... ...is building model evals and safety from the ground up...Full timeRelocation package
$320k
Anthropic is seeking a Research Engineer in San Francisco to design and build autonomous AI systems that counter adversarial threats. This role involves creating evaluation environments and developing defensive agents while collaborating with experts in the field. Candidates...Relocation package- ...technology to create a safer world. Our team combines expertise from law enforcement, data science, engineering, and financial intelligence to combat terrorist financing, drug trafficking, human trafficking, and other serious crimes. Here, you\'ll join a mission-driven,...Local areaRemote workWork from homeWorldwideWeekday work
- Job Title: AI Research Engineer About Xterra Xterra is a Khosla Ventures-backed company building... ...datasets. This is a rare intersection of frontier AI and real-world scientific impact.... ...performance, building the harnesses that run evals at scale, and making sure our...
- ...powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've... ...offering three integrated solutions for frontier AI development: Enterprise Platform & Tools... .... Role Overview As an Applied Research Engineer, you will be at the forefront of...Flexible hours
- ...well known in the AI community for seminal research accomplishments at top AI labs, have run... ...Chai Discovery is seeking an AI Engineer to play a crucial role in developing our... ...What We Offer Highly engaging work at the frontier of AI-driven drug discovery that will fundamentally...Full time
- ...work, it has to be a trustworthy searcher. We're hiring a Research Engineer to advance the science and engineering that goes into making... ...care about, rather than overfitting to any one regime. Design frontier‑discriminating evaluations that distinguish genuine...Visa sponsorship
- ...these models to real-world industry and economy use cases. As a Research Engineer on our Physical AI team, you will lead pre-training and post... ...and your impact will be direct. If you want to work at the frontier of generative modeling and physical AI, this is the team....Work at office
- About Us Sieve is the only AI research lab exclusively focused on video data. We combine... ...sources to develop datasets that push the frontier of video modeling. Video makes up 80% of... ...About the Role As an applied research engineering intern at Sieve, you’ll help build high...Internship
$200k - $400k
...team. About the Team Read more about the research team\'s work here: The Research team... ...world deployments. Our goal is to push the frontier of applied conversational AI: agents... ...scale. About the Role As a Voice Research Engineer, you’ll lead the development of the models...Full timeWork at officeLocal area$315k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... .... As a Research Engineer on this team, you\'ll ensure our frontier models train reliably, efficiently, and at scale. This is...Full timeWork at officeVisa sponsorshipFlexible hoursWeekend workAfternoon shift- ...developers and non-developers alike. Our mission is to push the frontier of code generation and agentic reasoning, and deploy these... ...designed for agentic coding. We operate across research, engineering, product, and infrastructure—owning the full lifecycle of experimentation...Work at officeRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Frontier Evals - Finance. Be the first to apply!
- senior research engineer San Francisco, CA
- research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research programmer San Francisco, CA
- deep learning research engineer San Francisco, CA
- ai research engineer San Francisco, CA
- research assistant engineering San Francisco, CA
- research software engineer San Francisco, CA
- research professional San Francisco, CA
- research and development manager San Francisco, CA

