Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)

$229.9k - $262.4k

Capital One

Overview:

At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.

Team Description:

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.

In this role, you will:

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems.
Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.

The Ideal Candidate:

You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.
Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.
You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.
You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.
You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.

Basic Qualifications:

Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies
At least 6 years of experience programming with Python, Go, Scala, or Java

Preferred Qualifications:

7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
Experience designing, developing, integrating, delivering, and supporting complex AI systems
Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders
Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production
Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.

The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked.

Cambridge, MA: $229,900 - $262,400 for Sr. Lead AI Engineer

McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer

New York, NY: $250,800 - $286,200 for Sr. Lead AI Engineer

San Francisco, CA: $250,800 - $286,200 for Sr. Lead AI Engineer

San Jose, CA: $250,800 - $286,200 for Sr. Lead AI Engineer

Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter.

This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan.

Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website ( . Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.

This role is expected to accept applications for a minimum of 5 business days.

No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries.

If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io . All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.

For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io

Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.

Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) in San Francisco, CA vacancy

Lead AI Engineer
$197.3k - $225.1k
Lead AI Engineer At Capital One, we are creating responsible and reliable... .... Our AI models and platforms empower teams across Capital... ...training, large language model inference, similarity search,... ...introduce state-of-the-art LLM optimization techniques to improve the performance...
Platform
Full time
Part time
Local area
Capital One
San Francisco, CA
1 day ago
Sr. Distinguished AI Engineer (Remote Eligible)
$286.2k - $326.7k
Sr. Distinguished AI Engineer (Remote Eligible) At Capital... ...deliver our industry leading capabilities with... ...Our AI models and platforms empower teams across... ...large language model inference, similarity search,... ...-of-the-art LLM optimization techniques to improve...
Platform
Senior
Full time
Part time
Local area
Remote work
Capital One Financial Corporation
San Francisco, CA
11 hours ago
Senior Staff AI Engineer
...role: SoFi’s Staff AI Engineer is a hands-on AI... ...our next-generation AI platform, particularly focusing... ...failovers Deep Model Optimization: Pioneer and institutionalize... ...platform designed to host internally fine-tuned... ..., low-latency inference across diverse hardware...
Platform
Senior
Full time
Sofi
San Francisco, CA
11 hours ago
Lead AI Engineer, Data Solutions
...Salesforce is a leading AI‑powered customer relationship management platform seeking a Lead AI Engineer to develop next‑generation AI and ML... ...training, evaluation, and inference Transform raw interaction... ...signals to drive continuous optimization Systems & API Development...
Platform
Salesforce
San Francisco, CA
4 days ago
Lead AI Engineer ( MLX, Gen AI Platform Services, Agentic AI)
$197.3k - $225.1k
...responsible and reliable AI systems, changing... ...science and engineering teams to deliver our industry leading capabilities with... ...Our AI models and platforms empower teams across... ...language model inference, similarity search... ...state‑of‑the‑art LLM optimization techniques to...
Platform
Full time
Part time
Local area
Capital One
San Francisco, CA
5 days ago
Staff + Senior Software Engineer, Cloud Inference
$300k
...interpretable, and steerable AI systems. We want AI... ...researchers, engineers, policy experts, and... ...Role The Cloud Inference team scales and optimizes Claude to serve the... ...Claude on each cloud platform—from API integration... ...group, and we host frequent research discussions...
Platform
Senior
Full time
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
11 hours ago
Senior AI Engineer
$250k - $300k
..., we are building the AI Search Infrastructure... ...backed information. Our platform combines proprietary... ...indexes with LLM-optimized retrieval systems to power... ...Our team includes engineers, researchers, product... ...agentic results, cutting inference cost and token usage,...
Platform
Senior
Full time
Immediate start
Remote work
Work from home
Flexible hours
You.com
San Francisco, CA
11 hours ago
Sr. Applied AI Engineer
...AI at Zapier At Zapier , we build and use automation every... ...excited about building the platform that makes AI and machine... ...’s AI Platform team as a Sr. Applied AI Engineer ! As a key member of this... ...problems like model access, inference reliability, observability,...
Platform
Senior
Remote job
Full time
Zapier
San Francisco, CA
11 hours ago
Software Engineer, Inference
$300k
...interpretable, and steerable AI systems. We want... ...researchers, engineers, policy experts,... ...role Our Inference team is responsible... ...in multiple cloud platforms. You may be a... ...LLM inference optimization, batching, and caching... ...group, and we host frequent research...
Platform
Full time
Work at office
Worldwide
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
11 hours ago
Software Engineer - Voice AI (Inference Runtime)
...mission-critical inference for the world's most dynamic AI companies, like Cursor... ...help build the platform engineers turn to to ship AI... ...and server-level optimizations. Build large-scale... ...: Own and lead Voice AI product areas... ...profiling across host-device boundaries...
Platform
Full time
Flexible hours
Baseten
San Francisco, CA
11 hours ago
Lead AI Platform Engineer
...Overview Join a boutique quantamental hedge fund as our Lead AI Platform Engineer. Spearhead the buildout of a new internal data lake and platform... ...with a proven track record of selecting and evaluating optimal technology stacks Deep NLP expertise for extracting insights...
Platform
Full time
Immediate start
RBF Capital, LLC
San Francisco, CA
3 days ago
Software Engineer, Inference - Performance Optimization
...the Team Our team analyzes inference stack performance across the... ...into performance optimizations and models that project performance... ...Enjoy collaborating with engineering and research teams to improve... ...About OpenAI OpenAI is an AI research and deployment company...
Full time
OpenAI
San Francisco, CA
11 hours ago
Distinguished AI Engineer (Agentic AI Platform)
$269.1k - $307.2k
Distinguished AI Engineer (Agentic AI Platform) At Capital One,... ...deliver our industry leading capabilities with breakthrough... ...end performance by optimizing orchestration -... ...and evangelize - hosting architecture office... ...technologies (e.g. LLM Inference, Similarity Search...
Platform
Full time
Part time
Work at office
Local area
Capital One Financial Corporation
San Francisco, CA
11 hours ago
Lead AI Engineer
$300k - $400k
...reinvent how designers work in the AI era. We’re backed by top... ...About the Role We’re hiring an Lead AI Engineer to own and scale our AI infrastructure... ...You’ll Do Own the training-to-inference pipeline for large code models—optimize inference with quantization,...
Full time
Noon S.r.o
San Francisco, CA
11 hours ago
Senior Staff AI Engineer
$207k - $290k
...Description About JazzX AI: Vision:... ...seeking an experienced AI Engineer with deep expertise in... ...generation enterprise AGI platform. You will lead the design, development, and optimization of cutting-edge RL... ...techniques , including inference-time search, chain-of-...
Platform
Senior
Worldwide
Flexible hours
JazzX AI
San Francisco, CA
6 days ago
Senior AI Engineer
...Parasail is redefining AI infrastructure by enabling... ...network of GPUs, optimizing for cost, performance,... ...for a hungry, creative engineer who thrives in a high-trust... ...prompt engineering Lead experimental build cycles... ...Ensure alignment with platform architecture; seek guidance...
Platform
Senior
Full time
Parasail
San Francisco, CA
11 hours ago
Lead AI Software Engineer
$190k - $270k
...HP IQ is HP’s new AI innovation lab. Combining startup... ..., world‑class team—engineers, designers, researchers... .... We are looking for a Lead Software Engineer to... ...models and edge devices. Optimize data pipelines and storage... ...for real‑time AI inference and processing. Implement...
Full time
Temporary work
Local area
Flexible hours
ROLE
San Francisco, CA
1 day ago
Senior AI Inference Engineer - GPU, Rust & CUDA
$220k
...Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience...
Senior
Perplexity
San Francisco, CA
3 days ago
Senior AI Engineer Health Intelligence
...of integrating modern AI and LLMs into the Oura... ...years. As a Senior AI Engineer, you will design, build... ...safety for AI workflows: Lead the design and... ...multi-LLM and reasoning platform: Prototype and productionize... ...multi-objective optimization and guardrails for safety...
Platform
Senior
Full time
Temporary work
Work at office
Local area
Remote work
Flexible hours
Ura Corp
San Francisco, CA
11 hours ago
Senior AI Engineer
$192k - $237.1k
...brings unique perspectives that lead to better solutions.... ...automation by integrating intelligent AI capabilities into its platform. We are seeking a Senior AI Engineer to help design, build, and scale... ...production environments. Optimize for latency, cost, reliability...
Platform
Senior
Full time
Work at office
Immediate start
Worldwide
Monday to Friday
Flexible hours
Drata
San Francisco, CA
11 hours ago
Senior AI Engineer, Agentic Data Enrichment
$230k - $340k
...including FIS, Rho, Socure and leading loan infrastructure... ...real and worth trusting: gig platforms, marketplaces, AI companies, and commerce... ...We're hiring a Senior AI Engineer to own a slice of this enrichment... .... Cost/latency optimization: response caching, semantic...
Platform
Senior
Full time
Work at office
Flexible hours
Baselayer
San Francisco, CA
11 hours ago
Senior Staff Machine Learning Engineer, LLM/VLM Model Architecture & Optimization
$298k - $368k
...applied to a range of vehicle platforms and product use cases. The... ...with downstream teams on the optimization and integration into the... ...diverse set of sensors, enabling engineers like you to (1) develop... ...expertise in low-latency on-device inference techniques and a deep...
Platform
Senior
Full time
Remote work
Waymo
San Francisco, CA
11 hours ago
Software Engineer, Inference GPU Enablement
...the Team OpenAI’s Inference team ensures that our... ...at scale. We build and optimize the systems that power... ...broader set of compute platforms - for example AMD GPUs... ...Role We’re hiring engineers to scale and optimize... ...OpenAI OpenAI is an AI research and deployment...
Platform
Full time
OpenAI
San Francisco, CA
11 hours ago
Software Engineer, Inference - Multi Modal
...About the Team OpenAI’s Inference team powers the... ...- across a variety of platforms. Our work ensures these... ..., fast-moving team of engineers focused on delivering... ...the boundaries of what AI can do. We’re expanding... .... You'll build and optimize the systems that let users...
Platform
Full time
OpenAI
San Francisco, CA
11 hours ago
Lead AI Engineer
$200k
A top global hedge fund is looking for an AI Platform Engineer to lead the development and management of cutting-edge AI infrastructure for our complex enterprise environment. This hire will design and manage federated AI ecosystems, including MCP servers, agents and enterprise...
Platform
Xcede
San Francisco, CA
2 days ago
Principal AI/ML Engineer - AdTech
$300k - $400k
...NYSE: ZETA) is the AI-Powered Marketing... ...the Zeta Marketing Platform (ZMP), our vision... ...a Principal AI/ML Engineer in our AdTech team... ...solutions for campaign optimization, user... ...Learning Leadership: Lead the design and implementation... ...to real-time inference , for our real-...
Platform
Full time
Zeta Global
San Francisco, CA
11 hours ago
Applied AI Engineer
..., we're building the AI platform for IT teams. Our goal... ...by product and engineering leaders from Verkada... ...is backed by industry-leading investors like First... ...ground up. Develop and optimize Serval’s applied AI systems... ...and fine-tuning to inference and evaluation...
Platform
Full time
Serval
San Francisco, CA
11 hours ago
Software Engineer, Productivity - Inference Runtime
...a Developer Productivity engineer to support OpenAI’s Inference Runtime teams. These teams... ...launches, inference optimizations, cloud provider integrations... ...performance-sensitive inference platforms in the world. In... ...OpenAI OpenAI is an AI research and deployment company...
Platform
Full time
OpenAI
San Francisco, CA
11 hours ago
Senior AI Engineer, Agent Harness
$166.9k - $225.9k
...brings unique perspectives that lead to better solutions.... ...automation by integrating intelligent AI capabilities into its trust platform. We are seeking a Senior AI Engineer to help design, build, and... ...in production environments Optimize for latency, cost,...
Platform
Senior
Full time
Work at office
Immediate start
Worldwide
Monday to Friday
Flexible hours
Drata
San Francisco, CA
11 hours ago
Founding AI Engineer
$150k - $220k
...Optimized deploys AI agents into the operations that run the physical economy... ..., and we need a founding engineer to own it. As a Founding AI... ...Engineer, you'll architect the platform our agents run on:... ...and integration interfaces, inference and serving, observability,...
Platform
Optimized, Inc.
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform). Be the first to apply!