Lead AI Engineer (FM Hosting, LLM Inference)

$197.3k - $225.1k

Capital One Financial Corp

Overview

At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine learning - position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.

Team Description:

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.

In this role, you will:

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems.
Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.

The Ideal Candidate:

You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.
Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.
You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.
You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.
You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.

Basic Qualifications:

Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies
At least 4 years of experience programming with Python, Go, Scala, or Java

Preferred Qualifications:

6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
Experience designing, developing, delivering, and supporting AI services
Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.

The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked.

McLean, VA: $197,300 - $225,100 for Lead AI Engineer

New York, NY: $215,200 - $245,600 for Lead AI Engineer

San Francisco, CA: $215,200 - $245,600 for Lead AI Engineer

San Jose, CA: $215,200 - $245,600 for Lead AI Engineer

Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate's offer letter.

This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan.

Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website. Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.

This role is expected to accept applications for a minimum of 5 business days.

No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City's Fair Chance Act; Philadelphia's Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries.

If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.

For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io

Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.

Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Lead AI Engineer (FM Hosting, LLM Inference) in San Francisco, CA vacancy

Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
$229.9k - $262.4k
...Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems, changing... ..., and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance —...
Suggested
Full time
Part time
Local area
Capital One
San Francisco, CA
2 days ago
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
$193.4k - $220.7k
...responsible and reliable AI systems, changing banking... ...class applied science and engineering teams to deliver our industry leading capabilities with... ...training, large language model inference, similarity search, guardrails... ...state-of-the-art LLM optimization techniques to...
Suggested
Local area
Capital One
San Francisco, CA
1 day ago
Lead AI Software Engineer
$190k - $270k
...HP IQ is HP’s new AI innovation lab. Combining startup... ..., world‑class team—engineers, designers, researchers... .... We are looking for a Lead Software Engineer to... ...solutions for real‑time AI inference and processing.... ...Python. Proficient in LLM integration into multi‑...
Suggested
Full time
Temporary work
Local area
Flexible hours
HP IQ
San Francisco, CA
2 days ago
Lead AI Software Engineer
$190k - $270k
...HP IQ is HP's new AI innovation lab. Combining... ...diverse, world-class team-engineers, designers, researchers... .... We are looking for a Lead Software Engineer to design... ...for real-time AI inference and processing. Implement... .... ~ Proficient in LLM integration into multi-...
Suggested
Full time
Temporary work
Local area
Flexible hours
HP IQ
San Francisco, CA
23 hours ago
Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services)
$197.3k - $225.1k
...Overview Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services) At Capital One, we... ...foundation model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...
Suggested
Full time
Part time
Local area
Capital One Financial Corp
San Francisco, CA
3 days ago
Senior Lead AI Engineer,(MLX, Agentic AI, Gen AI platform Services)
$229.9k - $262.4k
...Senior Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services) At Capital One, we are creating... ...model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state‑of‑the‑art LLM optimization techniques to improve the...
Full time
Part time
Local area
Capital One
San Francisco, CA
23 hours ago
Lead AI Engineer
$197.3k - $225.1k
Lead AI Engineer At Capital One, we are creating responsible and reliable AI systems, changing... ...foundation model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...
Full time
Part time
Local area
Capital One
San Francisco, CA
4 hours ago
Lead AI Engineer
...About the Role An early-stage, AI-native product company is hiring a Lead AI Engineer to own the model layer of its core... ...and iterating on production-grade LLM systems with real-time user impact... ...& Optimization Optimize inference speed and throughput Improve reliability...
Harnham
San Francisco, CA
1 day ago
Lead AI Engineer, Data Solutions
Salesforce is a leading AI‑powered customer relationship management platform seeking a Lead AI Engineer to develop next‑generation AI and ML systems... ...AI agents that combine LLM reasoning, tool usage, and ML... ...for training, evaluation, and inference Transform raw interaction...
Salesforce
San Francisco, CA
23 hours ago
Senior AI Inference Engineer - GPU, Rust & CUDA
$220k
...Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience...
Perplexity
San Francisco, CA
2 days ago
Lead AI Engineer
...assets across global electricity markets is looking to hire a Leading AI Engineer to drive the next phase of their platform. Their technology... ...with power markets, trading & asset performance data Develop LLM-driven decision-making systems used in live environments Combine...
Storm4
San Francisco, CA
23 hours ago
Lead AI Engineer
$275k - $350k
...Job Description Hybrid role As an AI Lead at Weaver AI, you will be at the forefront of integrating... ...will be responsible for setting up the LLM architecture, ensuring low latency, and... ...accuracy. Collaborate closely with the engineering team to integrate AI models into the...
Immediate start
Visa sponsorship
AI Talent Now
San Francisco, CA
23 hours ago
Lead Applied AI Engineer
$159k - $305k
...About the Team The Cognitive AI Solutions Team is building the next generation... ...The Opportunity We are seeking a Lead Applied AI Engineer to be the technical engine of that delivery... ...techniques. Use foundation models, LLM orchestration, retrieval, and...
Work experience placement
Wells Fargo
San Francisco, CA
4 days ago
Lead AI Engineer
$200k
A top global hedge fund is looking for an AI Platform Engineer to lead the development and management of cutting-edge AI infrastructure for our complex... ...-as-code tools. Experience with AI agent frameworks, LLM integration, or Model Context Protocol (MCP) implementations...
Xcede
San Francisco, CA
1 day ago
Lead AI Engineer
Hilbert's AI Hilbert is building a reasoning engine that must navigate non-deterministic user behavior across data... ...of orchestrating multi-step inference over messy, high-stakes enterprise... ...answers don't exist. We're looking for a Lead AI Engineer who can own the technical...
Shift work
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
1 day ago
Distinguished AI Engineer (Agentic AI Platform)
$269.1k - $307.2k
Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are... ...to deliver our industry leading capabilities with breakthrough... ...will coach and evangelize - hosting architecture office hours, mentoring... ...or technologies (e.g. LLM Inference, Similarity Search and...
Full time
Part time
Work at office
Local area
Capital One
San Francisco, CA
23 hours ago
Lead AI Platform Engineer
...Overview Join a boutique quantamental hedge fund as our Lead AI Platform Engineer. Spearhead the buildout of a new internal data lake and platform... ...models, and feature engineering Demonstrated proficiency in LLM fine-tuning, system prompting, and multi-agent frameworks (e...
Full time
Immediate start
RBF Capital, LLC
San Francisco, CA
2 days ago
AI/ML Infra Engineer - Hosting
$250k
...career? Join a rapidly growing AI cloud infrastructure provider... ...large-scale AI training and inference workloads. With expanding GPU... ...As a Senior ML Infrastructure Engineer, the successful candidate will... ...using vLLM, SGLang, TensorRT-LLM, or Triton Knowledge of GPU...
Permanent employment
San Francisco, CA
29 days ago
AI GTM Engineer
$175k - $215k
...Samba TV AI-Native Revenue Organization Leader... ...this is the person who leads that build. Samba TV... ...MCP architecture, prompt engineering standards, and enablement... ...depend on ~ Hands-on LLM production experience:... ...scale, drift monitoring, inference cost management...
Samba TV
San Francisco, CA
2 days ago
AI Engineer (Full-stack)
$180k - $300k
...this role You'll build the AI systems that power taste evals... ...across the web Set up inference serving and APIs for client-facing... ...fullstack or backend software engineering Compensation & Additional... ...Python, React, TypeScript, LLM & Agent frameworks (LangChain,...
Full time
H1b
Work at office
Visa sponsorship
Tangerine Search, Inc.
San Francisco, CA
3 days ago
Senior AI Engineer
$250k - $300k
...Senior AI Engineer San Francisco (Hybrid) At You.com, we are building the AI Search Infrastructure... ...proprietary vertical indexes with LLM-optimized retrieval systems to power AI... ...— improving agentic results, cutting inference cost and token usage, and getting strong...
Full time
Immediate start
Remote work
Work from home
Flexible hours
Y.O.U.
San Francisco, CA
4 days ago
Senior AI Engineer
...We're building an agentic AI caregiver advocate that lives... ...and needs over time. The AI engineering challenge: build an autonomous... ...design and implement the agentic LLM systems, RAG pipelines, tool-use... ...landscape evolves. Optimize inference costs - our unit economics depend...
Immediate start
Remote work
Flexible hours
Citizen Health
San Francisco, CA
1 day ago
Founding AI Engineer
$150k - $220k
...Optimized deploys AI agents into the operations that run the physical... ...part, and we need a founding engineer to own it. As a Founding AI... ...and integration interfaces, inference and serving, observability, and... ...to-end Have built or deployed LLM-powered agents, tool-use...
Optimized, Inc.
San Francisco, CA
23 hours ago
Senior AI Engineer
$210k - $270k
...to help them hire. Senior AI Engineer Location - San Francisco... ...model accuracy, evaluation, inference, monitoring, and production AI... ...Teams deploying production LLM systems Organizations focused... ...Opportunity to build and lead the AI function Access to...
Remote work
Visa sponsorship
Recruiting from Scratch
San Francisco, CA
2 days ago
AI Engineer (Full-stack)
$180k - $300k
...the United States to help them hire. AI Engineer (Full-stack) Location: San Francisco,... ...scraping systems for visual datasets Build inference serving systems Develop APIs powering... ...AI systems Experience working with LLM systems Strong systems design...
H1b
Work at office
Remote work
Visa sponsorship
Recruiting from Scratch
San Francisco, CA
2 days ago
AI Engineer
$180k - $250k
...AI Engineer We're hiring a full-time AI Engineer to own the prompts, agents, evals, and... ...For Hands-on experience building LLM-powered features that shipped to real users... ...distributed compute frameworks for batch inference, eval pipelines, or scaling agent...
Full time
Work at office
Remote work
Relocation
Fluency Corp
San Francisco, CA
1 day ago
AI Engineer
$100k - $120k
...Junior-Level Ai Developer At Jacobs, we're challenging... ...with a solid software engineering foundation and a drive... ...wants to grow into a leading voice on our AI... ...back-end APIs, and cloud-hosted AI services Evaluate... ...building applications with AI/LLM APIs and frameworks ~...
Full time
Remote work
Jacobs Solutions
San Francisco, CA
3 days ago
Lead AI Engineer
$275k - $350k
...Lead AI Engineer Title of Role: Lead AI Engineer Location: San Francisco, hybrid Company Stage of Funding: Venture-Backed - AI, Software Development, Devtools, B2B Office Type: Hybrid Salary: $275K-$350K Company Description We're representing...
Work at office
Recruiting from Scratch
San Francisco, CA
4 hours ago
AI Engineer
...partnering with a fast-moving technology company to find an AI Engineer to build and ship LLM-powered applications. Our client is looking for someone... ...used daily by thousands of users. Build and optimize inference pipelines and real-time speech recognition systems....
Full time
Work experience placement
Work at office
Remote work
twenty80.io
San Francisco, CA
1 day ago
Founding AI Engineer
...building the next-generation retrieval engine for AI systems. We’re rethinking search from the... ...and what it takes to scale training and inference pipelines in the real world. You want to... ...Background in information retrieval, NLP, or LLM internals. Familiarity with type‑safe...
Zeroentropy
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead AI Engineer (FM Hosting, LLM Inference). Be the first to apply!