Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead AI Engineer (FM Hosting, LLM Inference)

$197.3k - $225.1k

Capital One Financial Corp

Lead AI Engineer (FM Hosting, LLM Inference)

Overview

At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine learning - position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.

Team Description:

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.

In this role, you will:
  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
  • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
  • Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
  • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems.
  • Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.
The Ideal Candidate:
  • You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.
  • Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.
  • You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.
  • You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.
  • You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.
Basic Qualifications:
  • Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies
  • At least 4 years of experience programming with Python, Go, Scala, or Java
Preferred Qualifications:
  • 6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
  • Experience designing, developing, delivering, and supporting AI services
  • Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
  • Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
  • Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.

The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked.

McLean, VA: $197,300 - $225,100 for Lead AI Engineer

New York, NY: $215,200 - $245,600 for Lead AI Engineer

San Francisco, CA: $215,200 - $245,600 for Lead AI Engineer

San Jose, CA: $215,200 - $245,600 for Lead AI Engineer

Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate's offer letter.

This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan.

Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website. Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.

This role is expected to accept applications for a minimum of 5 business days.

No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City's Fair Chance Act; Philadelphia's Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries.

If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.

For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io

Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.

Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Lead AI Engineer (FM Hosting, LLM Inference) in San Francisco, CA vacancy
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems,...  ...and more. ~ Invent and introduce state-of-the-art LLM optimization techniques to improve the performance -... 
    Suggested
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Francisco, CA
    4 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems, changing...  ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation... 
    Suggested
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Francisco, CA
    3 days ago
  • $300k - $400k

    Noon is looking for a Lead AI Engineer to scale AI infrastructure and product capabilities. You will build LLM-powered features, and define the technical direction of AI systems. Candidates should have over 5 years experience in software engineering and 2 years of hands... 
    Suggested

    Noon

    San Francisco, CA
    3 days ago
  •  ...Hilbert is building a reasoning engine that must navigate non-...  ...problem of orchestrating multi-step inference over messy, high-stakes...  ...'t exist. We're looking for a Lead AI Engineer who can own the technical...  ...(RAG) , vector databases, or LLM-powered search and recommendation... 
    Suggested
    Shift work

    Hilbert's Ai

    San Francisco, CA
    1 day ago
  • $197.3k - $225.1k

     ...Lead AI Engineer At Capital One, we are creating responsible and reliable AI systems, changing...  ...model training, large language model inference, similarity search, guardrails, model evaluation...  ...Invent and introduce state-of-the-art LLM optimization techniques to improve the... 
    Suggested
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Francisco, CA
    3 days ago
  • $225.4k - $257.2k

     ...responsible and reliable AI systems, changing banking...  ...class applied science and engineering teams to deliver our industry leading capabilities with...  ...training, large language model inference, similarity search, guardrails...  ...state‑of‑the‑art LLM optimization techniques to... 
    Full time
    Part time
    Local area

    Capital One National Association

    San Francisco, CA
    17 hours ago
  • $190k - $270k

     ...HP IQ is HP's new AI innovation lab. Combining...  ...diverse, world-class team-engineers, designers, researchers...  .... We are looking for a Lead Software Engineer to design...  ...for real-time AI inference and processing. Implement...  .... ~ Proficient in LLM integration into multi-... 
    Full time
    Temporary work
    Local area
    Flexible hours

    HP IQ

    San Francisco, CA
    3 days ago
  • $225.4k - $257.2k

     ...responsible and reliable AI systems, changing banking...  ...class applied science and engineering teams to deliver our industry leading capabilities with...  ...training, large language model inference, similarity search, guardrails...  ...state‑of‑the‑art LLM optimization techniques to... 
    Full time
    Part time
    Local area

    Capital One National Association

    San Francisco, CA
    2 days ago
  •  ...Lead AI Engineer – Salesforce Lead AI Engineer at Salesforce responsible for building next‑generation...  ...) Design AI agents that combine LLM reasoning, tool usage, and ML decisioning...  ...‑time) for training, evaluation, and inference Transform raw interaction data into features... 

    Salesforce.Com Inc

    San Francisco, CA
    1 day ago
  • $172.5k - $260.1k

     ...Job Category Software Engineering Job Details About...  ...Salesforce is the #1 AI CRM, where humans with...  ...career at the company leading workforce transformation...  ...AI agents that combine LLM reasoning, tool usage,...  ...training, evaluation, and inference Transform raw... 

    Salesforce

    San Francisco, CA
    17 hours ago
  •  ...looking for a Model Performance Engineer in San Francisco, California to optimize model inference speed, cost, and reliability. You...  ...that accelerates the AI team’s processes. The role covers...  ...should have deep experience with LLM serving frameworks, substantial... 

    Pantera Capital

    San Francisco, CA
    1 day ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview At Capital One, we are...  ...foundation model training, large language model inference, similarity search, guardrails, model...  ..., and more. Invent state‑of‑the‑art LLM optimization techniques to improve... 
    Local area

    Capital One National Association

    San Francisco, CA
    17 hours ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One...  ...foundation model training, large language model inference, similarity search, guardrails, model...  ...Invent and introduce state-of-the-art LLM optimization techniques to improve the... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Francisco, CA
    1 day ago
  • Principal AI Engineer (LLM Agents & Orchestration) Role Title: Principal AI Engineer (LLM Agents...  ...in Large Language Models (LLMs) to lead the architectural development of our new...  ...reliably. Latency & Reliability: Optimize inference pipelines for speed (streaming, token... 

    ImagineArt

    San Francisco, CA
    4 days ago
  •  ...people interact with the web by building AI agents that can reliably do everyday digital...  ...reward models) Scale infra for agentic inference (throughput and latency of perception‑...  ...generalist web‑agent Work closely with product engineers to translate cutting‑edge AI capabilities... 
    Work at office
    Relocation
    Visa sponsorship

    Yutori

    San Francisco, CA
    4 days ago
  • $220k

    We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures at scale with tight latency and...  ...systems programming experience is a plus. You understand modern LLM architectures and are able to bring them up reliably in a... 

    Perplexity

    San Francisco, CA
    1 day ago
  • A leading financial institution in California seeks a Senior Lead AI Engineer to design and develop innovative AI products. The successful candidate will collaborate with diverse teams to optimize AI systems and solutions, contributing significantly to modern banking. A... 

    Capital One National Association

    San Francisco, CA
    4 days ago
  • $167.4k - $205.1k

     ...Sr AI/ML Developer JLL empowers you to shape a brighter way. Our people at JLL...  ...and build intelligent AI agent systems, LLM-based applications, and autonomous workflow...  ...Demonstrated proficiency in LLM fine-tuning, prompt engineering, and model optimization Demonstrated... 
    Daily paid

    JLL Technology Solutions (formerly BRG)

    San Francisco, CA
    15 days ago
  • $175k - $225k

     ...security. Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple, and...  ...A* and Atomic with participation from other leading venture capital firms. The Role We're looking for an AI Inference Engineer who lives at the boundary of high-... 
    Local area
    Remote work

    Sauron

    San Francisco, CA
    2 days ago
  • $167.4k - $205.1k

     ...Sr AI/ML Developer JLL empowers you to shape a brighter way. Our people at JLL...  ...and build intelligent AI agent systems, LLM-based applications, and autonomous workflow...  ...Demonstrated proficiency in LLM fine-tuning, prompt engineering, and model optimization Demonstrated... 
    Daily paid

    JLL

    San Francisco, CA
    5 days ago
  • $220k

    Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience... 

    Perplexity

    San Francisco, CA
    17 hours ago
  • A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service... 
    Remote job

    DigitalOcean

    San Francisco, CA
    1 day ago
  • Linuxcareers is seeking a Software Engineer in San Francisco to design and maintain AI workflows that automate business processes. The engineer will manage cloud infrastructure, including LLM integration and production support. The ideal candidate will have strong cloud... 

    Linuxcareers

    San Francisco, CA
    3 days ago
  • Health Harbor, located in San Francisco, is seeking experienced engineers to build and scale their Voice AI LLM and orchestration system. The role demands strong problem-solving skills and the ability to work under high pressure, with a commitment of about 70 hours a week... 
    Flexible hours

    Health Harbor

    San Francisco, CA
    4 days ago
  • $180k - $250k

    Fluency in San Francisco is hiring a full-time AI Engineer to create LLM-powered features and improve AI output quality. You will build new features end to end, partner with product engineers, and evaluate models. Applicants should have experience in TypeScript or Python... 
    Full time

    Fluency

    San Francisco, CA
    2 days ago
  • $166.5k - $266.2k

     ...something unprecedented — an AI foundation that will push the...  ...areas. The Forward Deployed AI Engineer is the connective tissue between...  ...of multi‑omics data, leading end‑to‑end deployments from scoping...  ...and on‑call readiness Apply LLM, retrieval‑augmented generation... 
    Full time
    Flexible hours

    Eli Lilly and Company

    San Francisco, CA
    4 days ago
  • $203.5k

     ...professionally and personally. About Bain AI, Insights & Solutions (AIS)...  ..., design, architecture and engineering, and client stakeholders,...  ...You will design and implement LLM-driven applications and...  ...and non-technical stakeholders; lead working sessions, present recommendations... 
    Full time
    Temporary work
    Apprenticeship
    Work at office
    Local area
    Work from home
    Home office
    3 days per week

    Bain & Company

    San Francisco, CA
    4 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Remote job

    DigitalOcean

    San Francisco, CA
    10 days ago
  • Sciforium is seeking a skilled Distributed Training and Inference Engineer to optimize critical software stacks for AI training. The ideal candidate has over 5 years' experience and deep knowledge of ML frameworks. Responsibilities include maintaining ML libraries, optimizing... 
    Flexible hours

    Sciforium

    San Francisco, CA
    1 day ago
  • An innovative AI company is seeking a Software Engineer to develop infrastructure that supports AI training and inference workflows. This role requires strong object-oriented programming skills and a solid foundation in data structures and algorithms. The ideal candidate... 

    SpreeAI

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead AI Engineer (FM Hosting, LLM Inference). Be the first to apply!