Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Lead AI Engineer (FM Hosting, LLM Inference)

$229.9k - $262.4k

Capital One

Senior Lead AI Engineer (FM Hosting, LLM Inference)

Overview:

At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine learning - position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.

Team Description:

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.

In this role, you will:

  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.

  • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.

  • Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.

  • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems.

  • Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.

The Ideal Candidate:

  • You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.

  • Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.

  • You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.

  • You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.

  • You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.

Basic Qualifications:

  • Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies

  • At least 6 years of experience programming with Python, Go, Scala, or Java

Preferred Qualifications:

  • 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)

  • Experience designing, developing, integrating, delivering, and supporting complex AI systems

  • Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders

  • Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang

  • Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost

  • Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production

  • Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.

The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked.

Cambridge, MA: $229,900 - $262,400 for Sr. Lead AI Engineer

McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer

New York, NY: $250,800 - $286,200 for Sr. Lead AI Engineer

San Jose, CA: $250,800 - $286,200 for Sr. Lead AI Engineer

Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate's offer letter.

This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan.

Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website ( . Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.

This role is expected to accept applications for a minimum of 5 business days.

No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City's Fair Chance Act; Philadelphia's Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries.

If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io . All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.

For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io

Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.

Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Lead AI Engineer (FM Hosting, LLM Inference) in San Jose, CA vacancy
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems,...  ...and more. ~ Invent and introduce state-of-the-art LLM optimization techniques to improve the performance -... 
    Senior
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA Gruppe is seeking a Senior Software Engineer - AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services) Overview At Capital One, we are...  ...model training, large language model inference, similarity search, guardrails, model...  ...Invent and introduce state‑of‑the‑art LLM optimization techniques to improve the... 
    Senior
    Local area

    Capital One National Association

    San Jose, CA
    3 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) Overview At Capital One, we are creating responsible and reliable AI systems...  ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation... 
    Suggested
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    3 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (GenAI Platform Services) Overview At Capital One, we are creating responsible...  ...model training, large language model inference, similarity search, guardrails, model...  ...Invent and introduce state‑of‑the‑art LLM optimization techniques to improve the... 
    Senior
    Local area

    Comfort Systems USA

    San Jose, CA
    1 day ago
  • $152k - $241.5k

     ...is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves...  ...development experience in Python or C++. A deep understanding of LLM architectures is necessary. The base salary range is $152,000 -... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • Cerebras Systems, Inc. is looking for a Senior Performance Engineer to enhance the performance...  ...competitive pricing models for their AI chip. The ideal candidate will have extensive experience with open-source inference frameworks and an understanding of ML... 
    Senior

    Cerebras Systems, Inc.

    Sunnyvale, CA
    11 hours ago
  • A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $240k - $260k

     ...AI Platform Engineer - Training & Inference Saviynt's AI-powered identity platform manages...  ...and empower the world's leading brands, Fortune 500 companies...  ...H100s, the multi-engine LLM inference mesh (vLLM,...  ...aware fallback between self-hosted SLMs and cloud LLMs •... 

    Saviynt

    Milpitas, CA
    1 day ago
  • $184k - $287.5k

    NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • Lendistry is seeking a Senior AI Engineer to lead the delivery of AI strategies, focusing on end-to-end LLM features including document intelligence and risk assessment workflows. This role involves collaborating with senior leaders and mentoring junior engineers, ensuring... 
    Senior

    Lendistry

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

    NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference...  ...engineers. A strong background in systems engineering, LLM serving, and programming in Rust, C++, Python, and CUDA... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • Lendistry, LLC. is seeking a Senior AI Engineer to lead the delivery of AI solutions, including document intelligence and risk assessment tools....  ...strong background in software engineering, particularly with LLM applications, and experience in financial services. Join us... 
    Senior

    Lendistry, LLC.

    Santa Clara, CA
    1 day ago
  • $197.3k - $225.1k

     ...Lead AI Engineer Overview At Capital One, we are creating responsible and reliable AI...  ...foundation model training, large language model inference, similarity search, guardrails, model...  ...Invent and introduce state-of-the-art LLM optimization techniques to improve the... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    6 days ago
  • $272k - $431.25k

     ...throughput, low-latency inference framework for...  ...generative AI and reasoning models...  ...of cutting-edge LLM workloads. We...  ...Systems Engineer to define the vision...  ...memory, pinned host memory, RDMA-accessible...  ...with leading LLM serving engines...  ...pools. Mentor senior and junior engineers... 
    Local area
    Remote work

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe is looking for a skilled engineer to join their TensorRT Edge-LLM team in Santa Clara, California. The role involves developing a state-of-the-art inference framework for large language models and optimizing it for real-time performance on embedded platforms... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $229.9k - $262.4k

     ...Overview Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital...  ...model training, large language model inference, similarity search, guardrails, model...  ...Invent and introduce state-of-the-art LLM optimization techniques to improve the... 
    Senior
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    a month ago
  • $209k

     ...Machine Learning Platform Engineer Immigration sponsorship is not...  ...maintaining the high-performance LLM training GPU infrastructure...  ...Understand the auto scale for inference service and multi-models for dynamical...  ..., and resource-efficient AI workloads across multi-node... 
    Senior
    Work at office
    Remote work
    1 day per week

    Zoom Video Communications

    San Jose, CA
    1 day ago
  • A technology firm located in California is seeking candidates with experience in AI and ML algorithm development, particularly in LLM Inference and Similarity Search. Applicants should have strong communication skills and the ability to work independently. Familiarity... 
    Senior

    Ethereum Technologies LLC

    Sunnyvale, CA
    9 days ago
  • $152k - $241.5k

     ...optimize and benchmark GenAI inference on NVIDIA's latest...  ...within TensorRT-LLM, SGLang, and vLLM, building...  ...intersection of GPU performance engineering and public...  ...workflows, and other emerging AI use cases. Collaborate...  ...benchmark timelines, and lead a world-class team.... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

    A leading technology company is seeking a Senior Software Engineer for AI and DL Kernel Libraries in Santa Clara, CA. The role involves designing and optimizing kernels for high-impact AI workloads and collaborating with engineers on innovative solutions. Candidates should... 
    Senior
    Remote job

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA Gruppe is seeking an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler team in Santa Clara, California. This role involves analyzing and optimizing deep learning networks, as well as developing compiler algorithms to enhance performance on... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • Nutanix is seeking a candidate to join their team focused on the development of advanced AI agents and systems. Responsibilities include designing and deploying complex multi-agent systems and collaborating closely with product managers and UX designers to create impactful... 
    Senior

    Nutanix

    Santa Clara, CA
    2 days ago
  • $172.5k - $306.63k

     ...exceptional content effortlessly. The AI for Engineering team builds a scalable,...  ..., memory systems, inference services, data flows, evaluation...  ...emphasis on Agentic AI systems and LLM‑native architectures. Design...  ...adaptive AI systems. Mentor senior engineers in modern AI... 
    Senior
    Local area

    Dormont Manufacturing Company

    San Jose, CA
    3 days ago
  • $184k - $287.5k

    Senior AI Software Engineer, Kernel Libraries page is loaded## Senior AI Software Engineer, Kernel Librarieslocations...  ...groundbreaking technologies in the inference systems software stack! We build...  ...attention kernel implementations, new LLM inference runtimes components, and... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  •  ...exceptional content effortlessly. The AI for Engineering team builds a scalable,...  ...on Agentic AI systems and LLM‑native architectures. Design...  .... Develop high‑performance inference and runtime systems with...  ...adaptive AI systems. Mentor senior engineers in modern AI system... 
    Senior

    Adobe

    San Jose, CA
    2 days ago
  • We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced...  ...and improving the performance of LLM inference. This role focuses on designing and...  ...‑functional teams in generative AI, automotive, image understanding, and... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Lead AI Engineer (FM Hosting, LLM Inference). Be the first to apply!