Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)

$229.9k - $262.4k

Information Technology Senior Management Forum

Sr. Lead AI Engineer (Inference Optimization, FM Hosting, AI Platform) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real‑time, personalized customer experiences. Our investments in technology infrastructure and world‑class talent—along with our deep experience in machine learning—position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world‑class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high‑performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand‑in‑hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will Partner with a cross‑functional team of engineers, research scientists, technical program managers, and product managers to deliver AI‑powered products that change how our associates work and how our customers interact with Capital One. Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state‑of‑the‑art LLM optimization techniques to improve the performance—scalability, cost, latency, throughput—of large‑scale production AI systems. Contribute to the technical vision and the long‑term roadmap of foundational AI systems at Capital One. The Ideal Candidate You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good. Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production. You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven. You are deeply technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss. You are a resilient trail‑blazer who can forge new paths to achieve business goals when the route is unknown. Basic Qualifications Minimum bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields and at least 6 years of experience developing AI and ML algorithms or technologies, OR a master’s degree in the same fields with at least 4 years of experience. At least 6 years of experience programming with Python, Go, Scala, or Java. Preferred Qualifications 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g., AWS, Google Cloud, Azure, or equivalent private cloud). Experience designing, developing, integrating, delivering, and supporting complex AI systems. Demonstrated ability to lead and mentor an engineering team and influence cross‑functional stakeholders. Experience developing AI and ML algorithms or technologies (e.g., LLM inference, similarity search and vector databases, guardrails, memory) using Python, C++, C#, Java, or Go. Experience developing and applying state‑of‑the‑art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost. Passion for staying abreast of the latest AI research and AI systems and judiciously applying novel techniques in production. Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers. Capital One will consider sponsoring a new qualified applicant for employment authorization for this position. Cambridge, MA: $229,900 - $262,400 for Sr. Lead AI Engineer McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer New York, NY: $250,800 - $286,200 for Sr. Lead AI Engineer San Francisco, CA: $250,800 - $286,200 for Sr. Lead AI Engineer San Jose, CA: $250,800 - $286,200 for Sr. Lead AI Engineer This role is also eligible to earn performance‑based incentive compensation, which may include cash bonus(es) and/or long‑term incentives (LTI). Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well‑being. Learn more at the Capital One Careers website. Eligibility varies based on full or part‑time status, exempt or non‑exempt status, and management level. This role is expected to accept applications for a minimum of 5 business days. No agencies please. Capital One is an equal‑opportunity employer (EOE, including disability/vet) committed to non‑discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug‑free workplace. It will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries. If you need accommodations, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io. All information will be kept confidential and used only to the extent required to provide needed reasonable accommodations. For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io. Capital One does not provide, endorse, or guarantee and is not liable for third‑party products, services, educational tools or other information available through this site. Capital One Financial is made up of several different entities. Positions posted in Canada are for Capital One Canada, in the United Kingdom for Capital One Europe, and in the Philippines for Capital One Philippines Service Corp. (COPSSC). #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) in San Jose, CA vacancy
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create... 
    Platform
    Senior
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    3 days ago
  •  ...Capital One is hiring a Sr. Lead AI Engineer in San Jose, CA to innovate AI systems that enhance customer interactions. This role requires leading cross-functional teams to develop and support AI-powered solutions that are robust and scalable. The ideal candidate has... 
    Platform
    Senior

    Information Technology Senior Management Forum

    San Jose, CA
    1 day ago
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (Gen AI Platform Services) Overview At Capital One, we are creating responsible and...  ...model training, large language model inference, similarity search, guardrails,...  ...and introduce state‑of‑the‑art LLM optimization techniques to improve the performance... 
    Platform
    Senior
    Local area

    Capital One National Association

    San Jose, CA
    5 days ago
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (Gen AI Platform Services) Overview: At Capital One, we are creating responsible...  ...model training, large language model inference, similarity search, guardrails,...  ...and introduce state-of-the-art LLM optimization techniques to improve the... 
    Platform
    Senior
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    15 days ago
  • ## Sr AI/Agentic EngineerApplylocations...  ...*The Senior AI Engineer will deliver...  ...Intelligence, the AI team lead, and the Senior...  ...the shared AI platform — the prompt...  ...harness, and inference routing layer —...  ...and Bedrock-hosted models) to...  ...tuning, and prompt optimization techniques.*... 
    Platform
    Senior
    Full time
    Work at office
    Local area
    Shift work

    Lendistry

    Santa Clara, CA
    4 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $212.8k

     ...to building the inference infrastructure for...  ...frontier multimodal AI systems. Our...  ...throughput and latency optimization, kernel...  ...Science, Software Engineering, Artificial Intelligence...  ...TikTok is the leading destination for short...  .... Our platform connects people from... 
    Platform
    Senior
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    1 day ago
  • $250.8k - $286.2k

     ...A leading financial services firm in San Jose, CA seeks a Sr. Lead AI Engineer to enhance banking with AI technologies. The ideal candidate will have extensive experience...  ...functional teams, designing AI software, and optimizing performance. This role offers a salary range... 
    Platform
    Senior

    Capital One National Association

    San Jose, CA
    5 days ago
  • $314.8k - $359.3k

     ...Sr. Distinguished AI Engineer (Agentic AI Platform) Overview: At Capital One, we...  ...deliver our industry leading capabilities with...  ...end performance by optimizing orchestration -...  ...and evangelize - hosting architecture office...  ...technologies (e.g. LLM Inference, Similarity Search... 
    Platform
    Senior
    Full time
    Part time
    Work at office
    Local area

    Capital One

    San Jose, CA
    16 days ago
  • $250.8k - $286.2k

     ...Capital One National Association in San Jose is seeking a Senior Lead AI Engineer to design, develop, and support AI-powered products that...  ...experiences using techniques like machine learning and optimization strategies. The compensation ranges from $250,800 to $286,20... 
    Platform
    Senior

    Capital One National Association

    San Jose, CA
    5 days ago
  • $272k - $431.25k

     ...looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running...  ...-performance inference on NVIDIA platforms and involves collaboration across...  .... Key responsibilities include optimizing inference runtimes, improving efficiency... 
    Platform

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer Overview At Capital One, we are creating responsible...  .... Our AI models and platforms empower teams across Capital...  ...training, large language model inference, similarity search,...  ...introduce state-of-the-art LLM optimization techniques to improve the performance... 
    Platform
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    12 days ago
  • $152.93k - $254.88k

     ...possible. Our industry-leading portfolio unlocks...  ...this in a simple and optimized way by connecting people...  ...are looking for a Lead AI Engineer to help build our next...  ...generation Agentic AI platform from 0-1. This is a hands...  ...across AI pipelines, inference services, orchestration... 
    Platform

    BMC Software

    Santa Clara, CA
    4 days ago
  • NVIDIA Corporation is looking for a Senior Inference Engineer to advance AIConfigurator, enhancing model serving and performance for large-...  ...integrating complex deployment configurations on NVIDIA GPU platforms. The ideal candidate will have over 10 years of software engineering... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At...  ...customers. Our AI models and platforms empower teams across...  ...training, large language model inference, similarity search, guardrails...  ...state-of-the-art LLM optimization techniques to improve the performance... 
    Platform
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    2 days ago
  •  ...Role: AI Inference Engineer Location: San Jose, CA Duration: 6 to 12 Months Overview...  ...of systems engineering, GPU optimization, and distributed infrastructure, and...  ...OpenShift, Helm, or similar orchestration platforms Implement tensor parallelism, pipeline... 
    Platform

    Triune Infomatics Inc

    San Jose, CA
    8 days ago
  • $110k - $145k

     ...Sr. AI Machine Learning Engineer Position Overview We are looking for a talented...  ..., including safe inference strategies to ensure reliable...  ...and pipelines Design and optimize LLM architectures to improve...  ...experience with cloud computing platforms (e.g., AWS, Azure, Google... 
    Platform
    Senior

    A10 Networks

    San Jose, CA
    3 days ago
  •  ...experiences-from AI and data centers,...  ...Senior Staff AI Infra Engineer who is passionate...  ...and software to optimize performance for next...  ...: • Lead technical initiatives...  ...LLM training and inference on AMD GPUs, improving...  ...training or inference platforms using Kubernetes,... 
    Platform

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  • $124k - $195.5k

    ## AI Inference Performance Engineer - New College Grad 2026Applylocations: US, CA, Santa...  ...id: JR2014441We optimize and benchmark GenAI inference...  ...benchmark timelines, and lead a world-class team.**What...  ...cost-effective computing platform driving our success in this... 
    Platform

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  •  ...A leading AI technology company in Sunnyvale, California, is seeking a skilled software engineer to optimize its AI cloud platform for model training and inference. In this role, you'll enhance deployment efficiency and ensure system reliability and scalability. The ideal... 
    Platform

    Cerebras

    Sunnyvale, CA
    5 days ago
  •  ...generation computing experiences—from AI and data centers, to PCs, gaming and...  ...ROLE We are seeking a Principal GenAI Inference Optimization Engineer to join our Models and Applications...  ...generative AI inference workloads on AMD GPU platforms. You will contribute to optimizing... 
    Platform

    Advanced Micro Devices , Inc.

    San Jose, CA
    4 days ago
  • $229.9k - $262.4k

     ...Overview Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview...  ...training, large language model inference, similarity search,...  ...introduce state-of-the-art LLM optimization techniques to improve the...  ...: $229,900 - $262,400 for Sr. Lead AI Engineer New... 
    Platform
    Senior
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    a month ago
  • $246.5k

     ...the #1 TV streaming platform in the U.S.,...  ...time multi-objective optimization across distributed...  ...Reinforcement Learning, AI, Control and...  ...Machine Learning and Inference Platform that...  ...architect, design, and lead the development of...  ...excited to mentor engineers, innovate at scale... 
    Platform
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    3 days ago
  •  ...AI Platform Engineer - Training & Inference Saviynt's AI-powered identity platform manages and governs human...  ...that protect and empower the world's leading brands, Fortune 500 companies and...  ...with cost-aware fallback between self-hosted SLMs and cloud LLMs • Build RL... 
    Platform

    Saviynt

    Milpitas, CA
    2 days ago
  •  ...A forward-thinking AI infrastructure company is seeking a Staff AI Runtime Engineer to lead the design and optimization of their AI compute platform. In this leadership role, you'll enhance AI training and inference capabilities. Successful candidates will have over 8... 
    Platform
    Senior

    FlexAI

    Santa Clara, CA
    4 days ago
  • $244.8k

     ...applied research in Generative AI and CV/Multimodal...  ...Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing...  ...CapCut and Pico as well as platforms specific to the China market...  ...things with great people. We lead with curiosity, humility,... 
    Platform
    Senior
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    1 day ago
  •  ...A leading technology company seeks an engineering leader to drive compiler innovation for AI applications. The successful candidate will enhance...  ...of data center GPU platforms and mentor senior staff....  ...include designing LLVM backend optimizations and collaborating with... 
    Platform

    Advanced Micro Devices , Inc.

    San Jose, CA
    4 days ago
  • NVIDIA Gruppe in Santa Clara, California is seeking AI Compiler Engineers to drive technological innovation within their compiler organization. The role involves working on kernel generation and optimization for next-generation NVIDIA GPUs and solving complex compilation... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA Gruppe is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...unlimited potential of AI to define the next era...  ...High Performance AI Engineer to build groundbreaking...  ...doing: Design, build and optimize agentic AI systems for...  ...training, and inference/serving—and with model...  ...resource-constrained platforms. Deep expertise in GPU... 
    Platform
    Senior

    2100 NVIDIA USA

    Santa Clara, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform). Be the first to apply!