Senior Lead AI Engineer (FM Hosting, LLM Inference)

$229.9k - $262.4k

Capital One

Senior Lead AI Engineer (FM Hosting, LLM Inference) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description: The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will: * Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One. * Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. * Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. * Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems. * Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One. The Ideal Candidate: * You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good. * Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production. * You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven. * You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss. * You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown. Basic Qualifications: * Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies * At least 6 years of experience programming with Python, Go, Scala, or Java Preferred Qualifications: * 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud) * Experience designing, developing, integrating, delivering, and supporting complex AI systems * Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders * Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang * Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost * Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production * Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers Capital One will consider sponsoring a new qualified applicant for employment authorization for this position. The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked. Cambridge, MA: $229,900 - $262,400 for Sr. Lead AI Engineer McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer New York, NY: $250,800 - $286,200 for Sr. Lead AI Engineer San Jose, CA: $250,800 - $286,200 for Sr. Lead AI Engineer Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate's offer letter. This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan. Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website . Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level. This role is expected to accept applications for a minimum of 5 business days.No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City's Fair Chance Act; Philadelphia's Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io . All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations. For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site. Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).

Apply

Vacancy posted 21 hours ago

Similar jobs that could be interesting for youBased on the Senior Lead AI Engineer (FM Hosting, LLM Inference) in San Jose, CA vacancy

Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
$229.9k - $262.4k
...Overview Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable... ...more. ~ Invent and introduce state-of-the-art LLM optimization techniques to improve the performance —...
Senior
Full time
Part time
Local area
Capital One
San Jose, CA
a month ago
Senior Lead AI Engineer
$209k - $238.5k
...Overview Senior Lead AI Engineer Overview: At Capital One, we are creating responsible... ...model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...
Senior
Full time
Part time
Local area
Capital One
San Jose, CA
7 days ago
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
$197.3k - $225.1k
...Overview Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...
Suggested
Full time
Part time
Local area
Capital One
San Jose, CA
a month ago
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
$229.9k - $262.4k
...Overview Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital... ...model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...
Senior
Full time
Part time
Local area
Capital One
San Jose, CA
more than 2 months ago
Lead Machine Learning Engineer - Agentic Models, LLM, RAG, GenAI
Lead the team in: research, design, development, and deployment of advanced AI agents and agentic systems. Architect and implement complex multi-agent systems... ...-tuning strategies (QLORA, DPO) and inference optimization (vLLM, TensorRT-LLM). Research experience in agentic AI...
Suggested
Full time
Work experience placement
Eightfold
Santa Clara, CA
8 hours ago
AI Inference Engineer - Speech (San Jose)
$151.8k - $332.2k
...What you can expect We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. In this role... ...fields with 3+ years of experience in speech recognition, speech-llm or AI model inference.Display knowledge in deep learning and...
Full time
Part time
Work at office
Remote work
Zoom
San Jose, CA
17 hours ago
Principal Software Engineer - Large-Scale LLM Memory and Storage Systems
$272k - $425.5k
Principal Software Engineer – Large-Scale LLM Memory and... ...throughput, low-latency inference framework for serving generative AI and reasoning... ...GPU memory, pinned host memory, RDMA-accessible... ...with leading LLM serving engines... ...memory pools.* Mentor senior and junior...
Local area
Remote work
NVIDIA
Santa Clara, CA
8 hours ago
Sr. AI / Embedded ML Engineer
$150k - $225k
...overall quality of life. As a Senior AI / Embedded Engineer, you will be responsible... ...software, and hybrid LLM integration to deliver production... ...to reduce model size and inference latency ◦ Use frameworks... ...inference and cloud or edge-hosted LLM components ◦ Evaluate trade...
Senior
Full time
Work at office
Immediate start
Visa sponsorship
Night shift
E-Space
Saratoga, CA
8 hours ago
Senior Software Engineer, Quantized Inference
$152k - $241.5k
We are now looking for a Senior Software Engineer for Quantized Inference! NVIDIA is seeking software engineers to accelerate... ...in inference engines (vLLM, TRT‑LLM, SGLang). The candidate will... ...concise, well‑tested code; fluent with AI‑assisted tooling Experience with ML...
Senior
NVIDIA
Santa Clara, CA
4 days ago
Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services)
...responsible and reliable AI systems, changing banking... ...class applied science and engineering teams to deliver our industry‑leading capabilities with... ...training, large language model inference, similarity search, guardrails... ...state‑of‑the‑art LLM optimization techniques to...
Senior
Full time
Local area
SwiftCruit
San Jose, CA
8 hours ago
AI Platform Engineer, Training and Inference
$240k - $260k
...Description Job Description AI Platform Engineer – Training & Inference Saviynt's AI-powered... ...and empower the world's leading brands, Fortune 500... ...H100s, the multi-engine LLM inference mesh (vLLM, SGLang... ...fallback between self-hosted SLMs and cloud LLMs • Build...
Saviynt
Milpitas, CA
12 days ago
Senior AI Software Engineer, Kernel Libraries (Santa Clara)
$184k - $287.5k
...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software... ...efficient attention kernel implementations, new LLM inference runtimes components, and kernel code...
Senior
Full time
Part time
Nvidia
Santa Clara, CA
12 hours ago
Senior AI Workflow Engineer (Santa Clara)
$184k - $287.5k
...the unlimited potential of AI to define the next era of computing... ...the world.NVIDIA is hiring senior software engineers in its Infrastructure,... ...versed with Large Language Mode (LLM), Machine Learning (ML),... ...(SQL/NoSQL), and deployment/hosting (e.g., AWS, Azure, GCP).Experience...
Senior
Full time
Part time
Work experience placement
Nvidia
Santa Clara, CA
12 hours ago
Senior AI Platform Engineer (San Jose)
$151.8k - $265.35k
...content effortlessly. The AI for Engineering team builds a scalable... ..., memory systems, inference services, data flows,... ...Agentic AI systems and LLM-native architectures.Design... ...AI systems.Mentor senior engineers in modern AI... .... Adobe’s industry-leading offerings including Adobe...
Senior
Full time
Temporary work
Part time
Local area
Worldwide
Adobe Systems
San Jose, CA
12 hours ago
Lead Machine Learning Engineer - Agentic Models, LLM, RAG, GenAI (Santa Clara)
$193.13k - $257.5k
...About Eightfold.ai:Eightfold is a global leader in AI-... ..., and high standards. Our engineers, product leaders, and go-to... ...languages.Responsibilities:Lead the team in: research,... ...strategies (QLORA, DPO) and inference optimization (vLLM, TensorRT-LLM).Desired Skills & Experience...
Part time
Work experience placement
Work at office
Remote work
Flexible hours
3 days per week
Eightfold
Santa Clara, CA
12 hours ago
Principal LLM Inference Engineer (Santa Clara)
...potential of generative AI to power the transformation... ...Group sits at the leading edge of what’s possible with LLM inference on heterogeneous hardware... ...an applied research and engineering team that moves fast, ships... ...real hardware.• Small, senior team with high autonomy...
Part time
d-Matrix
Santa Clara, CA
12 hours ago
Senior Solutions Architect, AI Performance Engineering (Santa Clara)
$184k - $287.5k
...Architect with a performance engineering background who can help... ...accelerate Physical AI workloads using NVIDIA'... ...model training and inference.Effective verbal/written... ...least one of these areas: LLM and HPC. Having... ...ability to build modules and lead engineering approaches...
Senior
Full time
Part time
Remote work
Nvidia
Santa Clara, CA
12 hours ago
Senior Software Engineer, AI Inference Systems (Santa Clara)
$184k - $287.5k
...highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale... ...NVIDIA’s submissions to the industry-leading MLPerf Inference benchmarking suite... ...crowdExperience building and optimizing LLM inference engines (e.g., vLLM,...
Senior
Full time
Part time
Nvidia
Santa Clara, CA
12 hours ago
Senior / Principal AI Engineer for Business Intelligence (7063)
$123.24k - $200k
Overview Of Role As a Sr./Principal AI Engineer within TSMC's Artificial... ...environment. Responsibilities Lead System Architecture: Own the... ...systems leveraging frontier LLM models, including the design... ...to both engineering teams and senior management audiences. Education...
Senior
Work at office
TSMC
San Jose, CA
21 hours ago
Lead AI Engineer
$197.3k - $225.1k
...Overview Lead AI Engineer Overview: At Capital One, we are creating responsible and... ...foundation model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...
Full time
Part time
Local area
Capital One
San Jose, CA
6 days ago
Senior AI Engineer
$159.5k - $271.2k
...expert teams of physicists, engineers, data scientists and... ...with the world's leading technology providers to... ...development engineers and senior product technology process... ...and motivated Senior AI Engineer with experience... ...~ Experience with LLM pre-training is optional...
Senior
Minimum wage
Work experience placement
Flexible hours
KLA
Milpitas, CA
2 days ago
AI/ML Enablement Lead Engineer (Cupertino)
$148.5k - $174.7k
...location three (3) or more days per week.U.S. Bank is seeking an AI/ML Enablement Lead Engineer to help drive the adoption, delivery, and scaling of... ...AI-enabled software solutions.Experience implementing LLM, RAG, Agentic AI, GenAI, or other modern AI solutions in production...
Full time
Part time
For contractors
Local area
3 days per week
US Bank
Cupertino, CA
12 hours ago
Senior Account Manager, Host Networking Segment Sales (Santa Clara)
$248k - $396.75k
...leader to grow NVIDIA's Host Networking business for AI infrastructure through AI... .... This Segment Sales Lead will drive strategic direction... ...product, sales, engineering, architecture, marketing... ...computing, generative AI, LLM training and inference, Network SerDes, in-network...
Senior
Full time
Part time
Nvidia
Santa Clara, CA
17 hours ago
Senior Staff / Principal Machine Learning Scientist, AI Inference & Optimization (Santa Clara)
$182.5k - $260.5k
...have built the market-leading cloud security... ...Positions are available at Senior Staff and above.... ..., you own the inference and optimization layer that makes AI in agentic workflows... ...systems and backend engineers to ship capabilities... .../SGLang, TensorRT-LLM, ONNX Runtime, llama...
Senior
Part time
Work at office
Netskope
Santa Clara, CA
12 hours ago
Inference Software Engineer
$2,000 per month
...About Etched Etched is building AI chips that are hard-coded for... ...Job Summary Etched’s Inference SW team enables optimal mapping... ...requests across multiple chips, hosts and racks. We are seeking a highly skilled and motivated engineer to join our team as we work towards...
Full time
Work at office
Relocation package
Etched
San Jose, CA
8 hours ago
Senior Staff AI Engineer, Edge AI (Sunnyvale)
$227k - $300k
...the transformation to AI-enabled software-defined... .... That’s why leading OEMs trust Sonatus to... ...are looking for a great Senior Staff AI Engineer to join our seasoned AI... ...(e.g., Transformers, LLM, CNN, LSTM, Trees) to... ...modern C++ (C++14/17 for inference).Deep proficiency with...
Senior
Part time
Work at office
Worldwide
Flexible hours
Shift work
3 days per week
Sonatus
Sunnyvale, CA
12 hours ago
Lead Machine Learning Inference Engineer, Advertising (San Jose)
$246.5k
...Learning, Reinforcement Learning, AI, Control and Optimization... ...this is our Machine Learning and Inference Platform that powers the... ...you will architect, design, and lead the development of a SOTA Inference... ...- someone excited to mentor engineers, innovate at scale, and shape...
Part time
Work at office
Local area
Remote work
Monday to Thursday
Flexible hours
Roku
San Jose, CA
17 hours ago
Senior Customer Facing Applied AI Engineer (San Jose)
$139k - $257.55k
...newly formed Forward Deployment Engineering (FDE) team within Adobe’s... ...We are a team of world-class AI and traditional engineers dedicated... ...+ years working with ML/AI or LLM-based applications.An... ...experiences. Adobe’s industry-leading offerings including Adobe Acrobat...
Senior
Full time
Temporary work
Part time
Local area
Worldwide
Adobe Systems
San Jose, CA
12 hours ago
Sr. Forward Deployed AI Engineer (Santa Clara)
...generation computing experiences—from AI and data centers, to PCs,... ...a Forward Deployed Research Engineer to build, evaluate, and deploy... ...-grade AI applications, LLM agents, and engineering automation... ...Agent EvaluationAI Training or Inference InfrastructureDeep understanding...
Senior
Part time
AMD
Santa Clara, CA
12 hours ago
Senior Applied AI Engineer- Creative Systems & Brand Intelligence, Adobe Express (San Jose)
...exploring the next generation of AI-native creative systems that... ...looking for forward-thinking engineers who are excited to explore ambiguous... ....Design and integrate LLM-powered and multimodal AI workflows... ...experiences. Adobe’s industry-leading offerings including Adobe...
Senior
Part time
Worldwide
Adobe Systems
San Jose, CA
12 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Lead AI Engineer (FM Hosting, LLM Inference). Be the first to apply!