Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)

$229.9k - $262.4k

Full-time

Capital One

Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description: The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will: Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One. Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems. Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One. The Ideal Candidate: You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good. Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production. You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven. You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss. You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown. Basic Qualifications: Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies At least 6 years of experience programming with Python, Go, Scala, or Java Preferred Qualifications: 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud) Experience designing, developing, integrating, delivering, and supporting complex AI systems Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers Capital One will consider sponsoring a new qualified applicant for employment authorization for this position. The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked. Cambridge, MA: $229,900 - $262,400 for Sr. Lead AI Engineer McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer New York, NY: $250,800 - $286,200 for Sr. Lead AI Engineer San Francisco, CA: $250,800 - $286,200 for Sr. Lead AI Engineer San Jose, CA: $250,800 - $286,200 for Sr. Lead AI Engineer Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter. This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan. Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website. Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level. This role is expected to accept applications for a minimum of 5 business days. No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations. For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site. Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC). At Capital One, we’re building a leading information-based technology company. Still founder-led by Chairman and Chief Executive Officer Richard Fairbank, Capital One is on a mission to help our customers succeed by bringing ingenuity, simplicity, and humanity to banking. We measure our efforts by the success our customers enjoy and the advocacy they exhibit.

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) in San Francisco, CA vacancy

Lead AI Engineer (FM Hosting, LLM Inference)
$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and... ...of customers. Our AI models and platforms empower teams across Capital One... ...and introduce state-of-the-art LLM optimization techniques to improve the performance...
Platform
Full time
Part time
Local area
Capital One Financial Corp
San Francisco, CA
3 days ago
Sr. Lead AI Engineer
$209k - $238.5k
...Sr. Lead AI Engineer Overview: At Capital One, we are creating responsible... .... Our AI models and platforms empower teams across Capital... ...training, large language model inference, similarity search,... ...introduce state-of-the-art LLM optimization techniques to improve the performance...
Platform
Senior
Full time
Part time
Local area
Capital One
San Francisco, CA
5 days ago
Sr. Lead AI Engineer (GenAI Platform)
...responsible and reliable AI systems, changing... ...science and engineering teams to deliver our industry leading capabilities with... ...Our AI models and platforms empower teams across... ...language model inference, similarity search... ...state‑of‑the‑art LLM optimization techniques to...
Platform
Senior
Local area
Capital One National Association
San Francisco, CA
22 hours ago
Lead AI Engineer
...building a reasoning engine that must navigate non... ...our demand intelligence platform doesn't just call APIs... ...multi-step inference over messy, high-stakes... ...We're looking for a Lead AI Engineer who can own the... ...new approaches versus optimize existing ones. Agentic...
Platform
Shift work
Hilbert's AI
San Francisco, CA
18 days ago
Staff + Sr. Software Engineer, Cloud Inference Launch Engineering
$320k
...interpretable, and steerable AI systems. We want AI... ...researchers, engineers, policy experts, and... ...Role The Cloud Inference team scales and optimizes Claude to serve the... ...Claude on each cloud platform, from API... ...collaborative group, and we host frequent research discussions...
Platform
Senior
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
2 days ago
Lead AI Engineer
$197.3k - $225.1k
...Lead AI Engineer At Capital One, we are creating responsible and reliable... .... Our AI models and platforms empower teams across Capital... ...training, large language model inference, similarity search,... ...introduce state-of-the-art LLM optimization techniques to improve the performance...
Platform
Full time
Part time
Local area
Capital One
San Francisco, CA
1 day ago
Senior AI Model Serving Engineer — Low-Latency Inference
A leading data and AI company in San Francisco is seeking a Senior Engineer to enhance their Model Serving platform. This role requires expertise in building large-scale distributed systems... ...and collaboration across teams to optimize performance and reliability. Ideal...
Platform
Senior
Jobleads-US
San Francisco, CA
22 hours ago
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
$197.3k - $225.1k
...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview... ...customers. Our AI models and platforms empower teams across Capital... ..., large language model inference, similarity search, guardrails... ...introduce state-of-the-art LLM optimization techniques to improve the...
Platform
Full time
Part time
Local area
Capital One
San Francisco, CA
22 hours ago
Senior AI Kernel Engineer for Edge Inference
Quadric in San Francisco is looking for an experienced AI Kernel Engineer to develop and optimize AI kernels for their innovative neural processing platform. This role involves enhancing performance for various hardware configurations and providing technical support to...
Platform
Senior
Quadric
San Francisco, CA
2 days ago
Senior Staff AI Engineer
...SoFi's Senior Staff AI Engineer is a hands-on AI... ...our next-generation AI platform, particularly focusing... ...failovers Deep Model Optimization: Pioneer and institutionalize... ...platform designed to host internally fine-tuned... ..., low-latency inference across diverse hardware...
Platform
Senior
Remote work
SoFi
San Francisco, CA
1 day ago
Sr Director, AI Engineering
$286.2k - $326.7k
...responsible and reliable AI systems, changing... ...science and engineering teams to deliver our industry leading capabilities with... ...Our AI models and platforms empower teams across... ...‑language‑model inference, similarity search... ...state‑of‑the‑art LLM optimization techniques to...
Platform
Senior
Local area
Hobbsnews
San Francisco, CA
13 hours ago
Lead AI Engineer - Scalable Backend & Agentic Systems
...Experienced backend engineer. 5-7+ years of... ...generative & agentic AI in production. You’... ...building shared AI/ML platforms or services used by... ...job involves As a Lead AI Engineer in the... ...Scale AI data and inference pipelines. Develop... ...Implement services that host and orchestrate AI...
Platform
Work at office
Local area
Klaviyo
San Francisco, CA
22 hours ago
Applied AI Inference Engineer
...Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence... .... Join us and help build the platform engineers turn to to ship AI products. THE... ...and outcomes for our customers. Optimize and enhance AI/ML projects,...
Platform
Work experience placement
Flexible hours
Baseten
San Francisco, CA
2 days ago
AI Platform Engineer, Training and Inference
AI Platform Engineer - Training & Inference Saviynt's AI-powered identity platform manages and governs human and... ...that protect and empower the world's leading brands, Fortune 500 companies and... ...with cost-aware fallback between self-hosted SLMs and cloud LLMs. Build RL...
Platform
Medium
San Francisco, CA
13 hours ago
Lead AI Engineer
$275k - $350k
...Lead AI Engineer Title of Role: Lead AI Engineer Location: San Francisco, hybrid... ...Analyze system performance and implement optimizations to ensure high availability and... ...Preferred Experience with cloud platforms and services that enhance AI capabilities...
Platform
Work at office
Recruiting from Scratch
San Francisco, CA
4 days ago
Senior Engineer, Inference Data Plane
$139.2k - $174k
...expanding its AI Infrastructure... ...seeking a Senior Engineer 2 to play a key... ...AI workloads— inference, training, fine... ...large Gen AI model hosting infrastructure.... ...our global platform remains simple,... ...Operational Excellence: Lead the operational... ...enable fleet optimization and help evolve...
Platform
Senior
Local area
Remote work
Worldwide
Flexible hours
DigitalOcean
San Francisco, CA
2 days ago
Lead AI Engineer
$167.4k - $205.1k
...design and build intelligent AI agent systems, LLM-based... ...hands-on experience working as Sr AI/ML Developer with at... ...in LLM fine-tuning, prompt engineering, and model optimization Demonstrated proficiency... ...Experience with cloud AI platforms (Azure OpenAI, AWS Bedrock,...
Platform
Daily paid
JLL
San Francisco, CA
1 day ago
Scientific Lead, Generative AI Engineer, Applied Intelligence for Discovery
$181.5k - $283.8k
...something unprecedented, an AI foundation that will... ...delivery and core platform development. AI4D's mission... ...As a Generative AI Engineer, you will design, build... ...Design, build, and optimize RAG pipelines over internal... ...Women's Initiative for Leading at Lilly (WILL), enAble...
Platform
Full time
Flexible hours
Eli Lilly
San Francisco, CA
1 day ago
Senior/Lead AI Software Engineer, Agentforce for Supply Chain
$13 per hour
...Salesforce****Salesforce is the #1 AI CRM, where humans with agents... ...chain with an AI-powered platform for designing, automating, and... ...managed by the same founders and engineers who built the original company... ...You’ll Do****## As a Senior/Lead AI Software Engineer, for the...
Platform
Senior
Immediate start
Salesforce, Inc.
San Francisco, CA
13 hours ago
Senior/Staff AI Engineer
$150k - $250k
...Senior/Staff AI Engineer Job Locations US-CA-San Francisco... ...-world model serving and inference. This is the role for engineers... ...What you'll do Build and optimize LLM serving and inference... ...performance systems, storage platforms, or adjacent distributed systems...
Platform
Senior
Full time
Remote work
DataDirect Networks Inc
San Francisco, CA
4 days ago
Lead, AI Engineering
$203.5k
...personally. About Bain AI, Insights & Solutions... ...design, architecture and engineering, and client... ...and traditional data platforms), covering indexing strategies... ...tuning and preference optimization approaches). Write clean... ...stakeholders; lead working sessions, present...
Platform
Full time
Temporary work
Apprenticeship
Work at office
Local area
Work from home
Home office
3 days per week
Bain & Company
San Francisco, CA
22 hours ago
Senior AI Serving Engineer, Backend
...Sciforium AI Infrastructure Role Sciforium... ...high-efficiency serving platform. Backed by multi-million... ...-on support from AMD engineers the team is scaling... ...how large AI models are optimized and deployed at scale,... ...Monitoring, and distributed inference features....
Platform
Senior
Work at office
Flexible hours
Sciforium
San Francisco, CA
3 days ago
VCA AI Agent Engineering Sr. Manager
...Job Description The Sr. Manager is responsible... ...and integration of AI agents, ensuring seamless... ...and ongoing performance optimization. This role works... ...Agent Implementation: Lead the hands-on development... ...enterprise tools to enhance platform capabilities. Code Deployment...
Platform
Senior
Work experience placement
Work at office
Local area
Visa
San Francisco, CA
3 days ago
Senior AI Engineer
...re building an agentic AI caregiver advocate... ...over time. The AI engineering challenge: build an autonomous... ...closely with Product, Platform Engineering, and... ...engineering and model optimization Design prompt architectures... ...evolves. Optimize inference costs our unit economics...
Platform
Senior
Immediate start
Remote work
Flexible hours
Citizen Health
San Francisco, CA
4 days ago
Sr. AI Engineer, Time-Series Signal Processing
...Sr. AI Engineer, Time-Series Signal Processing Bright.... ...intelligent automation. Our platform processes visual,... ...Signal Processing to lead the development of AI... ...experimentation and optimization of signal-processing... ...processing, and real-time inference, and incorporate them...
Platform
Senior
Remote job
Immediate start
BrightAI Corporation
San Francisco, CA
more than 2 months ago
Lead AI Engineer, Data Solutions
$172.5k
...Category Software Engineering Job Details... ...Salesforce is the #1 AI CRM, where humans... ...at the company leading workforce transformation... ...evaluate, optimize, and improve agent... ..., evaluation, and inference Transform raw interaction... ...Collaborate with platform teams while owning...
Platform
Salesforce
San Francisco, CA
6 days ago
Lead Sr Staff Software Engineer - Front End Fundamentals
$166.2k - $304.7k
...technology company and the world's leading independent platform for digital advertising,... ...Lead Senior Staff Software Engineers are end-to-end owners who... ...Continuously analyze and optimize team efficiency, removing... ...microservices, Kubernetes, and AI/LLM integration into...
Platform
Senior
Full time
Temporary work
Local area
Worldwide
The Trade Desk
San Francisco, CA
3 days ago
Staff/Senior Software AI engineer: Predictive Analytics
$150k - $200k
...skyrocketing, driven by AI factories, electric vehicles... ...top of Amperesand's IoT platform. This is a hands... ..., forecasting, and optimization signals. The role requires... ...data ingestion, feature engineering, model training, inference, deployment, and monitoring...
Platform
Senior
Temporary work
Work experience placement
Local area
Shift work
Amperesand
San Francisco, CA
1 day ago
Senior Staff AI Engineer
$207k - $290k
...Description About JazzX AI: Vision:... ...seeking an experienced AI Engineer with deep expertise in... ...generation enterprise AGI platform. You will lead the design, development, and optimization of cutting-edge RL... ...techniques , including inference-time search, chain-of-...
Platform
Senior
Worldwide
Flexible hours
JazzX AI
San Francisco, CA
14 days ago
Logistics Optimization - Lead Senior/Staff Software Engineer
...By using the same technologies that power leading marketplace and last-mile platforms, we deliver care where people are, especially... ...the Role We're hiring a Senior Software Engineer to join our Logistics Optimization team, where we tackle some of the hardest algorithmic...
Platform
Senior
Temporary work
Work at office
Flexible hours
Sprinter Health
San Francisco, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform). Be the first to apply!