Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
$229.9k - $262.4kCapital One Financial Corp
Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine learning - position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description: The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will:- Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
- Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
- Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
- Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems.
- Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.
- You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.
- Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.
- You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.
- You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.
- You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.
- Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies
- At least 6 years of experience programming with Python, Go, Scala, or Java
- 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
- Experience designing, developing, integrating, delivering, and supporting complex AI systems
- Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders
- Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
- Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
- Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production
- Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) in San Francisco, CA vacancy
$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and... ...of customers. Our AI models and platforms empower teams across Capital One... ...and introduce state-of-the-art LLM optimization techniques to improve the performance...PlatformFull timePart timeLocal area$209k - $238.5k
...Sr. Lead AI Engineer Overview: At Capital One, we are creating responsible... .... Our AI models and platforms empower teams across Capital... ...training, large language model inference, similarity search,... ...introduce state-of-the-art LLM optimization techniques to improve the performance...PlatformSeniorFull timePart timeLocal area- ...responsible and reliable AI systems, changing... ...science and engineering teams to deliver our industry leading capabilities with... ...Our AI models and platforms empower teams across... ...language model inference, similarity search... ...state‑of‑the‑art LLM optimization techniques to...PlatformSeniorLocal area
$225.4k - $257.2k
...responsible and reliable AI systems,... ...science and engineering teams to deliver our industry leading capabilities with... ...AI models and platforms empower teams... ...language model inference, similarity search... ...of‑the‑art LLM optimization techniques to... ...- $257,200 for Sr. Lead AI Engineer...PlatformSeniorFull timePart timeLocal area$229.9k - $262.4k
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview At Capital One, we are creating... ...model training, large language model inference, similarity search, guardrails,... ...more. Invent state‑of‑the‑art LLM optimization techniques to improve performance,...PlatformSeniorLocal area- ...building a reasoning engine that must navigate non... ...our demand intelligence platform doesn't just call APIs... ...multi-step inference over messy, high-stakes... ...We're looking for a Lead AI Engineer who can own the... ...new approaches versus optimize existing ones. Agentic...PlatformShift work
- A leading financial institution in California seeks a Senior Lead AI Engineer to design and develop innovative AI products. The successful candidate will collaborate with diverse teams to optimize AI systems and solutions, contributing significantly to modern banking....PlatformSenior
$320k
...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About Anthropic... ..., and steerable AI systems. We want AI... ...team scales and optimizes Claude to serve the... ...Claude on each cloud platform, from API integration... ...collaborative group, and we host frequent research...PlatformSeniorWork at officeVisa sponsorshipFlexible hours$197.3k - $225.1k
...Lead AI Engineer At Capital One, we are creating responsible and reliable... .... Our AI models and platforms empower teams across Capital... ...training, large language model inference, similarity search,... ...introduce state-of-the-art LLM optimization techniques to improve the performance...PlatformFull timePart timeLocal area- A leading data and AI company in San Francisco is seeking a Senior Engineer to enhance their Model Serving platform. This role requires expertise in building large-scale distributed systems... ...and collaboration across teams to optimize performance and reliability. Ideal...PlatformSenior
$197.3k - $225.1k
...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview... ...customers. Our AI models and platforms empower teams across Capital... ..., large language model inference, similarity search, guardrails... ...introduce state-of-the-art LLM optimization techniques to improve the...PlatformFull timePart timeLocal area- Quadric in San Francisco is looking for an experienced AI Kernel Engineer to develop and optimize AI kernels for their innovative neural processing platform. This role involves enhancing performance for various hardware configurations and providing technical support to...PlatformSenior
- Lead AI Engineer - Salesforce Lead AI Engineer at Salesforce responsible... ...training, evaluation, and inference Transform raw interaction... ...signals to drive continuous optimization Systems & API Development... ...workflows Collaborate with platform teams while owning application...Platform
- ...SoFi's Senior Staff AI Engineer is a hands-on AI... ...our next-generation AI platform, particularly focusing... ...failovers Deep Model Optimization: Pioneer and institutionalize... ...platform designed to host internally fine-tuned... ..., low-latency inference across diverse hardware...PlatformSeniorRemote work
$286.2k - $326.7k
...responsible and reliable AI systems, changing... ...science and engineering teams to deliver our industry leading capabilities with... ...Our AI models and platforms empower teams across... ...‑language‑model inference, similarity search... ...state‑of‑the‑art LLM optimization techniques to...PlatformSeniorLocal area- ...Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence... .... Join us and help build the platform engineers turn to to ship AI products. THE... ...and outcomes for our customers. Optimize and enhance AI/ML projects,...PlatformWork experience placementFlexible hours
$275k - $350k
...Lead AI Engineer Title of Role: Lead AI Engineer Location: San Francisco, hybrid... ...Analyze system performance and implement optimizations to ensure high availability and... ...Preferred Experience with cloud platforms and services that enhance AI capabilities...PlatformWork at office$139.2k - $174k
...expanding its AI Infrastructure... ...seeking a Senior Engineer 2 to play a key... ...AI workloads— inference, training, fine... ...large Gen AI model hosting infrastructure.... ...our global platform remains simple,... ...Operational Excellence: Lead the operational... ...enable fleet optimization and help evolve...PlatformSeniorLocal areaRemote workWorldwideFlexible hours$167.4k - $205.1k
...Sr AI/ML Developer JLL empowers you to shape a brighter way. Our people... ...proficiency in LLM fine-tuning, prompt engineering, and model optimization Demonstrated proficiency in... ...management Experience with cloud AI platforms (Azure OpenAI, AWS Bedrock, Google Cloud...PlatformDaily paid$167.4k - $205.1k
...and build intelligent AI agent systems, LLM-... ...experience working as Sr AI/ML Developer with... ...fine-tuning, prompt engineering, and model optimization Demonstrated proficiency... ...with cloud AI platforms (Azure OpenAI, AWS Bedrock... ...and affiliates, is a leading global provider of...PlatformDaily paidLocal area$181.5k - $283.8k
...something unprecedented, an AI foundation that will... ...delivery and core platform development. AI4D's mission... ...As a Generative AI Engineer, you will design, build... ...Design, build, and optimize RAG pipelines over internal... ...Women's Initiative for Leading at Lilly (WILL), enAble...PlatformFull timeFlexible hours$150k - $250k
...Senior/Staff AI Engineer Job Locations US-CA-San Francisco... ...-world model serving and inference. This is the role for engineers... ...What you'll do Build and optimize LLM serving and inference... ...performance systems, storage platforms, or adjacent distributed systems...PlatformSeniorFull timeRemote work$203.5k
...personally. About Bain AI, Insights & Solutions... ...design, architecture and engineering, and client... ...and traditional data platforms), covering indexing strategies... ...tuning and preference optimization approaches). Write clean... ...stakeholders; lead working sessions, present...PlatformFull timeTemporary workApprenticeshipWork at officeLocal areaWork from homeHome office3 days per week- ...Sciforium AI Infrastructure Role Sciforium... ...high-efficiency serving platform. Backed by multi-million... ...-on support from AMD engineers the team is scaling... ...how large AI models are optimized and deployed at scale,... ...Monitoring, and distributed inference features....PlatformSeniorWork at officeFlexible hours
- ...Sr. Manager AI Agent Implementation About Us Visa is a world leader... ...deployment, and ongoing performance optimization. This role works closely... ...Agent Implementation: Lead the hands-on development and... ...enterprise tools to enhance platform capabilities. Code Deployment...PlatformSeniorWork experience placementWork at officeLocal area
- ...re building an agentic AI caregiver advocate... ...over time. The AI engineering challenge: build an autonomous... ...closely with Product, Platform Engineering, and... ...engineering and model optimization Design prompt architectures... ...evolves. Optimize inference costs our unit economics...PlatformSeniorImmediate startRemote workFlexible hours
- ...Sr. AI Engineer, Time-Series Signal Processing Bright.... ...intelligent automation. Our platform processes visual,... ...Signal Processing to lead the development of AI... ...experimentation and optimization of signal-processing... ...processing, and real-time inference, and incorporate them...PlatformSeniorRemote jobImmediate start
$172.5k
...Category Software Engineering Job Details... ...Salesforce is the #1 AI CRM, where humans... ...at the company leading workforce transformation... ...evaluate, optimize, and improve agent... ..., evaluation, and inference Transform raw interaction... ...Collaborate with platform teams while owning...Platform$166.2k - $304.7k
...technology company and the world's leading independent platform for digital advertising,... ...Lead Senior Staff Software Engineers are end-to-end owners who... ...Continuously analyze and optimize team efficiency, removing... ...microservices, Kubernetes, and AI/LLM integration into...PlatformSeniorFull timeTemporary workLocal areaWorldwide- ...By using the same technologies that power leading marketplace and last-mile platforms, we deliver care where people are, especially... ...the Role We're hiring a Senior Software Engineer to join our Logistics Optimization team, where we tackle some of the hardest algorithmic...PlatformSeniorTemporary workWork at officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform). Be the first to apply!
Related searches
- lead operating engineer San Francisco, CA
- lead engineer San Francisco, CA
- lead infrastructure engineer San Francisco, CA
- lead integration engineer San Francisco, CA
- lead security engineer San Francisco, CA
- lead backend developer San Francisco, CA
- lead algorithm engineer San Francisco, CA
- lead network engineer San Francisco, CA
- lead system engineer San Francisco, CA
- lead product engineer San Francisco, CA

