Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
$229.9k - $262.4kCapital One
Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description: The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will: Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One. Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems. Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One. The Ideal Candidate: You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good. Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production. You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven. You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss. You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown. Basic Qualifications: Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies At least 6 years of experience programming with Python, Go, Scala, or Java Preferred Qualifications: 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud) Experience designing, developing, integrating, delivering, and supporting complex AI systems Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers Capital One will consider sponsoring a new qualified applicant for employment authorization for this position. The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked. Cambridge, MA: $229,900 - $262,400 for Sr. Lead AI Engineer McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer New York, NY: $250,800 - $286,200 for Sr. Lead AI Engineer San Francisco, CA: $250,800 - $286,200 for Sr. Lead AI Engineer San Jose, CA: $250,800 - $286,200 for Sr. Lead AI Engineer Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter. This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan. Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website. Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level. This role is expected to accept applications for a minimum of 5 business days. No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations. For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site. Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC). At Capital One, we’re building a leading information-based technology company. Still founder-led by Chairman and Chief Executive Officer Richard Fairbank, Capital One is on a mission to help our customers succeed by bringing ingenuity, simplicity, and humanity to banking. We measure our efforts by the success our customers enjoy and the advocacy they exhibit.
$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable... ...of customers. Our AI models and platforms empower teams across Capital One... ...introduce state-of-the-art LLM optimization techniques to improve the...PlatformFull timePart timeLocal area- ...Sr. Lead AI Engineer At Klaviyo, we value the unique backgrounds, experiences... ...—it's systems that can run, optimize, and adapt themselves based... ...of services that host and orchestrate AI models (LLMs... ...Marketing AI and other product/platform teams—clarifying ownership boundaries...PlatformSeniorLocal area
$229.9k - $262.4k
...Sr. Lead AI Engineer (AI Foundations) Overview: At Capital One, we are... ...customers. Our AI models and platforms empower teams across... ...training, large language model inference, similarity search, guardrails... ...state-of-the-art LLM optimization techniques to improve the performance...PlatformSeniorFull timePart timeLocal area$225.4k - $257.2k
...Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic... .... Our AI models and platforms empower teams across... ...large language model inference, similarity search, guardrails... ...state-of-the-art LLM optimization techniques to improve... ...5,400 - $257,200 for Sr. Lead AI Engineer...PlatformSeniorFull timePart timeLocal area- ...Akamai Technologies GmbH is seeking a Senior II Software Engineer Lead to drive innovation in AI systems within the Akamai Inference Cloud Team. This role involves leading the design and implementation of platform components, driving key technical decisions, and...PlatformSeniorRemote workFlexible hours
$193.4k - $220.7k
...Lead AI Engineer At Capital One, we are creating responsible and reliable... .... Our AI models and platforms empower teams across Capital... ...training, large language model inference, similarity search,... ...introduce state-of-the-art LLM optimization techniques to improve the performance...PlatformFull timePart timeLocal area- A leading technology company is seeking a Senior Software Engineer to develop the AI Inference Server. The role involves managing the full lifecycle of building and releasing system software across various architectures. You will work on cloud infrastructure and enhance...PlatformSenior
$180k - $230k
...with cutting-edge AI/ML and data solutions... ...Data Systems, Platforms at Scale, and Complex... ...operators and engineers to rapidly build,... ...for an experienced Lead AI/ML Software Engineer... ...capability Optimize platform... ...model integration, inference optimization, and...PlatformLive inRemote work$87.52k - $140.77k
...Role: Senior Software Engineer ( Gen AI) Location :... ...Blue Yonder is a leading AI-driven global supply... ...advancing sustainability by optimizing resources. Revolutionizing... ...and optimize AI inference systems for large... ...Familiarity with cloud platforms (AWS, GCP, or Azure)...PlatformSeniorLocal areaRemote workRelocationFlexible hours$314.8k - $359.3k
Sr. Distinguished AI Engineer At Capital One, we are creating responsible... ...our industry leading capabilities with breakthrough... ...Our AI models and platforms empower teams... ...language model inference, similarity search,... ...state‑of‑the‑art LLM optimization techniques to...PlatformSeniorFull timePart timeLocal area$150k - $180k
...a new team focused on accelerating AI adoption and usage, the AI Engineer will design and deploy AI-powered automation... ..., orchestration, and model inference layers Participate in technical... ...and architecture discussions Data Platform Integration Utilize Snowflake as...PlatformWork at officeRemote workRelocation2 days per week$180k - $280k
...mission is to be the leading telecommunications company... ...and experienced Lead AI Engineer to join our dynamic... ...learning strategies to optimize and automate decision-... ...enterprise applications and platforms. ~ Solid... ...frameworks and model inference engines such as Triton...Platform$152k - $228k
...systems that can learn, run, and optimize themselves based on... ...building the next generation of AI and machine learning technologies... ...for any business. As a Lead AI Engineer , you’ll take ownership of... ...klaviyo.com), instant messaging platforms, or unsolicited calls. By...Platform- ...Senior AI Engineer 1 Your journey at Crowe starts here: At... ...smarter, transform faster, and lead in a platform-driven economy. Built for... ...design end-to-end AI solutions, optimize model performance, and... ...datasets, model telemetry, and inference performance to drive...PlatformSeniorRemote workFlexible hours
- ...Senior Software Engineer - AI & Data Engineering... ...domains. We partner with leading organizations to deliver... ...Generative AI, cloud platforms, and data-driven automation... ..., factoring in LLM inference latency and API rate... .... Integrate and optimize tools/frameworks such...PlatformSenior
$150k - $210k
...live longer by using AI to transform... ...recommendations. Our AI platform is central to this... ...a Senior AI/ML Engineer to help scale the... ...across WHOOP products. Lead end-to-end AI... ...direct preference optimization (DPO), and reinforcement... ...deployments with inference optimization,...PlatformSeniorFull timeWork at officeRelocation$229.9k - $262.4k
...Senior Lead Machine Learning Engineer (Intelligent Foundations... ...emerging technology platforms. You'll focus on... ...in building AI/ML capabilities... ...experimentation, LLM inference, similarity... ...products that enable optimized associate... ...- $262,400 for Sr. Lead Machine Learning...PlatformSeniorFull timePart timeInternshipLocal area- A leading technology firm is seeking a Sr. Managed Services Engineer specializing in AI and Copilot solutions. Candidates should have a strong technical... ...include managing customer platforms, collaborating on projects, and optimizing systems. The role is remote with...PlatformSeniorRemote work
$133.65k - $220.68k
...believe the future of AI is open and we are... .... The Red Hat AI Inference team accelerates AI... ...GenAI deployments. As leading developers and... ...provides a stable platform for enterprises to build, optimize, and scale LLM deployments... ...Senior Software Engineer to build and...PlatformSeniorPermanent employmentFull timeContract workWork experience placementWork at officeRemote workFlexible hours$185.13k - $226.27k
...Lead AI Engineer Ready to help us transform healthcare? Bring your true colors to blue. About the Role: Blue Cross Blue Shield... ...Linux/Unix scripting. Experience working with multiple cloud platforms (AWS, GCP, Azure). Solution Architect Associate...PlatformWork at officeLocal areaFlexible hoursShift work$129.5k - $186.1k
...workers start and end their days with our workforce operating platform. Helping people get paid, grow in their careers, and shape the... ...work matters-and so do you. We are seeking an experienced AI Engineer to design, build, and scale AI-driven solutions and autonomous...PlatformLocal area- ...Job Title: Generative AI Engineer (Senior / Lead / Principal)- Multiple openings Experience... ...include custom development, cloud data platforms, AI/ML integrations, and data... ...workflows . LLM Integration & Prompt Optimization Work with OpenAI, Claude, Bard...PlatformSeniorWork at officeRemote work
- ...high-value tasks to intelligent AI agents that help reshape the... ...edge of AI, blending deep engineering with practical product impact... ...from scratch, but on composing, optimizing, and scaling AI systems that... ...- work closely with product, platform, and backend teams to ensure...PlatformSeniorShift work
- A leading software solutions provider seeks a Forward Deployed Engineer to manage and optimize Large Language Model (LLM) inference systems for client environments. The ideal candidate will have over... ...particularly with Kubernetes and AI inference techniques. Responsibilities...
- ...and faster. Opportunity Mission-driven engineering: Every automation you build helps... ...emissions What You'll Do At M-KOPA, you'll lead the AI Ops team from day one - owning delivery... ...Identity automation (Entra), internal platforms, Notion, HiBob integrations Existing Foundation...PlatformApprenticeshipInternshipLocal areaRemote workFlexible hours
- ...Senior AI Engineer Description We are looking for a Senior AI Engineer to join our Data & AI Platform Team in building a next-generation data platform which will leverage aPriori... ..., prompt versioning, cost/performance optimization. Combine traditional AI/ML with...PlatformSeniorFlexible hoursNight shift
$148k - $222k
...in software that can run and optimize itself based on outcome or reward... ...to build state-of-the-art AI and machine learning... ...campaigns for any business. As a Sr. AI Engineer at Klaviyo, you'll play a key... ...klaviyo.com), instant messaging platforms, or unsolicited calls. By...PlatformSenior$180k - $250k
...Senior AI Engineer — Voice & Agentic Systems Location: Remote (US) / Hybrid (Boston or... ...scalable AI nurse concierge. Their platform automates care coordination workflows across... ...AI quality through reward-driven optimization Build real-time STT / TTS pipelines...PlatformSeniorH1bWork at officeRemote work$148k - $222k
...Title: Senior Software Engineer - Marketing Agent... ...the next generation of AI agents that can automatically... ...create, execute, and optimize marketing and customer... ...for them. You will lead and design the next generation... ...), instant messaging platforms, or unsolicited calls....PlatformSeniorWork experience placement$60 per hour
...developing cutting-edge AI systems, while enjoying the... ...become available on our platform. Benefits Fully remote: work... ..., experimental analysis, optimization, and statistical inference. Write clear technical... ...Computer Science, Mathematics, Engineering, or similar); a master's...PlatformHourly payFull timeRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform). Be the first to apply!
- lead engineer Cambridge, MA
- ai developer Cambridge, MA
- ai engineer Cambridge, MA
- senior ai engineer Cambridge, MA
- senior manager clinical operations Cambridge, MA
- senior lead project manager Cambridge, MA
- senior manager quality engineering Cambridge, MA
- senior device engineer Cambridge, MA
- senior research manager Cambridge, MA
- senior hvac project manager Cambridge, MA


