Lead AI Engineer (FM Hosting, LLM Inference) (Hiring Immediately)
$197.3k - $225.1kCapital One
Job Description
At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.
Team Description:
The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.
In this role, you will:
Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems.
Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.
The Ideal Candidate:
You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.
Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.
You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.
You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.
You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.
Basic Qualifications:
Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies
At least 4 years of experience programming with Python, Go, Scala, or Java
Preferred Qualifications:
6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
Experience designing, developing, delivering, and supporting AI services
Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production
Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.
The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked.
McLean, VA: $197,300 - $225,100 for Lead AI Engineer New York, NY: $215,200 - $245,600 for Lead AI Engineer San Jose, CA: $215,200 - $245,600 for Lead AI EngineerCandidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter.
This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan.Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website . Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.
This role is expected to accept applications for a minimum of 5 business days.No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries.If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on apply.placed-app.com or via email at View email address on apply.placed-app.com . All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.
For technical support or questions about Capital One's recruiting process, please send an email to View email address on apply.placed-app.com
Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.
Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).
$197.3k - $225.1k
Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For... ...that this salary information is solely for candidates hired to perform work within one of these locations, and refers...Immediate startFull timePart timeLocal area$197.3k - $225.1k
Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) Overview At Capital One, we are creating... ...model training, large language model inference, similarity search, guardrails, model... ...is solely for candidates hired to perform work within one of these...Immediate startFull timePart timeLocal area$197.3k - $225.1k
Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) Overview At Capital One, we are creating... ...model training, large language model inference, similarity search, guardrails, model... ...is solely for candidates hired to perform work within one of these...Immediate startFull timePart timeLocal area$197.3k - $225.1k
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible... ...training, large language model inference, similarity search, guardrails, model... ...information is solely for candidates hired to perform work within one of these...Immediate startFull timePart timeLocal area$197.3k - $225.1k
Lead AI Engineer (GenAI Platform, AI Foundations, LLM Core and Agentic AI)Overview Lead AI Engineer At Capital One, we... ...model training, large language model inference, similarity search, guardrails,... ...is solely for candidates hired to perform work within one of these...Immediate startFull timePart timeLocal area$197.3k - $225.1k
Lead AI Engineer At Capital One, we are creating responsible and reliable... ...training, large language model inference, similarity search,... ...and introduce state-of-the-art LLM optimization techniques to improve... ...information is solely for candidates hired to perform work within one of...Immediate startFull timePart timeLocal area$269.1k - $307.2k
Distinguished AI Engineer (Agentic AI Platform) At Capital... ...deliver our industry leading capabilities with... ...coach and evangelize - hosting architecture office hours... ...or technologies (e.g. LLM Inference, Similarity Search and... ...solely for candidates hired to perform work within...Immediate startFull timePart timeWork at officeLocal area$197.3k - $225.1k
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible... ...training, large language model inference, similarity search, guardrails, model... ...information is solely for candidates hired to perform work within one of these...Full timeLocal area$229.9k - $262.4k
...Senior Lead Machine Learning Engineer (Intelligent Foundations and Experiences) As... ...learning engineers in building AI/ML capabilities for Credit... ...in model experimentation, LLM inference, similarity search, and... ...information is solely for candidates hired to perform work within one...Immediate startFull timePart timeInternshipLocal area- ...dexter health, we build AI-powered software for... ...looking for a high-agency AI Engineer to help us build new AI... ...Design and implement LLM-based workflows,... ...provider Support self-hosted model workflows where they... ...hosted LLMs, model serving, inference optimization, or...Remote work
- About the job You will own the inference backbone behind QVAC's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory...Remote jobLocal area
$165k - $290k
...Overview AI Lead Engineer responsible for shaping and executing the AI vision... ...LLMs, prompt orchestration, LLM telemetry, and inspection.... ..., model orchestration tools, inference frameworks, cloud‑native ML workflows... ...to creating an accessible hiring experience for all candidates...Local areaRemote work- ...YipitData is seeking an AI Engineer to handle data science projects end-to-end, utilizing LLM coding assistants to enhance workflow. This remote-friendly position... ...ideal candidate has extensive experience in causal inference and technical writing, along with strong Python...Remote workFlexible hours
- ...Mistral At Mistral AI, we believe in the... ...About The Job: Technical Lead, Applied AI Mistral... ...project teams of Applied AI Engineers, ensuring the... ...workflows, and custom LLM applications, ensuring... ...practices for fine-tuning, inference, and deployment. - Work...Work at officeVisa sponsorship
$172.5k - $260.1k
...Category Software Engineering Job Details... ...Salesforce is the #1 AI CRM, where humans... ...career at the company leading workforce... ...agents that combine LLM reasoning, tool usage... ..., evaluation, and inference Transform raw interaction... ...selection and hiring decisions. Please...- ...Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization... ...AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails...Full timePart time
- ...fastest-growing AI-native patent intelligence... ...generative AI engine-custom-built for... ...globally, hire future mission-... ...on multi-step LLM pipelines for patent... ...for production inference: prompt caching... ...solutions Lead ongoing R&D of new... ...patent claim and not immediately give up. You're...Immediate startRemote work
$215.2k - $245.6k
Capital One National Association is seeking a Lead AI Engineer to create responsible AI systems that redefine banking experiences. This role involves partnering with diverse teams to develop AI solutions, leveraging cutting-edge technologies. Requirements include a Bachelor...- A leading financial services company is seeking a skilled AI Engineer in New York City to develop innovative AI and ML algorithms. This role requires at least 4 years of experience with programming in Python and expertise in deploying scalable AI solutions on cloud platforms...
$133.37k - $156.9k
AI Red Team Lead Engineer page is loaded## AI Red Team Lead Engineerlocations: Minneapolis... ...Training, evaluation, and inference pipelines + Data ingestion,... ..., containerized, and AI-hosting environments* Proficiency... ...of the application or hiring process, please refer to our...Temporary workWork experience placementLocal area3 days per week- ...Senior AI Engineer — Inference & Agent Systems United States Title: Applied AI Engineer — Inference & Agent Systems Location: United States What... ...enforcement: JSON schema validation, retry loops on malformed LLM output, graceful degradation Tool call design: schema design...
$197.3k - $225.1k
Lead Machine Learning Engineer (Gen AI, Python, Go, AWS) As a Capital One Machine Learning Engineer (MLE) on the GenAI Workflows Serving team, you'll be... ...that this salary information is solely for candidates hired to perform work within one of these locations, and refers...Immediate startFull timePart timeInternshipH1bLocal area$175k - $250k
...clients a Staff AI Engineer - this is a... ...the fleet. We're hiring a Staff AI Engineer... ...loop is immediate - your work either... ...works) Model & Inference Own the path from external LLM dependence to... ...the right model hosting strategy - from... ...predictions lead to actions that...Immediate startRemote workShift work$175k - $250k
...- $250K We are hiring on behalf of our... .... The Staff AI Engineer will be responsible... ...loop is immediate and measurable:... ...strategies. Model & Inference Infrastructure... ...from external LLM dependence to controlled... ..., evaluating hosting strategies... ...where predictions lead to actions that...Immediate startFull timeRemote workShift work- ...and implement a Gen. AI platform, including... ...all Mastercard's engineering principles Actively... ...You* 8+ Years of lead software... ...Foundational Models (FM) from OpenAI, Anthropic... ...well as open-source LLM models deployed in... ...York, United States Company : Vets Hired...
- A leading financial services company in New York seeks an experienced AI Engineer to develop and optimize machine learning algorithms. Candidates should hold a Bachelor's or Master's degree and have significant experience programming in Python and leveraging cloud platforms...
$170k - $230k
...A leading tech company is seeking an Applied AI Engineer for its New York City office. The role involves building AI tools... ...use, maintaining the self-hosted LibreChat interface, and contributing... ...experience, with unique opportunities for immediate impact on the team and projects....Immediate startWork at office$229.9k - $262.4k
Senior Lead Machine Learning Engineer As a Capital One Machine Learning Engineer (MLE), you'll be part of... ...practices in Responsible and Explainable AI Use programming languages like... ...information is solely for candidates hired to perform work within one of these locations...Immediate startFull timePart timeInternshipLocal area$197.3k - $225.1k
...Lead Machine Learning Engineer As a Capital One Machine Learning Engineer (MLE), you'll be part of an Agile... ...in Responsible and Explainable AI. Use programming languages like Python... ...information is solely for candidates hired to perform work within one of these locations...Immediate startFull timePart timeInternshipH1bLocal area$170k - $230k
Applied AI Engineer, Internal Tooling - New York Location:... ...extend LibreChat, our self-hosted AI interface. Create... ...next to you, getting immediate feedback, and iterating... ...engagement. Understanding of LLM APIs (OpenAI, Anthropic... ...more about How We Hire and What We Expect from...Immediate startWork at officeRemote workWork from home
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead AI Engineer (FM Hosting, LLM Inference) (Hiring Immediately). Be the first to apply!
- lead support engineer New York, NY
- lead mobile developer New York, NY
- lead quality engineer New York, NY
- lead project engineer New York, NY
- lead network engineer New York, NY
- lead field engineer New York, NY
- lead product engineer New York, NY
- lead web developer New York, NY
- lead software test engineer New York, NY
- lead system engineer New York, NY


