Lead AI Engineer (FM Hosting, LLM Inference)

$197.3k - $225.1k

Capital One

Overview

At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.

Team Description:

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.

In this role, you will:

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems.
Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.

The Ideal Candidate:

You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.
Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.
You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.
You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.
You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.

Basic Qualifications:

Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies
At least 4 years of experience programming with Python, Go, Scala, or Java

Preferred Qualifications:

6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
Experience designing, developing, delivering, and supporting AI services
Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.

The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked.

Cambridge, MA: $197,300 - $225,100 for Lead AI Engineer McLean, VA: $197,300 - $225,100 for Lead AI Engineer New York, NY: $215,200 - $245,600 for Lead AI Engineer San Jose, CA: $215,200 - $245,600 for Lead AI Engineer

Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter.

This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan.

Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at theCapital One Careers website. Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.

This role is expected to accept applications for a minimum of 5 business days.No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries.

If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.

For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io

Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.

Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Lead AI Engineer (FM Hosting, LLM Inference) in New York, NY vacancy

Lead AI Engineer (FM Hosting, LLM Inference) (Hiring Immediately)
$197.3k - $225.1k
Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer...
Suggested
Full time
Part time
Local area
Immediate start
Capital One
New York, NY
23 hours ago
Lead AI Engineer (FM Hosting, LLM Inference)
$197.3k - $225.1k
Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer...
Suggested
Full time
Part time
Local area
Capital One
New York, NY
9 hours ago
Lead AI Engineer (FM Hosting, LLM Inference)
...Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization... ...AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails...
Suggested
Full time
Part time
Capital One
New York, NY
2 days ago
Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) (Hiring Immediately)
$197.3k - $225.1k
Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) Overview At Capital One, we are creating responsible and reliable AI systems, changing... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...
Suggested
Full time
Part time
Local area
Immediate start
Capital One
New York, NY
23 hours ago
Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) (Hiring Immediately)
$197.3k - $225.1k
Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) Overview At Capital One, we are creating responsible and reliable AI systems, changing... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...
Suggested
Full time
Part time
Local area
Immediate start
Capital One
New York, NY
23 hours ago
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (Hiring Immediately)
$225.4k - $257.2k
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview: At Capital One, we are creating responsible and reliable AI systems, changing... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...
Full time
Part time
Local area
Immediate start
Capital One
New York, NY
23 hours ago
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
$197.3k - $225.1k
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...
Full time
Local area
Capital One National Association
New York, NY
3 days ago
Lead AI Engineer (Hiring Immediately)
$197.3k - $225.1k
Lead AI Engineer At Capital One, we are creating responsible and reliable AI systems, changing... ...foundation model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...
Full time
Part time
Local area
Immediate start
Capital One
New York, NY
23 hours ago
Applied AI, Technical Lead - Forward Deployed AI Engineer
...Technical Lead, Applied AI At Mistral AI, we believe in the power of... ...project teams of Applied AI Engineers, ensuring the successful deployment... ...workflows, and custom LLM applications, ensuring alignment... ...practices for fine-tuning, inference, and deployment. Work...
Work at office
Visa sponsorship
Mistral AI
New York, NY
2 days ago
Lead AI Engineer: Foundations, LLM Core & Agentic AI
A leading financial services company is seeking a skilled AI Engineer in New York City to develop innovative AI and ML algorithms. This role requires at least 4 years of experience with programming in Python and expertise in deploying scalable AI solutions on cloud platforms...
Capital One
New York, NY
17 hours ago
AI Red Team Lead Engineer
$133.37k - $156.9k
AI Red Team Lead Engineer page is loaded## AI Red Team Lead Engineerlocations: Minneapolis, MN: Atlanta... ...) + Training, evaluation, and inference pipelines + Data ingestion, labeling,... ...Experience with cloud, containerized, and AI-hosting environments* Proficiency in one or...
Temporary work
Work experience placement
Local area
3 days per week
U.S. Bank
New York, NY
1 day ago
Lead AI Engineer — Foundations, LLM Core & Agentic AI
$215.2k - $245.6k
Capital One National Association is seeking a Lead AI Engineer to create responsible AI systems that redefine banking experiences. This role involves partnering with diverse teams to develop AI solutions, leveraging cutting-edge technologies. Requirements include a Bachelor...
Capital One National Association
New York, NY
1 day ago
Senior AI Engineer - LLM Inference & Production Systems
A leading financial services company in New York seeks an experienced AI Engineer to develop and optimize machine learning algorithms. Candidates should hold a Bachelor's or Master's degree and have significant experience programming in Python and leveraging cloud platforms...
Capital One
New York, NY
17 hours ago
Senior AI Engineer - LLM Inference & Production Systems
A leading financial services company is seeking a technical professional to develop AI and ML algorithms. The ideal candidate has at least 4 years of experience in programming and deploying AI solutions on cloud platforms. Responsibilities include building state-of-the...
Capital One
New York, NY
2 days ago
Lead AI Engineer - Gen AI Platform & LLM Systems
A leading financial services company is seeking an experienced Lead AI Engineer to enhance AI systems and create transformative AI-powered banking products. This role involves collaborating with diverse teams to advance AI capabilities and requiring a strong background...
Capital One National Association
New York, NY
2 days ago
Founding AI Engineer (Fintech/LLM)
...Job Description Job Description Founding AI Engineer (Fintech/LLM) Location: New York City, NY (Hybrid, 4 days onsite) Role Overview Founding AI Engineer role owning the end-to-end design, development, and scaling of LLM-powered product features within a high-...
David Joseph & Company
New York, NY
26 days ago
Remote Clojure AI / LLM Engineer - Stylitics
...We are looking for a versatile and experienced AI / LLM Data Engineer to join our team and help shape the future of how Stylitics leverages large language models. In this role, you’ll combine your expertise in data science, software engineering, and prompt design to drive...
Work experience placement
Remote work
WorksHub
New York, NY
17 hours ago
Lead/Staff AI Acceleration Engineer
$500 per month
...liquidity for millions of Americans. As a leading innovator in the U.S. financial... ...most. We’re hiring a Lead / Staff AI Acceleration Engineer to help define how AI is built and used... ...engineering ~ Hands-on experience building LLM-powered applications, agents, or tool-...
Temporary work
Work at office
Local area
Remote work
Work from home
Home office
Flexible hours
GrabJobs
New York, NY
1 day ago
Staff AI Engineer
$175k - $250k
...market outcomes. The Staff AI Engineer will be responsible for... ...strategies. Model & Inference Infrastructure... ...Transition from external LLM dependence to controlled... ...intelligence, evaluating hosting strategies ranging from... ...systems where predictions lead to actions that generate...
Full time
Immediate start
Remote work
Shift work
MLabs
New York, NY
3 days ago
Lead Applied AI Engineer
...We're building cutting-edge LLM-powered tools that supercharge... ...professional services firms, and leading consulting practices:... ...Description As one of our first AI/ML hires, you'll have the unique... ...prioritization Shape our engineering culture We're live with 2...
Summer work
Work at office
10 hours per week
Junior Company
New York, NY
4 days ago
Applied AI Engineer
...deliver healthcare. Our AI sensing platform enables... ...for an Applied AI Engineer to take our growing collection... ...fleet Deploy across our inference surfaces: third-party APIs, self-hosted, and on-robot edge... ...pipelines (STT, TTS, streaming LLM) Production RAG systems...
Norbert Health
Brooklyn, NY
3 days ago
Senior Python Developer with GenAI, LLM
...Developer with GenAI, LLM Location: Jersey City... .... Understanding of AI governance, model... ...programs (e.g., Expert Engineers) starting to be firm-wide... ...sources within secure, cloud-hosted environments (e.g., AWS... ...to deliver industry-leading capabilities to our clients...
Contract work
Casual work
InterSources
Jersey City, NJ
1 day ago
Lead AI/ML Engineer
$170k - $190k
...mission is simple: deliver the best AI-powered customer experience—... ...Job Description You will lead the design and delivery of end... ...handling, streaming inference, and audio quality, and can translate... ...leadership within the team, mentoring engineers and promoting best practices...
Remote work
ASAPP
New York, NY
2 days ago
Senior AI Engineer for Equities: Scalable LLM Pipelines
A leading tech firm in New York is seeking a Senior AI Engineer to join its Equities Technology AI group. This role entails managing the design and optimization of... ...have a strong background in Python, experience with LLM frameworks, and a PhD or Master's with substantial...
Work experience placement
Talent Algo
New York, NY
3 days ago
AI Lead Engineer, Client Technology - SVP - NYC
$176.72k - $265.08k
...are seeking a highly skilled and pragmatic AI Lead to design, develop, and deploy advanced... ...based systems, machine learning, software engineering and management best practices. This... ...reliability, and security. Machine Learning & LLM Integration: Integrate various machine...
Full time
Citigroup Inc.
New York, NY
1 day ago
Edge AI Inference Engineer On-Device ML Systems
...A technology company in Georgia is seeking a C++ Engineer to own the inference backbone of its AI stack, focusing on deploying models to edge devices. You'll collaborate closely with researchers and manage a cross-functional team to enhance existing products with AI features...
Framework Ventures
New York, NY
17 hours ago
AI Platform Engineer, Python - Lead Software Engineer
...Lead Software Engineer We have an opportunity to impact your career and provide an adventure where... ...development, and technical troubleshooting for AI-enabled applications. Develop secure,... ..., moderation) and comprehensive LLM usage logging/monitoring (no prior experience...
Work at office
Chase
Jersey City, NJ
2 days ago
Senior AI Agents Engineer: Production-Grade LLM Systems
A leading financial institution in New York seeks an AI Agents Applied Research/Engineering Senior Associate. In this role, you will lead the lifecycle of LLM-based agents, implementing research and best practices for AI systems. You will ensure the delivery of efficient...
JPMorgan Chase & Co.
New York, NY
1 day ago
Lead Applied AI Software Engineer ( AI)
$170.8k - $234.8k
...caring community The Enterprise AI organization at Humana is a... .... By collaborating with world-leading experts, we are at the forefront... ...managers, data scientists, engineers, policy experts, and business... ...databases, embedding models, LLM APIs, and orchestration frameworks...
Bi-weekly pay
Full time
Temporary work
Apprenticeship
Casual work
Work at office
Work from home
Home office
Humana Inc
New York, NY
3 days ago
Senior AI Engineer: Multi-Agent Systems & LLM Orchestration
A leading media company is seeking an Associate Director, AI Engineer in New York. This senior role involves building AI agents and ensuring high-quality delivery of... ...experience, strong skills in Python, and knowledge of LLM frameworks. Responsibilities include designing...
Flexible hours
Dentsu Aegis Network Ltd.
New York, NY
17 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead AI Engineer (FM Hosting, LLM Inference). Be the first to apply!