Lead AI Engineer (FM Hosting, LLM Inference)

$197.3k - $225.1k

Capital One National Association

Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real‑time, personalized customer experiences. Our investments in technology infrastructure and world‑class talent – along with our deep experience in machine learning – position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI and ML are bringing humanity and simplicity to banking. We are committed to continuing to build world‑class applied science and engineering teams to deliver our industry‑leading capabilities with breakthrough product experiences and scalable, high‑performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand‑in‑hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will: Partner with a cross‑functional team of engineers, research scientists, technical program managers, and product managers to deliver AI‑powered products that change how our associates work and how our customers interact with Capital One. Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of open source and SaaS AI technologies such as AWS Ultraclusters, HuggingFace, vector databases, Nemo Guardrails, PyTorch, and more. Invent and introduce state‑of‑the‑art LLM optimization techniques to improve the performance – scalability, cost, latency, throughput – of large‑scale production AI systems. Contribute to the technical vision and the long‑term roadmap of foundational AI systems at Capital One. The Ideal Candidate You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good. Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production. You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven. You are deeply technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss. You are a resilient trailblazer who can forge new paths to achieve business goals when the route is unknown. Basic Qualifications Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies; or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies. At least 4 years of experience programming with Python, Go, Scala, or Java. Preferred Qualifications 6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud). Experience designing, developing, delivering, and supporting AI services. Experience developing AI and ML algorithms or technologies (e.g. LLM inference, similarity search and vector databases, guardrails, memory) using Python, C++, C#, Java, or Golang. Experience developing and applying state‑of‑the‑art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost. Passion for staying abreast of the latest AI research and AI systems, and judiciously applying novel techniques in production. Salary and Incentives Salary ranges by location: Cambridge, MA: $197,300 – $225,100; McLean, VA: $197,300 – $225,100; New York, NY: $215,200 – $245,600; San Jose, CA: $215,200 – $245,600. This role is also eligible to earn performance‑based incentive compensation, which may include cash bonuses and/or long‑term incentives (LTI). Incentives could be discretionary or non‑discretionary depending on the plan. Benefits Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well‑being. Equal Opportunity Statement Capital One is an equal‑opportunity employer (EOE, including disability/vet) committed to non‑discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug‑free workplace. Capital One considers for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901‑4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. #J-18808-Ljbffr Capital One National Association

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Lead AI Engineer (FM Hosting, LLM Inference) in New York, NY vacancy

Lead AI Engineer (FM Hosting, LLM Inference)
$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized...
Suggested
Full time
Part time
Local area
Capital One
New York, NY
4 days ago
Senior Lead AI Engineer (FM Hosting, LLM Inference)
$229.9k - $262.4k
Senior Lead AI Engineer (FM Hosting, LLM Inference) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer...
Suggested
Full time
Part time
Local area
Capital One
New York, NY
6 hours ago
Lead AI Engineer (AI Foundations, LLM Customization and Finetuning)
$197.3k - $225.1k
...Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) Overview At Capital One, we are creating responsible and reliable AI systems... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...
Suggested
Full time
Part time
Local area
Capital One Financial Corp
New York, NY
4 days ago
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
$197.3k - $225.1k
...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...
Suggested
Full time
Local area
Capital One National Association
New York, NY
4 days ago
Lead AI Engineer - LLM Inference & Production AI
Capital One National Association is looking for a Lead AI Engineer in New York, NY, to drive the development of responsible AI systems and solutions. This role involves collaborating with cross-functional teams, designing, and deploying AI products using cutting-edge technologies...
Suggested
Capital One National Association
New York, NY
4 days ago
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
...Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization... ...AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails...
Full time
Part time
Capital One
New York, NY
3 days ago
AI Engineer (LLM Products) - Remote
...dexter health, we build AI-powered software for... ...looking for a high-agency AI Engineer to help us build new AI... ...Design and implement LLM-based workflows,... ...provider Support self-hosted model workflows where they... ...hosted LLMs, model serving, inference optimization, or...
Remote job
dexter health
New York, NY
3 days ago
Lead AI Inference Engineer 100% Remote
About the job You will own the inference backbone behind QVAC's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory...
Remote job
Local area
Framework Ventures
New York, NY
1 day ago
Lead AI Engineer
$197.3k - $225.1k
...Lead AI Engineer At Capital One, we are creating responsible and reliable AI systems, changing... ...model training, large language model inference, similarity search, guardrails, model evaluation... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...
Full time
Part time
Local area
Capital One Financial Corp
New York, NY
7 days ago
Lead AI Engineer, Data Solutions
...Lead AI Engineer – Salesforce Lead AI Engineer at Salesforce responsible for building next‑generation... ...) Design AI agents that combine LLM reasoning, tool usage, and ML decisioning... ...‑time) for training, evaluation, and inference Transform raw interaction data into features...
Salesforce.Com Inc
New York, NY
4 days ago
Senior Lead AI Engineer (GenAI Platform Services)
...Senior Lead AI Engineer (GenAI Platform Services) Overview At Capital One, we are creating responsible... ...model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state‑of‑the‑art LLM optimization techniques to improve the...
Full time
Part time
Local area
Comfort Systems USA
New York, NY
3 days ago
Applied AI, Technical Lead - Forward Deployed AI Engineer
# Applied AI, Technical Lead - Forward Deployed AI EngineerMistral AIVia company... ...project teams of Applied AI Engineers, ensuring the successful... ...workflows, and custom LLM applications, ensuring alignment... ...practices for fine-tuning, inference, and deployment.**- Work closely...
CVFine by Instrovate Technologies
New York, NY
1 day ago
Lead AI Engineer — Foundations, LLM Core & Agentic AI
$215.2k - $245.6k
Capital One National Association is seeking a Lead AI Engineer to create responsible AI systems that redefine banking experiences. This role involves partnering with diverse teams to develop AI solutions, leveraging cutting-edge technologies. Requirements include a Bachelor...
Capital One National Association
New York, NY
5 days ago
Forward Deployed Engineer, Lead - AI Engineer
...nation states. Our team of AI researchers and company... ...exceptional technical lead to build and scale... ...Reflection’s Forward Deployed Engineering function within the AI... ...sophisticated LLM workflows and integrating... ...internal codebase for inference, fine‑tuning, and evaluation...
Relocation package
AI Chopping Block, Inc.
New York, NY
5 days ago
Lead AI Engineer: Architect LLM Pipelines for Security
A cybersecurity ratings leader in New York City is seeking a Senior AI Engineer to develop AI-powered product features. This hands-on role involves designing workflows, integrating AI, and leading code reviews. Ideal candidates will have 5+ years of software engineering...
SecurityScorecard Inc.
New York, NY
19 days ago
Senior Lead AI Engineer - Foundations & LLM Systems
Capital One National Association is seeking a Senior Lead AI Engineer in New York, NY to build and optimize AI systems that enhance customer interactions. This role involves partnering with cross-functional teams to deliver innovative AI solutions and driving the technical...
Capital One National Association
New York, NY
3 days ago
Senior Lead AI Engineer (GenAI Platform, Agentic Infrastructure)
$209k - $286.2k
...responsible and reliable AI systems, changing... ...talent position us to lead enterprises leveraging... ...art in science and AI engineering, building and deploying... ..., large language model inference, similarity search, guardrails... ...introduce state‑of‑the‑art LLM optimization techniques...
Local area
Capital One
New York, NY
2 days ago
AI/LLM Engineer (Agentic AI & Generative AI)
...Overview: Job Title: AI/LLM Engineer (Agentic AI & Generative AI) Location: New York, NY Job Type: Full-Time Experience: 8-10 Years Job Summary We are seeking an experienced AI/LLM Engineer to design, develop, and deploy enterprise-grade Generative...
Full time
Purple Drive
New York, NY
1 day ago
AI/LLM Engineer
$70 - $75 per hour
...Location: New York (Need to come to office for F2F Round and work from office 3 days a week) Requirement ID: 93120-1 Title: AI/LLM Engineer - Agent Frameworks & Intelligent Systems Experience Required: 8-10 Years Pay Range: $70 - $75/hr Duration: 6 months...
Work at office
3 days per week
Artech
New York, NY
5 days ago
Remote AI & LLM Engineer (Python)
Mad Devs LLC is seeking an AI Engineer to oversee the AI layer of our products. You will have end-to-end ownership of AI initiatives, tackling... ...and shaping AI-powered products. The role involves designing LLM pipelines and building production-grade Python services while...
Remote job
Mad Devs LLC
New York, NY
1 day ago
Lead Applied AI Engineer
...We're building cutting-edge LLM-powered tools that supercharge... ...professional services firms, and leading consulting practices:... ...Description As one of our first AI/ML hires, you'll have the unique... ...prioritization Shape our engineering culture We're live with 2...
Summer work
Work at office
10 hours per week
Junior Company
New York, NY
4 days ago
Applied AI Engineer
...deliver healthcare. Our AI sensing platform enables... ...for an Applied AI Engineer to take our growing collection... ...fleet Deploy across our inference surfaces: third-party APIs, self-hosted, and on-robot edge... ...pipelines (STT, TTS, streaming LLM) Production RAG systems...
Norbert Health
Brooklyn, NY
3 days ago
Lead AI/ML Engineer
$170k - $190k
...mission is simple: deliver the best AI-powered customer experience—... ...Job Description You will lead the design and delivery of end... ...handling, streaming inference, and audio quality, and can translate... ...leadership within the team, mentoring engineers and promoting best practices...
Remote work
ASAPP
New York, NY
2 days ago
GenAI/LLM Fine-tuning and Agentic AI Engineer
Generative AI/ LLM Fine-tuning and Agentic AI Engineer USA - Remote Full time (W2 only) Key Responsibilities LLM Customization: Design, execute, and manage the fine-tuning of Large Language Models (LLMs) for specific enterprise tasks and domain-specific knowledge. Agentic...
Full time
Remote work
Sesheng, LLC
New York, NY
3 days ago
AI Engineer: IP Intelligence & LLM Pipelines
Alumni Ventures is on the lookout for an AI Engineer to join our team in New York. In this role, you’ll develop AI/ML algorithms for cutting-edge IP applications and design robust LLM pipelines. We are seeking candidates with experience in deploying data science products...
Flexible hours
Alumni Ventures
New York, NY
3 days ago
Senior AI Engineer, LLM Code Generation & RAG
Framework Ventures is looking for a Senior AI Engineer to join their team. This role focuses on LLM-based code generation and involves optimizing large language models for their platform, Caffeine. Responsibilities include developing multi-agent architectures, optimizing...
Framework Ventures
New York, NY
3 days ago
Remote AI Engineer — Agentic AI & LLM Apps
A leading AI company is seeking an AI Engineer to design and ship end-to-end AI solutions for enterprise problems. The role involves building LLM-powered applications and requires 3+ years of hands-on AI/ML engineering experience. You will work alongside experts to ensure...
Remote job
Flexible hours
h2o.ai
New York, NY
3 days ago
Remote Clojure AI / LLM Engineer - Stylitics
We are looking for a versatile and experienced AI / LLM Data Engineer to join our team and help shape the future of how Stylitics leverages large language models. In this role, you’ll combine your expertise in data science, software engineering, and prompt design to drive...
Remote job
Work experience placement
WorksHub
New York, NY
3 days ago
AI Microservices Engineer: LLM/API Innovator
Side is looking for an AI Microservices Developer to create APIs leveraging Large Language Models and Generative AI for various projects... ...possess a Bachelor’s in Computer Science and have experience with Python-based REST APIs and LLM technologies. #J-18808-Ljbffr Side
Side
New York, NY
5 days ago
Senior AI Engineer for Equities: Scalable LLM Pipelines
A leading tech firm in New York is seeking a Senior AI Engineer to join its Equities Technology AI group. This role entails managing the design and optimization of... ...have a strong background in Python, experience with LLM frameworks, and a PhD or Master's with substantial...
Work experience placement
Talent Algo
New York, NY
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead AI Engineer (FM Hosting, LLM Inference). Be the first to apply!