Lead AI Engineer (FM Hosting, LLM Inference)
$197.3k - $225.1kCapital One Financial Corp
Lead AI Engineer (FM Hosting, LLM Inference)
Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine learning - position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description: The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will:- Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
- Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
- Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
- Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems.
- Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.
- You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.
- Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.
- You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.
- You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.
- You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.
- Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies
- At least 4 years of experience programming with Python, Go, Scala, or Java
- 6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
- Experience designing, developing, delivering, and supporting AI services
- Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
- Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
- Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Lead AI Engineer (FM Hosting, LLM Inference) in New York, NY vacancy
$229.9k - $262.4k
...Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems,... ...more. ~ Invent and introduce state-of-the-art LLM optimization techniques to improve the performance —...SuggestedFull timePart timeLocal area$197.3k - $225.1k
...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems, changing... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...SuggestedFull timePart timeLocal area$197.3k - $225.1k
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking... ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation...SuggestedFull timeLocal area- ...Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization... ...AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails...SuggestedFull timePart time
- ...dexter health, we build AI-powered software for... ...looking for a high-agency AI Engineer to help us build new AI... ...Design and implement LLM-based workflows,... ...provider Support self-hosted model workflows where they... ...hosted LLMs, model serving, inference optimization, or...SuggestedRemote work
$165k - $290k
...Overview AI Lead Engineer responsible for shaping and executing the AI vision at Commerce. The... ...multimodal LLMs, prompt orchestration, LLM telemetry, and inspection. Upskill the... ...databases, model orchestration tools, inference frameworks, cloud‑native ML workflows)....Local areaRemote work$179.4k - $204.7k
...Lead AI Engineer (AI Foundations) Overview At Capital One, we are creating responsible... ...foundation model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...Full timePart timeLocal area$197.3k - $225.1k
...Lead AI Engineer At Capital One, we are creating responsible and reliable AI systems, changing... ...model training, large language model inference, similarity search, guardrails, model evaluation... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...Full timePart timeLocal area- ...YipitData is seeking an AI Engineer to handle data science projects end-to-end, utilizing LLM coding assistants to enhance workflow. This remote-friendly position... ...ideal candidate has extensive experience in causal inference and technical writing, along with strong Python...Remote workFlexible hours
$133.37k - $156.9k
...from Day One. Job Description The AI Red Team Lead Engineer leads the execution and evolution of... ...) Training, evaluation, and inference pipelines Data ingestion, labeling... ...Experience with cloud, containerized, and AI-hosting environments Proficiency in one or...Temporary workWork experience placementLocal area3 days per week- ...Mistral At Mistral AI, we believe in the... ...About The Job: Technical Lead, Applied AI Mistral... ...project teams of Applied AI Engineers, ensuring the... ...workflows, and custom LLM applications, ensuring... ...practices for fine-tuning, inference, and deployment. - Work...Work at officeVisa sponsorship
$172.5k - $260.1k
...Job Category Software Engineering Job Details About... ...Salesforce is the #1 AI CRM, where humans with... ...career at the company leading workforce transformation... ...AI agents that combine LLM reasoning, tool usage,... ...training, evaluation, and inference Transform raw interaction...- ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures at... ...programming experience is a plus. You understand modern LLM architectures and are able to bring them up reliably in a production...
$215.2k - $245.6k
Capital One National Association is seeking a Lead AI Engineer to create responsible AI systems that redefine banking experiences. This role involves partnering with diverse teams to develop AI solutions, leveraging cutting-edge technologies. Requirements include a Bachelor...- ...Senior AI Engineer — Inference & Agent Systems United States Title: Applied AI Engineer — Inference & Agent Systems Location: United States What... ...enforcement: JSON schema validation, retry loops on malformed LLM output, graceful degradation Tool call design: schema design...
- ...Gifthealth, Inc. is seeking a full-time Lead AI Engineer to drive the AI engineering function and manage agentic AI systems while leading a team of AI Engineers in a fully remote environment. The ideal candidate will have extensive experience in software engineering with...Full timeRemote work
$229.9k - $262.4k
Sr. Lead AI Engineer (GenAI Platform) Overview At Capital One, we are creating responsible and... ...foundation model training, large language model inference, similarity search, guardrails, model... ...Invent and introduce state‑of‑the‑art LLM optimization techniques to improve the...Local area- ...Consultants is seeking an experienced Artificial Intelligence Engineer to design and deploy AI/ML and Generative AI solutions addressing real-world... ...to deliver AI-driven applications using production-grade LLM-powered systems. The ideal candidate has 5+ years in AI/ML...
- ...AMIRA LEARNING INC is seeking a results-oriented generative AI practitioner to enhance LLM-powered systems. The role focuses on designing and... ...improving AI features and collaborating across product and engineering teams. Candidates should have at least 2 years of...Remote work
- ...Framework Ventures is looking for a Senior AI Engineer to join their team. This role focuses on LLM-based code generation and involves optimizing large language models for their platform, Caffeine. Responsibilities include developing multi-agent architectures, optimizing...
- ...Lead Software Engineer We have an opportunity to impact your career and provide an adventure where you can push the limits of what's possible... ...main focus on full stack development using Python, Java and AI/LLM tools. You will be part of the front-office technology team...
- ...Senior Data / ML Engineer (LLM & Agentic Systems) Employment Type: Full Time, Temporary/Contractual Role Category: IT & Information Security... ...in a new use case approach using LLM models Dev methodology in AI use cases Cost optimization using various models Routing scenarios...Full timeTemporary work
- ...Texas Sports Academy Main is seeking a Software Engineer (AI-Forward) to build software for managing student records and AI tools. The role... ...experience, proficiency in AI coding tools, and involvement in LLM-powered features. You'll work closely with founders, moving features...Remote work
- ...A leading AI company is seeking an AI Engineer to design and ship end-to-end AI solutions for enterprise problems. The role involves building LLM-powered applications and requires 3+ years of hands-on AI/ML engineering experience. You will work alongside experts to ensure...Remote workFlexible hours
- ...We are looking for a versatile and experienced AI / LLM Data Engineer to join our team and help shape the future of how Stylitics leverages large language models. In this role, you’ll combine your expertise in data science, software engineering, and prompt design to drive...Work experience placementRemote work
$94.64k - $169.52k
...diseases through research and treatment. We are leading the way the world understands, treats and... ...diseases. We are looking for a Lead AI Engineer to join our Analytics Services team in... ...of production‐grade AI systems, including LLM‐based and agentic solutions, that integrate...Remote work- ...Portless Inc is looking for a full-time AI Engineer who will work remotely to own the design, development, and deployment of AI-powered systems... ...and/or TypeScript/JavaScript, and hands-on experience with LLM APIs and prompt engineering. You will collaborate with product,...Full timeRemote work
- ...Lead AI Engineer for Banking Technology Discover your future at Citi Working at Citi is far more than just a job. A career with us means... ..., reasoning, planning, execution), integrating multiple LLM providers (OpenAI, Anthropic, Google). Full‑Stack AI Engineering...
- ...information, please read ourSenior Software Engineer page is loaded## Senior Software... ...implement production-grade components for AI agents—tools, orchestration logic, memory... ...Stay current with advances in agentic AI, LLM orchestration frameworks, and systems engineering...Remote workWorldwide
- ...Lead AI Engineer (Hands-On) – NYC (4 Days On-Site) Large, Mission-Driven Non-Profit We’re partnering with a well-funded, high-impact non-... ...drive real-world impact Architect and implement RAG pipelines, LLM integrations, and agent-based workflows Lead backend...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead AI Engineer (FM Hosting, LLM Inference). Be the first to apply!
Related searches
- lead maintenance engineer New York, NY
- lead support engineer New York, NY
- lead operating engineer New York, NY
- lead software test engineer New York, NY
- lead engineer New York, NY
- lead infrastructure engineer New York, NY
- lead security engineer New York, NY
- lead backend developer New York, NY
- lead algorithm engineer New York, NY
- lead industrial engineer New York, NY

