Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Lead AI Engineer (FM Hosting, LLM Inference)

$229.9k - $262.4k
Full-time

Capital One

Senior Lead AI Engineer (FM Hosting, LLM Inference) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description: The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will: Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One. Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems. Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One. The Ideal Candidate: You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good. Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production. You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven. You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss. You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown. Basic Qualifications: Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies At least 6 years of experience programming with Python, Go, Scala, or Java Preferred Qualifications: 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud) Experience designing, developing, integrating, delivering, and supporting complex AI systems Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers Capital One will consider sponsoring a new qualified applicant for employment authorization for this position. The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked. Cambridge, MA: $229,900 - $262,400 for Sr. Lead AI Engineer McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer New York, NY: $250,800 - $286,200 for Sr. Lead AI Engineer San Jose, CA: $250,800 - $286,200 for Sr. Lead AI Engineer Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter. This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan. Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website. Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level. This role is expected to accept applications for a minimum of 5 business days. No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations. For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site. Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC). At Capital One, we’re building a leading information-based technology company. Still founder-led by Chairman and Chief Executive Officer Richard Fairbank, Capital One is on a mission to help our customers succeed by bringing ingenuity, simplicity, and humanity to banking. We measure our efforts by the success our customers enjoy and the advocacy they exhibit.

Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Senior Lead AI Engineer (FM Hosting, LLM Inference) in New York, NY vacancy
  • $197.3k - $225.1k

     ...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized... 
    Suggested
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    4 days ago
  • $197.3k - $225.1k

    Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real‑time, personalized customer... 
    Suggested
    Local area

    Capital One National Association

    New York, NY
    4 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) Overview At Capital One, we are creating responsible and reliable AI systems...  ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation... 
    Suggested
    Full time
    Part time
    Local area

    Capital One Financial Corp

    New York, NY
    4 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking...  ...foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation... 
    Suggested
    Full time
    Local area

    Capital One National Association

    New York, NY
    4 days ago
  •  ...Senior Lead AI Engineer (GenAI Platform Services) Overview At Capital One, we are creating responsible...  ...model training, large language model inference, similarity search, guardrails, model...  ...Invent and introduce state‑of‑the‑art LLM optimization techniques to improve the... 
    Senior
    Full time
    Part time
    Local area

    Comfort Systems USA

    New York, NY
    3 days ago
  • Capital One National Association is looking for a Lead AI Engineer in New York, NY, to drive the development of responsible AI systems and solutions. This role involves collaborating with cross-functional teams, designing, and deploying AI products using cutting-edge technologies... 

    Capital One National Association

    New York, NY
    4 days ago
  • Capital One National Association is seeking a Senior Lead AI Engineer in New York, NY to build and optimize AI systems that enhance customer interactions. This role involves partnering with cross-functional teams to deliver innovative AI solutions and driving the technical... 
    Senior

    Capital One National Association

    New York, NY
    3 days ago
  •  ...Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization...  ...AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails... 
    Full time
    Part time

    Capital One

    New York, NY
    3 days ago
  • $209k - $286.2k

     ...responsible and reliable AI systems, changing...  ...talent position us to lead enterprises leveraging...  ...art in science and AI engineering, building and deploying...  ..., large language model inference, similarity search, guardrails...  ...introduce state‑of‑the‑art LLM optimization techniques... 
    Senior
    Local area

    Capital One

    New York, NY
    2 days ago
  •  ...health, we build AI-powered software for...  ...a high-agency AI Engineer to help us build new...  ...and implement LLM-based workflows, structured...  ...Support self-hosted model workflows...  ...less about formal seniority and more about speed...  ..., model serving, inference optimization, or similar... 
    Remote job

    dexter health

    New York, NY
    3 days ago
  • About the job You will own the inference backbone behind QVAC's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory... 
    Remote job
    Local area

    Framework Ventures

    New York, NY
    1 day ago
  • Senior Vice President - AI Lead Engineer - Banking Technology Job Overview Role Citi's Banking Technology organization is seeking an exceptional full stack...  ...that seamlessly integrate state‑of‑the‑art ML/LLM tools and services into comprehensive AI solutions. Champion... 
    Senior
    Work experience placement

    Citi

    New York, NY
    2 days ago
  • Framework Ventures is looking for a Senior AI Engineer to join their team. This role focuses on LLM-based code generation and involves optimizing large language models for their platform, Caffeine. Responsibilities include developing multi-agent architectures, optimizing... 
    Senior

    Framework Ventures

    New York, NY
    3 days ago
  • A leading tech firm in New York is seeking a Senior AI Engineer to join its Equities Technology AI group. This role entails managing the design and optimization of...  ...have a strong background in Python, experience with LLM frameworks, and a PhD or Master's with substantial... 
    Senior
    Work experience placement

    Talent Algo

    New York, NY
    3 days ago
  • Framework Ventures is looking for a Security Engineer to build AI-driven security infrastructure for LLM applications. The role involves designing multi-agent systems and integrating security methods into development pipelines. Candidates should have at least 3 years of... 
    Senior

    Framework Ventures

    New York, NY
    3 days ago
  • $175k - $200k

    Iovance Biotherapeutics seeks a senior engineer to design and deploy AI systems focused on operational productivity. Ideal candidates will have over 10...  ...experience in software/AI engineering, including designing LLM-based applications. The role involves working with cross... 
    Senior

    Iovance Biotherapeutics

    New York, NY
    4 days ago
  • $90 - $100 per hour

    Eliassen Group is seeking a Senior AI/ML Engineer to design and deliver cloud-native machine learning solutions on AWS. Responsibilities include LLM orchestration, predictive modeling, and collaboration with cross-functional teams. This is a remote position requiring U... 
    Senior
    Remote job
    Permanent employment

    Eliassen Group

    New York, NY
    5 days ago
  • $160k - $240k

    Bloomberg L.P. in New York is seeking a Senior Software Engineer for AI Inference to design and build scalable infrastructure for machine learning applications. The ideal candidate will have over 5 years of software engineering experience, expertise in distributed systems... 
    Senior

    Bloomberg

    New York, NY
    1 day ago
  • About the Job We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine‑tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine‑tuning for language models... 
    Senior
    Remote job

    Framework Ventures

    New York, NY
    3 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer At Capital One, we are creating responsible and reliable AI systems, changing...  ...model training, large language model inference, similarity search, guardrails, model evaluation...  ...Invent and introduce state-of-the-art LLM optimization techniques to improve the... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    New York, NY
    7 days ago
  • $229.9k - $262.4k

     ...Overview Senior Lead AI Engineer (GenAI Platform, Agentic Infrastructure) Overview: At Capital...  ...model training, large language model inference, similarity search, guardrails, model...  ...Invent and introduce state-of-the-art LLM optimization techniques to improve the... 
    Senior
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    11 days ago
  •  ...Lead AI Engineer – Salesforce Lead AI Engineer at Salesforce responsible for building next‑generation...  ...) Design AI agents that combine LLM reasoning, tool usage, and ML decisioning...  ...‑time) for training, evaluation, and inference Transform raw interaction data into features... 

    Salesforce.Com Inc

    New York, NY
    4 days ago
  • Senior. Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are creating...  ...to deliver our industry leading capabilities with...  ...will coach and evangelize - hosting architecture office hours...  ...algorithms or technologies (e.g. LLM Inference, Similarity Search and... 
    Senior
    Work at office
    Local area

    COMFORT SYSTEMS

    New York, NY
    4 days ago
  • $200k - $300k

     ...Senior / Principal AI Engineer (Generative AI, Multi-Agent Systems) Location: New York City (Hybrid -...  ...model optimization, and high-throughput inference—alongside infrastructure for EHR...  ...of clinical guidelines structured for LLM consumption Fine-tuning and model... 
    Senior

    NxT Level

    New York, NY
    2 days ago
  • $165k - $230k

    Position Summary We’re hiring a product-oriented Senior AI Engineer who loves tinkering with LLMs and shipping real agent...  ...eval harnesses, and debug latency/cost at scale. LLM breadth: Comfortable comparing models/hosts, prompt strategies, and guardrail techniques.... 
    Senior
    Permanent employment
    Work at office
    Monday to Friday

    Raylu, Inc

    New York, NY
    3 days ago
  •  ...Senior AI Engineer Washington D.C. / New York Senior AI Engineer Hybrid - Washington D....  ...analyses, including back-testing, rejection inference, and performance analyses using diverse...  ...and apply emerging AI/ML, GenAI/LLM technologies, staying abreast of state-... 
    Senior
    Flexible hours

    VantageScore®

    New York, NY
    3 days ago
  •  ...is a data, cloud, and AI company whose focus is...  ...exciting to you, let’s chat! SENIOR AI ENGINEER We are hiring a Senior AI Engineer to lead the development of our...  ...applications with LLM frameworks such as LangChain...  ...Experience optimizing inference costs and latency for... 
    Senior
    For contractors
    For subcontractor

    Further Ventures

    New York, NY
    2 days ago
  • $40 per hour

     ...Senior AI Engineer Doctronic | New York City ONSITE (Hybrid) | Reports...  ...clinical guidelines built for LLM consumption Fine tuning and...  ...performance Real-time inference systems handling millions of...  ...significant autonomy Access to leading AI technologies and medical... 
    Senior

    Doctronic Inc

    New York, NY
    3 days ago
  • $160k - $170k

    Octus Octus is a leading global provider of credit intelligence...  ...and much more. Role As a Senior AI Engineer focused on CreditAI, our...  ...design Integrate with managed LLM services across cloud providers...  ...for training and inference Apply systems thinking to design... 
    Senior

    Octus Inc

    New York, NY
    2 days ago
  •  ...Senior Applied AI Engineer Paramount Skydance Corp. is seeking a Senior Applied...  ...grounding techniques, hybrid LLM architectures. Perform LLM...  ...Endpoints for real-time inference. Integrate Vertex AI with...  ...Technical Leadership Lead architectural decisions on LLM... 
    Senior

    Paramount Global Services

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Lead AI Engineer (FM Hosting, LLM Inference). Be the first to apply!