Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)

$229.9k - $262.4k
Full-time

Capital One

Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description: The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact. In this role, you will: Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One. Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems. Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One. The Ideal Candidate: You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good. Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production. You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven. You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss. You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown. Basic Qualifications: Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies At least 6 years of experience programming with Python, Go, Scala, or Java Preferred Qualifications: 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud) Experience designing, developing, integrating, delivering, and supporting complex AI systems Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers Capital One will consider sponsoring a new qualified applicant for employment authorization for this position. The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked. Cambridge, MA: $229,900 - $262,400 for Sr. Lead AI Engineer McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer New York, NY: $250,800 - $286,200 for Sr. Lead AI Engineer San Francisco, CA: $250,800 - $286,200 for Sr. Lead AI Engineer San Jose, CA: $250,800 - $286,200 for Sr. Lead AI Engineer Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter. This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan. Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the Capital One Careers website. Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level. This role is expected to accept applications for a minimum of 5 business days. No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at View phone number on click.appcast.io or via email at View email address on click.appcast.io. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations. For technical support or questions about Capital One's recruiting process, please send an email to View email address on click.appcast.io Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site. Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC). At Capital One, we’re building a leading information-based technology company. Still founder-led by Chairman and Chief Executive Officer Richard Fairbank, Capital One is on a mission to help our customers succeed by bringing ingenuity, simplicity, and humanity to banking. We measure our efforts by the success our customers enjoy and the advocacy they exhibit.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) in New York, NY vacancy
  • $197.3k - $225.1k

     ...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and...  ...of customers. Our AI models and platforms empower teams across Capital One...  ...and introduce state-of-the-art LLM optimization techniques to improve the performance... 
    Platform
    Full time
    Part time
    Local area

    Capital One Financial Corp

    New York, NY
    3 days ago
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (AI Foundations) Overview: At Capital One, we...  ...customers. Our AI models and platforms empower teams across...  ...training, large language model inference, similarity search, guardrails...  ...state-of-the-art LLM optimization techniques to improve the performance... 
    Platform
    Senior
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    3 days ago
  • $229.9k - $262.4k

    Sr. Lead AI Engineer (GenAI Platform) Overview At Capital One, we are creating responsible and reliable AI...  ...model training, large language model inference, similarity search, guardrails,...  ...and introduce state‑of‑the‑art LLM optimization techniques to improve the performance... 
    Platform
    Senior
    Local area

    COMFORT SYSTEMS

    New York, NY
    3 days ago
  •  ...Senior AI Engineer — Inference & Agent Systems United States Title: Applied AI Engineer — Inference...  ...correct. The Work Inference Optimization Drive TTFT below 400ms for multi-step...  ...shipped inference systems at An agent platform (any domain) Or you've built eval/harness... 
    Platform
    Senior

    Arcana Analytics Inc.

    New York, NY
    4 days ago
  • $165k - $290k

     ...Overview AI Lead Engineer responsible for shaping and executing the AI vision...  ...fit for purpose and optimize ROI for the business. Build...  ...model orchestration tools, inference frameworks, cloud‑native ML...  ...science. Expertise with cloud platforms (AWS, GCP, or Azure), API architecture... 
    Platform
    Local area
    Remote work

    Commerce.com US, Inc.

    New York, NY
    4 days ago
  •  ...execution, leveraging AI, data, and...  ...AI Automation to optimize productivity, streamline...  ...data pipelines, host and deploy working...  ...alongside an AI Strategy Lead who defines the...  ...tools, and AI platforms Host and deploy solutions...  ...including prompt engineering, API integrations,... 
    Platform
    Freelance
    Remote work

    Hiretruss

    New York, NY
    22 hours ago
  • $197.3k - $225.1k

     ...Lead AI Engineer At Capital One, we are creating responsible and reliable...  .... Our AI models and platforms empower teams across Capital...  ...training, large language model inference, similarity search,...  ...introduce state-of-the-art LLM optimization techniques to improve the performance... 
    Platform
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    1 day ago
  •  ...A leading optimization technology firm in the United States is seeking a Senior AI Engineer to design and implement AI agents for optimization applications. The ideal candidate will have over 5 years of experience as a software engineer and expertise in prompt engineering... 
    Platform
    Senior

    Gurobi Optimization

    New York, NY
    22 hours ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview...  ...customers. Our AI models and platforms empower teams across Capital...  ..., large language model inference, similarity search, guardrails...  ...introduce state-of-the-art LLM optimization techniques to improve the... 
    Platform
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    22 hours ago
  • $133.37k - $156.9k

     ...Job Description The AI Red Team Lead Engineer leads the execution and evolution...  ...focused on AI/ML systems, platforms, and integrations, in...  ...Training, evaluation, and inference pipelines Data ingestion...  ...cloud, containerized, and AI-hosting environments Proficiency... 
    Platform
    Temporary work
    Work experience placement
    Local area
    3 days per week

    U.S. Bank

    New York, NY
    3 days ago
  •  ...A leading optimization technology firm in the United States is seeking a Senior AI Engineer to enhance its platform with AI agents and machine learning. The ideal candidate will have 5+ years of software engineering experience, proficiency in languages such as Python or... 
    Platform
    Senior

    Medium

    New York, NY
    22 hours ago
  • $170k - $190k

     ...deliver the best AI-powered customer experience...  .... You will lead the design and...  ...handling, streaming inference, and audio quality...  ...Design and optimize low-latency inference...  ...the team, mentoring engineers and promoting best...  ..., experimentation platforms, or evaluation frameworks... 
    Platform
    Remote work

    ASAPP

    New York, NY
    4 days ago
  •  ...Mistral At Mistral AI, we believe in the...  ...high-performance, optimized, open-source and...  ...comprehensive AI platform is designed to...  ...The Job: Technical Lead, Applied AI Mistral...  ...of Applied AI Engineers, ensuring the...  ...for fine-tuning, inference, and deployment.... 
    Platform
    Work at office
    Visa sponsorship

    Mistral AI

    New York, NY
    4 days ago
  • $172.5k - $260.1k

     ...Category Software Engineering Job Details...  ...Salesforce is the #1 AI CRM, where humans...  ...career at the company leading workforce...  ...continuously evaluate, optimize, and improve agent...  ..., evaluation, and inference Transform raw interaction...  ...Collaborate with platform teams while owning... 
    Platform

    Salesforce.Com Inc

    New York, NY
    22 hours ago
  • $97.59k - $140k

     ...Sr. AI Engineer IT/Health IT/Informatics New York, NY • Full-Time...  ...technology to deploy, monitor, and optimize large language models and...  ...capabilities into the RPM platform and related systems. Job...  ...performance, and cost. Optimize inference performance and cost... 
    Platform
    Senior
    Full time
    Remote work

    NYU Langone Health

    New York, NY
    3 days ago
  • $197.3k - $225.1k

    Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At...  ...customers. Our AI models and platforms empower teams across...  ...training, large language model inference, similarity search, guardrails...  ...state‑of‑the‑art LLM optimization techniques to improve the performance... 
    Platform
    Full time
    Local area

    Capital One National Association

    New York, NY
    22 hours ago
  •  ...A leading cloud technology company in the United States seeks an ML Performance Engineer Principal Lead to optimize inference performance across its platforms. The role involves evaluating techniques like quantization and hardware-aware scheduling. Ideal candidates will... 
    Platform

    Akamai

    New York, NY
    22 hours ago
  • $160k - $240k

     ...Senior Software Engineer - AI Inference Location New York...  ...Bloomberg AI Inference Platform provides production-grade...  ...infrastructure for hosting, deploying, and...  ...inference workloads. Lead integration of high-performance...  ..., and performance optimization. You'll need to... 
    Platform
    Senior
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    4 days ago
  • $300k

     ...interpretable, and steerable AI systems. We want AI to...  ...researchers, engineers, policy experts, and business...  .... About The Role Our Inference team builds and...  ...running in multiple cloud platforms. Representative...  ...routing algorithms that optimize request distribution across... 
    Platform
    Senior
    Worldwide

    Menlo Ventures

    New York, NY
    22 hours ago
  •  ...About the job Lead AI Engineer Role : Lead AI & Analytics Engineer US Remote...  ...including orchestration, data modeling, and platforms such as Databricks, Snowflake, or GCP...  ...from multiple sources Develop and optimize batch and near real-time data... 
    Platform
    Remote work

    Inizio Partners

    New York, NY
    22 hours ago
  •  ...Lead AI Engineer for Banking Technology Discover your future at Citi Working at Citi is...  ...agentic workflows, and generative AI platforms to improve efficiency, automation, simplification...  ...fast time‑to‑value. Evaluation & Optimization: Define rigorous evaluation frameworks... 
    Platform

    Citi

    New York, NY
    4 days ago
  •  ...please read ourSenior Software Engineer page is loaded## Senior...  ...production-grade components for AI agents—tools, orchestration logic...  .... \* \*\*Performance Optimization:\*\* Profile and optimize agent...  ...and the OutSystems low-code platform, ensuring seamless data flow... 
    Platform
    Remote work
    Worldwide

    OutSystems

    New York, NY
    22 hours ago
  • $94.64k - $169.52k

     ...research and treatment. We are leading the way the world...  .... We are looking for a Lead AI Engineer to join our Analytics Services...  ...tools aimed at automating and optimizing clinical workflows, operational...  ...operational, and administrative platforms. Translate institutional... 
    Platform
    Remote work

    St. Jude Children's Research Hospital

    New York, NY
    22 hours ago
  • $176.72k - $265.08k

     ...highly skilled and pragmatic AI Lead to design, develop, and...  ...machine learning, software engineering and management best practices...  ...Implement LLM integration using platforms like OpenAI, Anthropic, and...  ...Task Automation & Workflow Optimization: Develop agents that can automate... 
    Platform
    Full time

    Citigroup Inc

    New York, NY
    5 days ago
  •  ...partner with a global logistics company leveraging AI, Machine Learning, and Data Engineering to optimize warehouse operations, predictive maintenance, and route...  ...and ML infrastructure . Experience in cloud AI platforms (AWS Sagemaker, GCP AI Platform, Azure ML). Experience... 
    Platform
    Long term contract
    Remote work

    Sphere Software

    New York, NY
    22 hours ago
  •  ...professional services firms, and leading consulting practices:...  ...As one of our first AI/ML hires, you'll have...  ...Shape our engineering culture We're live...  ...thousands of calls on our platform. There's lots of data...  ...gathering / logging → prompt optimizing Have a strong bias... 
    Platform
    Summer work
    Work at office
    10 hours per week

    Junior Company

    New York, NY
    1 day ago
  •  ...Overview Lead AI Product Engineer. Location: United States (Remote) - Only US citizen or Permanent...  ...to integrate AI capabilities into our platforms. Develop and refine prompt strategies...  ...product goals into actionable tasks optimized for both human and AI execution. Conduct... 
    Platform
    Permanent employment
    Remote work

    Mitratech

    New York, NY
    22 hours ago
  • $40 per hour

     ...Senior AI Engineer Doctronic | New York City ONSITE...  ...performance Real-time inference systems handling...  ..., fine-tuning, model optimization for specialized domains...  ...Access to leading AI technologies and medical...  ...certified clinicians. Our platform (doctronic.ai)... 
    Platform
    Senior

    Doctronic Inc

    New York, NY
    22 hours ago
  •  ...Senior Data Scientist/Lead AI Engineer Immediate need for a talented Senior Data Scientist/Lead AI Engineer. This is a fulltime opportunity...  ...~ Familiarity with tools like MLflow, Vertex AI, or similar platforms ~ Cloud & Data Platforms: ~ Experience with GCP (... 
    Platform
    Senior
    Full time
    Immediate start

    Pyramid Consulting

    New York, NY
    3 days ago
  • $170.8k - $234.8k

     ...community The Enterprise AI organization at Humana...  ...with world-leading experts, we are at the...  ...managers, data scientists, engineers, policy experts, and business...  ...experiences, and optimizing operational efficiency...  ..., scalable healthcare platforms. These platforms... 
    Platform
    Bi-weekly pay
    Full time
    Temporary work
    Apprenticeship
    Casual work
    Work at office
    Work from home
    Home office

    Humana

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform). Be the first to apply!