Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Infrastructure Engineer, Model Serving Platform

$216k - $270k

Scale AI

As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and production systems, supporting both internal and external use cases across various environments.

The ideal candidate combines strong ML fundamentals with deep expertise in backend system design. You'll work in a highly collaborative environment, bridging research and engineering to deliver seamless experiences to our customers and accelerate innovation across the company.

You will:
  • Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale.
  • Build an internal platform to empower LLM capability discovery.
  • Collaborate with researchers and engineers to integrate and optimize models for production and research use cases.
  • Conduct architecture and design reviews to uphold best practices in system design and scalability.
  • Develop monitoring and observability solutions to ensure system health and performance.
  • Lead projects end-to-end, from requirements gathering to implementation, in a cross-functional environment.


Ideally you'd have:
  • 5+ years of experience building large-scale, high-performance backend systems.
  • Strong programming skills in one or more languages (e.g., Python, Go, Rust, C++).
  • Experience with LLM serving and routing fundamentals (e.g. rate limiting, token streaming, load balancing, budgets, etc.)
  • Experience with LLM capabilities and concepts such as reasoning, tool calling, prompt templates, etc.
  • Experience with containers and orchestration tools (e.g., Docker, Kubernetes).
  • Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as code (e.g., Terraform).
  • Proven ability to solve complex problems and work independently in fast-moving environments.


Nice to haves:
  • Experience with modern LLM serving frameworks such as vLLM, SGLang, TensorRT-LLM, or text-generation-inference.


Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.

Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $216,000—$270,000 USD

PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.

About Us:

At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Cisco, DLA Piper, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications.

We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.

We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at View email address on click.appcast.io. Please see the United States Department of Labor's Know Your Rights poster for additional information.

We comply with the United States Department of Labor's Pay Transparency provision .

PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants' needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior AI Infrastructure Engineer, Model Serving Platform in New York, NY vacancy
  •  ...Scientific Data and AI company. We are catalyzing...  ...cloud, data, and AI infrastructure have converged on...  ...We’re looking for a Senior AI Platform Engineer to help design, build...  ...ingest, transform, and serve data for ML and analytics...  ...‑efficiency of AI models in production. Drive... 
    Platform
    Senior
    Immediate start
    Remote work
    Flexible hours

    TetraScience

    New York, NY
    2 days ago
  •  ...Description Job Title: Senior AI Platform Engineer (Kubernetes & LLMOps)...  ...engineering role focused on infrastructure, scalability, and developer...  ...pipelines, and fine-tuned models on OpenShift. Agentic...  ...management, air-gapped model serving, and immutable audit logs.... 
    Platform
    Senior

    Pelham Berkeley Search

    New York, NY
    3 days ago
  • $160k - $235k

     ...Senior AI Engineer, AI Platform Affinity stitches together billions of data points from massive datasets...  ...with document chunking, embedding models, and context window optimization ~...  ...information from unstructured data, serving embedding models to vectorize chunks,... 
    Platform
    Senior
    Work at office
    Remote work
    Worldwide
    Flexible hours
    2 days per week
    3 days per week

    Affinity

    New York, NY
    1 day ago
  • $110k - $140k

     ...high‑performance cloud infrastructure easy to use,...  ...accessible for enterprises and AI innovators around the...  ...skilled and experienced AI Platform Engineer to own the strategy...  ...curate open‑source models — Llama, Mistral, Qwen...  ...the AI platform layer serves all teams without becoming... 
    Platform
    Senior
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Vultr

    New York, NY
    2 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One, we...  ...investments in technology infrastructure and world-class talent —...  ...capabilities to reimagine how we serve our customers and...  ...millions of customers. Our AI models and platforms empower... 
    Platform
    Senior
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    4 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services) Overview: At Capital One, we are creating...  ...in technology infrastructure and world-class talent —...  ...capabilities to reimagine how we serve our customers and businesses...  ...millions of customers. Our AI models and platforms empower... 
    Platform
    Senior
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    1 day ago
  • $229.9k - $262.4k

    Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One, we...  ...in technology infrastructure and world-class talent —...  ...capabilities to reimagine how we serve our customers and businesses...  ...millions of customers. Our AI models and platforms empower... 
    Platform
    Senior
    Full time
    Part time
    Local area
    Immediate start

    Capital One

    New York, NY
    10 hours ago
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (GenAI Platform) Overview: At Capital One, we are creating...  ...investments in technology infrastructure and world-class talent -...  ...to reimagine how we serve our customers and businesses...  ...millions of customers. Our AI models and platforms empower... 
    Platform
    Senior
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    8 hours ago
  • $229.9k - $262.4k

    Sr. Lead AI Engineer (GenAI Platform) Overview At Capital One, we are creating responsible...  ...investments in technology infrastructure and world‑class talent —...  ...to reimagine how we serve our customers and...  ...millions of customers. Our AI models and platforms empower teams... 
    Platform
    Senior
    Local area

    COMFORT SYSTEMS

    New York, NY
    5 days ago
  •  ...Neurawork GmbH & Co. KG sucht einen erfahrenen Engineer zur Entwicklung unserer AI-Agent-Plattform für regulierte Branchen. Du wirst an einer performanceorientierten Lösung arbeiten und maßgeblich an der Architektur beteiligt sein. Voll-remote ist möglich, allerdings ist... 
    Platform
    Senior
    Remote work

    Neurawork GmbH & Co. KG

    New York, NY
    1 day ago
  • A forward-thinking AI company is seeking a Senior AI Platform Engineer to lead the design and delivery of intelligent workflows. The role entails technical leadership, hands-on development of AI systems, and collaboration with product management. Ideal candidates have over... 
    Platform
    Senior
    Remote work

    Infinity

    New York, NY
    4 days ago
  • $180k - $200k

     ...Modern Campus is looking for a Senior Director of Development to oversee engineering execution and technical leadership across its product lines, including the Connected Curriculum. The role involves leading multiple teams, ensuring high quality and timely delivery of... 
    Platform
    Senior
    Remote work

    Modern Campus

    New York, NY
    2 days ago
  •  ...Supernal is seeking a Senior AI Platform Engineer to lead software delivery for AI Employee implementations. As part of this remote role, you will design and build core software systems while ensuring technical delivery and client success. The ideal candidate has at least... 
    Platform
    Senior
    Remote work

    Infinity

    New York, NY
    3 days ago
  • $220k - $240k

     ...A leading healthcare technology firm is seeking a Senior Engineer Manager to lead innovative AI-driven solutions impacting U.S. healthcare. The ideal candidate will have extensive technical leadership and backend development experience, particularly in Java or Rust. This... 
    Platform
    Senior
    Remote work

    The Rawlings Group

    New York, NY
    2 days ago
  •  ...A leading optimization technology firm in the United States is seeking a Senior AI Engineer to design and implement AI agents for optimization applications. The ideal candidate will have over 5 years of experience as a software engineer and expertise in prompt engineering... 
    Platform
    Senior

    Gurobi Optimization

    New York, NY
    2 days ago
  •  ...Provectus is seeking a senior hands-on engineer to take ownership of a sophisticated AI platform in production. This role entails maintaining and extending a complex system...  .... This position offers a remote-first work model, annual performance bonuses, and comprehensive... 
    Platform
    Senior
    Remote work

    Provectus

    New York, NY
    2 days ago
  •  ...Jitterbit is seeking a Senior AI Engineer to join our innovative team in the United States. The role focuses on building AI capabilities on a cutting-edge platform, using technologies like LLM and Azure AI. Applicants should have extensive experience in cloud-based application... 
    Platform
    Senior
    Remote work

    Jitterbit

    New York, NY
    2 days ago
  •  ...A leading AI procurement firm is seeking a Senior AI Platform Engineer to build and scale their core platform. The role requires strong backend development skills, proficiency in Node.js and React, and experience with AI systems. You'll work alongside founding engineers... 
    Platform
    Senior
    Remote work

    Negotiateai Inc.

    New York, NY
    2 days ago
  •  ...A leading crypto exchange is seeking a Senior Engineer to build a high-performance AI Service Platform. The role includes designing architectures for real-time fraud detection, developing middleware for AI services, and ensuring system reliability in volatile markets.... 
    Platform
    Senior

    Framework Ventures

    New York, NY
    2 days ago
  •  ...Smartsheet Inc. is seeking an AI Platform Engineer to lead the design of the core infrastructure for AI experiences. Candidates should have 8+ years of software engineering experience, particularly with LLMs, along with strong Python skills and experience with prompt... 
    Platform
    Senior
    Full time
    Remote work

    Smartsheet

    New York, NY
    2 days ago
  • A leading AI company is seeking a Senior AI Platform Engineer to lead the development of voice-first AI agents. This remote role requires experience in deploying complex systems and managing client deliveries. Responsibilities include building agent workflows, handling... 
    Platform
    Senior
    Remote work

    Infinity Constellation

    New York, NY
    4 days ago
  •  ...Cyclotron, Inc. is seeking a Sr. AI Governance Engineer to lead the governance of Microsoft AI and automation platforms. This role is client-facing and requires effective management of platform strategies and governance frameworks. The ideal candidate will have over 5... 
    Platform
    Senior
    Remote work

    Cyclotron, Inc.

    New York, NY
    3 days ago
  • A technology consulting firm in the United States seeks a Cloud Platform Engineer to enhance AI models by evaluating their performance and logic. This remote position requires proficiency in at least one programming language and a detail-oriented mindset. Responsibilities... 
    Platform
    Remote work
    Flexible hours

    DataAnnotation

    New York, NY
    8 hours ago
  • $30 per hour

    A technology company specializing in AI training is seeking a Cloud Platform Engineer to join their team. The position is remote and focuses on training and evaluating AI chatbots. The ideal candidate should be proficient in programming languages like JavaScript, Python... 
    Platform
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    8 hours ago
  •  ...A leading education technology firm is seeking a Senior AI Software Engineer to design and deploy AI-enabled applications. This fully remote role involves developing scalable solutions using Python and AWS, collaborating with cross-functional teams to enhance learning... 
    Platform
    Senior
    Remote work

    Harnham

    New York, NY
    2 days ago
  • $148k - $216k

     ...Life360 is hiring a Senior Software Engineer II to enhance mobile software engineering with AI tools. This remote position focuses on developing infrastructure that integrates AI to improve quality and efficiency in the development lifecycle. The ideal candidate will help... 
    Platform
    Senior
    Remote work

    Life360

    New York, NY
    2 days ago
  • $150k - $180k

     ...A technology consulting firm in the United States is seeking a Senior Software Engineer to lead innovative projects in AI and agent-based systems. The role involves integrating LLMs with various platforms and APIs while working in a remote-first environment. Ideal candidates... 
    Platform
    Senior
    Remote work

    Effectual Services

    New York, NY
    2 days ago
  •  ...A leading optimization technology firm in the United States is seeking a Senior AI Engineer to enhance its platform with AI agents and machine learning. The ideal candidate will have 5+ years of software engineering experience, proficiency in languages such as Python... 
    Platform
    Senior

    Medium

    New York, NY
    2 days ago
  •  ...Granum is seeking an experienced AI Engineer to help reshape the future of their products through AI-driven capabilities. This role involves designing autonomous AI agents and architecting systems that enable seamless AI integration throughout the organization. Ideal candidates... 
    Platform
    Senior

    Granum

    New York, NY
    2 days ago
  •  ...A leading scientific data and AI company in the United States is seeking a Senior AI Platform Engineer to design and maintain cloud-based AI infrastructure. This role involves building scalable MLOps pipelines and collaborating with cross-functional teams to enhance AI... 
    Platform
    Senior
    Remote work

    TetraScience

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Infrastructure Engineer, Model Serving Platform. Be the first to apply!