Senior AI Infrastructure Engineer, Model Serving Platform
$216k - $270kScale AI
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and production systems, supporting both internal and external use cases across various environments.
The ideal candidate combines strong ML fundamentals with deep expertise in backend system design. You'll work in a highly collaborative environment, bridging research and engineering to deliver seamless experiences to our customers and accelerate innovation across the company.
You will:- Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale.
- Build an internal platform to empower LLM capability discovery.
- Collaborate with researchers and engineers to integrate and optimize models for production and research use cases.
- Conduct architecture and design reviews to uphold best practices in system design and scalability.
- Develop monitoring and observability solutions to ensure system health and performance.
- Lead projects end-to-end, from requirements gathering to implementation, in a cross-functional environment.
- 5+ years of experience building large-scale, high-performance backend systems.
- Strong programming skills in one or more languages (e.g., Python, Go, Rust, C++).
- Experience with LLM serving and routing fundamentals (e.g. rate limiting, token streaming, load balancing, budgets, etc.)
- Experience with LLM capabilities and concepts such as reasoning, tool calling, prompt templates, etc.
- Experience with containers and orchestration tools (e.g., Docker, Kubernetes).
- Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as code (e.g., Terraform).
- Proven ability to solve complex problems and work independently in fast-moving environments.
- Experience with modern LLM serving frameworks such as vLLM, SGLang, TensorRT-LLM, or text-generation-inference.
Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.
Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $216,000—$270,000 USDPLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.
About Us:
At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Cisco, DLA Piper, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications.
We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.
We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at View email address on click.appcast.io. Please see the United States Department of Labor's Know Your Rights poster for additional information.
We comply with the United States Department of Labor's Pay Transparency provision .
PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants' needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
- ...Scientific Data and AI company. We are catalyzing... ...cloud, data, and AI infrastructure have converged on... ...We’re looking for a Senior AI Platform Engineer to help design, build... ...ingest, transform, and serve data for ML and analytics... ...‑efficiency of AI models in production. Drive...PlatformSeniorImmediate startRemote workFlexible hours
- ...Description Job Title: Senior AI Platform Engineer (Kubernetes & LLMOps)... ...engineering role focused on infrastructure, scalability, and developer... ...pipelines, and fine-tuned models on OpenShift. Agentic... ...management, air-gapped model serving, and immutable audit logs....PlatformSenior
$160k - $235k
...Senior AI Engineer, AI Platform Affinity stitches together billions of data points from massive datasets... ...with document chunking, embedding models, and context window optimization ~... ...information from unstructured data, serving embedding models to vectorize chunks,...PlatformSeniorWork at officeRemote workWorldwideFlexible hours2 days per week3 days per week$110k - $140k
...high‑performance cloud infrastructure easy to use,... ...accessible for enterprises and AI innovators around the... ...skilled and experienced AI Platform Engineer to own the strategy... ...curate open‑source models — Llama, Mistral, Qwen... ...the AI platform layer serves all teams without becoming...PlatformSeniorWork at officeImmediate startRemote workFlexible hours$229.9k - $262.4k
...Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One, we... ...investments in technology infrastructure and world-class talent —... ...capabilities to reimagine how we serve our customers and... ...millions of customers. Our AI models and platforms empower...PlatformSeniorFull timePart timeLocal area$229.9k - $262.4k
...Senior Lead AI Engineer (Gen AI Platform Services) Overview: At Capital One, we are creating... ...in technology infrastructure and world-class talent —... ...capabilities to reimagine how we serve our customers and businesses... ...millions of customers. Our AI models and platforms empower...PlatformSeniorFull timePart timeLocal area$229.9k - $262.4k
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One, we... ...in technology infrastructure and world-class talent —... ...capabilities to reimagine how we serve our customers and businesses... ...millions of customers. Our AI models and platforms empower...PlatformSeniorFull timePart timeLocal areaImmediate start$229.9k - $262.4k
...Sr. Lead AI Engineer (GenAI Platform) Overview: At Capital One, we are creating... ...investments in technology infrastructure and world-class talent -... ...to reimagine how we serve our customers and businesses... ...millions of customers. Our AI models and platforms empower...PlatformSeniorFull timePart timeLocal area$229.9k - $262.4k
Sr. Lead AI Engineer (GenAI Platform) Overview At Capital One, we are creating responsible... ...investments in technology infrastructure and world‑class talent —... ...to reimagine how we serve our customers and... ...millions of customers. Our AI models and platforms empower teams...PlatformSeniorLocal area- ...Neurawork GmbH & Co. KG sucht einen erfahrenen Engineer zur Entwicklung unserer AI-Agent-Plattform für regulierte Branchen. Du wirst an einer performanceorientierten Lösung arbeiten und maßgeblich an der Architektur beteiligt sein. Voll-remote ist möglich, allerdings ist...PlatformSeniorRemote work
- A forward-thinking AI company is seeking a Senior AI Platform Engineer to lead the design and delivery of intelligent workflows. The role entails technical leadership, hands-on development of AI systems, and collaboration with product management. Ideal candidates have over...PlatformSeniorRemote work
$180k - $200k
...Modern Campus is looking for a Senior Director of Development to oversee engineering execution and technical leadership across its product lines, including the Connected Curriculum. The role involves leading multiple teams, ensuring high quality and timely delivery of...PlatformSeniorRemote work- ...Supernal is seeking a Senior AI Platform Engineer to lead software delivery for AI Employee implementations. As part of this remote role, you will design and build core software systems while ensuring technical delivery and client success. The ideal candidate has at least...PlatformSeniorRemote work
$220k - $240k
...A leading healthcare technology firm is seeking a Senior Engineer Manager to lead innovative AI-driven solutions impacting U.S. healthcare. The ideal candidate will have extensive technical leadership and backend development experience, particularly in Java or Rust. This...PlatformSeniorRemote work- ...A leading optimization technology firm in the United States is seeking a Senior AI Engineer to design and implement AI agents for optimization applications. The ideal candidate will have over 5 years of experience as a software engineer and expertise in prompt engineering...PlatformSenior
- ...Provectus is seeking a senior hands-on engineer to take ownership of a sophisticated AI platform in production. This role entails maintaining and extending a complex system... .... This position offers a remote-first work model, annual performance bonuses, and comprehensive...PlatformSeniorRemote work
- ...Jitterbit is seeking a Senior AI Engineer to join our innovative team in the United States. The role focuses on building AI capabilities on a cutting-edge platform, using technologies like LLM and Azure AI. Applicants should have extensive experience in cloud-based application...PlatformSeniorRemote work
- ...A leading AI procurement firm is seeking a Senior AI Platform Engineer to build and scale their core platform. The role requires strong backend development skills, proficiency in Node.js and React, and experience with AI systems. You'll work alongside founding engineers...PlatformSeniorRemote work
- ...A leading crypto exchange is seeking a Senior Engineer to build a high-performance AI Service Platform. The role includes designing architectures for real-time fraud detection, developing middleware for AI services, and ensuring system reliability in volatile markets....PlatformSenior
- ...Smartsheet Inc. is seeking an AI Platform Engineer to lead the design of the core infrastructure for AI experiences. Candidates should have 8+ years of software engineering experience, particularly with LLMs, along with strong Python skills and experience with prompt...PlatformSeniorFull timeRemote work
- A leading AI company is seeking a Senior AI Platform Engineer to lead the development of voice-first AI agents. This remote role requires experience in deploying complex systems and managing client deliveries. Responsibilities include building agent workflows, handling...PlatformSeniorRemote work
- ...Cyclotron, Inc. is seeking a Sr. AI Governance Engineer to lead the governance of Microsoft AI and automation platforms. This role is client-facing and requires effective management of platform strategies and governance frameworks. The ideal candidate will have over 5...PlatformSeniorRemote work
- A technology consulting firm in the United States seeks a Cloud Platform Engineer to enhance AI models by evaluating their performance and logic. This remote position requires proficiency in at least one programming language and a detail-oriented mindset. Responsibilities...PlatformRemote workFlexible hours
$30 per hour
A technology company specializing in AI training is seeking a Cloud Platform Engineer to join their team. The position is remote and focuses on training and evaluating AI chatbots. The ideal candidate should be proficient in programming languages like JavaScript, Python...PlatformHourly payRemote workFlexible hours- ...A leading education technology firm is seeking a Senior AI Software Engineer to design and deploy AI-enabled applications. This fully remote role involves developing scalable solutions using Python and AWS, collaborating with cross-functional teams to enhance learning...PlatformSeniorRemote work
$148k - $216k
...Life360 is hiring a Senior Software Engineer II to enhance mobile software engineering with AI tools. This remote position focuses on developing infrastructure that integrates AI to improve quality and efficiency in the development lifecycle. The ideal candidate will help...PlatformSeniorRemote work$150k - $180k
...A technology consulting firm in the United States is seeking a Senior Software Engineer to lead innovative projects in AI and agent-based systems. The role involves integrating LLMs with various platforms and APIs while working in a remote-first environment. Ideal candidates...PlatformSeniorRemote work- ...A leading optimization technology firm in the United States is seeking a Senior AI Engineer to enhance its platform with AI agents and machine learning. The ideal candidate will have 5+ years of software engineering experience, proficiency in languages such as Python...PlatformSenior
- ...Granum is seeking an experienced AI Engineer to help reshape the future of their products through AI-driven capabilities. This role involves designing autonomous AI agents and architecting systems that enable seamless AI integration throughout the organization. Ideal candidates...PlatformSenior
- ...A leading scientific data and AI company in the United States is seeking a Senior AI Platform Engineer to design and maintain cloud-based AI infrastructure. This role involves building scalable MLOps pipelines and collaborating with cross-functional teams to enhance AI...PlatformSeniorRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Infrastructure Engineer, Model Serving Platform. Be the first to apply!
- machine learning ai engineer New York, NY
- senior ai engineer New York, NY
- ai engineer remote New York, NY
- ai ml engineer New York, NY
- ai engineer New York, NY
- ai developer New York, NY
- ai research engineer New York, NY
- ai prompt engineer New York, NY
- data infrastructure engineer New York, NY
- infrastructure engineering manager New York, NY


