Founding Machine Learning Infrastructure Engineer

Model AI

Founding Machine Learning Infrastructure Engineer Location: Onsite in Palo Alto Compensation: Competitive Salary + Equity About Model AI Model AI is building the infrastructure and application stack for the next generation of agentic AI systems. We believe token usage will grow exponentially over the coming years, but routing all inference through closed model providers will remain too expensive for many users and enterprises. Our thesis is that agentic applications require a vertically integrated stack: high-throughput, cost-efficient serving infrastructure paired with an application layer designed for long-running, agentic workloads. Model AI is building the Agent Cloud, a serving and training infrastructure platform purpose-built for agentic workloads, long-context inference, and large-scale open-source model deployment. By combining infrastructure and application design, we aim to make open-source models significantly more performant, practical, and competitive. About This Role We are looking for an ML Systems Engineer to help build and optimize the core serving infrastructure behind Agent Cloud. This role focuses on high-performance inference across different accelerators. You will work on model serving performance, accelerator utilization, long-context inference, batching, scheduling, KV cache management, runtime efficiency, and cost reduction. This is a deeply technical role at the intersection of ML systems, infrastructure, and product. Direct TPU experience is a strong plus, but not required. We care most about strong ML systems fundamentals, performance intuition, and the ability to ship reliable systems quickly. What You'll Do Optimize large-scale LLM inference and serving systems. Improve total tokens per second, decode tokens per second, latency, throughput, and cost efficiency. Work on serving infrastructure for open-source models across different types of accelerators. Improve batching, scheduling, KV cache management, memory usage, and accelerator utilization. Support long-context inference, including workloads targeting up to 1M context. Debug performance bottlenecks across model execution, runtime, networking, and infrastructure. Work with frameworks such as JAX/XLA, PyTorch, vLLM, SGLang, TensorRT-LLM, or related systems. Collaborate closely with the application team to ensure infrastructure is optimized for agentic workloads, not just generic chatbot inference. Help turn research prototypes into reliable, high-performance production systems. Qualifications Strong experience in ML systems, distributed systems, or high-performance computing. Experience optimizing inference or training workloads for large models. Familiarity with TPUs, GPUs, or other accelerators. Experience with one or more of CUDA, Triton, NCCL, JAX/XLA, PyTorch internals, vLLM, SGLang, TensorRT-LLM, distributed inference, or distributed training. Strong systems debugging skills. Comfort working across model code, runtime, infrastructure, and product requirements. High ownership and the ability to operate effectively in an early-stage startup environment. Cultural Fit Hands-on technical excellence and strong engineering judgment. End-to-end ownership, from design to implementation to production outcomes. Bias for action: ship quickly, learn from failures, and iterate. High intensity during critical milestones, with a focus on real customer impact. Ability to do deep, focused work and sustain execution. Clear communication with teammates, customers, and stakeholders. Comfort with ambiguity, rapid change, and wearing multiple hats. Low ego, high integrity, high accountability, and strong collaboration. Continuous learning and a belief that judgment, intelligence, and capability compound over time. If you are excited to build the infrastructure and agent systems behind the next generation of AI applications, push open-source models to production-grade performance, and turn ambitious research ideas into real-world impact, Model AI is the place for you. #J-18808-Ljbffr Model AI

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Founding Machine Learning Infrastructure Engineer in Palo Alto, CA vacancy

Founding ML Engineer for AI-Driven Chip Design
Architect is seeking a Founding Member of the Technical Staff to enhance AI models for chip design... ...be responsible for developing Reinforcement Learning environments and improving model capabilities through rigorous engineering practices. The ideal candidate holds a PhD...
Suggested
Architect, Inc.
Palo Alto, CA
1 day ago
Founding ML Engineer - RL for Next-Gen Chip Design
Architect Labs is seeking a Founding Member of the Technical Staff to lead... ...-on research with production engineering, focusing on optimizing reinforcement learning environments and deploying industry... ...field, deep expertise in machine learning, and the ability to prototype...
Suggested
Architect Labs
Palo Alto, CA
4 days ago
Founding ML Systems Engineer - Infra & Inference
Model AI is looking for a Founding Machine Learning Infrastructure Engineer in Palo Alto to help optimize infrastructure for AI systems. In this role, you will focus on enhancing model serving performance and cost efficiency. The ideal candidate will have strong experience...
Suggested
Model AI
Palo Alto, CA
2 days ago
Founding ML Research Engineer RL for AI Chips (Equity)
...blends AI with Silicon with a founding team from Anthropic, Google... ...implementing the Reinforcement Learning environments and algorithms,... ...edge research and production engineering for chip designs, implementing... ..., specialization in Machine Learning, Deep Learning, or Artificial...
Suggested
Architect, Inc.
Palo Alto, CA
1 day ago
Senior Staff Machine Learning Infrastructure Engineer - Search & Discovery
...our Senior Staff Software Engineer, ML infra Engineer for Search... ...* Develop and scale data infrastructure that powers batch and real-... ...professional experience in applied machine learning * Experience in machine... ...if a candidate is found to have submitted false information...
Suggested
Full time
Temporary work
Flexible hours
Coupang
Mountain View, CA
3 days ago
Machine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad)
...Regional Manager, Sales Engineering - Public Sector As a Regional Manager, Sales Engineering, you will lead a team of Sales Engineers and frontline leaders, driving technical execution, operational excellence, and team development across your region. You’ll act as a force...
Internship
United States Digital Space LLC
Mountain View, CA
1 day ago
Machine Learning Engineer
$160k - $225k
...manual campaign management. Founded by ad platform veterans... ...to expand our product and engineering teams, bringing our vision... ...clear playbook, building the infrastructure for autonomous, intelligent... ...writing the manual. As an early Machine Learning Engineer at MAI, you won't...
MAI
Mountain View, CA
2 days ago
Senior Machine Learning Engineer
...shaping the future. Hippocratic AI was co‑founded by CEO Munjal Shah and a team of... ...judge systems. This is a high-leverage engineering role where your work directly gates what... ...building robust data analysis and evaluation infrastructure, not just running experiments...
Work at office
Hippocratic AI
Palo Alto, CA
3 days ago
Staff Machine Learning Infrastructure Engineer
...performance in the industry. Dyna Robotics was founded by repeat founders Lindon Gao and... ...next frontier of AI-driven robotics! Learn more at dyna.co Position Overview: We are seeking an experienced Machine Learning Infrastructure Engineer to join our team and help scale our...
Local area
Dyna Robotics
Redwood City, CA
3 days ago
Senior Machine Learning Engineer, Recommendation & AI Applications
$195k - $230k
...About NewsBreak Founded in 2015, NewsBreak is the Content Intelligence platform... ...fulfill our mission: building the infrastructure layer for content intelligence.... ...Role We are looking for a Senior Machine Learning Engineer to help evolve our large-scale recommendation...
Full time
Local area
Work from home
NewsBreak
Mountain View, CA
4 days ago
Senior Machine Learning Engineer
...Founding Machine Learning Systems Engineer We are working with an early-stage AI systems company in Palo Alto building infrastructure for the next generation of agentic AI workloads. The company is developing a platform that combines high-performance model serving...
Work at office
Strativ Group
Palo Alto, CA
3 days ago
Machine Learning Engineer
...Machine Learning Engineer One of the first ML Engineers at a 25-person rocketship automating a $1... ...across the full stack. Our client was founded by technologists and operators who... ...collaborate with AI agents. Build MLOps infrastructure for training, fine‑tuning, and...
Work at office
Aionia Group
Mountain View, CA
2 days ago
Machine Learning Infrastructure Engineer
...industrial environments. Our ability to iterate quickly on large-scale models depends on world-class ML infrastructure. We’re looking for a Machine Learning Infrastructure Engineer to build the core systems that enable fast, reliable, and scalable model training—powering...
Mind Robotics Inc.
Palo Alto, CA
17 hours ago
Machine Learning Infrastructure Engineer
Location Palo Alto Employment Type Full time Location Type On-site Department Software Engineering We’re hiring Machine Learning Infrastructure Engineers to build the systems that make large-scale model training actually work. This role is for people who enjoy operating...
Full time
Garuda Ventures
Palo Alto, CA
1 day ago
Staff Machine Learning Engineer
$130k - $260k
## Staff Machine Learning EngineerApplyremote type: Hybridlocations: Palo Alto, CA: Seattle, WAtime... ...impact on local communities nationwide.Founded in 1936, GEICO is a member of the... ...seeking an accomplished Senior Staff ML Engineer who will serve as a technical leader for...
Hourly pay
Work experience placement
Local area
GEICO
Palo Alto, CA
4 days ago
Founding ML Engineer: Production Inference & Deployment
HiringCafe is seeking a Founding ML Engineer in Cupertino to transform AI and ML models into reliable production systems. You'll be responsible... ...role requires a hands-on approach, experience with deep learning model optimization, and the ability to design scalable serving...
HiringCafe
Cupertino, CA
1 day ago
Founding Scientific Data Scientist
...We are seeking an elite Founding Scientific Data Scientist... ...and multi-omics data with machine learning models (e.g., AlphaMissense... ...between wet-lab biology and AI engineering, translating complex client... ...behind to build early-stage infrastructure that truly matters. Shape...
For contractors
Oakwell Hampton Group
Palo Alto, CA
1 day ago
Director, Machine Learning Engineering
$150k - $300k
Director, Machine Learning EngineeringSkip to main contentGEICO uses cookies... ..., Machine Learning Engineering page is loaded## Director,... ...local communities nationwide.Founded in 1936, GEICO is a member... ...Product, Design, Data, and Infrastructure teams to deliver integrated...
Hourly pay
Temporary work
Work experience placement
Local area
GEICO
Palo Alto, CA
3 days ago
Principal Machine Learning Engineer
...Intuit is seeking a highly motivated and experienced Principal Machine Learning Engineer to join our Mid Market AI team. In this influential role, you will lead the design, development, and deployment of end-to-end AI/ML solutions that power the next generation of intelligent...
Intuit Inc.
Mountain View, CA
1 day ago
Staff Machine Learning Engineer
$197k - $266.5k
...Overview Come join Intuit as a Staff Machine Learning Engineer! In this role, you’ll be embedded inside a vibrant team of data scientists. You’ll be expected to help conceive, code, and deploy data science models at scale using the latest industry tools. Important...
Work experience placement
Shift work
Intuit Inc.
Mountain View, CA
1 day ago
Founding Engineer - Shape Voice AI & Data Platform
landa is seeking a Founding Engineer to design architecture, ship features, and work closely with the founders. This role offers the chance to impact product direction and engage in cutting-edge voice AI technology. Candidates should possess strong technical skills, be...
landa
Palo Alto, CA
2 days ago
Machine Learning Engineer II
$145k - $165k
...optimization. Our mission is to apply machine learning to enhance user experiences, foster trust... ...with distinct roles: Machine Learning Engineers (this role) who focus on modeling and... ...innovation Machine Learning Infrastructure Engineers who build the platforms and...
Work experience placement
Work at office
Tinder
Palo Alto, CA
3 days ago
Machine Learning Engineer
$106.9k - $229.4k
...What’s in it for you? Constant learning, skill growth, great... ...reshaping the landscape of Machine Learning across various domains... ...building a nimble and versatile engineering team to empower our Data... ...cloud with a focus on backend infrastructure. Experience in developing data...
Worldwide
Flexible hours
SAP Belgium NV/SA
Palo Alto, CA
3 days ago
Machine Learning Engineer
...View, CA (any hybrid work will be at the manager’s discretion). W2 Candidates only Position Summary Seeking an experienced Machine Learning Engineer to lead the development of prompt injection and prompt safety models that protect downstream agentic AI systems across...
The Fountain Group
Mountain View, CA
2 days ago
Senior Machine Learning Engineer
$133.95k - $245k
...at the center of everything we do. Expectations are high, and so are the rewards. We're looking for an exceptional Senior Machine Learning Engineer to help shape the future of our core platforms, products, and customer experiences. FinTech is one of the most complex and...
Work at office
Remote work
Flexible hours
Shift work
3 days per week
Unchain Data
Menlo Park, CA
2 days ago
Senior Machine Learning Engineer
$133.95k - $245k
...at the center of everything we do. Expectations are high, and so are the rewards. We're looking for an exceptional Senior Machine Learning Engineer to help shape the future of our core platforms, products, and customer experiences. You’ll take on a highly influential role...
Work at office
Flexible hours
Shift work
3 days per week
Robinhood
Menlo Park, CA
2 days ago
Machine Learning Engineer
...agents that reason, act, and continuously improve. As a Machine Learning Engineer , you won't just build models, you'll architect the entire... ...thrives in ambiguity and wants to shape foundational AI infrastructure from the ground up. You'll work at the intersection of LLMs...
Barker Staffing Solutions LLC
Mountain View, CA
2 days ago
Machine Learning Engineer, GAI Search Platform - Moveworks
Machine Learning Engineer, GAI Search Platform - Moveworks Job Description What You Will Do We are looking for Machine learning engineers to... ...platform team works closely with the ranking, product, design, infrastructure and data science teams to drive our agentic search...
Moveworks
Mountain View, CA
2 days ago
Machine Learning Engineering TL, Behavior Planning
$171k - $247k
...accessible for all. We are seeking a ML Engineering TL to join the Behavior Planning Team... ...large-scale models trained with Imitation Learning and Reinforcement Learning that enable... ...Qualifications ~ MS or PhD in Robotics, Machine Learning, Computer Science, or a related...
Work at office
Local area
3 days per week
Aurora Innovation
Mountain View, CA
3 days ago
Senior Machine Learning Engineer
$200k - $280k
...hands‑on, high‑ownership role for ML engineers who want to build production models... ...under real‑world constraints. As a Founding Senior Machine Learning Engineer at Retell, you’ll work... ...inform model iterations. Level Up Infrastructure – Design and maintain the ML infrastructure...
H1b
Work at office
Retell AI
Redwood City, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Founding Machine Learning Infrastructure Engineer. Be the first to apply!