Founding Machine Learning Infrastructure Engineer
Model AI
Founding Machine Learning Infrastructure Engineer Location: Onsite in Palo Alto Compensation: Competitive Salary + Equity About Model AI Model AI is building the infrastructure and application stack for the next generation of agentic AI systems. We believe token usage will grow exponentially over the coming years, but routing all inference through closed model providers will remain too expensive for many users and enterprises. Our thesis is that agentic applications require a vertically integrated stack: high-throughput, cost-efficient serving infrastructure paired with an application layer designed for long-running, agentic workloads. Model AI is building the Agent Cloud, a serving and training infrastructure platform purpose-built for agentic workloads, long-context inference, and large-scale open-source model deployment. By combining infrastructure and application design, we aim to make open-source models significantly more performant, practical, and competitive. About This Role We are looking for an ML Systems Engineer to help build and optimize the core serving infrastructure behind Agent Cloud. This role focuses on high-performance inference across different accelerators. You will work on model serving performance, accelerator utilization, long-context inference, batching, scheduling, KV cache management, runtime efficiency, and cost reduction. This is a deeply technical role at the intersection of ML systems, infrastructure, and product. Direct TPU experience is a strong plus, but not required. We care most about strong ML systems fundamentals, performance intuition, and the ability to ship reliable systems quickly. What You'll Do Optimize large-scale LLM inference and serving systems. Improve total tokens per second, decode tokens per second, latency, throughput, and cost efficiency. Work on serving infrastructure for open-source models across different types of accelerators. Improve batching, scheduling, KV cache management, memory usage, and accelerator utilization. Support long-context inference, including workloads targeting up to 1M context. Debug performance bottlenecks across model execution, runtime, networking, and infrastructure. Work with frameworks such as JAX/XLA, PyTorch, vLLM, SGLang, TensorRT-LLM, or related systems. Collaborate closely with the application team to ensure infrastructure is optimized for agentic workloads, not just generic chatbot inference. Help turn research prototypes into reliable, high-performance production systems. Qualifications Strong experience in ML systems, distributed systems, or high-performance computing. Experience optimizing inference or training workloads for large models. Familiarity with TPUs, GPUs, or other accelerators. Experience with one or more of CUDA, Triton, NCCL, JAX/XLA, PyTorch internals, vLLM, SGLang, TensorRT-LLM, distributed inference, or distributed training. Strong systems debugging skills. Comfort working across model code, runtime, infrastructure, and product requirements. High ownership and the ability to operate effectively in an early-stage startup environment. Cultural Fit Hands-on technical excellence and strong engineering judgment. End-to-end ownership, from design to implementation to production outcomes. Bias for action: ship quickly, learn from failures, and iterate. High intensity during critical milestones, with a focus on real customer impact. Ability to do deep, focused work and sustain execution. Clear communication with teammates, customers, and stakeholders. Comfort with ambiguity, rapid change, and wearing multiple hats. Low ego, high integrity, high accountability, and strong collaboration. Continuous learning and a belief that judgment, intelligence, and capability compound over time. If you are excited to build the infrastructure and agent systems behind the next generation of AI applications, push open-source models to production-grade performance, and turn ambitious research ideas into real-world impact, Model AI is the place for you. #J-18808-Ljbffr Model AI
- Architect is seeking a Founding Member of the Technical Staff to enhance AI models for chip design... ...be responsible for developing Reinforcement Learning environments and improving model capabilities through rigorous engineering practices. The ideal candidate holds a PhD...Suggested
- Architect Labs is seeking a Founding Member of the Technical Staff to lead... ...-on research with production engineering, focusing on optimizing reinforcement learning environments and deploying industry... ...field, deep expertise in machine learning, and the ability to prototype...Suggested
- Model AI is looking for a Founding Machine Learning Infrastructure Engineer in Palo Alto to help optimize infrastructure for AI systems. In this role, you will focus on enhancing model serving performance and cost efficiency. The ideal candidate will have strong experience...Suggested
- ...blends AI with Silicon with a founding team from Anthropic, Google... ...implementing the Reinforcement Learning environments and algorithms,... ...edge research and production engineering for chip designs, implementing... ..., specialization in Machine Learning, Deep Learning, or Artificial...Suggested
- ...our Senior Staff Software Engineer, ML infra Engineer for Search... ...* Develop and scale data infrastructure that powers batch and real-... ...professional experience in applied machine learning * Experience in machine... ...if a candidate is found to have submitted false information...SuggestedFull timeTemporary workFlexible hours
- ...Regional Manager, Sales Engineering - Public Sector As a Regional Manager, Sales Engineering, you will lead a team of Sales Engineers and frontline leaders, driving technical execution, operational excellence, and team development across your region. You’ll act as a force...Internship
$160k - $225k
...manual campaign management. Founded by ad platform veterans... ...to expand our product and engineering teams, bringing our vision... ...clear playbook, building the infrastructure for autonomous, intelligent... ...writing the manual. As an early Machine Learning Engineer at MAI, you won't...- ...shaping the future. Hippocratic AI was co‑founded by CEO Munjal Shah and a team of... ...judge systems. This is a high-leverage engineering role where your work directly gates what... ...building robust data analysis and evaluation infrastructure, not just running experiments...Work at office
- ...performance in the industry. Dyna Robotics was founded by repeat founders Lindon Gao and... ...next frontier of AI-driven robotics! Learn more at dyna.co Position Overview: We are seeking an experienced Machine Learning Infrastructure Engineer to join our team and help scale our...Local area
$195k - $230k
...About NewsBreak Founded in 2015, NewsBreak is the Content Intelligence platform... ...fulfill our mission: building the infrastructure layer for content intelligence.... ...Role We are looking for a Senior Machine Learning Engineer to help evolve our large-scale recommendation...Full timeLocal areaWork from home- ...Founding Machine Learning Systems Engineer We are working with an early-stage AI systems company in Palo Alto building infrastructure for the next generation of agentic AI workloads. The company is developing a platform that combines high-performance model serving...Work at office
- ...Machine Learning Engineer One of the first ML Engineers at a 25-person rocketship automating a $1... ...across the full stack. Our client was founded by technologists and operators who... ...collaborate with AI agents. Build MLOps infrastructure for training, fine‑tuning, and...Work at office
- ...industrial environments. Our ability to iterate quickly on large-scale models depends on world-class ML infrastructure. We’re looking for a Machine Learning Infrastructure Engineer to build the core systems that enable fast, reliable, and scalable model training—powering...
- Location Palo Alto Employment Type Full time Location Type On-site Department Software Engineering We’re hiring Machine Learning Infrastructure Engineers to build the systems that make large-scale model training actually work. This role is for people who enjoy operating...Full time
$130k - $260k
## Staff Machine Learning EngineerApplyremote type: Hybridlocations: Palo Alto, CA: Seattle, WAtime... ...impact on local communities nationwide.Founded in 1936, GEICO is a member of the... ...seeking an accomplished Senior Staff ML Engineer who will serve as a technical leader for...Hourly payWork experience placementLocal area- HiringCafe is seeking a Founding ML Engineer in Cupertino to transform AI and ML models into reliable production systems. You'll be responsible... ...role requires a hands-on approach, experience with deep learning model optimization, and the ability to design scalable serving...
- ...We are seeking an elite Founding Scientific Data Scientist... ...and multi-omics data with machine learning models (e.g., AlphaMissense... ...between wet-lab biology and AI engineering, translating complex client... ...behind to build early-stage infrastructure that truly matters. Shape...For contractors
$150k - $300k
Director, Machine Learning EngineeringSkip to main contentGEICO uses cookies... ..., Machine Learning Engineering page is loaded## Director,... ...local communities nationwide.Founded in 1936, GEICO is a member... ...Product, Design, Data, and Infrastructure teams to deliver integrated...Hourly payTemporary workWork experience placementLocal area- ...Intuit is seeking a highly motivated and experienced Principal Machine Learning Engineer to join our Mid Market AI team. In this influential role, you will lead the design, development, and deployment of end-to-end AI/ML solutions that power the next generation of intelligent...
$197k - $266.5k
...Overview Come join Intuit as a Staff Machine Learning Engineer! In this role, you’ll be embedded inside a vibrant team of data scientists. You’ll be expected to help conceive, code, and deploy data science models at scale using the latest industry tools. Important...Work experience placementShift work- landa is seeking a Founding Engineer to design architecture, ship features, and work closely with the founders. This role offers the chance to impact product direction and engage in cutting-edge voice AI technology. Candidates should possess strong technical skills, be...
$145k - $165k
...optimization. Our mission is to apply machine learning to enhance user experiences, foster trust... ...with distinct roles: Machine Learning Engineers (this role) who focus on modeling and... ...innovation Machine Learning Infrastructure Engineers who build the platforms and...Work experience placementWork at office$106.9k - $229.4k
...What’s in it for you? Constant learning, skill growth, great... ...reshaping the landscape of Machine Learning across various domains... ...building a nimble and versatile engineering team to empower our Data... ...cloud with a focus on backend infrastructure. Experience in developing data...WorldwideFlexible hours- ...View, CA (any hybrid work will be at the manager’s discretion). W2 Candidates only Position Summary Seeking an experienced Machine Learning Engineer to lead the development of prompt injection and prompt safety models that protect downstream agentic AI systems across...
$133.95k - $245k
...at the center of everything we do. Expectations are high, and so are the rewards. We're looking for an exceptional Senior Machine Learning Engineer to help shape the future of our core platforms, products, and customer experiences. FinTech is one of the most complex and...Work at officeRemote workFlexible hoursShift work3 days per week$133.95k - $245k
...at the center of everything we do. Expectations are high, and so are the rewards. We're looking for an exceptional Senior Machine Learning Engineer to help shape the future of our core platforms, products, and customer experiences. You’ll take on a highly influential role...Work at officeFlexible hoursShift work3 days per week- ...agents that reason, act, and continuously improve. As a Machine Learning Engineer , you won't just build models, you'll architect the entire... ...thrives in ambiguity and wants to shape foundational AI infrastructure from the ground up. You'll work at the intersection of LLMs...
- Machine Learning Engineer, GAI Search Platform - Moveworks Job Description What You Will Do We are looking for Machine learning engineers to... ...platform team works closely with the ranking, product, design, infrastructure and data science teams to drive our agentic search...
$171k - $247k
...accessible for all. We are seeking a ML Engineering TL to join the Behavior Planning Team... ...large-scale models trained with Imitation Learning and Reinforcement Learning that enable... ...Qualifications ~ MS or PhD in Robotics, Machine Learning, Computer Science, or a related...Work at officeLocal area3 days per week$200k - $280k
...hands‑on, high‑ownership role for ML engineers who want to build production models... ...under real‑world constraints. As a Founding Senior Machine Learning Engineer at Retell, you’ll work... ...inform model iterations. Level Up Infrastructure – Design and maintain the ML infrastructure...H1bWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Founding Machine Learning Infrastructure Engineer. Be the first to apply!
- machine learning engineer Palo Alto, CA
- senior ml engineer Palo Alto, CA
- computer vision machine learning engineer Palo Alto, CA
- ai ml engineer Palo Alto, CA
- machine learning software engineer Palo Alto, CA
- machine learning ai engineer Palo Alto, CA
- security infrastructure engineer Palo Alto, CA
- infrastructure engineer Palo Alto, CA
- lead infrastructure engineer Palo Alto, CA
- data infrastructure engineer Palo Alto, CA



