Machine Learning, Platform Engineer
$160k - $250kTogether AI
Machine Learning, Platform Engineer
San Francisco
About the Role
Our team focuses on enabling custom models and dedicated inference on Together. We are responsible for building a container platform, optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling. We often focus on video or audio generation across the stack: CUDA kernels, pytorch optimization, inference engines, container orchestration, queueing theory, etc. An ideal candidate will be great at profiling/optimization but know the word kubernetes, or be intimately familiar with multi-cluster scheduling and have some sense of ML bottlenecks.
Responsibilities
- New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, model optimization, APIs for managing deployments, inference worker SDKs, and CLI tools.
- Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
- Partner with product teams to understand functional requirements and deliver solutions that meet business needs
- Write clear, well-tested, and maintainable software and IaC for both new and existing systems
- Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance
Requirements
- 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems.
- Experience running serverless inference platforms, doing model bring-up on short notice, being on call, or running a cloud provider is a very big plus
- Good taste and ability to thoughtfully discuss how what you've built has failed over time
- Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
- Excellent understanding of low level operating systems concepts including concurrency, networking and storage, performance and scale
- Expert-level programmer in one or more of Python, Golang, Rust, C++, or Haskell
- Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform
- Experience with Kubernetes internals or other container orchestration systems
- Sound judgement for when to use and when to not use LLMs for code
- Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
- Writing-heavy roles or companies are a plus
About Together AI
Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.
Compensation
We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
Please see our privacy policy at
- ...teams to maintain rigid systems, Lightfield learns from how companies actually work,... ...that drives growth. We're building the CRM platform we always wished existed: fast, intelligent... ...and define best practices for software engineering in an AI-driven development landscape....SuggestedWork from home
$166k - $225k
...P-984 Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine-tune... ...Compatible with all major cloud providers, the Mosaic AI platform provides maximum flexibility for AI development. Introduced...SuggestedLocal areaWorldwide- Job Title Disabled veteran A veteran who served on active duty in the U.S. military and is entitled to disability compensation (or who but for the receipt of military retired pay would be entitled to disability compensation) under laws administered by the Secretary of...Suggested
$151.8k - $265.35k
...all related technical fields, such as Machine Learning, Deep Learning, Computer Vision, and Natural... ...with world-class researchers and ML engineers to bring research ideas to production.... ...everyone to create through innovative platforms and tools that unleash creativity,...SuggestedTemporary workLocal areaWorldwide- ...Overview Pluralis Research is pioneering Protocol Learning-a fully decentralised way to train and deploy AI models that opens... ...to frontier-scale AI. We're looking for an ML Training Platform Engineer to architect, build, and scale the foundational infrastructure...SuggestedWork experience placement
$160k - $235k
...Senior Machine Learning Engineer, AI Platform Affinity stitches together billions of data points from massive datasets to create a powerful, accurate representation of the world's professional relationship graph. Based on this data, we offer our users the insights...Work at officeRemote workWorldwideFlexible hours2 days per week3 days per week$246.5k - $339k
...Faire Faire is a technology wholesale platform built on the belief that the future is... ...'re using the power of tech, data, and machine learning to connect this thriving community of... ...As a Staff Machine Learning Platform Engineer, you will help design, improve, and operate...Work experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours3 days per week$185k - $275k
Senior Machine Learning Engineer - GeoAI Platform Wherobots, Inc. San Francisco, California, United States | Information Technology About this position About Wherobots Wherobots was founded by the original creators of Apache Sedona to build the first fully‑managed, highly...Full timeWork at officeRemote workWork visa$204k - $259k
...Senior Machine Learning Engineer, Simulation Waymo is an autonomous driving technology company with the mission to be the world's most trusted... ...service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over...Work experience placement$244k - $292k
...relationships. Yes, you can build an exciting business AND have real-life real-customer impact. We are seeking a Senior Machine Learning Engineer to join our team. This role will focus on developing and maintaining machine learning infrastructure and operations,...Local area$200k - $400k
...Troveo is building the next-generation data platform to train AI video models. Troveo offers the world... ..., and we are seeking an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer will play a central role in designing...Work experience placement$170k - $230k
...Machine Learning Engineer Help us solve fraud asap with Casap — where we're building the world's first AI-native disputes automation and fraud prevention platform. Our mission is to create a future where trust is a given and fraud is rare by empowering financial institutions...Full timeWork at officeImmediate startMonday to Friday- ...NLP Machine Learning Engineer Work on a dataset with millions of customer searches, labeled fashion products, and years of transaction and clickstream data. Work with Client's numerous in-house systems experts for data manipulation, model construction, training, and...
$118k - $176k
...innovation and creating the best experience for job seekers. (*Comscore, Total Visits, March 2025) Day to Day The Machine Learning Engineer I role partners closely with business partners across various functions to help execute strategic initiatives that increase...Work experience placementLocal area- ...personal digital experience where customers can shop, buy and learn everything Apple, wherever they are. Each customer should... ...for a passionate, highly motivated, and hands-on applied Machine Learning Engineer. This role will assist our Online Retail Decision Automation...Work experience placement
$135k - $210k
...inventories, bloom maps, and more! All this data lives in our cloud platform, FruitScope OS, that we've developed from the ground up to... ...the fruit they are seeing. We are looking for a Machine Learning Engineer to build creative, practical, and robust solutions to ML/...Full timeWork at officeWeekend work- ...About the Role We are hiring Machine Learning Engineers who want to work on frontier problems in vision and generative AI where standard... ...impact. If you are passionate about innovation and shaping the future of fashion, SPREEAI offers a platform to make your mark.InternshipImmediate start
$212k - $318.4k
...Senior Machine Learning Engineer (Search) Apple Maps and the thousands of applications it empowers are being used by millions every single day! As a fundamental tool for human activity, Maps technology is evolving and new techniques are emerging. We are looking for...Local areaRelocation$175k - $250k
...Machine Learning Engineer Kiddom is a groundbreaking educational platform that promotes student equity and growth by uniting high-quality instructional materials with dynamic digital learning. Through unparalleled curriculum management functionality, Kiddom empowers...Local areaFlexible hours- ...built on the belief that every website, app, game, brand, and human will have an AI persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team, focused on building the next generation of AI-driven animation systems. This role is...Full timeWork experience placementWork at office
- ...About the job Machine Learning Engineer-Life Sciences We are looking for a Machine Learning Engineer (Life Sciences) to help build our platform for training, evaluating, and deploying interpretable frontier AI systems, with an emphasis on scientific and biological...
- ...chain from the ground up—and we're looking for a Senior+ Machine Learning Engineer to help make it autonomous. We're not a software company selling... ...interesting applied AI work happening today. Our internal platform, PlantOS, uses the same reinforcement learning toolkits...Immediate startShift work
- ...retrieval over complex unstructured data. We are a team of engineers and scientists from Berkeley, CMU, Ecole Polytechnique, USACO,... ...principles and best practices. Experience or willingness to learn about scalability technologies like AWS/Azure, Docker, and Kubernetes...Summer workInternship
- ...You'll collaborate with construction veterans and world-class engineers to solve physical-world problems that simulations can't... ...alongside a talented team-we'd love to have you join us. Machine Learning Engineer: Perception Bedrock is bringing autonomy to the...Work at officeFlexible hours
$180k - $220k
...Machine Learning Engineer At Ouster, we build sensors and tools for engineers, roboticists, and researchers, so they can make the world safer and more efficient. We've transformed LIDAR from an analog device with thousands of components to an elegant digital device...Work experience placementLocal area$225k - $325k
...vision for 2026 is to build a modern CX platform where entire contact centers are... ...a hands-on, high-ownership role for ML engineers who want to build production models that... ...world constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll work across...H1bWork at office$160k - $220k
...About the Role Together AI is looking for an ML Engineer who will develop systems and APIs that enable our customers to perform inference and fine tune LLMs. Relevant experience includes implementing runtime systems that perform inference at scale using AI/ML models...Full time$225k - $300k
...Machine Learning Engineer About Latent Health Healthcare today is only truly personalized for two groups: those with wealth and access, and those with physicians in their immediate family. For everyone else, care is fragmented and impersonal. Medical history...Work at officeImmediate start$150k - $220k
...Founding Machine Learning Engineer San Francisco Compensation ~ Estimated base salary $150K – $220K • Offers Equity • Offers Bonus... ...intelligence that powers Composite's proactive automation platform. You'll work at the intersection of LLM inference, browser...H1bWork at officeVisa sponsorshipSleeping nights$240.45k - $300.3k
...The goal of a Senior Machine Learning Engineer at Scale is to leverage techniques in the fields of generative AI, computer vision, reinforcement... ...specialization Experience working with cloud platforms (eg. AWS or GCP) and deploying machine learning models in...Full time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning, Platform Engineer. Be the first to apply!
- machine learning ai engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- entry level machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA

