Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning, Platform Engineer

$160k - $250k

Together AI

Machine Learning, Platform Engineer

San Francisco

About the Role

Our team focuses on enabling custom models and dedicated inference on Together. We are responsible for building a container platform, optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling. We often focus on video or audio generation across the stack: CUDA kernels, pytorch optimization, inference engines, container orchestration, queueing theory, etc. An ideal candidate will be great at profiling/optimization but know the word kubernetes, or be intimately familiar with multi-cluster scheduling and have some sense of ML bottlenecks.

Responsibilities
  • New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, model optimization, APIs for managing deployments, inference worker SDKs, and CLI tools.
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems
  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance
Requirements
  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems.
  • Experience running serverless inference platforms, doing model bring-up on short notice, being on call, or running a cloud provider is a very big plus
  • Good taste and ability to thoughtfully discuss how what you've built has failed over time
  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
  • Excellent understanding of low level operating systems concepts including concurrency, networking and storage, performance and scale
  • Expert-level programmer in one or more of Python, Golang, Rust, C++, or Haskell
  • Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform
  • Experience with Kubernetes internals or other container orchestration systems
  • Sound judgement for when to use and when to not use LLMs for code
  • Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
  • Writing-heavy roles or companies are a plus
About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Machine Learning, Platform Engineer in San Francisco, CA vacancy
  •  ...teams to maintain rigid systems, Lightfield learns from how companies actually work,...  ...that drives growth. We're building the CRM platform we always wished existed: fast, intelligent...  ...and define best practices for software engineering in an AI-driven development landscape.... 
    Suggested
    Work from home

    LIGHTFIELD INC

    San Francisco, CA
    4 days ago
  • $166k - $225k

     ...P-984 Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine-tune...  ...Compatible with all major cloud providers, the Mosaic AI platform provides maximum flexibility for AI development. Introduced... 
    Suggested
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    3 days ago
  • Job Title Disabled veteran A veteran who served on active duty in the U.S. military and is entitled to disability compensation (or who but for the receipt of military retired pay would be entitled to disability compensation) under laws administered by the Secretary of...
    Suggested

    Veho

    San Francisco, CA
    16 hours ago
  • $151.8k - $265.35k

     ...all related technical fields, such as Machine Learning, Deep Learning, Computer Vision, and Natural...  ...with world-class researchers and ML engineers to bring research ideas to production....  ...everyone to create through innovative platforms and tools that unleash creativity,... 
    Suggested
    Temporary work
    Local area
    Worldwide

    Adobe

    San Francisco, CA
    3 days ago
  •  ...Overview Pluralis Research is pioneering Protocol Learning-a fully decentralised way to train and deploy AI models that opens...  ...to frontier-scale AI. We're looking for an ML Training Platform Engineer to architect, build, and scale the foundational infrastructure... 
    Suggested
    Work experience placement

    Pluralis

    San Francisco, CA
    1 day ago
  • $160k - $235k

     ...Senior Machine Learning Engineer, AI Platform Affinity stitches together billions of data points from massive datasets to create a powerful, accurate representation of the world's professional relationship graph. Based on this data, we offer our users the insights... 
    Work at office
    Remote work
    Worldwide
    Flexible hours
    2 days per week
    3 days per week

    Affinity Inc

    San Francisco, CA
    3 days ago
  • $246.5k - $339k

     ...Faire Faire is a technology wholesale platform built on the belief that the future is...  ...'re using the power of tech, data, and machine learning to connect this thriving community of...  ...As a Staff Machine Learning Platform Engineer, you will help design, improve, and operate... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    3 days per week

    Faire Inc

    San Francisco, CA
    1 day ago
  • $185k - $275k

    Senior Machine Learning Engineer - GeoAI Platform Wherobots, Inc. San Francisco, California, United States | Information Technology About this position About Wherobots Wherobots was founded by the original creators of Apache Sedona to build the first fully‑managed, highly... 
    Full time
    Work at office
    Remote work
    Work visa

    Wherobots, Inc

    San Francisco, CA
    20 hours ago
  • $204k - $259k

     ...Senior Machine Learning Engineer, Simulation Waymo is an autonomous driving technology company with the mission to be the world's most trusted...  ...service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over... 
    Work experience placement

    Waymo

    San Francisco, CA
    4 days ago
  • $244k - $292k

     ...relationships. Yes, you can build an exciting business AND have real-life real-customer impact. We are seeking a Senior Machine Learning Engineer to join our team. This role will focus on developing and maintaining machine learning infrastructure and operations,... 
    Local area

    Kikoff

    San Francisco, CA
    4 days ago
  • $200k - $400k

     ...Troveo is building the next-generation data platform to train AI video models. Troveo offers the world...  ..., and we are seeking an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer will play a central role in designing... 
    Work experience placement

    Troveo AI Inc

    San Francisco, CA
    3 days ago
  • $170k - $230k

     ...Machine Learning Engineer Help us solve fraud asap with Casap — where we're building the world's first AI-native disputes automation and fraud prevention platform. Our mission is to create a future where trust is a given and fraud is rare by empowering financial institutions... 
    Full time
    Work at office
    Immediate start
    Monday to Friday

    CASAP

    San Francisco, CA
    20 hours ago
  •  ...NLP Machine Learning Engineer Work on a dataset with millions of customer searches, labeled fashion products, and years of transaction and clickstream data. Work with Client's numerous in-house systems experts for data manipulation, model construction, training, and... 

    InterSources

    San Francisco, CA
    2 days ago
  • $118k - $176k

     ...innovation and creating the best experience for job seekers. (*Comscore, Total Visits, March 2025) Day to Day The Machine Learning Engineer I role partners closely with business partners across various functions to help execute strategic initiatives that increase... 
    Work experience placement
    Local area

    Indeed

    San Francisco, CA
    2 days ago
  •  ...personal digital experience where customers can shop, buy and learn everything Apple, wherever they are. Each customer should...  ...for a passionate, highly motivated, and hands-on applied Machine Learning Engineer. This role will assist our Online Retail Decision Automation... 
    Work experience placement

    Apple

    San Francisco, CA
    3 days ago
  • $135k - $210k

     ...inventories, bloom maps, and more! All this data lives in our cloud platform, FruitScope OS, that we've developed from the ground up to...  ...the fruit they are seeing. We are looking for a Machine Learning Engineer to build creative, practical, and robust solutions to ML/... 
    Full time
    Work at office
    Weekend work

    Orchard Robotics

    San Francisco, CA
    2 days ago
  •  ...About the Role We are hiring Machine Learning Engineers who want to work on frontier problems in vision and generative AI where standard...  ...impact. If you are passionate about innovation and shaping the future of fashion, SPREEAI offers a platform to make your mark.
    Internship
    Immediate start

    SpreeAI

    San Francisco, CA
    20 hours ago
  • $212k - $318.4k

     ...Senior Machine Learning Engineer (Search) Apple Maps and the thousands of applications it empowers are being used by millions every single day! As a fundamental tool for human activity, Maps technology is evolving and new techniques are emerging. We are looking for... 
    Local area
    Relocation

    Apple

    San Francisco, CA
    20 hours ago
  • $175k - $250k

     ...Machine Learning Engineer Kiddom is a groundbreaking educational platform that promotes student equity and growth by uniting high-quality instructional materials with dynamic digital learning. Through unparalleled curriculum management functionality, Kiddom empowers... 
    Local area
    Flexible hours

    Kiddom

    San Francisco, CA
    2 days ago
  •  ...built on the belief that every website, app, game, brand, and human will have an AI persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team, focused on building the next generation of AI-driven animation systems. This role is... 
    Full time
    Work experience placement
    Work at office

    GENIES INC

    San Francisco, CA
    3 days ago
  •  ...About the job Machine Learning Engineer-Life Sciences We are looking for a Machine Learning Engineer (Life Sciences) to help build our platform for training, evaluating, and deploying interpretable frontier AI systems, with an emphasis on scientific and biological... 

    Spark Recruiting

    San Francisco, CA
    3 days ago
  •  ...chain from the ground up—and we're looking for a Senior+ Machine Learning Engineer to help make it autonomous. We're not a software company selling...  ...interesting applied AI work happening today. Our internal platform, PlantOS, uses the same reinforcement learning toolkits... 
    Immediate start
    Shift work

    Mariana Minerals

    San Francisco, CA
    2 days ago
  •  ...retrieval over complex unstructured data. We are a team of engineers and scientists from Berkeley, CMU, Ecole Polytechnique, USACO,...  ...principles and best practices. Experience or willingness to learn about scalability technologies like AWS/Azure, Docker, and Kubernetes... 
    Summer work
    Internship

    Zeroentropy (yc W25)

    San Francisco, CA
    2 days ago
  •  ...You'll collaborate with construction veterans and world-class engineers to solve physical-world problems that simulations can't...  ...alongside a talented team-we'd love to have you join us. Machine Learning Engineer: Perception Bedrock is bringing autonomy to the... 
    Work at office
    Flexible hours

    Bedrock Robotics

    San Francisco, CA
    2 days ago
  • $180k - $220k

     ...Machine Learning Engineer At Ouster, we build sensors and tools for engineers, roboticists, and researchers, so they can make the world safer and more efficient. We've transformed LIDAR from an analog device with thousands of components to an elegant digital device... 
    Work experience placement
    Local area

    Ouster

    San Francisco, CA
    1 day ago
  • $225k - $325k

     ...vision for 2026 is to build a modern CX platform where entire contact centers are...  ...a hands-on, high-ownership role for ML engineers who want to build production models that...  ...world constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll work across... 
    H1b
    Work at office

    Retell AI

    San Francisco, CA
    4 days ago
  • $160k - $220k

     ...About the Role Together AI is looking for an ML Engineer who will develop systems and APIs that enable our customers to perform inference and fine tune LLMs. Relevant experience includes implementing runtime systems that perform inference at scale using AI/ML models... 
    Full time

    Together AI

    San Francisco, CA
    4 days ago
  • $225k - $300k

     ...Machine Learning Engineer About Latent Health Healthcare today is only truly personalized for two groups: those with wealth and access, and those with physicians in their immediate family. For everyone else, care is fragmented and impersonal. Medical history... 
    Work at office
    Immediate start

    Latent

    San Francisco, CA
    4 days ago
  • $150k - $220k

     ...Founding Machine Learning Engineer San Francisco Compensation ~ Estimated base salary $150K – $220K • Offers Equity • Offers Bonus...  ...intelligence that powers Composite's proactive automation platform. You'll work at the intersection of LLM inference, browser... 
    H1b
    Work at office
    Visa sponsorship
    Sleeping nights

    Composite.ai

    San Francisco, CA
    2 days ago
  • $240.45k - $300.3k

     ...The goal of a Senior Machine Learning Engineer at Scale is to leverage techniques in the fields of generative AI, computer vision, reinforcement...  ...specialization Experience working with cloud platforms (eg. AWS or GCP) and deploying machine learning models in... 
    Full time

    Scale AI

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning, Platform Engineer. Be the first to apply!