Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning, Platform Engineer

$160k - $250k

Together AI

About the Role

Our team focuses on enabling custom models and dedicated inference on Together. We are responsible for building a container platform, optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling. We often focus on video or audio generation across the stack: CUDA kernels, pytorch optimization, inference engines, container orchestration, queueing theory, etc. An ideal candidate will be great at profiling/optimization but know the word kubernetes, or be intimately familiar with multi-cluster scheduling and have some sense of ML bottlenecks.
Responsibilities
  • New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, model optimization, APIs for managing deployments, inference worker SDKs, and CLI tools.
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems
  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance
Requirements
  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems.
  • Experience running serverless inference platforms, doing model bring-up on short notice, being on call, or running a cloud provider is a very big plus
  • Good taste and ability to thoughtfully discuss how what you've built has failed over time
  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
  • Excellent understanding of low level operating systems concepts including concurrency, networking and storage, performance and scale
  • Expert-level programmer in one or more of Python, Golang, Rust, C++, or Haskell
  • Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform
  • Experience with Kubernetes internals or other container orchestration systems
  • Sound judgement for when to use and when to not use LLMs for code
  • Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
  • Writing-heavy roles or companies are a plus
About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.
Compensation

We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Machine Learning, Platform Engineer in San Francisco, CA vacancy
  • Overview Pluralis Research is pioneering Protocol Learning - a fully decentralised way to train and deploy AI models that opens this...  ...to frontier‑scale AI. We’re looking for an ML Training Platform Engineer to architect, build, and scale the foundational infrastructure... 
    Suggested
    Work experience placement

    Pluralis Research

    San Francisco, CA
    4 days ago
  • $166k - $225k

    P-984 Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine‑tune,...  ...Compatible with all major cloud providers, the Mosaic AI platform provides maximum flexibility for AI development. Introduced... 
    Suggested
    Local area
    Worldwide

    Cacheflow

    San Francisco, CA
    5 days ago
  •  ...teams to maintain rigid systems, Lightfield learns from how companies actually work,...  ...that drives growth. We’re building the CRM platform we always wished existed: fast, intelligent...  ...and define best practices for software engineering in an AI-driven development landscape.... 
    Suggested
    Work from home

    Lightfield

    San Francisco, CA
    5 days ago
  • $185k - $275k

    Senior Machine Learning Engineer - GeoAI Platform Wherobots, Inc. San Francisco, California, United States | Information Technology About this position About Wherobots Wherobots was founded by the original creators of Apache Sedona to build the first fully‑managed, highly... 
    Suggested
    Full time
    Work at office
    Remote work
    Work visa

    Wherobots, Inc

    San Francisco, CA
    5 days ago
  • $250k - $300k

     ...intelligence by becoming part of our community of problem solvers, technologists, clinicians, and innovators. The Role As a Machine Learning Engineer at Ambience, you will help guide our AI team’s technical direction, identifying the most impactful opportunities to build,... 
    Suggested
    Work at office

    Dormont Manufacturing Company

    San Francisco, CA
    1 day ago
  • $200k - $265k

     ...build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning. If you’re excited about the...  ...us in building the future! About the Role As a Senior Machine Learning Engineer on the AI Image Generation (Imagine) team, you’ll design,... 
    Work at office

    Cantina

    San Francisco, CA
    7 days ago
  • $210k - $300k

     ...requirements What We Are Looking For 4+ years of experience in ML engineering or applied research Strong background in computer vision (pose...  ...to Have Experience with robotics, embodied AI, or imitation learning Publications in top ML/CV venues Experience with 3D vision or... 
    Home office

    Gerra Group

    San Francisco, CA
    4 days ago
  •  ...trading decisions are made. We’re hiring our Founding ML Engineer, the first full-time machine learning hire who will turn research and data into production...  ...you’ll help scale the system into a full production platform and define best practices for future hires.... 
    Full time
    Immediate start
    Relocation
    Visa sponsorship
    Relocation package

    Poesis LLC

    San Francisco, CA
    4 days ago
  •  ...We're assisting a well-funded startup with their search for Machine Learning Engineers. Their product helps AI teams turn complex documents into LLM-ready inputs with exceptional accuracy. This role will work onsite in the SF office. What you'll do: Train and deploy new... 
    Work at office

    DRH Search

    San Francisco, CA
    5 days ago
  • $200k - $400k

     ...Troveo Troveo is building the next-generation data platform to train AI video models. Troveo offers the...  ...investors, and we are seeking an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer will play a central role in designing,... 
    Work experience placement

    Troveo AI

    San Francisco, CA
    4 days ago
  • $200k - $280k

     ...Founding Senior Machine Learning Engineer Join to apply for the Founding Senior Machine Learning Engineer role at Retell AI. Base pay range $200,000 – $280,000 per year About Retell AI Retell AI is using first principles to reimagine the call center with... 
    H1b
    Work at office

    Retell AI

    San Francisco, CA
    4 days ago
  •  ...About the Role We're looking for founding Machine Learning Engineers (MLEs) to own and improve our core action models end-to-end - the intelligence that powers Composite's proactive automation platform. You’ll work at the intersection of LLM inference, browser understanding... 
    Sleeping nights

    Composite.ai

    San Francisco, CA
    5 days ago
  •  ...Job Description We’re looking for a Machine Learning Engineer to build and deploy production-grade AI systems. In this role, you’ll take models from research to real-world applications, designing, optimizing, and scaling systems that power critical workflows across the... 

    Eragon

    San Francisco, CA
    1 day ago
  •  ...We are a team of engineers and researchers with an ambitious mission: to move the world toward error-free software. We're doing this...  ...(Astral), and a number of other open source developers, machine learning researchers, and entrepreneurs. If you wish to learn more, read... 
    Relocation package

    Assert

    San Francisco, CA
    4 days ago
  • $150k - $200k

     ...& practice—real customer use‑cases with clear success criteria. Required Qualifications PhD or MS degree in Computer Science, Machine Learning, Robotics, or equivalent technical discipline. Deep expertise in machine learning fundamentals, reinforcement learning, and associated... 

    Deft AI, Inc.

    San Francisco, CA
    5 days ago
  • $200k - $300k

     ...skills and experience — talk with your recruiter to learn more. Base pay range $200,000.00/yr - $300,00...  ...message the job poster from Willing Tech Machine Learning Engineer – Scientific Visualisation Platform Location: Remote (US/Canada) Sector: Scientific... 
    Full time
    Remote work

    Willing Tech

    San Francisco, CA
    5 days ago
  •  ...cross‑functional team of data scientists, engineers and data product managers to...  ...preferably in AWS 2+ years of experience as a Machine Learning Engineer or relevant jobs Up‑to‑date...  ..., including workflow/model management platforms Great skills in Data Engineering including... 
    Work at office
    Local area

    Veeva Systems

    San Francisco, CA
    5 hours ago
  •  ...tools consistently fail. We are a small, fast-growing team of engineers in San Francisco powering Fortune 100 enterprises, YC startups,...  ...looking for 5 days in-office at our San Francisco office Eager to learn and adapt quickly Prior startup or founding experience is a... 
    Work at office
    Visa sponsorship
    Relocation package

    Trypulse

    San Francisco, CA
    4 days ago
  •  ...About This Role Join our AI team to build and deploy production machine learning systems for enterprise clients. You\'ll work on everything...  ...and cost Requirements 4+ years of experience in ML engineering Strong Python skills and ML framework experience (PyTorch, TensorFlow... 
    Full time
    Remote work
    Flexible hours

    ACI Infotech

    San Francisco, CA
    5 days ago
  • $160k - $250k

     ...if you are interested in joining the future of AI! Senior Machine Learning Engineer In order to execute our vision, we need to grow our team of...  ..., performant and secure code that can be shared across platforms Meaningfully contribute to the product and core backend systems... 

    Hive

    San Francisco, CA
    5 days ago
  • $216.7k - $303.4k

     ...Join the Ads team as a Machine Learning Engineer and become a key contributor to Reddit’s business. In this hands‑on role, you will be responsible for the full lifecycle of our ML systems, from initial research and modeling to deployment and optimization in production.... 

    Tensec

    San Francisco, CA
    4 days ago
  •  ...Title: Machine Learning Engineer Job Type: Contract Contract Length: 6 months Target Start Date: ASAP Work Location/Structure...  ...Opportunity Our client, a leader in Social Media and Content Platforms, is looking for a skilled Machine Learning Engineer to... 
    Contract work
    Immediate start
    Remote work

    DeWinter Group

    San Francisco, CA
    5 days ago
  •  ...scalable ML architectures, serving as a bridge between research and engineering reality. You will have direct influence on the company’s AI...  ..., model governance, and monitoring playbooks. Own the ML platform: feature store, training infrastructure, model registry, deployment... 

    Sierracorp

    San Francisco, CA
    4 days ago
  •  ...the next generation of consumer apps, games, and interactive platforms. A bit of context on how the team thinks about itself: the market...  ...as the market chases AI agents. The team is looking for engineers who want depth on that real ML work rather than the AI agent hype... 
    Relocation

    Ersilia

    San Francisco, CA
    1 day ago
  • $164.7k - $266k

     ...opportunity. Using Docusign’s Intelligent Agreement Management platform, companies can create, commit, and manage agreements with...  ...contract lifecycle management (CLM). What you'll do As a Machine Learning Engineer on the AI Platform team, you will design and build the... 
    Contract work
    Work at office
    Local area
    Remote work
    2 days per week

    Unavailable

    San Francisco, CA
    4 days ago
  •  ...of Bits, and Aptiv. About this Role We are seeking talented engineers intent on changing the security industry. If you have experience...  ...infrastructure Understanding of both modern and classic machine learning techniques Equally comfortable with Jupyter notebooks and building... 

    RunSybil

    San Francisco, CA
    5 days ago
  •  ...The Community You Will Join: The Growth Platform team’s vision is to drive long term...  ...and digital advertising, as well as the machine learning/AI and data platforms that feed into the...  ...You Will Make: As a machine learning engineer or scientist, your expertise will be pivotal... 
    Work experience placement
    Remote work
    Shift work

    airbnb, Inc.

    San Francisco, CA
    2 days ago
  • $132.3k - $245.7k

    About the Role We are looking for a passionate Staff Machine Learning Engineer to bridge the gap between cutting‑edge ML algorithm development...  ...personalization and search experiences across our streaming platforms. Codebase Architecture: Design modular, scalable ML... 
    Temporary work
    Local area

    WarnerMedia Services, LLC

    San Francisco, CA
    4 days ago
  • $110k - $180k

     ...particularly in making high-quality, personalised advice scalable. Our platform enables intelligent agents to operate across core advisory...  ...environment where innovation, collaboration, and continuous learning are highly valued The opportunity to work with a diverse and... 
    Work at office
    Relocation

    Arta Finance

    San Francisco, CA
    4 days ago
  • $163k - $245k

     ...innovation and creating the best experience for job seekers. (*Comscore, Total Visits, March 2025) Day to Day As a Machine Learning Engineer III, you will be a team lead. You will own one of the team's major workstreams, help drive technical direction for the team... 
    Work experience placement
    Local area

    Indeed

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning, Platform Engineer. Be the first to apply!