Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Backend Engineer, Inference Platform

$160k - $250k

Together AI

Senior Backend Engineer, Inference Platform

San Francisco

About the Role

Together AI is building the Inference Platform that brings the most advanced generative AI models to the world. Our platform powers multi-tenant serverless workloads and dedicated endpoints, enabling developers, enterprises, and researchers to harness the latest LLMs, multimodal models, image, audio, video, and speech models at scale.

If you get a thrill from optimizing latency down to the last millisecond, this is your playground. You'll work hands-on with tens of thousands of GPUs (H100s, H200s, GB200s, and beyond), figuring out how to fully utilize every FLOP and every gigabyte of memory.

You'll collaborate directly with research teams to bring frontier models into production, making breakthroughs usable in the real world. Our team also works closely with the open source community, contributing to and leveraging projects like SGLang, vLLM, and NVIDIA Dynamo to push the boundaries of inference performance and efficiency.

  • Shape the core inference backbone that powers Together AI's frontier models.
  • Solve performance-critical challenges in global request routing, load balancing, and large-scale resource allocation.
  • Work with state-of-the-art accelerators (H100s, H200s, GB200s) at global scale.
  • Partner with world-class researchers to bring new model architectures into production.
  • Collaborate with and contribute to the open source community, shaping the tools that advance the industry.
  • A culture of deep technical ownership and high impact — where your work makes models faster, cheaper, and more accessible.
  • Competitive compensation, equity, and benefits.
Responsibilities
  • Build and optimize global and local request routing, ensuring low-latency load balancing across data centers and model engine pods.
  • Develop auto-scaling systems to dynamically allocate resources and meet strict SLOs across dozens of data centers.
  • Design systems for multi-tenant traffic shaping, tuning both resource allocation and request handling — including smart rate limiting and regulation — to ensure fairness and consistent experience across all users.
  • Engineer trade-offs between latency and throughput to serve diverse workloads efficiently.
  • Optimize prefix caching to reduce model compute and speed up responses.
  • Collaborate with ML researchers to bring new model architectures into production at scale.
  • Continuously profile and analyze system-level performance to identify bottlenecks and implement optimizations.
Requirements
  • 5+ years of demonstrated experience building large-scale, fault-tolerant, distributed systems and API microservices.
  • Strong background in designing, analyzing, and improving efficiency, scalability, and stability of complex systems.
  • Excellent understanding of low-level OS concepts: multi-threading, memory management, networking, and storage performance.
  • Expert-level programming in one or more of: Rust, Go, Python, or TypeScript.
  • Knowledge of modern LLMs and generative models and how they are served in production is a plus.
  • Experience working with the open source ecosystem around inference is highly valuable; familiarity with SGLang, vLLM, or NVIDIA Dynamo will be especially handy.
  • Experience with Kubernetes or container orchestration is a strong plus.
  • Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC technologies (InfiniBand, NVLink, MPI) is a plus.
  • Bachelor's or Master's degree in Computer Science, Computer Engineering, or related field, or equivalent practical experience.
About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Senior Backend Engineer, Inference Platform in San Francisco, CA vacancy
  • $170k - $195k

     ...make intelligent agents ubiquitous. We provide the agent engineering platform and open source frameworks developers need to ship reliable...  ..., New York, Amsterdam, or London. We are looking for a Senior Backend Engineer to join us. In this role you will be building the... 
    Senior
    Worldwide
    Flexible hours

    LangChain

    San Francisco, CA
    3 days ago
  • $210k - $285k

     ...Job Description Job Description Senior Backend Engineer - $210K-$285K San Francisco, CA A rapidly growing AI-powered logistics technology...  ...data systems, AI workflows, and real-time operational platforms. This is a high-impact opportunity for engineers who thrive... 
    Senior
    Relocation

    Direct Line Workforce Solutions

    San Francisco, CA
    16 days ago
  • $200k - $275k

     ...Senior Backend Engineer (Infra/Platform/SRE) Title of Role: Senior Backend Engineer (Infra/Platform/SRE) Location: San Francisco, hybrid Company Stage of Funding: Series A - Design Services Office Type: Hybrid Salary: $200K-$275K Company Description... 
    Senior
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    5 days ago
  •  ...Conversion is the AI-native marketing automation platform for modern software companies. Our...  ...is based in San Francisco and includes engineers, designers, and operators from Airbnb,...  .... Here's what we're looking for in a backend software engineer to join our core team:... 
    Senior

    Conversion Services

    San Francisco, CA
    3 days ago
  • $175k - $225k

     ...Senior Backend Engineer In person 5 days/week in San Francisco, Boston, MA, New York. We are looking for a Senior Backend Engineer to join...  ...systems that power LangChain's observability and evals platform. You will work on the core services that allow developers to... 
    Senior
    Work at office
    Flexible hours

    LangChain

    San Francisco, CA
    5 days ago
  • $210k - $240k

     ...Who Are We? Postman is the world's leading API platform, used by more than 45 million+ developers and 500,000 organizations...  ...picture and our vision at Postman. The Opportunity As a Senior Backend Engineer on the Cloud Platform team, you will play a key role in... 
    Senior
    Work at office
    Flexible hours
    3 days per week

    Postman

    San Francisco, CA
    4 days ago
  • $121.5k - $145.5k

     ...seeking a seasoned Sr. Software Engineer in the North America Mobility...  .... This role will sit in the Platform team that focuses on building...  ...object oriented code in our backend services. Develop public...  ...and drive alignment with other senior engineers. Write automated... 
    Senior
    Remote work
    Flexible hours

    WEX

    San Francisco, CA
    5 days ago
  •  ...and finance in an intelligent platform. We're on a mission to make...  .... We're looking for engineers who are excited about building...  ...The Role We're seeking a Senior Backend Engineer to join our growing...  ...generate and evaluate scenarios, infer scientific progress and risks... 
    Senior

    Orchestra Bio

    San Francisco, CA
    1 day ago
  • $159k - $278.25k

     ...only be sent from @Rippling.com addresses. About the Platform Team The Platform Engineering team is the invisible engine that powers all of...  ...will do Own the design and implementation of core backend services and APIs that power critical product capabilities... 
    Senior
    Work at office
    3 days per week

    Rippling

    San Francisco, CA
    2 days ago
  •  ...Senior Staff Backend Platform Engineer Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours
    2 days per week

    Patreon

    San Francisco, CA
    5 days ago
  • $192k - $260k

     ...world's best data and AI infrastructure platform so our customers can use deep data insights...  .... It offers real-time, low-latency inference, governance, monitoring, and lineage. As...  ...SLAs and cost efficiency. As a Staff Engineer, you'll play a critical role in shaping... 
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    4 days ago
  •  ...committed to building a world-class product and engineering culture in the Bay Area. While we trust...  ...Build Core Services: Contribute to backend systems that power quoting, underwriting...  ...: Build and maintain internal platforms, tooling, and CI/CD pipelines that enable... 
    Senior

    userainbow.com

    San Francisco, CA
    1 day ago
  • $200k

     ...software is bought and sold. We’re building an innovative platform that empowers buyers with transparent, interactive software...  ...traditional B2B sales motion. Role Overview We are hiring a Senior Backend Engineer to help design and build the core of our next-generation... 
    Senior
    Work at office
    Remote work
    Flexible hours

    TestBox

    San Francisco, CA
    1 day ago
  • $180k - $260k

     ...future of experience research won’t be powered by slow, siloed platforms. It will be fast, intelligent, and deeply integrated into...  ...you. About the Role We’re looking for a talented Senior Backend Engineer to join our Platform Team. The Platform Team at Sprig is responsible... 
    Senior
    Full time
    Work at office
    Flexible hours

    Sprig

    San Francisco, CA
    more than 2 months ago
  •  ...lasting impact. Learn more at Life as an Engineer at EvenUp Location & Work Model...  .... About the Team Join EvenUp’s AI Platform team, dedicated to making large language...  ...This role operates at the intersection of backend engineering, LLM integration, and hands-... 
    Senior
    Full time
    Temporary work
    Work at office
    Local area
    Home office
    Flexible hours
    3 days per week

    EvenUp

    San Francisco, CA
    1 day ago
  •  ...inflection point and we7;re looking for a Senior/Staff Engineers to help scale us to a $XXM+ in ARR....  ...past 3-4 years, thanks to excellent platforms that not only support weekend projects...  ...have the caliber to scale: Python backend (primarily Django) with Postgres Clerk... 
    Senior
    Work at office
    Flexible hours
    Weekend work
    Afternoon shift

    Complete

    San Francisco, CA
    1 day ago
  • $166k - $225k

     ...world's best data and AI infrastructure platform so our customers can use deep data insights...  ...to improve their business. Founded by engineers — and customer obsessed — we leap at...  ...look for: ~5+ years of experience in backend or infrastructure engineering ~ Strong... 
    Senior
    Remote job
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    more than 2 months ago
  • $160k - $300k

     ...our mission is to revolutionize how engineering decisions are made, turning complexity...  ...together. About the Role As a Senior / Staff Backend Engineer at Apiphany, you’ll design,...  ...systems that power our intelligence platform. You’ll own backend low-latency services... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Apiphany

    San Francisco, CA
    1 day ago
  • $216k - $270k

     ...As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving...  ...with deep expertise in backend system design. You'll work in...  ...TensorRT-LLM, or text-generation-inference. Compensation packages... 
    Senior
    Full time

    Scale AI

    San Francisco, CA
    14 days ago
  •  ...Senior Engineer We are looking for a senior engineer to join our team in San Francisco. As one of the founding members of our team, you...  ...first AI-powered wealth manager. Design and own the core backend infrastructure, including our multi-agent LLM architecture that... 
    Senior

    Finvest

    San Francisco, CA
    2 days ago
  •  ...The Role We are looking for a Founding Backend Engineer to lead the design and development of the backend infrastructure that powers the Button protocol a next generation agentic finance app built to give retail traders the access, tools, and capital efficiency that... 
    Senior

    Button Labs

    San Francisco, CA
    5 days ago
  •  ...Senior Backend Engineer We believe owning a home should feel as special as the moments that take place within them. If our mission inspires you, we'd love to hear from you. About the Role We're hiring a founding Senior Backend Engineer to help us design, develop... 
    Senior

    CASA

    San Francisco, CA
    2 days ago
  •  ...About the Role Join the engineering team of a Series A B2B SaaS company building...  ...customer communication platform for SMEs. You'll work on core backend technologies that power messaging,...  ...delivering features end-to-end at senior level. Tech Stack Backend... 
    Senior
    Visa sponsorship

    CLERA

    San Francisco, CA
    4 days ago
  •  ...gets bigger. What you'll do Build and ship scalable backend/platform systems Develop LLM agents that integrate with large...  ...What we're looking for Strong backend and platform engineering experience (3+ years) Ability to work independently, learn... 
    Senior
    Work at office
    Flexible hours

    Dynamis Labs

    San Francisco, CA
    5 days ago
  • $230k - $265k

     ...the financial tools they need through the platforms they already sell on. We...  ...Position We're looking for a software engineer to join Parafin's Infrastructure team and...  ...experimentation, training, evaluation, inference, and retraining that power underwriting... 
    Senior
    Work from home
    Flexible hours

    Parafin Inc

    San Francisco, CA
    1 day ago
  •  ...Senior Backend Engineer We're building purpose-built infrastructure for running AI agents. Unlike traditional web apps, agents run for long...  ...production systems — tracing, metrics, and alerting to keep the platform healthy Participate in on-call rotations and own... 
    Senior
    Work at office
    Flexible hours

    LangChain

    San Francisco, CA
    4 days ago
  •  ...Senior Backend Engineer As a Backend Engineer at Spherecast, you will be responsible for building the backbone to scale Agnes throughout the world - our AI Supply Chain Manager that decides what to produce, where to make it, and how to move it through factories, warehouses... 
    Senior
    Temporary work

    Spherecast

    San Francisco, CA
    5 days ago
  •  ...Senior Backend Engineer Hiring a senior backend engineer to join our team and focus on system architecture for our most popular products. Things You'll Be Working On Architecting and implementing backend systems in Typescript/Node is the core of what you'll be... 
    Senior
    Work at office
    3 days per week

    Replo

    San Francisco, CA
    1 day ago
  • $179k - $240k

     ...Senior Backend Engineer Title of Role: Senior Backend Engineer Location: San Francisco, onsite Company Stage of Funding: Corporate...  ...the world's first causal AI Marketing Intelligence Platform. This organization is focused on leveraging advanced data analytics... 
    Senior
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    7 days ago
  •  ...Senior Backend Engineer As a Senior Backend Engineer, you will design, build, and deploy the backend services that power our creative brainstorming and creation products. You will work with web and video standards to power our suite of web-based image and video creation... 
    Senior
    Full time
    Work at office

    HEDRA INC

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Backend Engineer, Inference Platform. Be the first to apply!