Senior Backend Engineer, Inference Platform
$160k - $250kTogether AI
Senior Backend Engineer, Inference Platform
San Francisco
About the Role
Together AI is building the Inference Platform that brings the most advanced generative AI models to the world. Our platform powers multi-tenant serverless workloads and dedicated endpoints, enabling developers, enterprises, and researchers to harness the latest LLMs, multimodal models, image, audio, video, and speech models at scale.
If you get a thrill from optimizing latency down to the last millisecond, this is your playground. You'll work hands-on with tens of thousands of GPUs (H100s, H200s, GB200s, and beyond), figuring out how to fully utilize every FLOP and every gigabyte of memory.
You'll collaborate directly with research teams to bring frontier models into production, making breakthroughs usable in the real world. Our team also works closely with the open source community, contributing to and leveraging projects like SGLang, vLLM, and NVIDIA Dynamo to push the boundaries of inference performance and efficiency.
- Shape the core inference backbone that powers Together AI's frontier models.
- Solve performance-critical challenges in global request routing, load balancing, and large-scale resource allocation.
- Work with state-of-the-art accelerators (H100s, H200s, GB200s) at global scale.
- Partner with world-class researchers to bring new model architectures into production.
- Collaborate with and contribute to the open source community, shaping the tools that advance the industry.
- A culture of deep technical ownership and high impact — where your work makes models faster, cheaper, and more accessible.
- Competitive compensation, equity, and benefits.
Responsibilities
- Build and optimize global and local request routing, ensuring low-latency load balancing across data centers and model engine pods.
- Develop auto-scaling systems to dynamically allocate resources and meet strict SLOs across dozens of data centers.
- Design systems for multi-tenant traffic shaping, tuning both resource allocation and request handling — including smart rate limiting and regulation — to ensure fairness and consistent experience across all users.
- Engineer trade-offs between latency and throughput to serve diverse workloads efficiently.
- Optimize prefix caching to reduce model compute and speed up responses.
- Collaborate with ML researchers to bring new model architectures into production at scale.
- Continuously profile and analyze system-level performance to identify bottlenecks and implement optimizations.
Requirements
- 5+ years of demonstrated experience building large-scale, fault-tolerant, distributed systems and API microservices.
- Strong background in designing, analyzing, and improving efficiency, scalability, and stability of complex systems.
- Excellent understanding of low-level OS concepts: multi-threading, memory management, networking, and storage performance.
- Expert-level programming in one or more of: Rust, Go, Python, or TypeScript.
- Knowledge of modern LLMs and generative models and how they are served in production is a plus.
- Experience working with the open source ecosystem around inference is highly valuable; familiarity with SGLang, vLLM, or NVIDIA Dynamo will be especially handy.
- Experience with Kubernetes or container orchestration is a strong plus.
- Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC technologies (InfiniBand, NVLink, MPI) is a plus.
- Bachelor's or Master's degree in Computer Science, Computer Engineering, or related field, or equivalent practical experience.
About Together AI
Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.
Compensation
We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
Please see our privacy policy at
$170k - $195k
...make intelligent agents ubiquitous. We provide the agent engineering platform and open source frameworks developers need to ship reliable... ..., New York, Amsterdam, or London. We are looking for a Senior Backend Engineer to join us. In this role you will be building the...SeniorWorldwideFlexible hours$210k - $285k
...Job Description Job Description Senior Backend Engineer - $210K-$285K San Francisco, CA A rapidly growing AI-powered logistics technology... ...data systems, AI workflows, and real-time operational platforms. This is a high-impact opportunity for engineers who thrive...SeniorRelocation$200k - $275k
...Senior Backend Engineer (Infra/Platform/SRE) Title of Role: Senior Backend Engineer (Infra/Platform/SRE) Location: San Francisco, hybrid Company Stage of Funding: Series A - Design Services Office Type: Hybrid Salary: $200K-$275K Company Description...SeniorWork at office- ...Conversion is the AI-native marketing automation platform for modern software companies. Our... ...is based in San Francisco and includes engineers, designers, and operators from Airbnb,... .... Here's what we're looking for in a backend software engineer to join our core team:...Senior
$175k - $225k
...Senior Backend Engineer In person 5 days/week in San Francisco, Boston, MA, New York. We are looking for a Senior Backend Engineer to join... ...systems that power LangChain's observability and evals platform. You will work on the core services that allow developers to...SeniorWork at officeFlexible hours$210k - $240k
...Who Are We? Postman is the world's leading API platform, used by more than 45 million+ developers and 500,000 organizations... ...picture and our vision at Postman. The Opportunity As a Senior Backend Engineer on the Cloud Platform team, you will play a key role in...SeniorWork at officeFlexible hours3 days per week$121.5k - $145.5k
...seeking a seasoned Sr. Software Engineer in the North America Mobility... .... This role will sit in the Platform team that focuses on building... ...object oriented code in our backend services. Develop public... ...and drive alignment with other senior engineers. Write automated...SeniorRemote workFlexible hours- ...and finance in an intelligent platform. We're on a mission to make... .... We're looking for engineers who are excited about building... ...The Role We're seeking a Senior Backend Engineer to join our growing... ...generate and evaluate scenarios, infer scientific progress and risks...Senior
$159k - $278.25k
...only be sent from @Rippling.com addresses. About the Platform Team The Platform Engineering team is the invisible engine that powers all of... ...will do Own the design and implementation of core backend services and APIs that power critical product capabilities...SeniorWork at office3 days per week- ...Senior Staff Backend Platform Engineer Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business...SeniorWork at officeLocal areaRemote workWorldwideFlexible hours2 days per week
$192k - $260k
...world's best data and AI infrastructure platform so our customers can use deep data insights... .... It offers real-time, low-latency inference, governance, monitoring, and lineage. As... ...SLAs and cost efficiency. As a Staff Engineer, you'll play a critical role in shaping...Local areaWorldwide- ...committed to building a world-class product and engineering culture in the Bay Area. While we trust... ...Build Core Services: Contribute to backend systems that power quoting, underwriting... ...: Build and maintain internal platforms, tooling, and CI/CD pipelines that enable...Senior
$200k
...software is bought and sold. We’re building an innovative platform that empowers buyers with transparent, interactive software... ...traditional B2B sales motion. Role Overview We are hiring a Senior Backend Engineer to help design and build the core of our next-generation...SeniorWork at officeRemote workFlexible hours$180k - $260k
...future of experience research won’t be powered by slow, siloed platforms. It will be fast, intelligent, and deeply integrated into... ...you. About the Role We’re looking for a talented Senior Backend Engineer to join our Platform Team. The Platform Team at Sprig is responsible...SeniorFull timeWork at officeFlexible hours- ...lasting impact. Learn more at Life as an Engineer at EvenUp Location & Work Model... .... About the Team Join EvenUp’s AI Platform team, dedicated to making large language... ...This role operates at the intersection of backend engineering, LLM integration, and hands-...SeniorFull timeTemporary workWork at officeLocal areaHome officeFlexible hours3 days per week
- ...inflection point and we7;re looking for a Senior/Staff Engineers to help scale us to a $XXM+ in ARR.... ...past 3-4 years, thanks to excellent platforms that not only support weekend projects... ...have the caliber to scale: Python backend (primarily Django) with Postgres Clerk...SeniorWork at officeFlexible hoursWeekend workAfternoon shift
$166k - $225k
...world's best data and AI infrastructure platform so our customers can use deep data insights... ...to improve their business. Founded by engineers — and customer obsessed — we leap at... ...look for: ~5+ years of experience in backend or infrastructure engineering ~ Strong...SeniorRemote jobLocal areaWorldwide$160k - $300k
...our mission is to revolutionize how engineering decisions are made, turning complexity... ...together. About the Role As a Senior / Staff Backend Engineer at Apiphany, you’ll design,... ...systems that power our intelligence platform. You’ll own backend low-latency services...SeniorWork at officeVisa sponsorshipFlexible hours$216k - $270k
...As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving... ...with deep expertise in backend system design. You'll work in... ...TensorRT-LLM, or text-generation-inference. Compensation packages...SeniorFull time- ...Senior Engineer We are looking for a senior engineer to join our team in San Francisco. As one of the founding members of our team, you... ...first AI-powered wealth manager. Design and own the core backend infrastructure, including our multi-agent LLM architecture that...Senior
- ...The Role We are looking for a Founding Backend Engineer to lead the design and development of the backend infrastructure that powers the Button protocol a next generation agentic finance app built to give retail traders the access, tools, and capital efficiency that...Senior
- ...Senior Backend Engineer We believe owning a home should feel as special as the moments that take place within them. If our mission inspires you, we'd love to hear from you. About the Role We're hiring a founding Senior Backend Engineer to help us design, develop...Senior
- ...About the Role Join the engineering team of a Series A B2B SaaS company building... ...customer communication platform for SMEs. You'll work on core backend technologies that power messaging,... ...delivering features end-to-end at senior level. Tech Stack Backend...SeniorVisa sponsorship
- ...gets bigger. What you'll do Build and ship scalable backend/platform systems Develop LLM agents that integrate with large... ...What we're looking for Strong backend and platform engineering experience (3+ years) Ability to work independently, learn...SeniorWork at officeFlexible hours
$230k - $265k
...the financial tools they need through the platforms they already sell on. We... ...Position We're looking for a software engineer to join Parafin's Infrastructure team and... ...experimentation, training, evaluation, inference, and retraining that power underwriting...SeniorWork from homeFlexible hours- ...Senior Backend Engineer We're building purpose-built infrastructure for running AI agents. Unlike traditional web apps, agents run for long... ...production systems — tracing, metrics, and alerting to keep the platform healthy Participate in on-call rotations and own...SeniorWork at officeFlexible hours
- ...Senior Backend Engineer As a Backend Engineer at Spherecast, you will be responsible for building the backbone to scale Agnes throughout the world - our AI Supply Chain Manager that decides what to produce, where to make it, and how to move it through factories, warehouses...SeniorTemporary work
- ...Senior Backend Engineer Hiring a senior backend engineer to join our team and focus on system architecture for our most popular products. Things You'll Be Working On Architecting and implementing backend systems in Typescript/Node is the core of what you'll be...SeniorWork at office3 days per week
$179k - $240k
...Senior Backend Engineer Title of Role: Senior Backend Engineer Location: San Francisco, onsite Company Stage of Funding: Corporate... ...the world's first causal AI Marketing Intelligence Platform. This organization is focused on leveraging advanced data analytics...SeniorWork at office- ...Senior Backend Engineer As a Senior Backend Engineer, you will design, build, and deploy the backend services that power our creative brainstorming and creation products. You will work with web and video standards to power our suite of web-based image and video creation...SeniorFull timeWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Backend Engineer, Inference Platform. Be the first to apply!
- senior backend developer San Francisco, CA
- entry level back-end developer San Francisco, CA
- lead backend developer San Francisco, CA
- remote back end developer San Francisco, CA
- back-end developer San Francisco, CA
- backend software engineer San Francisco, CA
- client platform engineer San Francisco, CA
- platform engineer San Francisco, CA
- senior platform engineer San Francisco, CA
- platform engineering manager San Francisco, CA



