Senior Software Engineer - Together Cloud Infrastructure
$160k - $230kTogether AI
Senior Software Engineer - Together Cloud Infrastructure
San Francisco
About the Role
Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure.
As a Senior AI Infrastructure Engineer, you will play a key role in building the next generation AI cloud platform – a highly available, global, blazing-fast cloud infrastructure that virtualizes cutting-edge ML hardware (GB200s/GB300s, BlueField DPUs) and enables state-of-the-art ML practitioners with self-serve AI cloud services, such as on-demand + managed Kubernetes and Slurm clusters. This platform serves both our internal SaaS products (inference, fine-tuning) and our external cloud customers, spanning dozens of data centers across the world.
Responsibilities
- Design, build, and maintain performant, secure, and highly-available backend services/operators that run in our data centers and automate hardware management, such as Infiniband partitioning, in-DC parallel storage provisioning, and VM provisioning.
- Design and build out the IaaS software layer for a new GB200 data center with thousands of GPUs.
- Work on a global multi-exabyte high-performance object store, serving massive datasets for pretraining.
- Build advanced observability stacks for our customers with automated node lifecycle management for fault-tolerant distributed pretraining.
- Perform architecture and research work for decentralized AI workloads
- Work on the core, open-source Together AI platform
- Create services, tools, and developer documentation
- Create testing frameworks for robustness and fault-tolerance
To be successful, you'll need to be deeply technical and possess excellent communication, collaboration, and diplomacy skills. You have strong fundamental software development skills. In addition, you have strong systems knowledge and troubleshooting abilities.
Requirements
- 5+ years of professional software development experience and proficiency in at least one backend programming language (Golang desired)
- 5+ years experience writing high-performance, well-tested, production quality code
- Demonstrated experience with building and operating high-performance and/or globally distributed micro-service architectures across one or more cloud providers (AWS, Azure, GCP)
- Excellent communication skills – able to write clear design docs and work effectively with both technical and non-technical team members
- Deep experience with Kubernetes internals a big plus, such as implementing non-trivial Kubernetes operators, device/storage/network plugins, custom schedulers, or patches thereon or Kubernetes itself
- Deep experience with VMs/hypervisors a big plus, such as QEMU/KVM, cloud-hypervisor, VFIO, virtio, PCIE passthrough, Kubevirt, SR-IOV
- Deep experience with DC networking tech + solutions a big plus, such as VLAN, VXLAN, VPN, VPC, OVS/OVN
- Experience with Cluster API or similar a big plus
- Experience working on high-performance compute, networking, and/or storage a big plus
- Experience virtualizing GPUs and/or Infiniband a big plus
- Strong systems knowledge across compute, networking, and storage, including concurrency, memory management, performant I/O, and scale
- Experience with infrastructure automation tools (Terraform, Ansible), monitoring/observability stacks (Prometheus, Grafana), and CI/CD pipelines (GitHub Actions, ArgoCD)
- Experience building IaaS or PaaS systems at scale a plus
- Experience with DPUs/SmartNICs a plus
- GPU programming, NCCL, CUDA knowledge a plus
About Together AI
Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.
Compensation
We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
Please see our privacy policy at
- A leading AI technology company based in San Francisco is looking for a seasoned Software Engineer with expertise in cloud architecture to join their Infrastructure Engineering team. The successful candidate will lead the design of core services infrastructure, automate...SeniorSoftware
- ...technology firm in San Francisco is seeking an experienced Infrastructure Engineer to lead the design and evolution of core services.... ...environment. Candidates should have 6+ years in software engineering, expertise in cloud services, and a Bachelor's degree in a related...SeniorSoftware
- A technology company is seeking a Senior Software Engineer, Infrastructure, in San Francisco. This role focuses on building scalable cloud solutions while collaborating with various teams. Candidates should have 5-8 years of experience, strong software engineering fundamentals...SeniorSoftware
- A blockchain analytics company in San Francisco is seeking a Senior Software Engineer, ML Infrastructure to design and operate GPU-backed systems for AI. The ideal candidate will have 5+ years of experience in building distributed infrastructure and a bachelor’s degree...SeniorSoftware
$196k - $220.5k
...after playing games. Our Platform Infrastructure teams are responsible for building... ...reliable, efficient, and scalable. As a Senior Software Engineer on these teams, you will... ...people, whether listening to music together or grinding in competitive matches for...SeniorSoftwareFull timeRelocationRelocation package$174k - $252k
Senior Software Engineer, Infrastructure, Google Cloud Platforms Google - Sunnyvale, CA, USA; San Francisco, CA, USA Requirements Bachelor’s degree or equivalent practical experience. 5 years of experience with software development in one or more programming languages...SeniorSoftwareFull time$127k - $249k
...hiring an experienced Security Software Engineer (Staff or Senior) for our Infrastructure Security team to design and build... ...services within MongoDB Atlas multi-cloud infrastructure. The team sits... ...fundamentals, and how they work together in complex systems ~...SeniorSoftwareWork at officeLocal areaRemote workWorldwideFlexible hours- A tech company specializing in property design solutions is seeking a Senior Software Engineer, Infrastructure in San Francisco. You will collaborate across engineering teams to enhance cloud infrastructure and ensure systems are reliable and scalable. Ideal candidates...SeniorSoftwareWork at office
$200k - $265k
...leading healthcare technology firm is seeking a Senior Software Engineer to design and maintain the infrastructure that empowers healthcare providers. This role involves... ...in software engineering and experience with cloud platforms, containers, and databases. The position...SeniorSoftware$174k - $252k
Google Inc. is seeking a Senior Software Engineer for Infrastructure within Google Cloud to develop innovative technologies that enhance user interaction. The ideal candidate will possess a Bachelor's degree along with substantial hands-on experience in software development...SeniorSoftware- A healthcare technology company is seeking a Senior Software Engineer to design and maintain core platform infrastructure. This role involves significant responsibility... ...software engineering experience, particularly in cloud environments and modern technology stacks. The...SeniorSoftwareRemote work
$117.2k - $313.7k
...efforts. Job Category Software Engineering Job Details About... ...agents drive customer success together. Here, ambition meets... ...Software Engineer - Public Cloud (Senior/Lead/Principal) Note:... ...Impact: Deliver cloud infrastructure automation tools,...SeniorSoftware$189k - $330.75k
...IT, and Finance. It brings together all of the workforce systems... ...the role Rippling's Infrastructure organization builds the technical... ...our global footprint, the Cloud team is tasked with... ...complexity. As a Staff Software Engineer on the Cloud Infrastructure...SoftwareWork at office3 days per week$100k - $250k
A leading AI software company in San Francisco is seeking a Senior Infrastructure Engineer to build the infrastructure for AI software development. You will work on components like AI agents and app hosting while ensuring scalable services. The ideal candidate has strong...SeniorSoftware- Senior Software Engineer, Infrastructure & Platform Role Overview: As a Senior Software Engineer, Infrastructure & Platform at AfterQuery, you will design... ...Experience designing and operating systems in cloud environments (GCP or AWS) Experience with message queues...SeniorSoftware
- Rippling is hiring a Senior Staff Software Engineer in San Francisco to lead the development of large-scale distributed systems and platform initiatives... ...engineering, with expertise in building scalable infrastructure and a strong understanding of computer science...SeniorSoftware
$250k - $350k
Senior Software Engineer - Infrastructure/Platform — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $250,000 - $350,000 base + competitive equity Visa Sponsorship: None available Experience Level: Senior (5+ years) Employment Type: Full-Time About AfterQuery...SeniorSoftwareFull timeVisa sponsorship$216k - $270k
...As a Software Engineer on the Machine Learning Infrastructure team, you will build the "Operating System" for our large-scale GPU clusters. You will architect... ...plugins for specialized hardware. ~ Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as...SeniorSoftwareFull time$216k - $270k
...As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform... ...tools (e.g., Docker, Kubernetes). ~ Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as code (e...SeniorSoftwareFull time- ...growing data company in San Francisco is seeking a Senior Engineer specializing in data infrastructure to drive the technical direction of their data platform... ...candidates will have 7+ years of experience in software engineering, strong data modeling instincts, and proficiency...SeniorSoftware
$170k - $220k
...Sift Sift is the data infrastructure platform for hardware engineering teams. Sift turns high-frequency... .... Collaborate with software engineers to optimize... ...infrastructure for both cloud and on-premise... ...and Thursdays —and come together for a full week every two...SeniorSoftwarePermanent employmentWork at officeRelocation- **Job Title:**Senior Manager, Software Engineering - Cloud Platform **Location:** New York, NY; San Francisco,... ...years of software engineering/cloud infrastructure experience with 3+ years of... ...impact so you can *do your best*. Together, we’ll bring the power of Agentforce...SeniorSoftwareWork experience placementShift work
$207k - $362.25k
...Finance. It brings together all of the... ...Rippling's Infrastructure organization is responsible... ...that allow 1,000+ engineers to build and ship... ...the way we build software is fundamentally... ...are looking for a Senior Staff Engineer to... ...including local and cloud environments,...SeniorSoftwareWork at officeLocal area3 days per week- ...audit. Put simply, we build software for the people who enable... ...lives easier by bringing together up to 50% of their work and... .... About the Role As a Senior Infrastructure Engineer at Fieldguide, you'll own... ...experience constructing complex cloud solutions using multiple...SeniorSoftwareRemote workWork from homeFlexible hours
- ...execution sandboxes that power Julius across cloud environments (AWS and GCP). We... ...operate secure, multi-tenant container infrastructure with fast startup and smart autoscaling.... ...tinkering with LLMs. Why Julius Small, senior team; massive impact surface; hard infra...SeniorSoftwareRemote work
- Databricks is seeking a Senior Software Engineer (Infrastructure) in San Francisco. You will be a core technical contributor to our IT Infrastructure team, building scalable solutions and enhancing our AWS infrastructure. The ideal candidate has over 5 years of experience...SeniorSoftware
$191k - $250k
...Descript, we believe that software engineers should own the reliability... ...ship to production, so as an Infrastructure Engineer, you will drive projects... ...of at least two of: public cloud infrastructure, Linux... ...collaboration that come from working together in person.Descript is an...SeniorSoftwareWork at officeRemote workFlexible hours- ...The Senior Infrastructure Engineer serves as a key contributor in the planning, design, construction,... .../Fiber, AutoCAD, and other relevant software and network testing tools. Proficiency... ...exclusively to the health sciences. We bring together the world's leading experts in...SeniorSoftwareWork experience placementLocal areaWorldwide
$140k - $225k
...assembling a diverse, world-class team-engineers, designers, researchers, and... ...across HP's portfolio. Together, we're developing intuitive, adaptive... ...About The Role As the Senior Software Engineer, Tooling and Development Infrastructure, you will play a critical role...SeniorSoftwareFull timeTemporary workLocal areaFlexible hours$200k - $400k
...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences... ...four focus areas: Core Infra: The foundational cloud stack—networking, compute, storage, security, and infrastructure...SeniorSoftwareFull timeWork at officeLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer - Together Cloud Infrastructure. Be the first to apply!
- graduate software developer San Francisco, CA
- rust software engineer San Francisco, CA
- senior software design engineer San Francisco, CA
- software engineer student San Francisco, CA
- software engineer amazon San Francisco, CA
- software developer positions San Francisco, CA
- software engineer full time San Francisco, CA
- software qa engineer San Francisco, CA
- new graduate software engineer San Francisco, CA
- junior software developer San Francisco, CA

