Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer - Together Cloud Infrastructure

$160k - $230k

Together AI

Senior Software Engineer - Together Cloud Infrastructure

San Francisco

About the Role

Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure.

As a Senior AI Infrastructure Engineer, you will play a key role in building the next generation AI cloud platform – a highly available, global, blazing-fast cloud infrastructure that virtualizes cutting-edge ML hardware (GB200s/GB300s, BlueField DPUs) and enables state-of-the-art ML practitioners with self-serve AI cloud services, such as on-demand + managed Kubernetes and Slurm clusters. This platform serves both our internal SaaS products (inference, fine-tuning) and our external cloud customers, spanning dozens of data centers across the world.

Responsibilities
  • Design, build, and maintain performant, secure, and highly-available backend services/operators that run in our data centers and automate hardware management, such as Infiniband partitioning, in-DC parallel storage provisioning, and VM provisioning.
  • Design and build out the IaaS software layer for a new GB200 data center with thousands of GPUs.
  • Work on a global multi-exabyte high-performance object store, serving massive datasets for pretraining.
  • Build advanced observability stacks for our customers with automated node lifecycle management for fault-tolerant distributed pretraining.
  • Perform architecture and research work for decentralized AI workloads
  • Work on the core, open-source Together AI platform
  • Create services, tools, and developer documentation
  • Create testing frameworks for robustness and fault-tolerance

To be successful, you'll need to be deeply technical and possess excellent communication, collaboration, and diplomacy skills. You have strong fundamental software development skills. In addition, you have strong systems knowledge and troubleshooting abilities.

Requirements
  • 5+ years of professional software development experience and proficiency in at least one backend programming language (Golang desired)
  • 5+ years experience writing high-performance, well-tested, production quality code
  • Demonstrated experience with building and operating high-performance and/or globally distributed micro-service architectures across one or more cloud providers (AWS, Azure, GCP)
  • Excellent communication skills – able to write clear design docs and work effectively with both technical and non-technical team members
  • Deep experience with Kubernetes internals a big plus, such as implementing non-trivial Kubernetes operators, device/storage/network plugins, custom schedulers, or patches thereon or Kubernetes itself
  • Deep experience with VMs/hypervisors a big plus, such as QEMU/KVM, cloud-hypervisor, VFIO, virtio, PCIE passthrough, Kubevirt, SR-IOV
  • Deep experience with DC networking tech + solutions a big plus, such as VLAN, VXLAN, VPN, VPC, OVS/OVN
  • Experience with Cluster API or similar a big plus
  • Experience working on high-performance compute, networking, and/or storage a big plus
  • Experience virtualizing GPUs and/or Infiniband a big plus
  • Strong systems knowledge across compute, networking, and storage, including concurrency, memory management, performant I/O, and scale
  • Experience with infrastructure automation tools (Terraform, Ansible), monitoring/observability stacks (Prometheus, Grafana), and CI/CD pipelines (GitHub Actions, ArgoCD)
  • Experience building IaaS or PaaS systems at scale a plus
  • Experience with DPUs/SmartNICs a plus
  • GPU programming, NCCL, CUDA knowledge a plus
About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer - Together Cloud Infrastructure in San Francisco, CA vacancy
  • A leading AI technology company based in San Francisco is looking for a seasoned Software Engineer with expertise in cloud architecture to join their Infrastructure Engineering team. The successful candidate will lead the design of core services infrastructure, automate... 
    Senior
    Software

    Hayden AI

    San Francisco, CA
    3 days ago
  •  ...technology firm in San Francisco is seeking an experienced Infrastructure Engineer to lead the design and evolution of core services....  ...environment. Candidates should have 6+ years in software engineering, expertise in cloud services, and a Bachelor's degree in a related... 
    Senior
    Software

    Hayden AI Technologies, Inc.

    San Francisco, CA
    3 days ago
  • A technology company is seeking a Senior Software Engineer, Infrastructure, in San Francisco. This role focuses on building scalable cloud solutions while collaborating with various teams. Candidates should have 5-8 years of experience, strong software engineering fundamentals... 
    Senior
    Software

    HOVER

    San Francisco, CA
    3 days ago
  • A blockchain analytics company in San Francisco is seeking a Senior Software Engineer, ML Infrastructure to design and operate GPU-backed systems for AI. The ideal candidate will have 5+ years of experience in building distributed infrastructure and a bachelor’s degree... 
    Senior
    Software

    TRM Labs

    San Francisco, CA
    2 days ago
  • $196k - $220.5k

     ...after playing games. Our Platform Infrastructure teams are responsible for building...  ...reliable, efficient, and scalable. As a Senior Software Engineer on these teams, you will...  ...people, whether listening to music together or grinding in competitive matches for... 
    Senior
    Software
    Full time
    Relocation
    Relocation package

    Discord

    San Francisco, CA
    1 day ago
  • $174k - $252k

    Senior Software Engineer, Infrastructure, Google Cloud Platforms Google - Sunnyvale, CA, USA; San Francisco, CA, USA Requirements Bachelor’s degree or equivalent practical experience. 5 years of experience with software development in one or more programming languages... 
    Senior
    Software
    Full time

    Google Inc.

    San Francisco, CA
    2 days ago
  • $127k - $249k

     ...hiring an experienced Security Software Engineer (Staff or Senior) for our Infrastructure Security team to design and build...  ...services within MongoDB Atlas multi-cloud infrastructure. The team sits...  ...fundamentals, and how they work together in complex systems ~... 
    Senior
    Software
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    San Francisco, CA
    4 days ago
  • A tech company specializing in property design solutions is seeking a Senior Software Engineer, Infrastructure in San Francisco. You will collaborate across engineering teams to enhance cloud infrastructure and ensure systems are reliable and scalable. Ideal candidates... 
    Senior
    Software
    Work at office

    HOVER

    San Francisco, CA
    3 days ago
  • $200k - $265k

     ...leading healthcare technology firm is seeking a Senior Software Engineer to design and maintain the infrastructure that empowers healthcare providers. This role involves...  ...in software engineering and experience with cloud platforms, containers, and databases. The position... 
    Senior
    Software

    Ambience Healthcare, Inc.

    San Francisco, CA
    4 days ago
  • $174k - $252k

    Google Inc. is seeking a Senior Software Engineer for Infrastructure within Google Cloud to develop innovative technologies that enhance user interaction. The ideal candidate will possess a Bachelor's degree along with substantial hands-on experience in software development... 
    Senior
    Software

    Google Inc.

    San Francisco, CA
    2 days ago
  • A healthcare technology company is seeking a Senior Software Engineer to design and maintain core platform infrastructure. This role involves significant responsibility...  ...software engineering experience, particularly in cloud environments and modern technology stacks. The... 
    Senior
    Software
    Remote work

    Ambience Healthcare

    San Francisco, CA
    1 day ago
  • $117.2k - $313.7k

     ...efforts. Job Category Software Engineering Job Details About...  ...agents drive customer success together. Here, ambition meets...  ...Software Engineer - Public Cloud (Senior/Lead/Principal) Note:...  ...Impact: Deliver cloud infrastructure automation tools,... 
    Senior
    Software

    Salesforce.Com Inc

    San Francisco, CA
    1 day ago
  • $189k - $330.75k

     ...IT, and Finance. It brings together all of the workforce systems...  ...the role Rippling's Infrastructure organization builds the technical...  ...our global footprint, the Cloud team is tasked with...  ...complexity. As a Staff Software Engineer on the Cloud Infrastructure... 
    Software
    Work at office
    3 days per week

    Rippling

    San Francisco, CA
    2 days ago
  • $100k - $250k

    A leading AI software company in San Francisco is seeking a Senior Infrastructure Engineer to build the infrastructure for AI software development. You will work on components like AI agents and app hosting while ensuring scalable services. The ideal candidate has strong... 
    Senior
    Software

    Hercules

    San Francisco, CA
    4 days ago
  • Senior Software Engineer, Infrastructure & Platform Role Overview: As a Senior Software Engineer, Infrastructure & Platform at AfterQuery, you will design...  ...Experience designing and operating systems in cloud environments (GCP or AWS) Experience with message queues... 
    Senior
    Software

    AfterQuery

    San Francisco, CA
    2 days ago
  • Rippling is hiring a Senior Staff Software Engineer in San Francisco to lead the development of large-scale distributed systems and platform initiatives...  ...engineering, with expertise in building scalable infrastructure and a strong understanding of computer science... 
    Senior
    Software

    Rippling

    San Francisco, CA
    3 days ago
  • $250k - $350k

    Senior Software Engineer - Infrastructure/Platform — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $250,000 - $350,000 base + competitive equity Visa Sponsorship: None available Experience Level: Senior (5+ years) Employment Type: Full-Time About AfterQuery... 
    Senior
    Software
    Full time
    Visa sponsorship

    David Joseph & Company

    San Francisco, CA
    3 days ago
  • $216k - $270k

     ...As a Software Engineer on the Machine Learning Infrastructure team, you will build the "Operating System" for our large-scale GPU clusters. You will architect...  ...plugins for specialized hardware. ~ Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as... 
    Senior
    Software
    Full time

    Scale AI

    San Francisco, CA
    2 days ago
  • $216k - $270k

     ...As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform...  ...tools (e.g., Docker, Kubernetes). ~ Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as code (e... 
    Senior
    Software
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  •  ...growing data company in San Francisco is seeking a Senior Engineer specializing in data infrastructure to drive the technical direction of their data platform...  ...candidates will have 7+ years of experience in software engineering, strong data modeling instincts, and proficiency... 
    Senior
    Software

    Middesk

    San Francisco, CA
    4 days ago
  • $170k - $220k

     ...Sift Sift is the data infrastructure platform for hardware engineering teams. Sift turns high-frequency...  .... Collaborate with software engineers to optimize...  ...infrastructure for both cloud and on-premise...  ...and Thursdays —and come together for a full week every two... 
    Senior
    Software
    Permanent employment
    Work at office
    Relocation

    Sift Science

    San Francisco, CA
    2 days ago
  • **Job Title:**Senior Manager, Software Engineering - Cloud Platform **Location:** New York, NY; San Francisco,...  ...years of software engineering/cloud infrastructure experience with 3+ years of...  ...impact so you can *do your best*. Together, we’ll bring the power of Agentforce... 
    Senior
    Software
    Work experience placement
    Shift work

    Salesforce, Inc.

    San Francisco, CA
    3 days ago
  • $207k - $362.25k

     ...Finance. It brings together all of the...  ...Rippling's Infrastructure organization is responsible...  ...that allow 1,000+ engineers to build and ship...  ...the way we build software is fundamentally...  ...are looking for a Senior Staff Engineer to...  ...including local and cloud environments,... 
    Senior
    Software
    Work at office
    Local area
    3 days per week

    Rippling

    San Francisco, CA
    1 day ago
  •  ...audit. Put simply, we build software for the people who enable...  ...lives easier by bringing together up to 50% of their work and...  .... About the Role As a Senior Infrastructure Engineer at Fieldguide, you'll own...  ...experience constructing complex cloud solutions using multiple... 
    Senior
    Software
    Remote work
    Work from home
    Flexible hours

    Fieldguide

    San Francisco, CA
    5 days ago
  •  ...execution sandboxes that power Julius across cloud environments (AWS and GCP). We...  ...operate secure, multi-tenant container infrastructure with fast startup and smart autoscaling....  ...tinkering with LLMs. Why Julius Small, senior team; massive impact surface; hard infra... 
    Senior
    Software
    Remote work

    Julius

    San Francisco, CA
    2 days ago
  • Databricks is seeking a Senior Software Engineer (Infrastructure) in San Francisco. You will be a core technical contributor to our IT Infrastructure team, building scalable solutions and enhancing our AWS infrastructure. The ideal candidate has over 5 years of experience... 
    Senior
    Software

    I did my part and supported the Regular Toilet

    San Francisco, CA
    5 days ago
  • $191k - $250k

     ...Descript, we believe that software engineers should own the reliability...  ...ship to production, so as an Infrastructure Engineer, you will drive projects...  ...of at least two of: public cloud infrastructure, Linux...  ...collaboration that come from working together in person.Descript is an... 
    Senior
    Software
    Work at office
    Remote work
    Flexible hours

    Descript

    San Francisco, CA
    1 day ago
  •  ...The Senior Infrastructure Engineer serves as a key contributor in the planning, design, construction,...  .../Fiber, AutoCAD, and other relevant software and network testing tools. Proficiency...  ...exclusively to the health sciences. We bring together the world's leading experts in... 
    Senior
    Software
    Work experience placement
    Local area
    Worldwide

    University of California , San Francisco

    San Francisco, CA
    3 days ago
  • $140k - $225k

     ...assembling a diverse, world-class team-engineers, designers, researchers, and...  ...across HP's portfolio. Together, we're developing intuitive, adaptive...  ...About The Role As the Senior Software Engineer, Tooling and Development Infrastructure, you will play a critical role... 
    Senior
    Software
    Full time
    Temporary work
    Local area
    Flexible hours

    HP IQ

    San Francisco, CA
    2 days ago
  • $200k - $400k

     ...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences...  ...four focus areas: Core Infra: The foundational cloud stack—networking, compute, storage, security, and infrastructure... 
    Senior
    Software
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer - Together Cloud Infrastructure. Be the first to apply!