Platform & AI Clusters Lead — Scale & SRE
Hamilton Barnes Associates Limited
A pioneering technology company in San Francisco is seeking a Head of Platform/AI Cluster Management to lead AI and platform initiatives. Responsibilities include overseeing scheduler management and optimizing performance across cross-functional teams. The ideal candidate should have extensive experience in cluster management, especially with Slurm and Kubernetes. This role offers a competitive salary of $500,000 gross per year. #J-18808-Ljbffr Hamilton Barnes Associates Limited
$300k
...startup in San Francisco seeks a Platform Engineer/Senior Site Reliability Engineer to manage their AI and cloud platform. You will design and maintain large-scale GPU clusters, create automation pipelines,... ...over 7 years of experience in SRE or DevOps, strong skills in...Platform- ...seasoned leader to oversee and grow software engineering teams within the Platform organization. This role focuses on building and scaling the foundational infrastructure that supports Anthropic's AI systems. The successful candidate will have over 10 years of engineering...Platform
$153k - $255k
Rippling in San Francisco is seeking a seasoned product leader for the Global Payroll platform. This role involves designing intuitive product experiences and defining AI-driven payroll capabilities to enhance workflows. Candidates should have over 6 years of product management...PlatformWork at office- ...Ambassador Program, orchestrate numerous events, and create engaging content across platforms. Ideal candidates have experience managing teams and a proven track record in large-scale event organization. Join to leverage real, impactful technology and shape a viral community...Platform
- ...helps teams launch and scale payment experiences in... ...infrastructure. Search and AI systems are becoming a... ...& AI Search Discovery Lead to help shape how the... ...), and the third-party platforms (G2, TrustRadius, YouTube... ...and own the topic cluster and authority map for Modern...PlatformLocal areaShift work
- ...strategic Staff/Senior Staff Site Reliability Engineer (SRE) to define the future of our cloud platform. This hybrid role requires office attendance at least... ...resilience, solve complex distributed systems at scale, and mentor others in engineering excellence. A comprehensive...PlatformWork at office
- Gimlet is building AI infrastructure and orchestration platforms for large-scale AI datacenters. This Infrastructure/Cluster Engineer role involves designing, building, and operating heterogeneous... ...engineering, platform engineering, SRE, HPC, or distributed systems Deep Linux...Platform
- Ready to lead innovation at the intersection of platforms and artificial intelligence? Join... ...in cloud, AI, and data-driven solutions... ...Head of Platform/AI Cluster Management to oversee... ...closely with infra, SRE, and network teams to... ...for large-scale compute. Hands-on background...Platform
$350k
...Reliability Engineer (SRE) San Francisco Thinking... ...and tools to make AI work for their unique needs... ...'re hiring to grow the platform alongside the Tinker... ...cloud services at scale (e.g., public cloud platforms... ...debugging, and tuning clusters handling heterogeneous...PlatformLocal areaVisa sponsorshipWork visaRelocation package$300k
...startup building out their AI and cloud platform, powered by thousands of H10... ...ready for experimentation, full-scale model training, or inference... ...of one of the largest GPU clusters in private deployment. If... ...7+ years of experience in SRE, DevOps, or Infrastructure Engineering...Platform$200k - $400k
...to grow vLLM as the world's AI inference engine and accelerate... ...We're looking for a hands‑on cluster administration engineer to... ...provision, operate, debug, and scale compute across providers. Your... ..., ML infrastructure, SRE, platform engineering, or infrastructure...PlatformRemote workVisa sponsorship$170k - $230k
...Reliability Engineer (SRE) Palo Alto / San... ...Mithril Mithril is an AI infrastructure platform built to make GPU... ...affordable for the world's leading enterprises, AI... ...will shape how Mithril scales its platform across a... ...comfortable managing clusters, deployments, and...PlatformWork at officeLocal area1 day per week- ...is seeking a Member of Technical Staff in AI Supercomputing to design, build, and operate... ...environment. You will enable fast, large-scale research by ensuring high-performance... ...has a strong background in operating GPU clusters, container orchestration, and deep learning...
- ...deployment leader to revolutionize banking through AI. You will deploy an innovative loan origination platform in Tier 1 banks, coordinating across engineering and... ...and a strong track record of handling large-scale projects. Join us to shape the future of banking and...Platform
- A leading technology firm is seeking a Talent Acquisition Manager to support the scaling of their AI and cloud infrastructure platform in San Francisco. The role involves hiring across various technology sectors including product, platform, and software engineering. Candidates...Platform
- ...in San Francisco, is hiring a Marketing Lead to shape their brand narrative and editorial... ...and short-form content across various platforms while ensuring alignment with sales strategies... .... Join us to influence our marketing function as we scale. #J-18808-Ljbffr Sumble IncPlatform
- Qcells is seeking a Senior DevOps & SRE Manager in San Francisco to lead the reliability and operational excellence of our multi-platform ecosystem. You will manage third-party providers... ...extensive experience in managing large-scale production environments and will work...Platform
- ...drive customer activation within the Cash App team. This role involves designing and analyzing experiments at scale for one of the largest consumer fintech platforms. As an essential team member, you will shape product strategy through data-driven insights and...Platform
$180k - $300k
This is a job that Jill, our AI Recruiter, is recruiting for on... ...to speak to Jack . SF Launch Lead ($180k-$300k + Equity) at VC-backed... ...reviewing roles to ensure platform liquidity and success. The... ...founder or early GTM leader who has scaled a startup from $0 to $1M+ ARR....PlatformLocal area- ...We are hiring for the role of Lead or Principal Account Solution... ...deep expertise in Generative AI. This Solution Engineer will also... ...impact Salesforce solutions. Platform and Data Expertise :... ...Salesforce solutions integrate at scale. Lead the discovery, analysis,...Platform
- ...in San Francisco. This role involves owning the inference platform that is crucial to AI interactions in the product. You will manage the whole... ...path and be responsible for optimizing traffic routing, cluster management, and performance. The ideal candidate has a strong...Platform
- The Stellar Development Foundation (SDF) is seeking a Director of Site Reliability Engineering to lead a dynamic SRE team. This senior role involves shaping engineering culture while improving production services and core infrastructure. With a focus on operational excellence...Platform
$194.83k - $204.58k
Hilbert AI Co. seeks an AI Growth Lead in San Francisco, CA to drive AI‑led growth by combining digital marketing... ...intelligent acquisition solutions at scale and increase the sales for our... ...high‑impact marketing actions across platforms. Complete regular competitor and...PlatformRemote work$190k - $253.75k
A leading data and AI platform company in San Francisco is seeking an Engineering Manager to lead a team responsible for critical components of its... ...candidate has over 10 years of experience with large-scale distributed systems and 3 years of engineering management....Platform- About Us 1mind is a platform that deploys multimodal Superhumans for... ...and product knowledge. They can lead unlimited, simultaneous... ...seamlessly into existing workflows, scale instantly, and drive measurable... ...We’re looking for an AI Research Lead to define and drive...PlatformFull timeLive inRelocationVisa sponsorship
$100k - $170k
Founding AI Implementation Lead -Minoa (San Francisco) San Francisco | $100-170k base + 0.3-1.0% equity... ...customer lifecycle, creating the AI platform for value intelligence. We work with... ...product should go next. Automate and scale. You'll manage dozens of accounts....Platform$150k - $170k
...About Us Crescendo is the first AI‑native contact center —... ...measurable outcomes at speed and scale. We don’t just ship software.... ...hiring multiple AI Innovation Leads to help embed practical, process... ...tools like Asana, Jira, CRM platforms, or operational systems Strengthen...PlatformFull timeContract workWork at officeRemote work- ...based in San Francisco is looking for an Engineering Manager to lead a team in transforming user experiences through experimentation... ...SDUI migration and ensuring the delivery of member-facing UI at scale. The company offers a hybrid work model, requiring in-office presence...PlatformWork at office3 days per week
- ...'re hiring a Leader for our AI / ML / Data Science team (US... ...with customers, and the rest leading and scaling the data science team (about... ...(representations, clustering, change‑point detection, weak... ...generation autonomous operations platform delivered as SaaS codifies the...PlatformFull timeLocal area
$250k - $270k
...This hybrid role demands a passion for building scalable systems and leading cross-functional teams. You will take on the Technical Lead Architect role overseeing Ironclad's scaling as it utilizes AI tools to enhance contract intelligence. Ideal candidates have over 7...PlatformContract workWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Platform & AI Clusters Lead — Scale & SRE. Be the first to apply!

