Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Head of Platform/AI Cluster Management - System Integrator

Hamilton Barnes Associates Limited

Ready to lead innovation at the intersection of platforms and artificial intelligence? Join a pioneering technology company driving advancements in cloud, AI, and data-driven solutions across global markets. The organization is recognized for fostering innovation, scalability, and collaboration through cutting-edge platforms that empower enterprises to evolve intelligently. The team is hiring a Head of Platform/AI Cluster Management to oversee the strategic development, integration, and optimization of AI and platform initiatives. The role will focus on leading cross-functional teams, enhancing performance and scalability, and aligning technology strategy with long-term business goals. Shape the future of intelligent platforms and transformative innovation. Apply now! Responsibilities Own the scheduler/runtime layer (Slurm, Kubernetes, Ray), including multi-tenancy, quotas, and GPU/host fleet management. Lead cluster operations across images, CI/CD, repair/health, performance/telemetry, and incident response. Deliver platform services that ensure workload SLOs and reliable runtime execution. Define and implement namespace/tenancy design, node health automation, golden images, admission controls, on-call runbooks, and go-live gates. Collaborate closely with infra, SRE, and network teams to optimize workload placement and cluster efficiency. Provide hands-on expertise in NCCL behaviours, placement strategies, and congestion signal management. Requirements Deep expertise in cluster management, scheduling, and runtime environments for large-scale compute. Hands-on background with Slurm, Kubernetes, Ray, or similar orchestration platforms. Strong understanding of NCCL performance tuning, workload isolation, and congestion management. Experience scaling multi-tenant, GPU-heavy clusters with strict SLOs. Ability to thrive in a startup environment with full ownership over platform and cluster strategy. Salary $500,000 gross per year (Negotiable) #J-18808-Ljbffr

Vacancy posted 2 hours ago
Similar jobs that could be interesting for youBased on the Head of Platform/AI Cluster Management - System Integrator in San Francisco, CA vacancy
  •  ...Global Head Of Gsi Alliances Our client is one...  ...number one operating system in the cloud sold across...  ...popular development platforms. Our client’s renowned...  ...enterprises and telcos, and managed service providers at...  .... Global system integrators (GSI) play a vital role... 
    Platform

    MBR Partners

    San Francisco, CA
    3 days ago
  •  ...A leading incident response platform is seeking a Senior BDR Leader to scale their global...  ...developing coaching frameworks, utilizing AI tools to enhance productivity, and maintaining...  ...growth, this role requires experience in managing teams across geographies and successfully... 
    Platform

    Incident

    San Francisco, CA
    3 hours ago
  • $315k - $380k

     ...jobr.pro is seeking a Manager of Startups Applied AI Architects in San Francisco, California. In this role, you will lead a team to drive the adoption...  ...technologies among startups using the Claude Developer Platform. You will develop strategies and playbooks to ensure... 
    Platform

    Jobr

    San Francisco, CA
    3 hours ago
  • $170k - $240k

     ...ZipHQ, Inc. is seeking a hands-on leader for their Internal AI team in San Francisco. This role focuses on defining AI initiatives, collaborating with departments, and enhancing AI capabilities across the organization. The ideal candidate will have over 7 years of experience... 
    Platform
    Flexible hours

    ZipHQ, Inc.

    San Francisco, CA
    3 hours ago
  •  ...Incident.ioincident.io is the leading AI incident response platform, built to help teams dramatically...  ...and improve GTM productivity.Head of GTM Systems & Applied AIAs the Head of GTM Systems...  ...), ensuring systems stay lean, integrated, and aligned with GTM strategy.Identify... 
    Platform

    Incident

    San Francisco, CA
    1 day ago
  • $275k - $325k

     ...Head Of Applied AI San Francisco Bay Area, CA Why Weave Exists At...  ...inexpensively as possible. The Weave Platform streamlines regulatory...  ...Head of Applied AI that can manage and lead our Applied AI team...  ...robust and scalable systems with flexible and scalable UI... 
    Platform
    Work at office
    Remote work
    Flexible hours

    Weave, Inc.

    San Francisco, CA
    7 days ago
  •  ...Head Of Ai & Machine Learning As the Head of AI & Machine Learning...  ...of transformative AI systems, leveraging generative AI and...  ...scaling of a multi-agent AI platform to deliver sophisticated, end...  ...to-end solutions. Enhance integration with foundation models while... 
    Platform

    Pivotal Solutions Inc

    San Francisco, CA
    4 days ago
  • $164k - $261.5k

     ...Salesforce is the #1 AI CRM, where humans with...  ...are looking for a Senior Manager of Strategic Modeling who...  ..., but as a dynamic system of human and digital capabilities...  ...the enterprise’s first integrated capacity model—one that...  ...Python, or advanced BI platforms—and can wrangle... 
    Platform
    Shift work

    Centaur Labs

    San Francisco, CA
    2 hours ago
  •  ...Head Of GTM, AI Inference Hybrid At Cloudflare, we are on a mission...  ...AI inference platforms in the market. We aren't building...  ...serverless inference platform integrated into the world's largest developer...  ...in investment banking, management consulting, or a... 
    Platform
    Temporary work
    Flexible hours
    Shift work

    Cloudflare Inc

    San Francisco, CA
    6 days ago
  • $203k - $399k

     ...Mongo is seeking a Head of AI Platform, GM to lead the development and scaling of a new AI Applications Platform. With millions of developers...  ...is building the future to enable them to create, deploy and manage AI Applications at enterprise scale. As GM you will own the... 
    Platform
    Local area
    Remote work
    Worldwide
    Flexible hours
    Day shift

    MongoDB

    San Francisco, CA
    5 days ago
  •  ...autonomous development systems — shipping code,...  ...the Role As the Systems Integrators Partnerships Lead at Factory...  ...transformative AI-powered development solutions...  ...in AI, cloud platforms, developer tools, or other...  ...partner motion. Experience managing partner relationships... 
    Platform

    Factory

    San Francisco, CA
    3 hours ago
  •  ...Head Of Ai Agent Systems San Francisco About Wonderschool Wonderschool builds software...  ...in childcare, where we provide a platform that helps providers manage enrollment, operations,...  ...protect revenue and improve program integrity What Success Looks Like (6-1... 
    Platform
    Immediate start
    Shift work

    Wonderschool

    San Francisco, CA
    27 days ago
  •  ...Description: Role: Head AI Architect Location:...  ...implementing production grade AI/ML systems that scale across enterprise...  ...•Build Agents, Models & AI platforms: Fine-tune and customize LLMs...  ...•MLOps, Model Lifecycle Management •Cloud AI Infrastructure... 
    Platform

    TEPHRA

    San Francisco, CA
    2 days ago
  • $200k - $270k

     ...Head Of Developer Experience Los Angeles, San Francisco...  ...and avatar layer for AI agents and developer...  ..., and build on our platform. The AI agent ecosystem...  ...Claude Code) to drive integration partnerships and ensure...  ...relations, technical product management, or technical product... 
    Platform
    Work experience placement
    Remote work

    Heygen

    San Francisco, CA
    21 hours ago
  • $160k - $240k

     ...Head Of Developer Relations SF Bay Area (Hybrid) Parasail is redefining AI infrastructure by enabling seamless deployment across a...  ...alongside our CEO's founder platform, with content and community...  ...infra, backend or distributed systems work, open-source contributions... 
    Platform
    Work at office

    Parasail

    San Francisco, CA
    3 days ago
  • $212k

    Uber AI Solutions delivers high-quality scaled...  ...strength of building a platform for flexible work will...  ...a Solutions Architect Manager that will be the technical...  ...in Autonomous Systems, Robotics, and Generative...  ...pilots, and solve complex integrations alongside the team.... 
    Platform
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    Uber

    San Francisco, CA
    3 days ago
  • $220k - $300k

     ...and our team is building its AI-native future. About the...  ...Sentry spans four areas: the integration and developer platform, docs, community, and...  ...of it. We recently had our Head of DevEx depart, and we’re...  ...role, with at least 3 years managing people Experience owning or... 
    Platform
    Hourly pay

    Sentry

    San Francisco, CA
    1 day ago
  • A pioneering technology company in San Francisco is seeking a Head of Platform/AI Cluster Management to lead AI and platform initiatives. Responsibilities include overseeing scheduler management and optimizing performance across cross-functional teams. The ideal candidate... 
    Platform

    Hamilton Barnes Associates Limited

    San Francisco, CA
    4 days ago
  •  ...and run a petabyte-scale Ceph cluster that we manage ourselves. We’ve raised $13....  ...founded by a team with deep systems and scaling experience,...  ...led by Jon Boyer, formerly Head of Sales at Zapier. We’re now...  ...infrastructure into a broader platform: running agent sandboxes at... 
    Platform

    Blacksmith

    San Francisco, CA
    1 day ago
  •  ...direction, builds the AI and computational infrastructure...  ...a building role: the Head of Bio AI will...  ...become durable technical systems. Help make build-vs...  ...production-grade, open platforms the scientific community...  ...Attributes Has managed small technical teams;... 
    Platform
    H1b
    Visa sponsorship
    3 days per week

    Astera Institute

    Emeryville, CA
    3 days ago
  •  ...OpenAI’s products and platform across SMB, enterprise,...  ...discover, evaluate, and adopt AI. We partner closely...  ...We’re looking for a Head of Demand Generation to...  ...other channels to ensure integrated execution across the...  ...the capabilities of AI systems and seek to safely deploy... 
    Platform
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    1 day ago
  • $245k - $272k

     ...Head Of ML/AI Engineering Denver, CO;San Francisco, CA;New York, NY...  ...to build AI- and ML-powered systems that improve customer experiences...  ...AI, ML, risk modeling, and platform capabilities come together across...  ...do day-to-day: Lead, manage, and develop a broad AI/MLE... 
    Platform
    Full time
    Work at office
    Remote work
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    2 days ago
  • $163.4k - $224.75k

    MKTQ227R35 The Senior Partner Marketing Manager, C&SI (Consulting & Systems Integrators), will accelerate Databricks’ growth with the world’s largest consulting firms by driving demand for our Data and AI Platform joint solutions and building brand awareness. You will... 
    Platform
    Full time
    Worldwide

    Databricks

    San Francisco, CA
    3 days ago
  •  ...deploy our highly capable AI products across their...  ...Partnerships team builds and manages a strategic global...  ...span technology partners, systems integrators, and strategic collaborators...  ...and impact of OpenAI’s platform. About the role We are seeking a Head of Partner Enablement to... 
    Platform
    Relocation package

    OpenAI

    San Francisco, CA
    2 hours ago
  • $210k - $270k

     ...requires human agents and AI in perfect balance, and...  ...is the only unified platform that orchestrates both...  ...in a single operating system. With AI Agents that resolve...  ...AI-powered workforce management that optimizes both...  ...The Role As the Head of AI Deployments, you... 
    Platform
    Work at office

    Assembled

    San Francisco, CA
    more than 2 months ago
  • $320k - $405k

     ...interpretable, and steerable AI systems. We want AI to be safe and...  ...capabilities can go hand in hand. Cluster Infra owns the full...  ...provisioning and lifecycle management across all major cloud providers...  ..., reliability, and cloud platforms (e.g., Kubernetes, IaC, AWS/... 
    Platform
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...A leading AI productivity company based in San Francisco is seeking an experienced engineering manager to lead SRE, DBA, and DevOps teams. The role requires strong expertise in cloud infrastructure, CI/CD, and managing complex reliability practices. Ideal candidates should... 
    Platform

    Plaud

    San Francisco, CA
    2 hours ago
  •  ...Head Of Deployment Serval is an AI-native automation platform transforming how enterprises operate. We build intelligent agents...  ...processes and rigid legacy systems with adaptive, learning software...  ...code to extend Serval's platform, integrate APIs, and solve unique customer... 
    Platform
    Temporary work
    Immediate start

    Serval

    San Francisco, CA
    6 days ago
  •  ...hardware, by developing the first AI Hardware Engineer. Our goal...  ...AI-native hardware design platform. We have strong product pull,...  ...PCBs. We are looking for a Head of Revenue to own Flux's revenue...  ...serve upgrade paths Lead and manage Growth, Marketing, and... 
    Platform
    Internship
    Local area
    Shift work

    Flux Protocol

    San Francisco, CA
    4 days ago
  •  ...region. We are hiring a Head of Video Policy &...  ...human moderation and AI-driven systems, serve as a principal...  ...cultural considerations. * Manage high-complexity...  ...policy, online safety, integrity, and authenticity challenges...  ...of social media platforms. * Exceptional ability... 
    Platform

    Tik Tok

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Head of Platform/AI Cluster Management - System Integrator. Be the first to apply!