Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Inference Platform Engineer - Data Center

$300k
Full-time

Join a stealth-mode hyperscale data center startup building an AI and cloud platform, powered by thousands of H100s, H200s, and B200s,  ready to go for experimentation, full-scale model training, or inference. 

Our client operates high-performance GPU clusters powering some of the most advanced AI workloads worldwide. They’re now building a serverless inference platform,  beginning with cost-efficient batch inference and expanding into low-latency, real-time inference and custom model hosting. This is a unique chance to join at an early stage and help define the architecture, scalability, and technical direction of that platform.

If you are interested in this opportunity, get in tuch!  You don't want to miss this opportunity! 

Key Responsibilities

  • Take ownership of the inference platform architecture, from batch to low-latency workloads.
  • Design, build, and optimise distributed inference systems to maximise GPU utilisation and minimise cold starts.
  • Integrate, tune, and operate inference engines such as vLLM, SGLang, and TensorRT-LLM across multiple model types.
  • Develop APIs, orchestration layers, and autoscaling logic to support both multi-tenant and dedicated deployments.
  • Collaborate with cross-functional teams to translate business and customer needs into robust technical solutions.
  • Stay up to date with the latest models, serving frameworks, and optimisation techniques, applying best practices in performance and efficiency.
  • Implement monitoring, alerting, and observability workflows for production systems.

Requirements:

  • 5+ years’ experience building large-scale, fault-tolerant distributed systems (ML inference, HPC, or similar).
  • Proficiency in Python, Go, Rust, or a comparable language.
  • Strong understanding of GPU software stacks (CUDA, Triton, NCCL) and Kubernetes orchestration.
  • Practical experience with model-serving frameworks such as vLLM, SGLang, TensorRT-LLM, or custom PyTorch deployments.
  • Knowledge of performance optimisation techniques, including batching, speculative decoding, quantisation, and caching.
  • Familiarity with Infrastructure-as-Code tools (Terraform, Helm) and low-level OS performance tuning.

Nice to Have

  • Experience with event-driven or serverless architectures.
  • Exposure to hybrid cloud or multi-cluster environments.
  • Contributions to open-source ML or inference systems projects.
  • Proven track record of cost optimisation in high-performance compute environments.

Benefits:

  • Equity

Salary:

  • $300,000 gross per year
Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Senior Inference Platform Engineer - Data Center in San Francisco, CA vacancy
  • $160k - $250k

     ...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the most...  ...routing, ensuring low-latency load balancing across data centers and model engine pods. Develop auto-scaling systems... 
    Senior
    Full time
    Local area

    Together AI

    San Francisco, CA
    3 days ago
  • $139.2k - $174k

     ...applications.We are seeking a Senior Engineer 2 to play a key role in our...  ...for running AI workloads— inference, training, fine-tuning— at scale...  ...to ensure our global platform remains simple, reliable, and...  ...position is based on market data, relevant years of experience... 
    Senior
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    2 days ago
  • $200k - $220k

     ...experts across energy, manufacturing, data center construction, and cloud services. If...  ...This Role: Join Crusoe Energy as a Senior Data Engineer, an early and pivotal hire on our...  ...architect and build the foundational data platform infrastructure that powers Crusoe's AI... 
    Senior
    Full time
    Temporary work
    Work at office
    Remote work

    Crusoe

    San Francisco, CA
    3 days ago
  • $230k - $265k

    Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role requires 5+ years of software...  ...platform functionalities, enhance real-time inference processes, and collaborate across teams... 
    Senior
    Remote job

    Parafin

    San Francisco, CA
    2 days ago
  • $180k - $280k

    A software monitoring company is looking for a software engineer for their Streaming Platform team in San Francisco. The role involves designing and operating components for real-time data processing, focusing on reliability and performance. The ideal candidate has experience... 
    Senior

    Sentry

    San Francisco, CA
    3 days ago
  • I did my part and supported the Regular Toilet is seeking a Senior Data Platform Engineer to join our innovative data team in San Francisco, California. In this role, you will collaborate closely with various teams to build top-tier data platform solutions, leveraging technologies... 
    Senior
    Work at office
    3 days per week

    I did my part and supported the Regular Toilet

    San Francisco, CA
    4 days ago
  • $202.5k - $247.5k

     ...Software Engineer III/Senior, Data Platform ngrok is an all-in-one cloud networking platform that secures, transforms, and routes traffic to services...  ...has grown into a universal gateway for API delivery, AI inference, device fleets, and site-to-site connectivity. It's the... 
    Senior
    Permanent employment
    Full time
    Live in
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    ngrok

    San Francisco, CA
    2 days ago
  • OpenAI is seeking a signal integrity system design engineer in San Francisco to design and optimize AI supercomputer hardware. The ideal candidate should have over 10 years of industry experience and a strong background in system design knowledge, especially in signal... 
    Senior
    Relocation package

    OpenAI

    San Francisco, CA
    2 days ago
  • $96.8k - $306.4k

     ...Job Description Seeking a senior staff-level engineer with expertise in datacenter platform firmware. This role requires working across product, infrastructure...  ...5 About Us Only Oracle brings together the data, infrastructure, applications, and expertise to power... 
    Senior
    Temporary work
    Flexible hours

    Oracle

    San Francisco, CA
    1 day ago
  • A leading AI research firm in San Francisco is seeking a Data Center Controls Network Engineer to design and manage OT network architectures for high-density data centers. The ideal candidate has over 8 years of experience in controls engineering, industrial networking,... 
    Senior

    OpenAI

    San Francisco, CA
    2 days ago
  • Orbital in San Francisco is seeking a Senior Mechanical Engineer to design and optimize advanced modular data centers with a focus on innovative cooling applications. The role involves hands-on participation in research and development, emphasizing system design and performance... 
    Senior

    Orbital

    San Francisco, CA
    4 days ago
  • Baseten is hiring a Network Engineer (Data Centers) in San Francisco to design and own...  ...infrastructure for their GPU clusters. This senior role collaborates closely with hardware and platform teams, directly impacting model performance and inference. Responsibilities include... 
    Senior
    Flexible hours

    Baseten

    San Francisco, CA
    1 day ago
  •  ...Role We're looking for a Senior Staff Data Engineer to be the technical backbone of our Data & ML Platform team - the foundation...  ...supports model training and inference workflows reliably. Raise...  ...worldwide. With an AI-powered human-centered care model, Hinge Health... 
    Senior
    Work at office
    Local area
    Immediate start
    Remote work
    Worldwide
    3 days per week

    Hinge Health

    San Francisco, CA
    3 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Senior
    Remote job

    DigitalOcean

    San Francisco, CA
    19 hours ago
  • $141k - $175k

     ...to our members and to the clients and communities we serve. We are seeking a Senior Electrical Engineer to join one of our Seattle, California, or Texas offices with a focus on data center project delivery. This role involves leading electrical engineering design,... 
    Senior
    Remote work
    Flexible hours

    Arup

    San Francisco, CA
    3 days ago
  • $141k - $175k

    Arup is seeking a Senior Electrical Engineer for its San Francisco office, focused on Healthcare project delivery. This role entails leading...  ...Engineering, with a minimum of 8 years’ experience, especially in data center projects, and strong knowledge of US codes. Salary range... 
    Senior
    Work at office

    Arup

    San Francisco, CA
    2 days ago
  • Requirements A Bachelor’s degree in Mechanical Engineering or a related field is essential. Advanced degrees or certifications in mechanical...  ...in mechanical engineering with a focus on high-power data center products, including a proven track record in designing and managing... 
    Senior

    Etched

    San Francisco, CA
    4 days ago
  •  ...that turn raw compute into useful intelligence - the inference services that serve LLMs at scale and the data pipelines that feed them. One week you're hunting...  ...keeps you honest about both. Researchers and ML engineers will hand you workloads that barely run; you'll hand... 
    Flexible hours

    Adaption

    San Francisco, CA
    16 days ago
  • Intersect is looking for a Civil Engineer to provide civil and structural design guidance for grid-scale solar and data center projects in Canada. The role involves collaborating with various teams while ensuring code-compliant designs. Candidates should have a Bachelor... 
    Senior

    Intersect

    San Francisco, CA
    4 days ago
  • $237.6k - $288k

     ...alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the...  ...us at Crusoe. About the Role: Crusoe is seeking a Senior Engineering Manager, Data Plane Systems to lead the team responsible for... 
    Senior
    Temporary work

    Crusoe

    San Francisco, CA
    20 days ago
  • As a Senior Platform Engineer at vCluster Labs, you aren't just maintaining infrastructure; you are the backbone of our engineering velocity....  ...the full operational stack operators need to run their GPU data centers — managed Kubernetes, fast isolated tenant provisioning,... 
    Senior
    Remote work
    Flexible hours

    vCluster Labs

    San Francisco, CA
    1 day ago
  • $160k - $240k

     ...Senior Data Engineer San Francisco About the Role Together AI is looking for a Senior Data...  ...and globally distributed data centers. If you thrive in fast-paced environments...  ...defining and building early-stage data platforms for a rapidly scaling and data-... 
    Senior
    Full time

    Together AI

    San Francisco, CA
    2 days ago
  • $139.2k - $232.6k

    Jacobs is seeking a Senior Engineering Authority in San Francisco, CA to lead telecommunications and ICT systems in data center projects. You will be responsible for design delivery, managing subcontractors, and ensuring project quality through inspections and engineering... 
    Senior
    For subcontractor
    Flexible hours

    Jacobs

    San Francisco, CA
    4 days ago
  • $151.3k - $271.15k

    ## Senior Manager - Platform Engineering and OperationsApplylocations: San Francisco, CA, USAtime type: Full timeposted on: Posted Yesterdayjob requisition...  ...excellence across multi-cloud infrastructure, data centers, networking, developer platforms, and enterprise... 
    Senior

    Autodesk, Inc.

    San Francisco, CA
    4 days ago
  •  ...Monaco is building an AI-native revenue platform that replaces the fragmented GTM...  ...redefining what's possible when all the data lives under one roof in the age of AI...  ...software. We're looking for a Senior Data Platform Engineer to help build Monaco's data and ML platform... 
    Senior
    Work at office
    Shift work

    Monaco

    San Francisco, CA
    3 days ago
  •  ...Software Engineer We are looking for a foundational member of our engineering team: a highly motivated Software Engineer to own the design, creation, and evolution of our data platform. You will be part of the team that owns the data ingestion and management infrastructure... 
    Senior

    Crustdata (YC F24)

    San Francisco, CA
    19 hours ago
  • $172k - $229k

     ...petabytes of multimodal sensor data. Our next-generation...  ...framework, is the engine that powers this discovery. As a Senior Machine Learning Engineer...  ...while drastically reducing inference latency and memory...  ...a mission-critical data platform serving the entire ML organization... 
    Senior
    Work at office
    Remote work

    Motional

    San Francisco, CA
    23 days ago
  •  ...Experience designing data workflows, feature pipelines...  ...AI/ML training, inference, experimentation, and...  ...monitoring , Data Product Engineering: Proven experience...  ...g. Spark), cloud data platforms, and high-scale compute...  ...across the industry , The Senior Data Engineer will be... 
    Senior

    Komodo Health

    San Francisco, CA
    4 days ago
  • A leading data collaboration platform in San Francisco is seeking an experienced Software Engineer to lead the development of their next-generation data processing platform. The ideal candidate will have over 5 years in software engineering, experience with object-oriented... 
    Senior
    Remote job

    LiveRamp

    San Francisco, CA
    3 days ago
  • A leading data and AI company in San Francisco is seeking a Sr. Staff Software Engineer to join their executive leadership team. This role involves designing the Data Intelligence Platform, developing scalable tools and infrastructure, and collaborating with tech leaders... 
    Senior

    Menlo Ventures

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Inference Platform Engineer - Data Center. Be the first to apply!