Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Inference Platform Engineer - Data Center

$300k
Full-time

Join a stealth-mode hyperscale data center startup building an AI and cloud platform, powered by thousands of H100s, H200s, and B200s,  ready to go for experimentation, full-scale model training, or inference. 

Our client operates high-performance GPU clusters powering some of the most advanced AI workloads worldwide. They’re now building a serverless inference platform,  beginning with cost-efficient batch inference and expanding into low-latency, real-time inference and custom model hosting. This is a unique chance to join at an early stage and help define the architecture, scalability, and technical direction of that platform.

If you are interested in this opportunity, get in tuch!  You don't want to miss this opportunity! 

Key Responsibilities

  • Take ownership of the inference platform architecture, from batch to low-latency workloads.
  • Design, build, and optimise distributed inference systems to maximise GPU utilisation and minimise cold starts.
  • Integrate, tune, and operate inference engines such as vLLM, SGLang, and TensorRT-LLM across multiple model types.
  • Develop APIs, orchestration layers, and autoscaling logic to support both multi-tenant and dedicated deployments.
  • Collaborate with cross-functional teams to translate business and customer needs into robust technical solutions.
  • Stay up to date with the latest models, serving frameworks, and optimisation techniques, applying best practices in performance and efficiency.
  • Implement monitoring, alerting, and observability workflows for production systems.

Requirements:

  • 5+ years’ experience building large-scale, fault-tolerant distributed systems (ML inference, HPC, or similar).
  • Proficiency in Python, Go, Rust, or a comparable language.
  • Strong understanding of GPU software stacks (CUDA, Triton, NCCL) and Kubernetes orchestration.
  • Practical experience with model-serving frameworks such as vLLM, SGLang, TensorRT-LLM, or custom PyTorch deployments.
  • Knowledge of performance optimisation techniques, including batching, speculative decoding, quantisation, and caching.
  • Familiarity with Infrastructure-as-Code tools (Terraform, Helm) and low-level OS performance tuning.

Nice to Have

  • Experience with event-driven or serverless architectures.
  • Exposure to hybrid cloud or multi-cluster environments.
  • Contributions to open-source ML or inference systems projects.
  • Proven track record of cost optimisation in high-performance compute environments.

Benefits:

  • Equity

Salary:

  • $300,000 gross per year
Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Senior Inference Platform Engineer - Data Center in San Francisco, CA vacancy
  • $96.8k - $306.4k

    Ll Oefentherapie is seeking a Senior Principal Firmware Engineer to design and develop firmware for cutting-edge data center platforms in San Francisco. The ideal candidate will have over 10 years of experience in firmware engineering, with strong leadership and communication... 
    Senior

    Ll Oefentherapie

    San Francisco, CA
    4 days ago
  • Fluency Digital, Inc. is seeking a data engineer to join their team in San Francisco, California. This role focuses on building the data and inference infrastructure for B2B marketing, ensuring hyper-precise targeting through data. Candidates should have 6 to 12 years of... 
    Senior

    Fluency Digital, Inc.

    San Francisco, CA
    2 days ago
  • Armada is seeking a Mechanical Engineer - Hydronics focused on modular liquid-cooled data center platforms. You will own the system architecture, ensuring it is robust and scalable across deployments. Collaborating with various teams, you will focus on hydronic systems,... 
    Senior
    Remote job

    Armada

    San Francisco, CA
    5 days ago
  • $141k - $175k

     ...to our members and to the clients and communities we serve. We are seeking a Senior Electrical Engineer to join one of our Seattle, California, or Texas offices with a focus on data center project delivery. This role involves leading electrical engineering design,... 
    Senior
    Remote work
    Flexible hours

    Arup

    San Francisco, CA
    3 days ago
  • $200k - $225k

    Straia is seeking a Senior Engineer in San Francisco to shape the platform for AI-driven analytics in higher education. You will build and scale data movement systems, integrate machine learning models, and lead platform initiatives to enhance productivity. The ideal candidate... 
    Senior

    Straia

    San Francisco, CA
    2 days ago
  • $150k - $170k

    Ellipsis Health, Inc. is looking for an experienced Senior Data Platform Engineer to develop and sustain advanced data platforms. This role involves leading the design and implementation of scalable infrastructures for analytics and ML Ops, with key collaboration across... 
    Senior

    Ellipsis Health

    San Francisco, CA
    5 days ago
  • $192k - $240k

    United States Digital Space LLC is seeking a Data Infrastructure Engineer to build and improve its data platform. You will work closely with product and data teams, enhancing the company’s data capabilities and driving impactful products. This role requires a strong background... 
    Senior
    Work at office
    3 days per week

    United States Digital Space LLC

    San Francisco, CA
    5 days ago
  • $200k - $350k

    Clera, an AI/ML data analytics startup, seeks a Senior Software Engineer to help build a robust platform and data lakehouse. This role involves designing systems that enable AI-driven analytics and architecting ETL pipelines. Requirements include 4+ years of experience... 
    Senior

    Clera

    San Francisco, CA
    2 days ago
  • $245k - $295k

     ...experts across energy, manufacturing, data center construction, and cloud services...  ...About the Role We are seeking a Senior Manager, Infrastructure Platform Engineering to lead a team building core...  ...GPU clusters, AI training, and inference workloads Working knowledge of... 
    Senior
    Temporary work
    Immediate start

    Crusoe Energy Systems LLC

    San Francisco, CA
    6 days ago
  • Strava is looking for a Senior Engineer for the Activities Platform team. You'll design and manage systems that support millions of activity uploads, ensuring efficiency and performance. With a hybrid working model in San Francisco, this role focuses on collaborating with... 
    Senior

    Strava

    San Francisco, CA
    3 days ago
  • Nextdata is looking for a Principal AI Platform Engineer to design and build interfaces, systems, and agents that enable AI systems to access governed enterprise data. This role involves defining how data products expose their semantics and ensuring AI systems can discover... 
    Senior

    Nextdata

    San Francisco, CA
    13 days ago
  • $160k - $250k

     ...About the Team Together AI is building the Inference Platform that brings the most advanced generative AI models to the world...  ...request routing, ensuring low-latency load balancing across data centers and model engine pods. Develop auto-scaling systems to dynamically... 
    Senior
    Full time
    Local area

    Together AI

    San Francisco, CA
    more than 2 months ago
  • OpenAI is seeking a signal integrity system design engineer in San Francisco to design and optimize AI supercomputer hardware. The ideal candidate should have over 10 years of industry experience and a strong background in system design knowledge, especially in signal... 
    Senior
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  • Faire, based in San Francisco, is seeking an experienced analytics engineer to drive key technical projects across various internal teams,...  ...Finance. The candidate will lead the development of scalable data models and enhance data processing quality. With a strong background... 
    Senior
    Work at office

    Faire Inc

    San Francisco, CA
    4 days ago
  • A leading AI research firm in San Francisco is seeking a Data Center Controls Network Engineer to design and manage OT network architectures for high-density data centers. The ideal candidate has over 8 years of experience in controls engineering, industrial networking,... 
    Senior

    OpenAI

    San Francisco, CA
    3 days ago
  • Orbital in San Francisco is seeking a Senior Mechanical Engineer to design and optimize advanced modular data centers with a focus on innovative cooling applications. The role involves hands-on participation in research and development, emphasizing system design and performance... 
    Senior

    Orbital

    San Francisco, CA
    5 days ago
  • TAR is seeking a Senior Mechanical Engineer to design and produce hardware for off-grid power systems. You'll focus on environmental durability and swift deployment, utilizing CAD and design for manufacturing principles. With a strong emphasis on rapid iteration, you will... 
    Senior

    TAR

    San Francisco, CA
    5 days ago
  • Airbyte is looking for a Software Engineer to join their Data Replication team in San Francisco, CA. This role involves scaling cloud data replication systems and building AI-powered tools to improve operations. The ideal candidate will have over 7 years of engineering... 
    Senior

    Airbyte

    San Francisco, CA
    6 days ago
  • The Consulting Solutions is looking for an experienced Quality Engineer to lead Product and Site Quality initiatives across OpenAI’s infrastructure ecosystem. This role involves establishing quality programs, managing supplier performance, and ensuring high standards in... 
    Senior

    The Consulting Solutions

    San Francisco, CA
    6 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Senior
    Remote job

    DigitalOcean

    San Francisco, CA
    21 days ago
  • $150k - $170k

     ...50K - $170K We are currently looking for an experienced Senior Data Platform Engineer, with significant experience in building, scaling, and optimizing...  ...(LLM) operations, including training, fine-tuning, and inference workflows Collaborate with AI/ML teams to build and... 
    Senior
    Full time
    Remote work
    Flexible hours

    Ellipsis Health

    San Francisco, CA
    5 days ago
  • $180k - $225k

     ...-one cloud networking platform that secures, transforms...  ...for API delivery, AI inference, device fleets, and...  ...your time. About the Data Platform Team The Data...  ...Platform team is part of the Engineering organization and doesn...  .... Compensation Senior Software Engineer Tier... 
    Senior
    Permanent employment
    Full time
    Live in
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    Dormont Manufacturing Co

    San Francisco, CA
    2 days ago
  • Intersect is looking for a Civil Engineer to provide civil and structural design guidance for grid-scale solar and data center projects in Canada. The role involves collaborating with various teams while ensuring code-compliant designs. Candidates should have a Bachelor... 
    Senior

    Intersect

    San Francisco, CA
    5 days ago
  • $123.6k - $162.3k

    Position Overview We’re looking for a high‑energy Plumbing Engineer to join our Building Mechanical team and help deliver innovative plumbing design solutions for our heavy industrial clients. You will work alongside a team of engineers and designers performing field investigations... 
    Senior
    Remote work

    Jacobs

    San Francisco, CA
    6 days ago
  • $139.2k - $232.6k

     ...Jacobs is seeking a Senior Engineering Authority in San Francisco, CA to lead telecommunications and ICT systems in data center projects. You will be responsible for design delivery, managing subcontractors, and ensuring project quality through inspections and engineering... 
    Senior
    For subcontractor
    Flexible hours

    Jacobs

    San Francisco, CA
    1 day ago
  • $175k - $220k

     ...software (with observability platforms such as AppDynamics and Datadog...  ...to thrive. From hospitals to data centers, from foreman to VPs of...  ...dashboards, and forecasting engines. We’re at an exciting stage...  ...construction! The Role As a Senior Platform Engineer, with an emphasis... 
    Senior
    Flexible hours
    Shift work

    Doxel Inc.

    San Francisco, CA
    2 days ago
  • $151.3k - $271.15k

     ...Overview We are seeking a strategic and hands‑on Senior Manager, Intelligent Operations & Platform Engineering to lead the transformation of our global...  ...operational excellence across multi‑cloud infrastructure, data centers, networking, developer platforms, and enterprise... 
    Senior

    Ultimate.ai

    San Francisco, CA
    6 days ago
  •  ...documented, and financed. Our platform spans the full care journey:...  ...burden and we're at the center of transforming it. Today, 5...  ...automation, interoperability, and data‑driven decision‑making....  ...is here to change that. As Senior Engineering Manager, Platform Infrastructure... 
    Senior
    Work at office
    Local area
    Immediate start

    Commure

    San Francisco, CA
    2 days ago
  • B2B marketing platform delivering hyper-precise targeting through data B2B marketing runs on bloated lists and spray-and...  .... This role builds the data and inference infrastructure to replace that guesswork...  ...pushing them through inference engines to pinpoint the right accounts... 

    Fluency Digital, Inc.

    San Francisco, CA
    2 days ago
  • $160k - $240k

    Together AI is seeking a Senior Data Engineer to build and operate scalable data infrastructure for its mission-critical systems. You will...  ...ideal candidate should have over 5 years of experience in data platforms, a strong programming background, and a relevant degree. We... 
    Senior

    Together AI

    San Francisco, CA
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Inference Platform Engineer - Data Center. Be the first to apply!