Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Cloud Inference Engineer

$320k

United States Digital Space LLC

United States Digital Space LLC is seeking a backend engineer for the Cloud Inference team. This role involves designing and building infrastructure that serves Claude across cloud service providers, optimizing for performance and cost. The ideal candidate will have significant software engineering experience with a major cloud platform. We offer an annual salary ranging from $320,000 to $485,000 USD and a hybrid working policy. This position is great for individuals who thrive in cross-functional environments and take ownership of their work. #J-18808-Ljbffr United States Digital Space LLC

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Senior Cloud Inference Engineer in San Francisco, CA vacancy
  • United States Digital Space LLC is seeking a Staff Software Engineer to build and operate safety mechanisms that protect AI systems on cloud platforms. The ideal candidate will have significant experience in software development, particularly in trust & safety, and be... 
    Senior

    United States Digital Space LLC

    San Francisco, CA
    5 days ago
  • $139.2k - $174k

    A leading cloud services provider is looking for a Senior Engineer 2 to join their AI Infrastructure Control Plane team. This role involves architecting high-quality software solutions for AI workloads while driving design and operational excellence. Candidates should... 
    Senior
    Remote work

    DigitalOcean

    San Francisco, CA
    1 day ago
  • $300k

     ...hyperscale data center startup building an AI and cloud platform, powered by thousands of H100s,...  ..., full-scale model training, or inference.  Our client operates high-performance...  ...Integrate, tune, and operate inference engines such as vLLM, SGLang, and TensorRT-LLM across... 
    Senior
    Permanent employment
    Worldwide
    San Francisco, CA
    more than 2 months ago
  • $160k - $250k

     ...About the Team Together AI is building the Inference Platform that brings the most advanced generative AI models to the world. Our platform...  ...low-latency load balancing across data centers and model engine pods. Develop auto-scaling systems to dynamically allocate resources... 
    Senior
    Full time
    Local area

    Together AI

    San Francisco, CA
    more than 2 months ago
  • $220k - $300k

     ...the United States to help them hire. Senior Software Engineer, Platform Location: San Francisco,...  ...services Managing large-scale cloud infrastructure Task orchestration and...  ...Optimize model serving infrastructure and inference pipelines Collaborate closely with... 
    Senior
    Work at office
    Remote work
    Visa sponsorship

    Recruiting from Scratch

    San Francisco, CA
    2 days ago
  •  ...founded by MIT CSAIL researchers . We are searching for a Senior Cloud Engineer (Multicloud) with deep, hands-on experience building,...  ...autoscaling and capacity planning. Support AI training and inference workloads in multicloud environments. Troubleshoot... 
    Senior
    Local area
    Immediate start
    Remote work
    Relocation package

    Pragmatike

    San Francisco, CA
    1 day ago
  • $230k - $265k

     ...need. About The Position We're looking for a software engineer to join Parafin's Infrastructure team and lead the evolution of...  ...systems for model experimentation, training, evaluation, inference, and retraining that power underwriting and other ML-driven products... 
    Senior
    Work from home
    Flexible hours

    Parafin Inc

    San Francisco, CA
    20 hours ago
  • Hamilton Barnes Associates Limited is seeking a Senior ML Infrastructure Engineer to help build and scale Kubernetes-based machine learning platforms...  ..., as well as hands-on experience with both training and inference infrastructure. The position offers a competitive salary... 
    Senior

    Hamilton Barnes Associates Limited

    San Francisco, CA
    5 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview At Capital One, we are creating...  ...model training, large language model inference, similarity search, guardrails, model...  ...and responsible AI solutions on cloud platforms (e.g., AWS, Google Cloud, Azure... 
    Senior
    Local area

    Capital One National Association

    San Francisco, CA
    1 day ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One, we...  ...model training, large language model inference, similarity search, guardrails, model...  ...scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure... 
    Senior
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Francisco, CA
    2 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services) Overview: At Capital One, we are creating responsible...  ...model training, large language model inference, similarity search, guardrails, model...  ...and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure... 
    Senior
    Full time
    Part time
    Local area

    Capital One

    San Francisco, CA
    3 days ago
  • $320k

    About the Role The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies...  ...execution, capacity management, and day‑to‑day operations. Our engineers are extremely high leverage: we simultaneously drive multiple... 
    Senior
    Visa sponsorship

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  •  ...Senior Lead Software Engineer Be an integral part of an agile team that's constantly pushing the envelope...  ...and deploy secure, scalable cloud platforms optimized for AI/ML workloads...  ...transformer architecture, ML training, and inference. Experience with Infrastructure as... 
    Senior
    For contractors

    Chase

    San Francisco, CA
    4 days ago
  •  ...all of humanity. About the Role We are looking for a Software Engineer to join our team to build tools and services that enable AI research...  ...‑ and dataset‑management platforms. Operated GPU‑backed inference or rollout workloads at very large scale. Equal Opportunity Employer... 
    Senior
    Work at office
    Relocation package

    The Consulting Solutions

    San Francisco, CA
    1 day ago
  • $230k - $265k

     ...looking for a seasoned software engineer to join Parafin's...  ...for small businesses. As a Senior Software Engineer, you'll be...  ...experience. ~ Expertise with modern cloud data stacks-AWS (S3, RDS), Databricks...  ..., retraining), real-time inference. Contributions to internal... 
    Senior
    Work from home
    Flexible hours

    Parafin Inc

    San Francisco, CA
    20 hours ago
  • $200k - $260k

    About The Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers...  ...best-in-class latency and reliability. We're looking for a Senior Platform Engineer to own the API and infrastructure layer for voice... 
    Senior
    Full time

    Together AI

    San Francisco, CA
    2 days ago
  • $320k - $405k

     ...growing group of committed researchers, engineers, policy experts, and business leaders working...  ...of thousands of nodes across multiple cloud providers and datacenters to train,...  ...CRDs Partner with research, training, and inference to understand workload shapes and turn their... 
    Senior

    Menlo Ventures

    San Francisco, CA
    3 days ago
  • $202.5k - $247.5k

     ...Inc. ngrok is an all‑in‑one cloud networking platform that secures...  ...gateway for API delivery, AI inference, device fleets, and site‑to‑...  ...builds the systems ngrok engineers rely on to build, deploy, and...  ...sponsorship. Compensation Job Title Senior Software Engineer Tier 1 (SF... 
    Senior
    Permanent employment
    Full time
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    jobr.pro

    San Francisco, CA
    5 days ago
  • $185k - $275k

    Senior Machine Learning Engineer - GeoAI Platform Wherobots, Inc. San Francisco, California, United States...  ...managed, highly scalable geospatial cloud database and analytics platform:...  ...intersection of distributed systems, ML inference, and geospatial data infrastructure.... 
    Senior
    Full time
    Work at office
    Remote work
    Work visa

    Wherobots, Inc

    San Francisco, CA
    2 days ago
  • United States Digital Space LLC is looking for a Software Engineer to join the Launch Engineering team in San Francisco. You’ll design...  ...build deployment infrastructure for continuous and unattended inference deployment. The ideal candidate will have at least 5 years of... 
    Senior

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  • MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability. The ideal candidate will have 3+ years of experience in production... 
    Senior

    MakerMaker.AI

    San Francisco, CA
    4 days ago
  • $220k

    Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience... 
    Senior

    Perplexity

    San Francisco, CA
    1 day ago
  • $261k - $326k

    A technology company specializing in AI infrastructure is seeking a Principal Engineer to enhance reliability and scalability of cloud systems. This role demands over 15 years of experience in production engineering or related fields and involves setting technical directions... 
    Senior

    Crusoe

    San Francisco, CA
    5 days ago
  • A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service... 
    Senior
    Remote job

    DigitalOcean

    San Francisco, CA
    2 days ago
  •  ...specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern inference frameworks and a solid... 
    Senior

    Reflection AI

    San Francisco, CA
    2 days ago
  • jobr.pro is seeking a Staff Engineer to lead technical direction for Inference Runtime. This senior IC role encompasses broad ownership of the runtime’s architecture and validation systems while collaborating across teams to drive performance and scalability. The ideal... 
    Senior
    Flexible hours

    jobr.pro

    San Francisco, CA
    1 day ago
  • $220k - $320k

    inference.net, a growing company in San Francisco, seeks an experienced engineer to optimize AI inference performance. The ideal candidate will have over 2 years of experience in ML systems and GPU programming. Key responsibilities include implementing optimization techniques... 
    Senior

    inference.net

    San Francisco, CA
    2 days ago
  • $314.8k - $359.3k

    Senior Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are creating responsible and reliable...  ...and responsible AI solutions on cloud platforms (e.g., AWS, Google Cloud,...  ...or technologies (e.g., LLM inference, similarity search and vector DBs, guardrails... 
    Senior
    Work at office
    Local area

    Capital One National Association

    San Francisco, CA
    3 days ago
  • $150k - $170k

     ...currently looking for an experienced Senior Data Platform Engineer, with significant experience in building...  ...modern data platforms across public cloud environments. We are located in the...  ...including training, fine-tuning, and inference workflows Collaborate with AI/ML... 
    Senior
    Full time
    Remote work
    Flexible hours

    Ellipsis Health

    San Francisco, CA
    5 days ago
  • $245k - $295k

     ...manufacturing, data center construction, and cloud services. If you want to do the...  ...About the Role We are seeking a Senior Manager, Infrastructure Platform Engineering to lead a team building core...  ...GPU clusters, AI training, and inference workloads Working knowledge of... 
    Senior
    Temporary work
    Immediate start

    Crusoe

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Cloud Inference Engineer. Be the first to apply!