Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Cloud Inference Engineer

$320k

United States Digital Space LLC

United States Digital Space LLC is seeking a backend engineer for the Cloud Inference team. This role involves designing and building infrastructure that serves Claude across cloud service providers, optimizing for performance and cost. The ideal candidate will have significant software engineering experience with a major cloud platform. We offer an annual salary ranging from $320,000 to $485,000 USD and a hybrid working policy. This position is great for individuals who thrive in cross-functional environments and take ownership of their work. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Cloud Inference Engineer in San Francisco, CA vacancy
  • $160k - $250k

     ...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the most advanced generative AI models to the world. Our platform powers multi-tenant serverless workloads and dedicated endpoints... 
    Senior
    Full time
    Local area

    Together AI

    San Francisco, CA
    2 days ago
  •  ...United States Digital Space LLC is seeking a Staff Software Engineer to build and operate safety mechanisms that protect AI systems on cloud platforms. The ideal candidate will have significant experience in software development, particularly in trust & safety, and be... 
    Senior

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  • $139.2k - $174k

    A leading cloud services provider is looking for a Senior Engineer 2 to join their AI Infrastructure Control Plane team. This role involves architecting high-quality software solutions for AI workloads while driving design and operational excellence. Candidates should... 
    Senior
    Remote work

    DigitalOcean

    San Francisco, CA
    6 hours ago
  • $300k

     ...hyperscale data center startup building an AI and cloud platform, powered by thousands of H100s,...  ..., full-scale model training, or inference.  Our client operates high-performance...  ...Integrate, tune, and operate inference engines such as vLLM, SGLang, and TensorRT-LLM across... 
    Senior
    Permanent employment
    Worldwide
    San Francisco, CA
    more than 2 months ago
  •  ...computing power with our Open-Access AI Cloud. By aggregating computing resources across...  ...an innovative GPU marketplace and AI inference service that promise affordability and accessibility...  ...About the Role We're seeking a Platform Engineer to design and build the control plane... 
    Senior
    Worldwide

    Hyperbolic Labs

    San Francisco, CA
    1 day ago
  •  ..., founded by MIT CSAIL researchers. We are searching for a Senior Cloud Engineer (Multicloud) with deep, hands-on experience building, operating...  ...autoscaling and capacity planning. Support AI training and inference workloads in multicloud environments. Troubleshoot complex... 
    Senior
    Local area
    Immediate start
    Remote work
    Relocation package

    Pragmatike

    San Francisco, CA
    6 hours ago
  • $81.4k - $151.8k

     ...Senior Cloud Engineer – ML/AI Platform We are seeking a Senior Cloud Engineer with deep expertise in AWS and Azure AI/ML services to drive...  ...with AWS AI/ML services: SageMaker (training, pipelines, inference, JumpStart), Bedrock • Deep hands-on expertise with Azure... 
    Senior
    Contract work
    Part time
    Local area

    BMO Financial Group

    San Francisco, CA
    5 days ago
  • $200k - $260k

     ...Senior Platform Engineer, Voice AI San Francisco About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and... 
    Senior
    Full time

    Together AI

    San Francisco, CA
    1 day ago
  • $220k - $300k

     ...the United States to help them hire. Senior Software Engineer, Platform Location: San Francisco,...  ...services Managing large-scale cloud infrastructure Task orchestration and...  ...Optimize model serving infrastructure and inference pipelines Collaborate closely with... 
    Senior
    Work at office
    Remote work
    Visa sponsorship

    Recruiting from Scratch

    San Francisco, CA
    4 days ago
  • $230k - $265k

     ...need. About The Position We're looking for a software engineer to join Parafin's Infrastructure team and lead the evolution of...  ...systems for model experimentation, training, evaluation, inference, and retraining that power underwriting and other ML-driven products... 
    Senior
    Work from home
    Flexible hours

    Parafin Inc

    San Francisco, CA
    3 days ago
  • $320k

     ...Cloud Inference Engineer Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Colorwave Inc

    San Francisco, CA
    4 days ago
  • $320k - $405k

     ...growing group of committed researchers, engineers, policy experts, and business leaders working...  ...of thousands of nodes across multiple cloud providers and datacenters to train,...  ...CRDs Partner with research, training, and inference to understand workload shapes and turn their... 
    Senior

    Menlo Ventures

    San Francisco, CA
    6 days ago
  • $235k - $285k

     ...evolve notifications and AI inference into reusable platform capabilities...  ..., data, security, client engineering, video, and service...  ...worldwide. About the Role As the Senior Manager of Platform Development...  ...experience building and operating cloud‑native backend or platform... 
    Senior
    Worldwide
    Flexible hours

    Dormont Manufacturing Company

    San Francisco, CA
    4 days ago
  • $195k - $265k

     ...Senior Authorization Software Engineer (Platform) San Francisco or New York About Pallet Pallet is...  ...support model training, evaluation, and inference infrastructure Identify systemic...  ...~ Strong understanding of cloud infrastructure, containers, networking... 
    Senior
    Full time
    Temporary work
    Work at office
    Local area
    Remote work
    Flexible hours

    Pallet Service Corporation

    San Francisco, CA
    4 days ago
  • $261k - $326k

     ...A technology company specializing in AI infrastructure is seeking a Principal Engineer to enhance reliability and scalability of cloud systems. This role demands over 15 years of experience in production engineering or related fields and involves setting technical directions... 
    Senior

    Crusoe

    San Francisco, CA
    4 days ago
  • $202.5k - $247.5k

     ...Inc. ngrok is an all‑in‑one cloud networking platform that secures...  ...gateway for API delivery, AI inference, device fleets, and site‑to‑...  ...builds the systems ngrok engineers rely on to build, deploy, and...  ...sponsorship. Compensation Job Title Senior Software Engineer Tier 1 (SF... 
    Senior
    Permanent employment
    Full time
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    jobr.pro

    San Francisco, CA
    3 days ago
  • $220k

     ...Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience... 
    Senior

    Perplexity

    San Francisco, CA
    4 days ago
  • $150k - $170k

     ...currently looking for an experienced Senior Data Platform Engineer, with significant experience in building...  ...modern data platforms across public cloud environments. We are located in the...  ...including training, fine-tuning, and inference workflows Collaborate with AI/ML teams... 
    Senior
    Full time
    Remote work
    Flexible hours

    Ellipsis Health

    San Francisco, CA
    1 day ago
  • $216k - $270k

     ...As a Software Engineer on the ML Infrastructure team, you will design and build platforms...  ...Docker, Kubernetes). ~ Familiarity with cloud infrastructure (AWS, GCP) and...  ...SGLang, TensorRT-LLM, or text-generation-inference. Compensation packages at Scale for... 
    Senior
    Full time

    Scale AI

    San Francisco, CA
    21 days ago
  • An innovative tech platform is seeking a Senior Principal Software Engineer to lead the development of its next-gen API Platform. The role involves defining...  ...development experience and extensive knowledge in API design, cloud platforms, and security protocols. #J-18808-Ljbffr
    Senior
    Remote work

    jobright.com

    San Francisco, CA
    1 day ago
  • $185k - $275k

    Senior Machine Learning Engineer - GeoAI Platform Wherobots, Inc. San Francisco, California, United States...  ...managed, highly scalable geospatial cloud database and analytics platform:...  ...intersection of distributed systems, ML inference, and geospatial data infrastructure.... 
    Senior
    Full time
    Work at office
    Remote work
    Work visa

    Wherobots, Inc

    San Francisco, CA
    5 days ago
  • MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability. The ideal candidate will have 3+ years of experience in production... 
    Senior

    MakerMaker.AI

    San Francisco, CA
    2 days ago
  •  ...Qualifications CUDA + GPU inference optimization vLLM, SGLang, or TensorRT-LLM experience KV caching, paged attention, batching, token...  ...SF. Ship low latency, high throughput model serving on Luminal Cloud. Day To Day Responsibilities Deploy and tune models with optimizations... 

    SupportFinity

    San Francisco, CA
    6 hours ago
  •  ...A leading healthcare technology firm in San Francisco is seeking a DevSecOps leader to design and manage cloud infrastructure on GCP. You will implement robust security protocols and enhance developer productivity while safeguarding sensitive healthcare data. Ideal candidates... 
    Senior

    Mandolin

    San Francisco, CA
    6 hours ago
  •  ...About the Role We’re looking for a Senior Staff Data Engineer to be the technical backbone of our Data & ML Platform team — the foundation powering...  ...produces ML‑ready data and supports model training and inference workflows reliably. Raise the engineering bar across the... 
    Senior
    Work at office
    Local area
    Immediate start
    Remote work
    Worldwide
    3 days per week

    Hinge Health

    San Francisco, CA
    4 days ago
  • $229.9k - $262.4k

    Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview At Capital One, we are creating...  ...model training, large language model inference, similarity search, guardrails, model...  ...and responsible AI solutions on cloud platforms (e.g., AWS, Google Cloud, Azure... 
    Senior
    Local area

    Capital One National Association

    San Francisco, CA
    5 days ago
  • A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service... 
    Senior
    Remote job

    DigitalOcean

    San Francisco, CA
    5 days ago
  • $220k - $320k

    inference.net, a growing company in San Francisco, seeks an experienced engineer to optimize AI inference performance. The ideal candidate will have over 2 years of experience in ML systems and GPU programming. Key responsibilities include implementing optimization techniques... 
    Senior

    inference.net

    San Francisco, CA
    5 days ago
  • jobr.pro is seeking a Staff Engineer to lead technical direction for Inference Runtime. This senior IC role encompasses broad ownership of the runtime’s architecture and validation systems while collaborating across teams to drive performance and scalability. The ideal... 
    Senior
    Flexible hours

    jobr.pro

    San Francisco, CA
    4 days ago
  •  ...specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern inference frameworks and a solid... 
    Senior

    Reflection AI

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Cloud Inference Engineer. Be the first to apply!