Staff Software Engineer, Cloud Inference Safeguards
$405kUnited States Digital Space LLC
About the company the company’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role We are seeking a Staff Software Engineer to build and operate the safety, oversight, and intervention mechanisms that protect Claude on third‑party cloud service provider (CSP) platforms. As the engineer responsible for Safeguards on those surfaces, you will ensure that every request served through our CSP partners is monitored for misuse, enforced against policy, and compliant with the data residency and privacy commitments that enterprise CSP customers expect. You will sit at the seam between the Safeguards organization and the Cloud Inference team: taking classifiers, detection signals, and enforcement policies developed by Safeguards and making them run reliably inside a CSP partner’s infrastructure at serving‑path latency and scale. You will own the architecture that lets our safeguards operate within those constraints without gaps. You will build, deploy and operate the multi‑layered defenses that catch unwanted model behavior in real time, the telemetry pipelines that give us situational awareness over CSP traffic, and the enforcement hooks that let us act quickly when something goes wrong. Your work will directly determine whether the company can ship frontier models on CSP platforms at the same safety bar we hold ourselves to on our first‑party API. Responsibilities: Build, deploy and operate real‑time safeguards infrastructure—classifiers, rate limits, enforcement actions, and intervention hooks—embedded directly in the third‑party CSP inference serving path Design and maintain the data residency and privacy architecture for safeguards signals on CSP platforms, ensuring we can detect abuse and monitor model behavior while honoring regionalization boundaries and enterprise contractual commitments Develop telemetry, logging, and evaluation pipelines that give Safeguards, Policy, and T&S operational teams situational awareness over CSP traffic and close the visibility gap between third‑party and first‑party serving Dive into the CSP serving stack to identify the lowest‑impact points to gather signals or introduce interventions without degrading latency, stability, or overall architecture Hold a high operational bar: own on‑call, drive root‑cause analyses and postmortems for safeguards incidents on CSP platforms, and build systems that reduce the human intervention required to keep Claude safe Work closely with Safeguards research, Policy & Enforcement, the Cloud Inference team, and CSP partner contacts to turn detection research and policy decisions into production enforcement that works inside a partner’s cloud You may be a good fit if you: Have a Bachelor’s degree in Computer Science, Software Engineering, or comparable experience Have 7+ years of experience in high‑scale, high‑reliability software development, ideally with exposure to trust & safety, anti‑abuse, fraud, or integrity systems Are proficient in Python and comfortable working across the stack—from request‑path services to data pipelines to internal tooling Think adversarially: you can see a system from a bad actor’s perspective, anticipate how they will respond to countermeasures, and design defenses in depth rather than single points of enforcement Have experience scaling infrastructure to accommodate rapid traffic growth while keeping latency and reliability within tight budgets Are deeply interested in the potential transformative effects of advanced AI systems and are committed to ensuring their safe development Have strong communication skills and can explain complex technical and risk tradeoffs to non‑technical stakeholders across Policy, Legal, and partner organizations Enjoy working in a fast‑paced, early environment; comfortable with adapting priorities as driven by the rapidly evolving AI space Strong candidates may also have experience with: Building trust and safety, anti‑spam, fraud, or abuse detection and mitigation mechanisms for AI/ML systems, or the infrastructure to support these systems at scale Machine learning serving infrastructure (GPUs/TPUs, inference servers, load balancing) and the operational realities of running models in production Major cloud platform internals—IAM, Network/service perimeter controls, regional resource constraints, cloud‑native logging/monitoring—or experience shipping software that runs inside a partner’s cloud rather than your own Data residency, privacy engineering, or compliance‑constrained architectures, particularly where telemetry has to stay within regional or contractual boundaries Working closely with operational and human‑review teams to build custom internal tooling, admin UX, and alerting Adversarial mindset: has shipped defenses against motivated attackers before, knows what it feels like when they adapt, and can sprint to close a gap before it becomes an incident Comfortable operating at the intersection of platform/infra engineering and trust & safety—neither a pure infra engineer nor a pure T&S engineer, but someone who can credibly do both Has shipped software that runs inside someone else’s infrastructure (partner cloud, embedded deployment, or similar) and knows how to get things done when you don’t control the whole stack Senior enough to own a cross‑team seam independently, drive consensus across orgs, and make latency/safety tradeoff calls without escalation TypeScript or Rust, and agentic coding tools such as Claude Code Annual Salary Range: $405,000–$485,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location‑based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do #J-18808-Ljbffr United States Digital Space LLC
- United States Digital Space LLC is looking for a Software Engineer to join the Launch Engineering team in San Francisco. You’ll design and... ...build deployment infrastructure for continuous and unattended inference deployment. The ideal candidate will have at least 5 years...Suggested
$320k
...committed researchers, engineers, policy experts, and... .... About the role Our Inference team is responsible for... ...hardware running in multiple cloud platforms. Key... ...qualifications Significant software engineering experience... ..., we expect all staff to be in one of our offices...SuggestedWorldwideVisa sponsorship$405k
About the role Anthropic's Inference organization serves Claude to millions of users and... ...platform we add. We're looking for a Staff Engineer to be a technical lead for Inference Runtime... ...across all of them Have significant software engineering experience, with a strong...SuggestedWork at officeVisa sponsorshipFlexible hours- ...Location Type Hybrid Department Inference Model Serving Who are we? Our... ...is a team of researchers, engineers, designers, and more, who are... ...looking for Members of Technical Staff to join the Model Serving... ...GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving Experience...SuggestedFull timeWork experience placementWork at officeRemote workFlexible hours
- United States Digital Space LLC is seeking a Staff Software Engineer to build and operate safety mechanisms that protect AI systems on cloud platforms. The ideal candidate will have... ...In this role, you will design real-time safeguards, engage with research and policy teams,...Suggested
- Member of Technical Staff - Software Engineer Valthos | Posted Mar 3 Full-time Negotiable Advanced... ...software and biological AI systems to safeguard humanity. The same AI architectures that... ...users. Develop and maintain a secure cloud environment for Valthos data and...Full timeWork at office
$320k - $405k
...group of committed researchers, engineers, policy experts, and business... ...of nodes across multiple cloud providers and datacenters to... ...with research, training, and inference to understand workload shapes... ...Minimum qualifications Significant software engineering experience...- ...power with our Open-Access AI Cloud. By aggregating computing... ...innovative GPU marketplace and AI inference service that promise... ...Role We're seeking a Platform Engineer to design and build the control... ...infrastructure services Expert-level software engineering skills in Go (...Worldwide
$208k - $250k
...and evaluation pipelines that operate across Ripple's polyrepo engineering environment. Define and advance Ripple's Enterprise Agentic... ...performance tuning in hybrid environments, including managed inference endpoints and GPU‑based workloads. Excellent collaboration skills...Full timeLocal area$320k
United States Digital Space LLC is seeking a backend engineer for the Cloud Inference team. This role involves designing and building infrastructure... ...and cost. The ideal candidate will have significant software engineering experience with a major cloud platform. We offer...- ...role involves designing large-scale deployment architectures, solving AI inference challenges, and collaborating closely with customers' DevOps teams. Ideal candidates will have 3+ years in cloud infrastructure or DevOps, strong skills in Kubernetes, Docker, Terraform,...Flexible hours
- Qualifications CUDA + GPU inference optimization vLLM, SGLang, or TensorRT-LLM experience KV caching, paged attention, batching, token streaming... ...SF. Ship low latency, high throughput model serving on Luminal Cloud. Day To Day Responsibilities Deploy and tune models with...
$192k - $260k
...for hosting and serving frontier AI model inference for open source models like Llama, Qwen,... ...is necessary. We’re looking for engineers who have owned high scale operational sensitive... ...LLM APIs and runtimes at scale. As a Staff Engineer, you’ll play a critical role in...Local areaWorldwide$405k
...group of committed researchers, engineers, policy experts, and business leaders... ...seeking an exceptional Senior Staff Software Engineer to join the Claude... ...partnering closely with Research, Inference, Platform, Infrastructure, and Safeguards to ensure the Claude API is reliable...Work at officeRemote workVisa sponsorshipFlexible hours$320k
About the Role The Cloud Inference team scales and optimizes Claude to serve... ...day‑to‑day operations. Our engineers are extremely high leverage:... ...If You Have significant software engineering experience, with... ...: Currently, we expect all staff to be in one of our offices...Visa sponsorship- ...is looking for a Developer Platform Engineer to build and maintain their API platform for inference. This role involves defining user-facing... ...robust infrastructures across cloud providers. Ideal candidates have 5+ years of software engineering experience, are collaborative...
- ...billions in revenue About the Role As a Staff Software Engineer on the Consumer Experience team, you'... ...systems Solid understanding of cloud platforms (AWS, Azure, or GCP) and modern... ...including entity resolution and real‑time inference Experience building AI‑powered systems...Full timeFreelanceInternshipWork at officeRemote workFlexible hours
- The Consensus is looking for a Software Engineer to join our Inference Stack team in San Francisco. You will help develop the infrastructure that powers large-scale LLM inference, ensuring scalability and reliability in our systems. This role is ideal for engineers who...
$170k - $220k
...very best supply chain and enterprise software investors. We're live with manufacturers... ...A high level of professional software engineering experience with a strong focus on frontend... ...(generics, discriminated unions, type inference). • Next.js mastery: Production...- ...BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies... .... Join us and help build the platform engineers turn to to ship AI products. THE ROLE Baseten... ..., reliability, and ease of use. As a Software Engineer on the Inference Stack team,...Flexible hours
- ...tools consistently fail. We are a small, fast-growing team of engineers in San Francisco powering Fortune 100 enterprises, YC startups... ...plus About the Role Specialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and...Work at officeVisa sponsorshipRelocation package
$325k
About the Team Our Inference team brings OpenAI's most capable research and technology to the world through our products. We empower consumers... ...progression via model inference. About the Role We're hiring engineers to scale and optimize OpenAI's inference infrastructure across...$175k - $225k
...security. Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple, and... .... The Role We're looking for an AI Inference Engineer who lives at the boundary of high-performance software and physical hardware. In this role, you won't...Local areaRemote work$205k - $250k
...Catalyst, SoftBank, and 8VC. About the Role We are seeking a Backend Engineer to design and scale high-performance backend systems that... ...—whether through integrating external AI APIs, managing ML inference pipelines, or supporting data infrastructure for model training...Work experience placementPrivate practiceWork at office- ...building and deploying AI Inference and Generative AI... ...foundation models, prompt engineering, fine‑tuning, semantic... ...AI/ML orchestration software KServe, Knative,... ...one of the following Cloud technologies - Google... ...Cloudera is looking for a Staff Software Engineer to join...
$189k - $303k
...accessible for all. We’re searching for a Staff Software Engineer on the Autonomy Data: Continuous... ...millions of miles Own model training and inference pipelines for all core Autonomy models... ..., HDFS) Experience working in a cloud environment (e.g., AWS, GCP, Azure, etc...Local area- NextGenEnergyJobs is seeking a Staff Software Engineer to develop and enhance datasets and models for... ...dataset quality, training and inference pipelines, and collaborating with cross... ...especially in C, and experience with cloud environments and database management systems...
$190.9k - $232.8k
About This Role As a staff software engineer for GenAI Performance and Kernel, you will own the design, implementation, optimization, and correctness... ...of the high‑performance GPU kernels powering our GenAI inference stack. You will lead development of highly‑tuned, low‑level...Local areaWorldwide$189k - $303k
Staff Software Engineer, Continuous Learning The role involves developing and improving datasets and... ...techniques, as well as managing training and inference pipelines to enhance the Aurora Driver... ..., or HDFS. Experience working in a cloud environment such as AWS, GCP, or Azure...Work at office3 days per week- ...Staff Software Engineer Lunar is a stealth technology company building a new type of software platform for health systems. We are on a mission... ...you may tackle in your first 6 months: Modern cloud architecture, built from scratch: We have no legacy systems...Remote workFlexible hours3 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Software Engineer, Cloud Inference Safeguards. Be the first to apply!
- cloud developer San Francisco, CA
- senior principal cloud computing engineer San Francisco, CA
- aws cloud infrastructure engineer San Francisco, CA
- cloud support engineer San Francisco, CA
- principal cloud computing engineer San Francisco, CA
- informatica cloud developer San Francisco, CA
- software engineer - cloud services San Francisco, CA
- cloud security engineer San Francisco, CA
- cloud architect San Francisco, CA
- big data cloud engineer San Francisco, CA

