Staff + Sr. Software Engineer, Inference
$300kUnited States Digital Space LLC
About the company The company’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide. We bring Claude to life by serving our models via the industry's largest compute‑agnostic inference deployments. We handle the entire stack from intelligent request routing to fleet‑wide orchestration across diverse AI accelerators. The team has a dual mandate: maximizing compute efficiency to serve explosive customer growth, and enabling breakthrough research by providing scientists with high‑performance inference infrastructure. Key responsibilities Build and maintain distributed inference systems. Design request routing, load balancing, and traffic management. Optimize compute utilization and manage elastic scaling. Deploy and integrate new AI accelerator platforms. Build deployment pipelines for new models. Provide infrastructure support for research teams. Tune performance using observability data. Manage multi‑region deployments and geographic routing. Representative projects Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators. Autoscaling our compute fleet to dynamically match supply with demand across production, research, and experimental workloads. Building production‑grade deployment pipelines for releasing new models to millions of users. Integrating new AI accelerator platforms to maintain our hardware‑agnostic competitive advantage. Contributing to new inference features (e.g., structured sampling, prompt caching). Supporting inference for new model architectures. Analyzing observability data to tune performance based on real‑world production workloads. Managing multi‑region deployments and geographic routing for global customers. Qualifications Significant software engineering experience, particularly with distributed systems. Results‑oriented, with a bias toward flexibility and impact. Ability to pick up slack, even if it goes outside job description. Enjoy pair programming. Willingness to learn more about machine learning systems and infrastructure. Thrives in environments where technical excellence directly drives business results and research breakthroughs. Care about societal impacts of work. Preferred experience High‑performance, large‑scale distributed systems. Implementing and deploying machine learning systems at scale. Load balancing, request routing, or traffic management systems. LLM inference optimization, batching, and caching strategies. Kubernetes and cloud infrastructure (AWS, GCP, Azure). Python or Rust. Compensation Annual Salary: $300,000 – $485,000 USD. Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and experience. Required field of study: a field relevant to the role as demonstrated through coursework, training, or professional experience. Minimum years of experience: correlated with internal job level requirements for the position. Location‑based hybrid policy: staff to be in one office at least 25% of the time. Visa sponsorship: we sponsor visas; we will make reasonable efforts to obtain a visa if an offer is made. #J-18808-Ljbffr United States Digital Space LLC
$320k
...group of committed researchers, engineers, policy experts, and business... ...Role Our mandate is to make inference deployment boring and... ...continuous and unattended. As a Software Engineer on the Launch Engineering... ...: Currently, we expect all staff to be in one of our offices...SeniorVisa sponsorshipShift work$320k
About the Role The Cloud Inference team scales and optimizes Claude... ...day‑to‑day operations. Our engineers are extremely high leverage:... ...Fit If You Have significant software engineering experience, with... ...policy: Currently, we expect all staff to be in one of our offices at...SeniorVisa sponsorship$325k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...-- we're looking for reliability-minded software engineers and SREs Are curious and... ...hybrid policy: Currently, we expect all staff to be in one of our offices at least 25%...SeniorVisa sponsorship$229.9k - $262.4k
...Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable... ...One. ~ Design, develop, test, deploy, and support AI software components including foundation model training, large language...SeniorFull timePart timeLocal area- Staff / Sr. Staff Software Engineer (Frontend) San Francisco Bay Area, California, United States About Us: Tessell is a fast‑growing company focused on data management. We're building cutting‑edge products that will disrupt the data management industry. Our team is composed...Senior
- United States Digital Space LLC is looking for a Software Engineer to join the Launch Engineering team in San Francisco. You’ll design and... ...build deployment infrastructure for continuous and unattended inference deployment. The ideal candidate will have at least 5 years...Senior
- ...tools consistently fail. We are a small, fast-growing team of engineers in San Francisco powering Fortune 100 enterprises, YC startups... ...plus About the Role Specialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and...Work at officeVisa sponsorshipRelocation package
- ...BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies... .... Join us and help build the platform engineers turn to to ship AI products. THE ROLE Baseten... ..., reliability, and ease of use. As a Software Engineer on the Inference Stack team,...Flexible hours
$325k
About the Team Our Inference team brings OpenAI's most capable research and technology to the world through our products. We empower consumers... ...progression via model inference. About the Role We're hiring engineers to scale and optimize OpenAI's inference infrastructure across...$220k
Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels... ...candidate has 3+ years of experience in software engineering with a focus on ML inference...Senior- A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service....SeniorRemote job
- Software Engineer (AI Infrastructure / Training / Inference) About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate reliably at production scale. This role exists because modern generative...
$181.1k - $318.4k
Sr. Applied AI Software Engineer- Vision Products Group & Siri San Francisco Bay Area, California, United States Software and Services Apple builds... ...and/or Machine Learning algorithms, including on-device inference, data-driven validation, requirement definition, and...SeniorRelocation$320k
United States Digital Space LLC is seeking a backend engineer for the Cloud Inference team. This role involves designing and building infrastructure... ...and cost. The ideal candidate will have significant software engineering experience with a major cloud platform. We offer...Senior$320k
...group of committed researchers, engineers, policy experts, and business... ...systems. About the role Our Inference team is responsible for... ...qualifications Significant software engineering experience, particularly... ...: Currently, we expect all staff to be in one of our offices...SeniorWorldwideVisa sponsorship$167.2k - $209k
A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong...SeniorRemote job$152k - $204k
...Job Posting Title: Sr Software Engineer - Ad Server Req ID: 10124428 Job Description: Disney Entertainment and ESPN Product & Technology Technology is at the heart of Disney’s past, present, and future. Disney Entertainment and ESPN Product & Technology...SeniorFull timeWork experience placement$180k - $220k
...help us accelerate that reality. About the Role As a Sr. Infrastructure Engineer at AKASA, you’ll work closely with our Infrastructure and... ...issues before they impact customers. You'll collaborate with software engineers to embed reliability best practices across the...SeniorWork at officeLocal areaRemote work- The Consensus is looking for a Software Engineer to join our Inference Stack team in San Francisco. You will help develop the infrastructure that powers large-scale LLM inference, ensuring scalability and reliability in our systems. This role is ideal for engineers who...
- ...- Work with product owners to understand desired application capabilities and testing scenarios - Continuously improve software engineering practices - Work within and across Agile teams to design, develop, test, implement, and support technical solutions across...Senior
- ¿ Performs as a key contributor to an engineering team that builds and supports exceptional products that provide innovative solutions to... ..., system analysis, and programming activities on application software; this may often require independent research and study ¿ Develops...Senior
$150k - $180k
...patients. It’s an outdated, painful, administrative mess. Job Description We are seeking a highly skilled Senior Java Backend Software Engineer to join our dynamic team. The ideal candidate will have extensive experience in Java and a solid understanding of backend development...SeniorRemote work- ...reliability, and hardware constraints. Software sits at the center of everything we ship... ...production. About the role As a Backend Software Engineer at Droyd, you’ll own core parts of the... ...ll build systems that support learning, inference, control, and fleet operations. You’ll...Senior
- ...Job 132621 - Sr. Software Engineer San Francisco, CA This is an early hire on an expanding engineering team committed to making continuous integration a core part of our organizational DNA. Position is empowered by and reports directly to the VP of Engineering. We are...SeniorFull time
- ...Senior Software Engineer,JazzX(New AI Venture at SAI Group) Role Overview: As a Senior Software Development Engineer, you'll lead the design, development, and management of critical services and components of an innovative enterprise AI platform to deliver transformative...SeniorWork experience placementFlexible hours
$150k - $250k
...Sr Software Engineer Step into a high-impact Sr Software Engineer opportunity with a confidential client, where you will help drive meaningful results across Software. This role offers the chance to make a visible contribution in San Francisco, California, USA...Senior- ...Sr Software Engineer Perfict Global is a leading IT consulting services provider focused on providing innovative and successful business workforce solutions to Fortune 500 companies. Our trained and experienced professionals constantly strive to bring together the...SeniorLong term contractWork at officeWork from home
$181.1k - $318.4k
Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge Platforms San Francisco Bay Area, California, United States Machine... ...Work along side Foundation Model Research team to optimize inference for cutting edge model architectures. Work closely with...SeniorRelocation- ...double-digits MoM, and expanding the core engineering team in SF. The surface area is big: realtime collaboration, GPU inference at scale, a modern TypeScript stack, and serving... ...enterprise The Role As the Senior Software Engineer – Backend (Systems /...Senior
$170k - $260k
...Sr. Software Engineer Job Summary At Pantomath, we are building the autopilot for the data-driven enterprise. Today, data teams are buried under operational toil: battling broken pipelines, schema drift, and silent quality failures. Each incident costs hours of manual...SeniorWork at officeRemote workNight shift
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff + Sr. Software Engineer, Inference. Be the first to apply!
- software sales engineer San Francisco, CA
- software engineer amazon San Francisco, CA
- software engineer student San Francisco, CA
- agile software developer San Francisco, CA
- rust software engineer San Francisco, CA
- software developer positions San Francisco, CA
- senior software design engineer San Francisco, CA
- software developer San Francisco, CA
- ngo software engineer San Francisco, CA
- startup software engineer San Francisco, CA


