Staff + Sr. Software Engineer, Inference
$300kMenlo Ventures
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About The Role Our Inference team builds and maintains the critical systems that serve Claude to millions of users worldwide. We handle the entire stack from intelligent request routing to fleet‑wide orchestration across diverse AI accelerators. The team’s dual mandate is to maximize compute efficiency to serve explosive customer growth and to enable breakthrough research by providing scientists with high‑performance inference infrastructure to develop next‑generation models. We tackle complex distributed systems challenges across multiple accelerator families and emerging AI hardware running in multiple cloud platforms. Representative Projects Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators. Autoscaling compute fleet to dynamically match supply with demand across production, research, and experimental workloads. Building production‑grade deployment pipelines for releasing new models to millions of users. Integrating new AI accelerator platforms to maintain our hardware‑agnostic competitive advantage. Contributing to new inference features such as structured sampling and prompt caching. Supporting inference for new model architectures. Analyzing observability data to tune performance based on real‑world production workloads. Managing multi‑region deployments and geographic routing for global customers. Qualifications Significant software engineering experience, particularly with distributed systems. Results‑oriented with a bias toward flexibility and impact. Ability to take on tasks that go beyond the job description. Enjoys pair programming. Wants to learn more about machine learning systems and infrastructure. Thrives in environments where technical excellence drives both business results and research breakthroughs. Concerned about the societal impacts of the work. Preferred Qualifications High‑performance, large‑scale distributed systems experience. Implementing and deploying machine learning systems at scale. Experience with load balancing, request routing, or traffic management systems. LLM inference optimization, batching, and caching strategies. Kubernetes and cloud infrastructure (AWS, GCP, Azure). Python or Rust proficiency. Compensation Annual Salary: $300,000—$485,000 USD. #J-18808-Ljbffr Menlo Ventures
- ...Staff + Sr. Software Engineer, Inference Deployment San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and...SeniorWork at officeVisa sponsorshipFlexible hoursShift work
$300k
...Staff + Sr. Software Engineer, Inference San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society...SeniorWork at officeWorldwideVisa sponsorshipFlexible hours$325k
...Staff + Sr. Software Engineer, AI Reliability San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society...SeniorWork at officeVisa sponsorshipFlexible hours$160k - $240k
...Senior Software Engineer - AI Inference Location New York Business Area Engineering and CTO Ref # 10050779 Description & Requirements Our team: Join the team that is building the core infrastructure for AI at Bloomberg. The Bloomberg AI Inference...SeniorTemporary workFor contractorsWork experience placement- ...ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence... ...Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE: Voice is...SuggestedFlexible hours
- ...specializes in developing custom hardware systems to accelerate AI inference. These inference systems offer significant performance and... ...to create the world's best AI inference systems. Senior Software Engineer – Machine Learning Systems & High-Performance LLM Inference...Senior
$190k - $220k
...integrations and join the team on‑call rotation. Collaborate with cross‑functional partners such as Product, Program, Design and Engineers, and directly with external partners. Provide input in team roadmap and technical direction. Build large scale backend solutions that...SeniorTemporary workWork experience placementCasual workLive inWork at officeRemote work$184.9k - $250.2k
...Studio from a catalog and provisioning engine into an intelligent, self-improving learning... ...and operates. We are hiring a Senior Software Development Engineer to drive the... ...machine learning model architecture and inference - 5+ years of highly scalable systems...SeniorInternshipFlexible hours- ...Software Engineer, Model Routing & Inference Engineering · Full-time · New York; San Francisco Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design...Full timeWork at office
$126k - $248k
...We are hiring a Senior Software Engineer to join our Server Security team. The Server Security team is a development-focused group within MongoDB's core engineering organization. Operating "close to the bottom of the stack," the team builds features that enable database...SeniorLocal areaRemote workWorldwideFlexible hours$139k - $242k
...Senior Software Engineer, Server Fleet Infrastructure Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators...SeniorTemporary workCasual workWork at officeRemote workFlexible hours$128.7k - $261.3k
...deployment platform within the autonomous vehicle sector. This role involves automating model deployment from training to on-vehicle inference and enhancing developer experience through robust tooling. Candidates should hold a relevant degree and possess significant...Senior$229.9k - $262.4k
Senior Lead AI Engineer (FM Hosting, LLM Inference) Overview: At Capital One, we are creating responsible and... ...develop, test, deploy, and support AI software components including foundation model... ...McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer New York, NY: $2...SeniorFull timePart timeLocal areaImmediate start$320k
...group of committed researchers, engineers, policy experts, and business... ...Our mandate is to make inference deployment boring and unattended... ...continuous and unattended. As a Software Engineer on the Launch... ...policy: Currently, we expect all staff to be in one of our offices...Full timeWork at officeVisa sponsorshipFlexible hoursShift work$300k
...group of committed researchers, engineers, policy experts, and business... ...About the role Our Inference team is responsible for building... ...you: Have significant software engineering experience,... ...policy: Currently, we expect all staff to be in one of our offices...Full timeWork at officeWorldwideVisa sponsorshipFlexible hours- ...systems that can scale with global participation. As a Senior Software Engineer / Architect , you’ll design and build the core... ...role emphasizes systems architecture at scale: distributed inference pipelines, Pareto frontier computation, caching and load-balancing...Senior
$128.7k - $261.3k
...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning... ...currently performed manually by engineers. Build the developer experience that ML... ...Experience designing clean, well-tested software with clear interfaces and good abstractions...SeniorFlexible hoursShift work$200k - $250k
...’re seeking an experienced Senior MLOps Engineer to take ownership of how our machine learning... ...and scaling – for a custom-built inference platform powering a live conversational... ...build observability and alerting. Apply software engineering best practices including testing...SeniorRemote workFlexible hours$174k - $273k
...open platform integrates with nearly 650 software, data and consulting partners to power... ...Paulo. The Role We are currently seeking a Staff Software Architect to join the Office of... ...leadership, education and guidance to engineering and product teams across the organization...SeniorH1bWork at officeRemote workWorldwideVisa sponsorshipWork visa- ...Real Time Trading, Compliance & Risk Systems Engineer This opportunity is sitting in NYC - Hybrid 2-3x/week You will be responsible for designing, implementing & continuously evolving real time trading, compliance & risk systems on the engineering team. This team works...Senior
$197k - $290k
...Life360 is seeking a skilled Cloud Engineer specializing in AI systems to architect the inference pipeline that integrates location data and user insights within... ...197,000 and $290,000 USD, alongside comprehensive employee benefits for US-based staff. #J-18808-Ljbffr...Remote work- ...Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more. About the team: As a mobile software engineer, you will build and maintain a core OKX app with millions of daily active users. You will work cross-functionally with design,...SeniorWork at office
- ...smooth, efficient clinical operations. Our best‑in‑class modern software solution for practice management is used by practices across... ...commitment to developer experience, that make being a Prospyr engineer delightful and allow you to stay in flow. Responsibilities Collaborate...SeniorRemote work
- Java/J2EE Frameworks (Spring MVC, Spring Batch, SpringBoot Microservices) Design patterns and principles, Web Services (REST, SOAP) Security & Integration technologies (SSO, MSSL, OAuth, JWT, etc.) Oracle PostgreSQL, NoSQL,JUNIT and Rest Assure Maven...Senior
- ...The Phia Group is looking for a Senior Software Engineer to join their team and take on the challenge of designing and building applications that support healthcare-related workflows. This remote position emphasizes AI-enabled development, focusing on improving software...SeniorRemote work
- ...Sr. Software Engineer A confidential organization is seeking a Senior Software Engineer to join a mature, high-performing engineering team. This position is ideal for a seasoned engineer who thrives in solving complex technical challenges, contributes to system design...SeniorContract work
- ...Framework Ventures is seeking a Senior/Staff Software Engineer to lead the technical design of a high-frequency trading platform. You will create innovative solutions for complex programming challenges and improve software architecture for efficiency. Responsibilities...Senior
- ...Senior Principal Software Engineer We're looking for a tech leader ready to take their career to new heights. Join the ranks of top talent... .... Leads deployment and optimization using Model Inference servers such as Triton Inference Server and vLLM for high-throughput...Senior
- ...About the Role We are looking for a backend-focused Senior Software Engineer to design, build, and scale the production systems that... ...serving frameworks, GPU infrastructure, batch vs. real-time inference) Familiarity with React or frontend technologies sufficient...Senior
$220k - $270k
...Senior Software Engineer USD $220,000 - $270,000 meaningful equity | New York | 5 days onsite Soda has partnered with an AI infrastructure... ...role. You'll work on core infrastructure problems around inference, orchestration, context evolution, and human-guided AI systems...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff + Sr. Software Engineer, Inference. Be the first to apply!
- graduate software developer New York, NY
- rust software engineer New York, NY
- senior software design engineer New York, NY
- software engineer student New York, NY
- software engineer amazon New York, NY
- software developer positions New York, NY
- software engineer full time New York, NY
- software qa engineer New York, NY
- new graduate software engineer New York, NY
- junior software developer New York, NY



