Software Engineer, Inference

$300k

Full-time

Anthropic

About Anthropic

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the role

Our Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide. We bring Claude to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request routing to fleet-wide orchestration across diverse AI accelerators.

The team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, while enabling breakthrough research by giving our scientists the high-performance inference infrastructure they need to develop next-generation models. We tackle complex, distributed systems challenges across multiple accelerator families and emerging AI hardware running in multiple cloud platforms.

You may be a good fit if you:

Have significant software engineering experience, particularly with distributed systems

Are results-oriented, with a bias towards flexibility and impact

Pick up slack, even if it goes outside your job description

Enjoy pair programming (we love to pair!)

Want to learn more about machine learning systems and infrastructure

Thrive in environments where technical excellence directly drives both business results and research breakthroughs

Care about the societal impacts of your work

Strong candidates may also have experience with:

High-performance, large-scale distributed systems

Implementing and deploying machine learning systems at scale

Load balancing, request routing, or traffic management systems

LLM inference optimization, batching, and caching strategies

Kubernetes and cloud infrastructure (AWS, GCP)

Python or Rust

Representative projects:

Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators

Autoscaling our compute fleet to dynamically match supply with demand across production, research, and experimental workloads

Building production-grade deployment pipelines for releasing new models to millions of users

Integrating new AI accelerator platforms to maintain our hardware-agnostic competitive advantage

Contributing to new inference features (e.g., structured sampling, prompt caching)

Supporting inference for new model architectures

Analyzing observability data to tune performance based on real-world production workloads

Managing multi-region deployments and geographic routing for global customers

Deadline to apply: None. Applications will be reviewed on a rolling basis.

The expected base compensation for this position is below. Our total compensation package for full-time employees includes equity, benefits, and may include incentive compensation.

Annual Salary:

$300,000 - $485,000 USD

Logistics

Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come work with us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process

Apply

Vacancy posted 11 hours ago

Similar jobs that could be interesting for youBased on the Software Engineer, Inference in New York, NY vacancy

Software Engineer, Inference Deployment
$320k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...the Role Our mandate is to make inference deployment boring and unattended.... ...deployment continuous and unattended. As a Software Engineer on the Launch Engineering team,...
Suggested
Full time
Work at office
Visa sponsorship
Flexible hours
Shift work
Anthropic
New York, NY
12 hours ago
Staff + Senior Software Engineer, Cloud Inference
$300k
...growing group of committed researchers, engineers, policy experts, and business leaders working... .... About the Role The Cloud Inference team scales and optimizes Claude to serve... ...Fit If You: Have significant software engineering experience, with a strong background...
Suggested
Full time
Work at office
Visa sponsorship
Flexible hours
Anthropic
New York, NY
12 hours ago
Senior Lead AI Engineer (FM Hosting, LLM Inference)
$229.9k - $262.4k
...Senior Lead AI Engineer (FM Hosting, LLM Inference) Overview: At Capital One, we are creating responsible and reliable AI systems, changing... ...One. Design, develop, test, deploy, and support AI software components including foundation model training, large language...
Suggested
Full time
Part time
Local area
Capital One
New York, NY
1 day ago
Fullstack Software Engineer
...Description We’re looking for a fullstack engineer who wants to build the interfaces, tools... ...systems. That includes building core inference workflows, creating intuitive UI for complex... ...function Engineering and Information Technology Software Development #J-18808-Ljbffr...
Suggested
Full time
Data Lab
New York, NY
3 days ago
Full Stack Software Engineer
...impactful. We’re a collaborative team of engineers, researchers, and designers who are... ...about AI and we’re looking for a Full Stack Software Engineer who’s excited to work at the intersection... ...—ready to handle large-scale NLP inference, analytics, and data flows. Build and...
Suggested
Second job
Alexandria Technology, Inc.
New York, NY
3 days ago
Software Engineer, Platform
...Beam is an ultrafast AI inference platform. We built a serverless runtime that launches... ...hire someone to help us with Platform Engineering work. We’re working on lots of fun problems... ...native technologies, and open source software Benefits Competitive salary and meaningful...
Work at office
Beam
New York, NY
3 days ago
Software Engineer, Observability
$109k - $145k
...power them. This team enables both internal engineers and customers to monitor, troubleshoot,... ...About the role: As a Software Engineer on the Observability team, you... ...GPU-based systems, large-scale training/inference workloads, or MLOps tooling Why...
Permanent employment
Full time
Temporary work
Casual work
Work at office
Remote work
Flexible hours
Coreweave
New York, NY
12 hours ago
Software Engineer
...This is a high-ownership generalist role. You'll work across 3 services spanning TypeScript/React, Python backends, and ML/CV inference, all running on Google Cloud and Modal. Comfort moving between product code and ML infrastructure is essential. Physician web portal...
Full time
Ataraxis AI
New York, NY
11 hours ago
Software Engineer, Machine Learning (Systems)
$240k
...defense layer for the AI age and are looking for an exceptional ML engineer to stabilize the system that turns raw signal into decisions —... ...turn them into consistent, trusted decisions. Define how inference works when inputs are incomplete, noisy, or conflicting....
Full time
Flexible hours
Sweep360
New York, NY
11 hours ago
Senior Software Engineer
...financial ecosystem. Role Description As a Senior Software Engineer , you'll be one of the early technical hires building the systems... ...customer-facing tools Develop infrastructure that serves inference and network analysis results in real time with high accuracy...
Full time
Cobalt Identity Systems
New York, NY
12 hours ago
Software Engineer, Backend
$300k - $320k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...: Anthropic is looking for backend software engineers to work across our product... ...our models. You'll partner closely with inference and safeguards to optimize the full...
Full time
Work at office
Visa sponsorship
Flexible hours
Anthropic
New York, NY
12 hours ago
Software Engineer
...isn't an AI wrapper slapped onto legacy software - we built a proprietary general ledger... .... About the role Hanover Park is an engineering-first company on a mission to build the... ...Trigger.dev for background jobs and AI inference. What we\'re looking for Need: Strong full...
Local area
Hanover Park
New York, NY
4 days ago
Software Engineer
$180k - $220k
...how value compounds across the platform. Engineers here treat AI as a force multiplier in... ...Join an AI-native engineering team as a Software Engineer. The engineering org is currently... ...that supports production-grade inference, evaluation, and monitoring. Work across...
Full time
For contractors
Work at office
Relocation
Syndesus, Inc.
New York, NY
3 days ago
Founding Software Engineer
$150k - $250k
...About The Team Mbodi builds the software layer that lets industrial robots learn new skills from instruction in minutes... ..., distributed agent orchestration, and compiled neural inference. As one of our early engineers, you’ll work on the core systems that translate camera...
Work at office
3 days per week
Mbodi AI
New York, NY
3 days ago
Software Engineer I
...As a Software Engineer I at Aledade, we maintain, improve, and expand our web application and data pipelines. We're looking for engineers... ...Expertise with statistical data techniques (such as causal inference, syntactic analysis, sampling methods, NLP etc), with experience...
Aledade,-Inc.-
New York, NY
2 days ago
Software Engineer, Observability
$109k - $145k
...power them. This team enables both internal engineers and customers to monitor, troubleshoot,... ...at massive scale. About the role: As a Software Engineer on the Observability team, you... ...GPU‑based systems, large‑scale training/inference workloads, or MLOps tooling Why...
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
New York, NY
3 days ago
Software Engineer
$184k - $287.5k
...incorporating: ~Control-plane gateway ~Privacy-conscious inference router ~Declarative policy enforcement ~Specialized container... ...of a Bachelor's degree in Computer Science, Electrical Engineering, or a related technical field, or equivalent experience. ~8...
Remotive
New York, NY
2 days ago
Software Engineer
$120k - $140k
...the job poster from Harnham Recruitment Consultant | Data & Software Engineering at Harnham Software Engineer New York, NY (Hybrid) $120,000... ...Referrals increase your chances of interviewing at Harnham by 2x Inferred from the description for this job Medical insurance Vision...
Full time
Harnham
New York, NY
1 day ago
Software Engineer in Test
$38.45 - $62.5 per hour
...Software Development Engineer in Test (SDET) Location: Remote or Hybrid in TX (Dallas metro-area) Long-Term Contract (3-4 years) Pay Rate: $38.... ...interviewing at ConsultNet Technology Services and Solutions by 2x Inferred from the description for this job Medical insurance...
Long term contract
Contract work
Work experience placement
Remote work
ConsultNet Technology Services and Solutions
New York, NY
4 days ago
Software Development Engineer
$165.2k - $223.6k
...Amazon Web Services (AWS) is building a central pipeline of Software Development Engineer (SDE) talent for anticipated roles in 2026. This... ...fundamentals, including transformer architecture, training/inference lifecycles, and optimization techniques. Knowledge of Python...
Internship
Local area
Flexible hours
Day shift
Amazon
New York, NY
2 days ago
Senior Software Engineer
$240k - $260k
...Base pay range $240,000.00/yr - $260,000.00/yr Senior / Staff Software Engineer (AI + Secure Data Infrastructure) — Mission-Driven National... .... Launch agentic systems leveraging LLMs, embeddings, and inference into high-security production environments. Design and maintain...
Full time
FORTË
New York, NY
3 days ago
Sr Software Engineer
...generation of AI-assisted tooling for aerospace engine design. We’re looking for a growth-... ...~Design, build, deliver, and maintain software applications and services across ML,... ...scalable workflows that enable modeling, inference, and decision support in design and manufacturing...
Work experience placement
Remotive
New York, NY
4 days ago
Senior Software Engineer
$225k - $300k
...who do the same. Role Overview We’re looking for a fullstack engineer who wants to build the interfaces, tools, and infrastructure that... ...document-understanding systems. That includes building core inference workflows, creating intuitive UI for complex parsing tasks, and...
Data Lab
New York, NY
4 days ago
Senior Software Engineer
$150k - $200k
...RSUs Direct message the job poster from VANTA Partners Inc Software Engineer (Full Stack) – Build Cutting-Edge Tech with Impact Location:... ...increase your chances of interviewing at VANTA Partners Inc by 2x Inferred from the description for this job Medical insurance Vision...
Full time
Remote work
Flexible hours
VANTA PARTNERS Inc
New York, NY
3 days ago
Senior Software Engineer
$200k - $250k
...collaborative team and the chance to shape engineering culture from the ground up. This role... ...function Information Technology Industries Software Development, IT System Custom Software... ...of interviewing at Foxley Talent by 2x Inferred from the description for this job Medical...
Full time
Summer work
Internship
Work at office
Relocation
Visa sponsorship
Foxley Talent
New York, NY
3 days ago
Senior Software Engineer - Payments
$165k - $180k
...Join to apply for the Senior Software Engineer - Payments role at Brigit Join to apply for the Senior Software Engineer - Payments role at... ...Referrals increase your chances of interviewing at Brigit by 2x Inferred from the description for this job Medical insurance Vision...
Full time
Summer work
Internship
Work at office
Local area
Immediate start
Remote work
Flexible hours
Brigit
New York, NY
3 days ago
Senior Software Engineer
$190k - $240k
...BriefCatch is the leading legal writing software trusted by top litigators, judges, and legal... ...genuinely matter, and we’re looking for engineers who care about the same things The Role... ...for document processing and AI inference workloads. Architect and run cloud-native...
Live in
RiverPark Ventures
New York, NY
3 days ago
Senior Software Engineer
$150k - $200k
...Senior Software Engineer Full‑Time NYC - Hybrid Base pay range $150,000.00/yr - $200,000.00/yr Additional compensation types Annual Bonus... ...now! Seniority Level Entry level Employment type Full‑time Inferred from the description for this job Medical insurance 401(k) #J...
Full time
Flexible hours
Selby Jennings
New York, NY
2 days ago
Software Engineer, Enterprise AI
$216k - $270k
...platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong engineer to join our team and help us build and scale... ...candidate will have a strong understanding of software engineering principles and practices, as well...
Full time
Scale Ai, Inc.
New York, NY
12 hours ago
Software Engineer, Delivery
$120k - $240k
...is a deep-tech company of scientists and engineers, developing machine learning... ...Own the deployment of custom-tailored software solutions with small, high performing teams... ...learning concepts (eg. model training, model inference, hardware accelerations) What we offer...
Full time
Work experience placement
Work at office
Remote work
Flexible hours
Physicsx
New York, NY
12 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Inference. Be the first to apply!