Model Performance Software Engineer, Claude Code
$405kUnited States Digital Space LLC
About the company The company’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the Role We're looking for a Staff Software Engineer to set technical direction at the intersection of engineering and research on the Claude Code team. In this role, you'll partner directly with the company's researchers and engineering leadership to shape how we measure, understand, and improve Claude's coding capabilities. You'll architect the systems, tooling, and evaluation infrastructure that determine how quickly our research can move—and you'll be accountable for the technical decisions that ripple across the team and beyond. This is a senior individual contributor role for someone who has already built and owned systems at significant scale, and who is ready to operate as a technical leader: driving architecture, mentoring engineers, and influencing the direction of Claude Code itself. Responsibilities Set technical direction for evaluation systems, research infrastructure, and internal tooling across the Claude Code team Architect eval frameworks that measure model capabilities across diverse coding tasks and scale with our research roadmap Lead the design of infrastructure that enables researchers to run experiments at scale, and make the foundational tradeoffs that shape how the team operates for years Identify the highest-leverage engineering investments—often before anyone has asked for them—and drive them to completion Serve as a senior technical bridge between product and research, using strong product intuition to influence which capabilities we prioritize and how we measure progress against them Mentor and raise the bar for other engineers on the team; review designs, unblock peers, and model the engineering standards we want to scale Partner with research leads to translate ambiguous research questions into durable engineering solutions Own critical systems end-to-end, from architecture through production reliability, and take responsibility for their long-term health You may be a good fit if you: Have 10+ years of software engineering experience, with a track record of operating as a Staff or Principal engineer (or equivalent) at a high-caliber organization Have architected and owned complex, high-stakes systems—pipelines, infrastructure, or platforms that orchestrate many components, handle significant state and logic, and serve multiple teams Have a history of setting technical direction that others follow—through design docs, architectural decisions, or technical strategy that shaped how a team or org operates Thrive in high-intensity environments with fast iteration cycles, and have the judgment to know when to move fast and when to invest in durability Take full ownership of ambiguous, open-ended problems and drive them to completion with minimal direction Are a power user of agentic coding tools with deep intuition about model capabilities and limitations Can dive into unfamiliar technical domains—ML systems, research workflows, novel infrastructure—and get to the frontier quickly Care deeply about correctness and reliability, and have raised engineering standards on teams you've been part of Are energized by working at the boundary between engineering and AI research, and by the prospect of influencing both Strong candidates may also have experience with: Designing or scaling eval/evaluation frameworks for ML systems Reinforcement learning infrastructure or training systems Leading technical initiatives in high-performance, demanding environments—trading firms, quant funds, frontier research labs, or fast-moving startups where intensity and technical excellence are the norm Research computing, scientific infrastructure, or developer platforms at scale A strong quantitative foundation (math, physics, or related fields) Expertise in Python and TypeScript Salary Annual Salary: $405,000—$485,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We sponsor visas for qualified candidates. We retain an immigration lawyer to assist with visa processes. #J-18808-Ljbffr United States Digital Space LLC
$320k
...committed researchers, engineers, policy experts, and... ...Role We’re looking for a Software Engineer to work at... ...engineering and research on the Claude Code team. In this role,... ...systems that measure model capabilities across... ...Working in high‑performance, demanding environments...PerformanceWork experience placementWork at officeVisa sponsorshipFlexible hours$305k
...leading AI company in San Francisco, is looking for a Product Manager for Claude Code's model performance team. In this role, you'll drive end-to-end model launches and collaborate with engineers and researchers to optimize coding performance. Ideal candidates have experience...Performance$305k
Product Manager for Claude Code Anthropic's mission is to create reliable, interpretable... ...growing group of committed researchers, engineers, policy experts, and business leaders... ...As a Product Manager on Claude Code's model performance team, you will drive model launches...PerformanceVisa sponsorship$320k
...core systems inside Bun and Claude Code—runtime internals, I/O paths,... ...and the Bun runtime Dig into performance problems across the stack: profiling... ...interface up through the JS engine to the agent layer... ...product engineers to make sure model capabilities translate into a...PerformanceWork experience placementWork at officeVisa sponsorshipFlexible hours- ...highly skilled Product Manager for Claude Code located in San Francisco. In this role, you will manage model launches, collaborate with engineering and researchers, and enhance developer... ...executing a roadmap for the future of coding performance. #J-18808-Ljbffr ColorwavePerformance
$220k - $320k
...squeezing every last drop of performance out of GPUs, diving... ...specialized language models for companies that... ...funded ten‑person team of engineers who work in‑person in... ...team has been writing code for over 10 years, and... ...and run their own software companies. We are high...PerformanceWork at office$166k - $225k
...their business. Databricks’ Model Serving product provides enterprises... ...efficiency. As a Senior Engineer, you’ll play a critical role... ...and trade-offs to optimize performance, throughput, autoscaling, and... ...Establish best practices for code quality, testing, and operational...PerformanceLocal areaWorldwide$192k - $260k
...business. Foundation Model Serving is the API Product... ...models like Claude and OpenAI GPT. For this... ...necessary. We’re looking for engineers who have owned high‑scale... ...trade‑offs to optimize performance, throughput, autoscaling... ...best practices for code quality, testing, and operational...PerformanceLocal areaWorldwide- ...the frontier of AI to bring cutting-edge models into production. With our recent $150M... ...customer demand. THE ROLE Baseten’s Model Performance (MP) team is responsible for ensuring... ...contributions to open-source inference engines (vLLM, TensorRT-LLM, SGLang, TGI) Knowledge...PerformanceFlexible hours
$405k
About the role Every engineer at Anthropic depends on the path from pull request to production. The Developer Productivity team... ...fast, predictable system that scales with the volume of code shipping into Claude and our research infrastructure. In this role, you'll be...Visa sponsorship$405k
...committed researchers, engineers, policy experts, and business... ...the people building Claude — and increasingly, for... ...re looking for a Staff Software Engineer to own our... ...Claude in the loop. Agentic coding is both how we operate... ...environment isolation model (sandboxes, ephemerally...Local areaVisa sponsorship$50 - $150 per hour
...San Francisco is seeking a Mid-Senior level contractor to improve large language model performance through software engineering expertise. The role involves leading projects, evaluating code quality, and collaborating with the team. Ideal candidates must have several...PerformanceContract workFor contractorsFlexible hours$320k - $405k
...committed researchers, engineers, policy experts, and business... .... About the role Claude.ai is one of the most-... ...Claude from a capable model into a product people genuinely... ...care about the performance characteristics that shape... ...a high bar for code quality and consistency...PerformanceVisa sponsorshipShift work- Software Engineer (Model Evaluation & Benchmarking) About the Role We are hiring Engineers focused on AI Model Evaluation to build the systems... ...automated benchmarking, dataset-driven testing, and performance validation pipelines. You will work at the intersection of...Performance
- About the Team We’re hiring software engineers to make the company’s Model Performance teams more productive. These teams work on the systems, tooling, and infrastructure that help improve model performance across the company’s training and inference workloads at frontier...Performance
- ...frontier of AI to bring cutting‑edge models into production. We're growing... ...Join us and help build the platform engineers turn to to ship AI products. THE... ...intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role...PerformanceFlexible hours
$270k - $310k
...growing group of committed researchers, engineers, policy experts, and business leaders... ...and customers demonstrate and support Claude Code with confidence. You’ll own Claude Code... ...Looking For 7+ years in a technical role (software engineering, solutions engineering,...Work at officeFlexible hours$192k - $260k
...their business. Databricks’ Model Serving product provides enterprises... ...cost efficiency. As a Staff Engineer, you’ll play a critical role... ...and trade-offs to optimize performance, throughput, autoscaling, and... ...Establish best practices for code quality, testing, and...PerformanceLocal areaWorldwide$305k
Anthropic is looking for a Product Manager for Claude Code's model performance team in San Francisco. As a Product Manager, you will lead end-to-end... ...launches, implement evaluations, and collaborate with engineers and researchers. The ideal candidate has an engineering background...Performance$186.4k - $266.3k
...is responsible for building the high‑performance, secure, and scalable systems that... ...What You'll Do: We are looking for a software engineer who is proactive, collaborative, and... ...Backend Development: Using AI‑assisted coding tools (Claude Code, Cursor, etc.) to accelerate...PerformanceWork at office2 days per week- ...computer vision to help physicians perform highly precise endovascular procedures... ...others. We are looking to hire a Software Engineer. What You'll Do: Develop... ...Python Advanced proficiency with AI coding tools (e.g., Claude Code, Cursor, or similar),...Performance
$150k - $250k
...Software Engineer Back End As a Software Engineer Back End... ...build secure, high-performance backend services deployed... ..., driving high coding standards and scalable... ...tools like ChatGPT, Claude, Perplexity, and Cursor... ...to state-of-the-art models, generous usage of modern...PerformanceWork at office3 days per week$180k - $260k
...the best product managers, software, and hardware talent at... ...help them hire. Software Engineer (Backend) Location:... ...reliability, observability, and performance Work directly with... ...using modern AI coding tools such as Cursor, Claude Code, Copilot, or similar...PerformanceH1bWork at officeRemote workRelocation package$170k - $220k
...Jiffy.com is seeking a Software Development Engineer with deep hands-on... ...experience orchestrating AI coding agents to ship... ...deployed by coding agents (Claude Code, Codex, Cursor)... ..., API design, data modeling, distributed-system reliability, and performance. ~ Proficiency in...PerformanceH1bLocal areaHome officeShift work$153k - $376k
...prototype, translating designs into code, or iterating with AI. From... ...join us! As a Full Stack Engineer, you’ll have the opportunity... ...user experiences, optimizing performance in real-time collaborative... ...g., GitHub Copilot, ChatGPT, Claude) to write, debug, and...PerformanceFull timeRemote workWork from home$159.2k - $301.6k
...Senior Full-Stack Software Engineer We are looking for a Senior Full-Stack... ...interfaces and robust, high-performing backend systems. It involves... ...architecture and quality coding practices for the platform's... ...across the team. Use tools like Claude Code, Cursor, and Codex to...PerformanceTemporary workLocal areaWorldwide$204.4k - $285.6k
...Senior Full Stack Software Engineer San Francisco (Brisbane),... ...to build polished, high-performance web applications using React... ...management patterns. Code Quality & Best Practices... ...leading language models and agentic tools like Claude Code and Codex, or similar...PerformanceLocal areaShift work- ...We're looking for a senior software engineer to not only amplify our development... ..., Redis, LLMs (GPT, Claude, etc), Docker, and other modern... ...into scalable and efficient code. Participate in and... ...completion with a focus on performance, reliability, and security....PerformanceLocal areaRemote work
- ...inherent to generative models. CTGT is the... ..., the CTGT Policy Engine (paired with GPT-1... ...Gemini 3 Pro Preview, Claude 4.5 Opus and 4.5... ...controllable, and performant in practice.... ...you will write the code that proves those... ...the fundamentals of software engineering, applied...Performance
- ...waiting on design and engineering to do it. Today we deliver... ..., and optimizes performance in response to real-world... ...The role As a Software Engineer at Flint, you... ...web experiences, the code-generation and editing... ...dev tools regularly (Claude Code, Cursor, Copilot,...PerformanceWork at officeShift workNight shift
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Model Performance Software Engineer, Claude Code. Be the first to apply!
- software engineer amazon San Francisco, CA
- experienced software developer San Francisco, CA
- federal - software developer San Francisco, CA
- software developer internship San Francisco, CA
- senior software engineer San Francisco, CA
- software developer fintech San Francisco, CA
- part time software developer remote San Francisco, CA
- software developer intern San Francisco, CA
- software data engineer San Francisco, CA
- fall software engineering internship San Francisco, CA

