Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff + Sr. Software Engineer, Inference

$300k

Menlo Ventures

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About The Role Our Inference team builds and maintains the critical systems that serve Claude to millions of users worldwide. We handle the entire stack from intelligent request routing to fleet‑wide orchestration across diverse AI accelerators. The team’s dual mandate is to maximize compute efficiency to serve explosive customer growth and to enable breakthrough research by providing scientists with high‑performance inference infrastructure to develop next‑generation models. We tackle complex distributed systems challenges across multiple accelerator families and emerging AI hardware running in multiple cloud platforms. Representative Projects Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators. Autoscaling compute fleet to dynamically match supply with demand across production, research, and experimental workloads. Building production‑grade deployment pipelines for releasing new models to millions of users. Integrating new AI accelerator platforms to maintain our hardware‑agnostic competitive advantage. Contributing to new inference features such as structured sampling and prompt caching. Supporting inference for new model architectures. Analyzing observability data to tune performance based on real‑world production workloads. Managing multi‑region deployments and geographic routing for global customers. Qualifications Significant software engineering experience, particularly with distributed systems. Results‑oriented with a bias toward flexibility and impact. Ability to take on tasks that go beyond the job description. Enjoys pair programming. Wants to learn more about machine learning systems and infrastructure. Thrives in environments where technical excellence drives both business results and research breakthroughs. Concerned about the societal impacts of the work. Preferred Qualifications High‑performance, large‑scale distributed systems experience. Implementing and deploying machine learning systems at scale. Experience with load balancing, request routing, or traffic management systems. LLM inference optimization, batching, and caching strategies. Kubernetes and cloud infrastructure (AWS, GCP, Azure). Python or Rust proficiency. Compensation Annual Salary: $300,000—$485,000 USD. #J-18808-Ljbffr Menlo Ventures

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Staff + Sr. Software Engineer, Inference in New York, NY vacancy
  •  ...Staff + Sr. Software Engineer, Inference Deployment San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    anthropic

    New York, NY
    2 days ago
  • $300k

     ...Staff + Sr. Software Engineer, Inference San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society... 
    Senior
    Work at office
    Worldwide
    Visa sponsorship
    Flexible hours

    anthropic

    New York, NY
    2 days ago
  • $325k

     ...Staff + Sr. Software Engineer, AI Reliability San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    New York, NY
    2 days ago
  • $160k - $240k

     ...Senior Software Engineer - AI Inference Location New York Business Area Engineering and CTO Ref # 10050779 Description & Requirements Our team: Join the team that is building the core infrastructure for AI at Bloomberg. The Bloomberg AI Inference... 
    Senior
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    2 days ago
  •  ...ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence...  ...Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE: Voice is... 
    Suggested
    Flexible hours

    Baseten

    New York, NY
    3 days ago
  •  ...specializes in developing custom hardware systems to accelerate AI inference. These inference systems offer significant performance and...  ...to create the world's best AI inference systems. Senior Software Engineer – Machine Learning Systems & High-Performance LLM Inference... 
    Senior

    GrabJobs

    New York, NY
    4 days ago
  • $190k - $220k

     ...integrations and join the team on‑call rotation. Collaborate with cross‑functional partners such as Product, Program, Design and Engineers, and directly with external partners. Provide input in team roadmap and technical direction. Build large scale backend solutions that... 
    Senior
    Temporary work
    Work experience placement
    Casual work
    Live in
    Work at office
    Remote work

    Traveltechessentialist

    New York, NY
    3 days ago
  • $184.9k - $250.2k

     ...Studio from a catalog and provisioning engine into an intelligent, self-improving learning...  ...and operates. We are hiring a Senior Software Development Engineer to drive the...  ...machine learning model architecture and inference - 5+ years of highly scalable systems... 
    Senior
    Internship
    Flexible hours

    Amazon

    Jersey City, NJ
    2 days ago
  •  ...Software Engineer, Model Routing & Inference Engineering · Full-time · New York; San Francisco Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design... 
    Full time
    Work at office

    Anysphere

    New York, NY
    2 days ago
  • $126k - $248k

     ...We are hiring a Senior Software Engineer to join our Server Security team. The Server Security team is a development-focused group within MongoDB's core engineering organization. Operating "close to the bottom of the stack," the team builds features that enable database... 
    Senior
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    New York, NY
    21 hours ago
  • $139k - $242k

     ...Senior Software Engineer, Server Fleet Infrastructure Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators... 
    Senior
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    New York, NY
    1 day ago
  • $128.7k - $261.3k

     ...deployment platform within the autonomous vehicle sector. This role involves automating model deployment from training to on-vehicle inference and enhancing developer experience through robust tooling. Candidates should hold a relevant degree and possess significant... 
    Senior

    General Motors

    New York, NY
    3 days ago
  • $229.9k - $262.4k

    Senior Lead AI Engineer (FM Hosting, LLM Inference) Overview: At Capital One, we are creating responsible and...  ...develop, test, deploy, and support AI software components including foundation model...  ...McLean, VA: $229,900 - $262,400 for Sr. Lead AI Engineer New York, NY: $2... 
    Senior
    Full time
    Part time
    Local area
    Immediate start

    Capital One

    New York, NY
    1 day ago
  • $320k

     ...group of committed researchers, engineers, policy experts, and business...  ...Our mandate is to make inference deployment boring and unattended...  ...continuous and unattended. As a Software Engineer on the Launch...  ...policy: Currently, we expect all staff to be in one of our offices... 
    Full time
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    Anthropic

    New York, NY
    18 hours ago
  • $300k

     ...group of committed researchers, engineers, policy experts, and business...  ...About the role Our Inference team is responsible for building...  ...you: Have significant software engineering experience,...  ...policy: Currently, we expect all staff to be in one of our offices... 
    Full time
    Work at office
    Worldwide
    Visa sponsorship
    Flexible hours

    Anthropic

    New York, NY
    18 hours ago
  •  ...systems that can scale with global participation. As a Senior Software Engineer / Architect , you’ll design and build the core...  ...role emphasizes systems architecture at scale: distributed inference pipelines, Pareto frontier computation, caching and load-balancing... 
    Senior

    Framework Ventures

    New York, NY
    3 days ago
  • $128.7k - $261.3k

     ...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning...  ...currently performed manually by engineers. Build the developer experience that ML...  ...Experience designing clean, well-tested software with clear interfaces and good abstractions... 
    Senior
    Flexible hours
    Shift work

    General Motors

    New York, NY
    3 days ago
  • $200k - $250k

     ...’re seeking an experienced Senior MLOps Engineer to take ownership of how our machine learning...  ...and scaling – for a custom-built inference platform powering a live conversational...  ...build observability and alerting. Apply software engineering best practices including testing... 
    Senior
    Remote work
    Flexible hours

    Wizard

    New York, NY
    3 days ago
  • $174k - $273k

     ...open platform integrates with nearly 650 software, data and consulting partners to power...  ...Paulo. The Role We are currently seeking a Staff Software Architect to join the Office of...  ...leadership, education and guidance to engineering and product teams across the organization... 
    Senior
    H1b
    Work at office
    Remote work
    Worldwide
    Visa sponsorship
    Work visa

    Addepar

    New York, NY
    3 days ago
  •  ...Real Time Trading, Compliance & Risk Systems Engineer This opportunity is sitting in NYC - Hybrid 2-3x/week You will be responsible for designing, implementing & continuously evolving real time trading, compliance & risk systems on the engineering team. This team works... 
    Senior

    BAMM Staffing

    New York, NY
    2 days ago
  • $197k - $290k

     ...Life360 is seeking a skilled Cloud Engineer specializing in AI systems to architect the inference pipeline that integrates location data and user insights within...  ...197,000 and $290,000 USD, alongside comprehensive employee benefits for US-based staff. #J-18808-Ljbffr... 
    Remote work

    Life360

    New York, NY
    3 days ago
  •  ...Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more. About the team: As a mobile software engineer, you will build and maintain a core OKX app with millions of daily active users. You will work cross-functionally with design,... 
    Senior
    Work at office

    Framework Ventures

    New York, NY
    3 days ago
  •  ...smooth, efficient clinical operations. Our best‑in‑class modern software solution for practice management is used by practices across...  ...commitment to developer experience, that make being a Prospyr engineer delightful and allow you to stay in flow. Responsibilities Collaborate... 
    Senior
    Remote work

    Prospyr Medical

    New York, NY
    3 days ago
  • Java/J2EE Frameworks (Spring MVC, Spring Batch, SpringBoot Microservices) Design patterns and principles, Web Services (REST, SOAP) Security & Integration technologies (SSO, MSSL, OAuth, JWT, etc.) Oracle PostgreSQL, NoSQL,JUNIT and Rest Assure Maven...
    Senior

    Procyon TS

    New York, NY
    2 days ago
  •  ...The Phia Group is looking for a Senior Software Engineer to join their team and take on the challenge of designing and building applications that support healthcare-related workflows. This remote position emphasizes AI-enabled development, focusing on improving software... 
    Senior
    Remote work

    The Phia Group

    New York, NY
    3 days ago
  •  ...Sr. Software Engineer A confidential organization is seeking a Senior Software Engineer to join a mature, high-performing engineering team. This position is ideal for a seasoned engineer who thrives in solving complex technical challenges, contributes to system design... 
    Senior
    Contract work

    Blake Smith Staffing LLC

    New York, NY
    2 days ago
  •  ...Framework Ventures is seeking a Senior/Staff Software Engineer to lead the technical design of a high-frequency trading platform. You will create innovative solutions for complex programming challenges and improve software architecture for efficiency. Responsibilities... 
    Senior

    Framework Ventures

    New York, NY
    3 days ago
  •  ...Senior Principal Software Engineer We're looking for a tech leader ready to take their career to new heights. Join the ranks of top talent...  .... Leads deployment and optimization using Model Inference servers such as Triton Inference Server and vLLM for high-throughput... 
    Senior

    Chase

    New York, NY
    2 days ago
  •  ...About the Role We are looking for a backend-focused Senior Software Engineer to design, build, and scale the production systems that...  ...serving frameworks, GPU infrastructure, batch vs. real-time inference) Familiarity with React or frontend technologies sufficient... 
    Senior

    Pangram

    New York, NY
    21 hours ago
  • $220k - $270k

     ...Senior Software Engineer USD $220,000 - $270,000 meaningful equity | New York | 5 days onsite Soda has partnered with an AI infrastructure...  ...role. You'll work on core infrastructure problems around inference, orchestration, context evolution, and human-guided AI systems... 
    Senior

    SoDA

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff + Sr. Software Engineer, Inference. Be the first to apply!