Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer II, Inference

$165k - $242k

CoreWeave

Senior Software Engineer II, Inference

Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at

What You'll Do:

Senior engineers are area owners who lead designs, raise engineering standards, and deliver measurable improvements to latency, throughput, and reliability across multiple services. You'll partner with product, orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale.

About the Role:
  • Lead design reviews and drive architecture within the team; decompose multi-service work into clear milestones.
  • Define and own SLIs/SLOs; ensure post-incident actions land and reliability improves release-over-release.
  • Implement advanced optimizations (e.g., micro-batch schedulers, speculative decoding, KV-cache reuse) and quantify impact.
  • Strengthen incident posture: capacity planning, autoscaling policy, graceful degradation, rollback/traffic-shift strategies.
  • Mentor IC1/IC2 engineers; review cross-team designs and elevate coding/testing standards.
  • Own an area spanning multiple services and teams (e.g., request routing & adaptive scheduling, cost-per-token analytics, GPU resource isolation).
Who You Are:
  • ~ 5-8 years industry experience building distributed systems or cloud services.
  • Strong coding in Python or Go (C++ a plus) and deep familiarity with networked systems and performance.
  • Optimize end-to-end ML system performance by developing and tuning CUDA kernels, reducing model latency, maximizing compute and memory bandwidth utilization, and leveraging custom accelerators for high-efficiency workloads
  • Hands-on experience with Kubernetes at production scale, CI/CD, and observability stacks (Prometheus, Grafana, OpenTelemetry).
  • Practical knowledge of inference internals: batching, caching, mixed precision (BF16/FP8), streaming token delivery.
  • Proven track record improving tail latency (P95/P99) and service reliability through metrics-driven work.

Preferred:

  • Contributions to inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve, TorchServe).
  • Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies.
  • Leading multi-team initiatives or partnering with customers on mission-critical launches.

Wondering if you're a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match.

Why CoreWeave?

At CoreWeave, we work hard, have fun, and move fast! We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:

  • Be Curious at Your Core
  • Act Like an Owner
  • Empower Employees
  • Deliver Best-in-Class Client Experiences
  • Achieve More Together

We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!

The base salary range for this role is $165,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).

What We Offer

The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Our Workplace

While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.

California Consumer Privacy Act - California applicants only

CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.

As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on click.appcast.io.

Export Control Compliance

This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer II, Inference in Sunnyvale, CA vacancy
  • $139k - $204k

     ...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave...  ...U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $180k - $225k

     ...gateway for API delivery, AI inference, device fleets, and...  ...our success! We like software that’s serious and...  ...runs entirely on AWS. Engineers develop by SSH’ing into...  .... Compensation Senior Software Engineer...  ...00 Software Engineer II Tier 1 (SF, LA, Seattle... 
    Senior
    Permanent employment
    Full time
    Work at office
    Local area
    Remote work

    GrabJobs

    San Jose, CA
    2 days ago
  • $152k - $241.5k

     ...most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the TensorRT team, you will be... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM serving by contributing directly to upstream inference engines like vLLM and SGLang-ensuring they run best‑in... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

     ...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today's most sophisticated AI applications. Our team is responsible... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node...  ...You will collaborate across internal GPU software teams and engage with open-source...  ...THE PERSON: Skilled engineer with strong technical and analytical expertise... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $139k - $204k

     ...Senior Software Engineer II, Applied Training CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $184k - $287.5k

     ...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...edge AI technology for safety-critical applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling high-performance AI inference solutions for automotive safety and other specialized platforms. Your expertise... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $193.3k - $261.5k

     ...(AWS) builds AWS Neuron, the software development kit used to accelerate...  ...JAX enabling unparalleled ML inference and training performance....  ...-software boundary, our engineers build systematic infrastructure...  ...-sharing and mentorship. Our senior members enjoy one-on-one mentoring... 
    Senior
    Work experience placement
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    3 days ago
  • $170k - $220k

     ...Senior Software Engineer We are seeking a senior software engineer to play a pivotal role in advancing our engineering efforts. The ideal candidate will have extensive experience in Android and Linux development, with a focus on Kotlin, Java, C++, and Python. This... 
    Senior

    Autoroboto

    Mountain View, CA
    1 day ago
  • $750 per month

     ...within the United States. We're looking for an experienced Senior Software Engineer to join our team and help eliminate the financial complexity...  ..., and highly-capable development team. As a Senior Engineer II, you should be comfortable leading complex, ambiguous projects... 
    Senior
    For contractors
    Work experience placement
    Freelance
    Currently hiring
    Remote work
    Work from home
    Flexible hours

    GrabJobs

    San Jose, CA
    4 days ago
  • $139k - $204k

     ...Senior Software Engineer, Cluster Orchestration CoreWeave is The Essential Cloud for AI™. Built for...  ...that powers AI training and inference at scale. This is an opportunity to help...  ...defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • A leading technology company in Santa Clara is seeking a Senior Deep Learning Software Engineer to design and build automated inference solutions. The ideal candidate will have extensive experience with deep learning techniques and software engineering. Key responsibilities... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $165k - $242k

     ...drives innovation.  What You’ll Do: Senior engineers are area owners who lead designs, raise...  ...hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at...  ...as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    more than 2 months ago
  • $165k - $242k

     ...CRWV) in March 2025. Learn more at What You'll Do: As a Senior Software Engineer II (IC4) on the AI Workload Orchestration team, you will...  ...Kueue, Volcano, and Ray to support modern AI training and inference workflows. It complements SUNK (Slurm on Kubernetes) by... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    6 days ago
  • $272k - $431.25k

     ...architecture and hands-on delivery across system software, drivers, and CUDA to make profiling...  .... Set technical direction for an engineering team; mentor engineers, drive technical...  ...Hands-on experience tuning ML training/inference loops based on deep profiling analysis,... 
    Senior

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $70.3k - $205k

     ...Senior Engineer II-Software Microchip Technology Inc. is a leading provider of embedded control applications. Our product portfolio comprises general purpose and specialized 8-bit, 16-bit, and 32-bit microcontrollers, 32-bit microprocessors, field-programmable gate... 
    Senior

    Microchip Technology

    Los Gatos, CA
    1 day ago
  • $152k - $241.5k

     ...limits of real-time large language model inference? Join NVIDIA's TensorRT Edge-LLM team...  ...automotive and robotics. We build the software stack that enables Large Language, Vision...  ...Computer Science, Electrical/Computer Engineering, or a closely related field. ~4+ years... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...Senior Software Engineer For Compiler Team NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part... 
    Senior
    Work experience placement

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...About the role: As a Senior Software Engineer on Samsara’s Route Execution team, you’ll build the systems that power route planning, optimization, dispatch, and real-time tracking for fleets across logistics, field services, and delivery. You’ll work across the stack,... 
    Senior
    Immediate start
    Remote work

    GrabJobs

    San Jose, CA
    4 days ago
  • $152k - $241.5k

     ...searching for highly motivated, creative engineers to join the Platform Software team. You will work with a team of...  ...across engineering levels and senior management. Strong C/C++ and Python...  ...GPU SW stack, LLM training and inference, and Arm architecture performance —... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...Introduction At IBM Software, we transform client challenges into solutions. Building...  ...changes the world. On the HashiCorp engineering team, we build the Infrastructure Cloud...  ...and responsibilities We’re looking for Senior Engineers with a deep backend focus to join... 
    Senior
    Remote work

    IBM

    San Jose, CA
    1 day ago
  •  ...Senior Software Engineer In Test At Intuitive, we are united behind our mission: we believe that minimally invasive care is life-enhancing care...  ...government's licensing process can take 3 to 6+ months) or (ii) implement a Technology Control Plan ("TCP") (note:... 
    Senior
    Work experience placement
    Local area
    Flexible hours

    Intuitive

    Sunnyvale, CA
    1 day ago
  • $155.42k - $205.9k

     ...Description About the Team: The ML Inference Platform is part of the AV ML...  ...About the Role: We are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms...  ...and implement core platform backend software components. Collaborate with ML... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    4 days ago
  • $139k - $242k

     ...Senior Software Engineer, Sandboxes & Virtualization Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA / San Francisco, CA CoreWeave...  ...a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $128.7k - $261.3k

     ...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine...  ...workflows currently performed manually by engineers. Build the developer experience that...  ...Experience designing clean, well-tested software with clear interfaces and good... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours
    Shift work

    General Motors

    Mountain View, CA
    1 day ago
  • $135.8k - $237.05k

     ...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2616050 Role description The opportunity Every day, we connect billions of players with... 
    Senior
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    4 days ago
  • A tech company in Mountain View, CA, seeks a Software Engineer II to manage the full lifecycle of software development, focusing on web applications and backend services. This role involves building modern, responsive web applications and backend web services, working... 
    Remote job

    Syllable Corporation

    Mountain View, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer II, Inference. Be the first to apply!