Senior Software Engineer II, Inference
$165k - $242kCoreWeave
Senior Software Engineer II, Inference
Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at
What You'll Do:
Senior engineers are area owners who lead designs, raise engineering standards, and deliver measurable improvements to latency, throughput, and reliability across multiple services. You'll partner with product, orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale.
About the Role:
- Lead design reviews and drive architecture within the team; decompose multi-service work into clear milestones.
- Define and own SLIs/SLOs; ensure post-incident actions land and reliability improves release-over-release.
- Implement advanced optimizations (e.g., micro-batch schedulers, speculative decoding, KV-cache reuse) and quantify impact.
- Strengthen incident posture: capacity planning, autoscaling policy, graceful degradation, rollback/traffic-shift strategies.
- Mentor IC1/IC2 engineers; review cross-team designs and elevate coding/testing standards.
- Own an area spanning multiple services and teams (e.g., request routing & adaptive scheduling, cost-per-token analytics, GPU resource isolation).
Who You Are:
- ~ 5-8 years industry experience building distributed systems or cloud services.
- Strong coding in Python or Go (C++ a plus) and deep familiarity with networked systems and performance.
- Optimize end-to-end ML system performance by developing and tuning CUDA kernels, reducing model latency, maximizing compute and memory bandwidth utilization, and leveraging custom accelerators for high-efficiency workloads
- Hands-on experience with Kubernetes at production scale, CI/CD, and observability stacks (Prometheus, Grafana, OpenTelemetry).
- Practical knowledge of inference internals: batching, caching, mixed precision (BF16/FP8), streaming token delivery.
- Proven track record improving tail latency (P95/P99) and service reliability through metrics-driven work.
Preferred:
- Contributions to inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve, TorchServe).
- Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies.
- Leading multi-team initiatives or partnering with customers on mission-critical launches.
Wondering if you're a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match.
Why CoreWeave?
At CoreWeave, we work hard, have fun, and move fast! We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:
- Be Curious at Your Core
- Act Like an Owner
- Empower Employees
- Deliver Best-in-Class Client Experiences
- Achieve More Together
We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!
The base salary range for this role is $165,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).
What We Offer
The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.
In addition to a competitive salary, we offer a variety of benefits to support your needs, including:
- Medical, dental, and vision insurance - 100% paid for by CoreWeave
- Company-paid Life Insurance
- Voluntary supplemental life insurance
- Short and long-term disability insurance
- Flexible Spending Account
- Health Savings Account
- Tuition Reimbursement
- Ability to Participate in Employee Stock Purchase Program (ESPP)
- Mental Wellness Benefits through Spring Health
- Family-Forming support provided by Carrot
- Paid Parental Leave
- Flexible, full-service childcare support with Kinside
- 401(k) with a generous employer match
- Flexible PTO
- Catered lunch each day in our office and data center locations
- A casual work environment
- A work culture focused on innovative disruption
Our Workplace
While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.
California Consumer Privacy Act - California applicants only
CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on click.appcast.io.
Export Control Compliance
This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
$139k - $204k
...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave... ...U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$180k - $225k
...gateway for API delivery, AI inference, device fleets, and... ...our success! We like software that’s serious and... ...runs entirely on AWS. Engineers develop by SSH’ing into... .... Compensation Senior Software Engineer... ...00 Software Engineer II Tier 1 (SF, LA, Seattle...SeniorPermanent employmentFull timeWork at officeLocal areaRemote work$152k - $241.5k
...most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the TensorRT team, you will be...Senior$152k - $241.5k
...NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM serving by contributing directly to upstream inference engines like vLLM and SGLang-ensuring they run best‑in...Senior$184k - $287.5k
...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today's most sophisticated AI applications. Our team is responsible...SeniorRemote work- ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node... ...You will collaborate across internal GPU software teams and engage with open-source... ...THE PERSON: Skilled engineer with strong technical and analytical expertise...Senior
$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA...Senior$139k - $204k
...Senior Software Engineer II, Applied Training CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive...Senior$152k - $241.5k
...edge AI technology for safety-critical applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling high-performance AI inference solutions for automotive safety and other specialized platforms. Your expertise...Senior$193.3k - $261.5k
...(AWS) builds AWS Neuron, the software development kit used to accelerate... ...JAX enabling unparalleled ML inference and training performance.... ...-software boundary, our engineers build systematic infrastructure... ...-sharing and mentorship. Our senior members enjoy one-on-one mentoring...SeniorWork experience placementInternshipLocal areaFlexible hours$170k - $220k
...Senior Software Engineer We are seeking a senior software engineer to play a pivotal role in advancing our engineering efforts. The ideal candidate will have extensive experience in Android and Linux development, with a focus on Kotlin, Java, C++, and Python. This...Senior$750 per month
...within the United States. We're looking for an experienced Senior Software Engineer to join our team and help eliminate the financial complexity... ..., and highly-capable development team. As a Senior Engineer II, you should be comfortable leading complex, ambiguous projects...SeniorFor contractorsWork experience placementFreelanceCurrently hiringRemote workWork from homeFlexible hours$139k - $204k
...Senior Software Engineer, Cluster Orchestration CoreWeave is The Essential Cloud for AI™. Built for... ...that powers AI training and inference at scale. This is an opportunity to help... ...defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours- A leading technology company in Santa Clara is seeking a Senior Deep Learning Software Engineer to design and build automated inference solutions. The ideal candidate will have extensive experience with deep learning techniques and software engineering. Key responsibilities...Senior
$165k - $242k
...drives innovation. What You’ll Do: Senior engineers are area owners who lead designs, raise... ...hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at... ...as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$165k - $242k
...CRWV) in March 2025. Learn more at What You'll Do: As a Senior Software Engineer II (IC4) on the AI Workload Orchestration team, you will... ...Kueue, Volcano, and Ray to support modern AI training and inference workflows. It complements SUNK (Slurm on Kubernetes) by...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$272k - $431.25k
...architecture and hands-on delivery across system software, drivers, and CUDA to make profiling... .... Set technical direction for an engineering team; mentor engineers, drive technical... ...Hands-on experience tuning ML training/inference loops based on deep profiling analysis,...Senior$70.3k - $205k
...Senior Engineer II-Software Microchip Technology Inc. is a leading provider of embedded control applications. Our product portfolio comprises general purpose and specialized 8-bit, 16-bit, and 32-bit microcontrollers, 32-bit microprocessors, field-programmable gate...Senior$152k - $241.5k
...limits of real-time large language model inference? Join NVIDIA's TensorRT Edge-LLM team... ...automotive and robotics. We build the software stack that enables Large Language, Vision... ...Computer Science, Electrical/Computer Engineering, or a closely related field. ~4+ years...SeniorRemote work$184k - $287.5k
...Senior Software Engineer For Compiler Team NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part...SeniorWork experience placement- ...About the role: As a Senior Software Engineer on Samsara’s Route Execution team, you’ll build the systems that power route planning, optimization, dispatch, and real-time tracking for fleets across logistics, field services, and delivery. You’ll work across the stack,...SeniorImmediate startRemote work
$152k - $241.5k
...searching for highly motivated, creative engineers to join the Platform Software team. You will work with a team of... ...across engineering levels and senior management. Strong C/C++ and Python... ...GPU SW stack, LLM training and inference, and Arm architecture performance —...Senior- ...Introduction At IBM Software, we transform client challenges into solutions. Building... ...changes the world. On the HashiCorp engineering team, we build the Infrastructure Cloud... ...and responsibilities We’re looking for Senior Engineers with a deep backend focus to join...SeniorRemote work
- ...Senior Software Engineer In Test At Intuitive, we are united behind our mission: we believe that minimally invasive care is life-enhancing care... ...government's licensing process can take 3 to 6+ months) or (ii) implement a Technology Control Plan ("TCP") (note:...SeniorWork experience placementLocal areaFlexible hours
$155.42k - $205.9k
...Description About the Team: The ML Inference Platform is part of the AV ML... ...About the Role: We are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms... ...and implement core platform backend software components. Collaborate with ML...SeniorLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$139k - $242k
...Senior Software Engineer, Sandboxes & Virtualization Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA / San Francisco, CA CoreWeave... ...a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$128.7k - $261.3k
...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine... ...workflows currently performed manually by engineers. Build the developer experience that... ...Experience designing clean, well-tested software with clear interfaces and good...SeniorLocal areaRemote workWork from homeRelocation packageFlexible hoursShift work$135.8k - $237.05k
...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2616050 Role description The opportunity Every day, we connect billions of players with...SeniorWork at officeWorldwideRelocation package- A tech company in Mountain View, CA, seeks a Software Engineer II to manage the full lifecycle of software development, focusing on web applications and backend services. This role involves building modern, responsive web applications and backend web services, working...Remote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer II, Inference. Be the first to apply!
- graduate software developer Sunnyvale, CA
- rust software engineer Sunnyvale, CA
- senior software design engineer Sunnyvale, CA
- software engineer student Sunnyvale, CA
- software engineer amazon Sunnyvale, CA
- software developer positions Sunnyvale, CA
- software engineer full time Sunnyvale, CA
- software qa engineer Sunnyvale, CA
- new graduate software engineer Sunnyvale, CA
- junior software developer Sunnyvale, CA

