Senior Software Engineer I, Inference
$139k - $204kCoreWeave
Job Description
Job Description
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at
What You'll Do:Senior engineers are area owners who lead designs, raise engineering standards, and deliver measurable improvements to latency, throughput, and reliability across multiple services. You'll partner with product, orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale.
About the role:- Lead design reviews and drive architecture within the team; decompose multi-service work into clear milestones.
- Define and own SLIs/SLOs; ensure post-incident actions land and reliability improves release-over-release.
- Implement advanced optimizations (e.g., micro-batch schedulers, speculative decoding, KV-cache reuse) and quantify impact.
- Strengthen incident posture: capacity planning, autoscaling policy, graceful degradation, rollback/traffic-shift strategies.
- Mentor IC1/IC2 engineers; review cross-team designs and elevate coding/testing standards.
- For IC4: own an area spanning multiple services and teams (e.g., request routing & adaptive scheduling, cost-per-token analytics, GPU resource isolation).
- IC3: ~3–5 years; IC4: ~5–8 years industry experience building distributed systems or cloud services.
- Computer Science or
- Strong coding in Python or Go (C++ a plus) and deep familiarity with networked systems and performance.
- Hands-on experience with Kubernetes at production scale, CI/CD, and observability stacks (Prometheus, Grafana, OpenTelemetry).
- Practical knowledge of inference internals: batching, caching, mixed precision (BF16/FP8), streaming token delivery.
- Proven track record improving tail latency (P95/P99) and service reliability through metrics-driven work.
- Bachelor's or Master's in CS, EE, or related field (or equivalent practical experience).
Preferred:
- Contributions to inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve, TorchServe).
- Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies.
- Leading multi-team initiatives or partnering with customers on mission-critical launches.
Wondering if you're a good fit? We believe in investing in our people and value candidates who can bring their diverse experiences to our teams – even if you aren't a 100% skill or experience match.
Why CoreWeave?At CoreWeave, we work hard, have fun, and move fast! We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:
- Be Curious at Your Core
- Act Like an Owner
- Empower Employees
- Deliver Best-in-Class Client Experiences
- Achieve More Together
We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for takeoff, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!
The base salary range for this role is $139,000 to $204,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).
What We Offer
The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.
In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in other locations, benefits vary and are shared during the hiring process. These include:
- Medical, dental, and vision insurance - 100% paid for by CoreWeave
- Company-paid Life Insurance
- Voluntary supplemental life insurance
- Short and long-term disability insurance
- Flexible Spending Account
- Health Savings Account
- Tuition Reimbursement
- Ability to Participate in Employee Stock Purchase Program (ESPP)
- Mental Wellness Benefits through Spring Health
- Family-Forming support provided by Carrot
- Paid Parental Leave
- Flexible, full-service childcare support with Kinside
- 401(k) with a generous employer match
- Flexible PTO
- Catered lunch each day in our office and data center locations
- A casual work environment
- A work culture focused on innovative disruption
California Applicants
California Consumer Privacy Act
Equal Opportunity & Accommodations
CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
As part of this commitment and consistent with the Americans with Disabilities Act (ADA) , CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on ziprecruiter.com.
Export Control Compliance
This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
$165k - $242k
...Senior Software Engineer II, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$139k - $204k
...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence....SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$152k - $241.5k
...most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the TensorRT team, you will be...Senior- ...advance your career. The Role As a senior member of the LLM inference framework team, you will be... ...sits at the intersection of inference engines, distributed systems, and GPU runtime... ...architectures and kernel development Software Engineering ~ Expertise in...Senior
$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA...Senior- ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node... ...You will collaborate across internal GPU software teams and engage with open-source... ...THE PERSON: Skilled engineer with strong technical and analytical expertise...Senior
$152k - $241.5k
...NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM serving by contributing directly to upstream inference engines like vLLM and SGLang-ensuring they run best‑in...SeniorRemote work$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive...Senior$152k - $241.5k
...edge AI technology for safety-critical applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling high-performance AI inference solutions for automotive safety and other specialized platforms. Your expertise...Senior$193.3k - $261.5k
...(AWS) builds AWS Neuron, the software development kit used to accelerate... ...JAX enabling unparalleled ML inference and training performance.... ...-software boundary, our engineers build systematic infrastructure... ...-sharing and mentorship. Our senior members enjoy one-on-one mentoring...SeniorWork experience placementInternshipLocal areaFlexible hours$272k - $431.25k
...architecture and hands-on delivery across system software, drivers, and CUDA to make profiling... .... Set technical direction for an engineering team; mentor engineers, drive technical... ...Hands-on experience tuning ML training/inference loops based on deep profiling analysis,...Senior$152k - $241.5k
...limits of real-time large language model inference? Join NVIDIA's TensorRT Edge-LLM team... ...automotive and robotics. We build the software stack that enables Large Language, Vision... ...Computer Science, Electrical/Computer Engineering, or a closely related field. ~4+ years...SeniorRemote work$184k - $287.5k
...infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. We're hiring senior software engineers for a compiler team within NVIDIA...SeniorWork experience placement$152k - $241.5k
...searching for highly motivated, creative engineers to join the Platform Software team. You will work with a team of... ...across engineering levels and senior management. Strong C/C++ and Python... ...GPU SW stack, LLM training and inference, and Arm architecture performance —...Senior$128.7k - $261.3k
...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine... ...workflows currently performed manually by engineers. Build the developer experience that... ...Experience designing clean, well-tested software with clear interfaces and good...SeniorLocal areaRemote workWork from homeRelocation packageFlexible hoursShift work$150k - $250k
...Senior-Level Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the... ...scale GPU pipelines using CUDA for neural scene training, inference, and real-time rasterization — including data layout, memory...SeniorFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$135.8k - $237.05k
...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2616050 Role description The opportunity Every day, we connect billions of players with...SeniorWork at officeWorldwideRelocation package$165k - $242k
...Nasdaq: CRWV) in March 2025. Learn more at What You'll Do: Senior engineers are area owners who lead designs, raise engineering... ...orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale. About the role...SeniorPermanent employmentTemporary workCasual workWork at officeFlexible hoursShift work$170k - $216k
...developer velocity. We're looking for a software engineer to join the team to build and maintain the... ...will report to the Head of ML Platform- Senior Staff Software Engineer. You will: Develop Waymo's inference platform to make it scalable, high throughput...Full timeRemote work$152k - $241.5k
...We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server ( . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution...Senior$193.3k - $261.5k
...AWS Neuron is the software stack powering AWS Inferentia and Trainium machine learning accelerators... ...to deliver high-performance, low-cost inference at scale. The Neuron Serving team... .... We are seeking a Software Development Engineer to lead and architect our next-...InternshipLocal areaFlexible hours$224k - $356.5k
...We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment solution. As part of the team, you will be instrumental in defining a scalable architecture for DL inference with emphasis on ease-of-use and compute...Senior$168k - $270.25k
...NVIDIA GPU Architecture Group is seeking a senior software engineer to automate and optimize performance analysis workflows for AI training and inference workloads. You will not only perform analysis but also reshape how it's done, building tools and workflows that scale...SeniorWork experience placement$184k - $287.5k
...We are looking for a skilled Agentic AI Software Engineer to join our team. The ideal candidate is passionate about building autonomous,... ...across the agentic AI ecosystem to enable Day-0 NVIDIA model and inference support in agent orchestration platforms and tooling....Senior$152k - $241.5k
...applications and industries. Within our software stack, CUTLASS stands out as a popular... ...state-of-the-art deep learning models’ inference and training passes to identify key GPU... ...PhD degree in Computer Science, Computer Engineering, or related field (or equivalent...Senior$238k - $302k
...that are core to our autonomous driving software. We help our partners by offering the best... ...driving. We are looking for engineers with ML system expertise to help us train... ...and implement optimizations to improve inference speed and resource utilization. Collaborate...SeniorFull timeRemote work$184k - $287.5k
...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you'll develop libraries, code generators...SeniorRemote work$193k - $291k
...Senior Software Engineer, Middleware Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to make autonomy... ...or other robotics frameworks Robotics experience, ML inference optimization experience, computer architecture experience...Senior$152k - $241.5k
...We are seeking a Senior Software Engineer to drive integration of the NVIDIA Grove project within Dynamo and across a set of leading open-source... ...Grove features to work smoothly across training and inference stacks. Partner with framework owners to upstream changes...SeniorRemote work$168k - $270.25k
...Senior Engineer For Factory Infrastructure And Automation NVIDIA is the platform upon which... ...infrastructure and automation for NVIDIA Inference Microservices (NIMs). The right person... ...in heterogeneous hardware and software environments. You will influence and drive...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer I, Inference. Be the first to apply!
- software engineer full time Sunnyvale, CA
- startup software engineer Sunnyvale, CA
- rust software engineer Sunnyvale, CA
- work from home software developer Sunnyvale, CA
- software developer Sunnyvale, CA
- software development engineer aws Sunnyvale, CA
- software qa engineer Sunnyvale, CA
- ngo software engineer Sunnyvale, CA
- software engineer staff Sunnyvale, CA
- part time software developer Sunnyvale, CA

