Senior Software Engineer I, Inference
$139k - $204kDormont Manufacturing Co
What You’ll Do: Senior engineers are area owners who lead designs, raise engineering standards, and deliver measurable improvements to latency, throughput, and reliability across multiple services. You’ll partner with product, orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale. About the role: Lead design reviews and drive architecture within the team; decompose multi-service work into clear milestones. Define and own SLIs/SLOs; ensure post-incident actions land and reliability improves release-over-release. Implement advanced optimizations (e.g., micro-batch schedulers, speculative decoding, KV-cache reuse) and quantify impact. Strengthen incident posture: capacity planning, autoscaling policy, graceful degradation, rollback/traffic-shift strategies. Mentor IC1/IC2 engineers; review cross-team designs and elevate coding/testing standards. For IC4: own an area spanning multiple services and teams (e.g., request routing & adaptive scheduling, cost-per-token analytics, GPU resource isolation). Who You Are: IC3: ~3–5 years; IC4: ~5–8 years industry experience building distributed systems or cloud services. Strong coding in Python or Go (C++ a plus) and deep familiarity with networked systems and performance. Hands‑on experience with Kubernetes at production scale, CI/CD, and observability stacks (Prometheus, Grafana, OpenTelemetry). Practical knowledge of inference internals: batching, caching, mixed precision (BF16/FP8), streaming token delivery. Proven track record improving tail latency (P95/P99) and service reliability through metrics‑driven work. Preferred: Contributions to inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve, TorchServe). Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies. Leading multi‑team initiatives or partnering with customers on mission‑critical launches. Benefits and Compensation Base salary range for this role is $139,000 to $204,000. The starting salary will be determined based on job‑related knowledge, skills, experience, and market location. In addition to base salary, the total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility). Medical, dental, and vision insurance – 100% paid for by CoreWeave Company‑paid life insurance Voluntary supplemental life insurance Short and long‑term disability insurance Flexible Spending Account Health Savings Account Tuition Reimbursement Employee Stock Purchase Program (ESP) participation Mental wellness benefits via Spring Health Family‑Forming support via Carrot Paid parental leave Flexible, full‑service childcare support with Kinside 401(k) with generous employer match Flexible PTO Catered lunch each day in office and data center locations Casual work environment Work culture focused on innovative disruption Equal Opportunity Employer CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information. Americans with Disabilities Act (ADA) Compliance CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on click.appcast.io. Export Control Compliance This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, the applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process. California applicants: California Consumer Privacy Act – California applicants only #J-18808-Ljbffr Dormont Manufacturing Co
$152k - $241.5k
Senior Software Engineer - Deep Learning Inference What you’ll be doing: Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning...Senior- ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node... ...You will collaborate across internal GPU software teams and engage with open-source... ...software ecosystem. THE PERSON: Skilled engineer with strong technical and analyticalexpertisein...Senior
$184k - $287.5k
Position Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and...Senior- About the Role We are seeking a Senior Inference Engineer to accelerate the performance of Pika's AI-driven products. In this highly technical role, you will operate at the intersection of cutting‑edge inference acceleration, GPU parallelism, advanced model deployment,...SeniorWork at office3 days per week
$152k - $204k
...Nasdaq: CRWV) in March 2025. Learn more at What You'll Do: Senior engineers are area owners who lead designs, raise engineering... ...orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale. About the role...SeniorPermanent employmentTemporary workCasual workWork at officeFlexible hoursShift work- Cerebras Systems, Inc. is looking for a Senior Performance Engineer to enhance the performance benchmarking and competitive pricing models for their... ...candidate will have extensive experience with open-source inference frameworks and an understanding of ML systems. This role...Senior
- A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...Senior
- A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations...Senior
$184k - $287.5k
NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves...Senior$152k - $241.5k
...seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves... ...Required qualifications include a relevant degree and significant software development experience in Python or C++. A deep understanding...Senior$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...Senior- ...Inc. is looking for a Sr. Member of Technical Staff to design software features that enhance system resiliency and high availability... ...distributed environments. The role includes developing scalable AI inference services and deploying cloud-based workflows. Ideal candidates...Senior
$165k - $242k
Dormont Manufacturing Co is seeking a Senior Engineer to lead designs and improve engineering standards. The role focuses on evolving our Kubernetes-native inference platform and ensuring reliability across multiple services. Qualified candidates should have 5-8 years experience...Senior$230k - $250k
Cerebras Systems is seeking a Sr. Member of Technical Staff in Sunnyvale, CA. This role involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a Master’s degree in Computer Science and experience...Senior$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior- United States Digital Space LLC in Palo Alto is looking for a Senior Engineer to develop a next-generation inference platform integrated with Atlas. This role involves building scalable infrastructure and collaborating with teams to enhance AI capabilities. Ideal candidates...Senior
$165k - $242k
A cloud service provider is seeking a Senior Software Engineer II for their Inference team in Sunnyvale, California. In this role, you'll lead design reviews, implement optimizations, and improve service reliability. The ideal candidate has extensive experience with distributed...Senior- Dormont Manufacturing Co is looking for a Senior ML Infrastructure Engineer to help build and scale robust platforms for ML inference workflows. You will collaborate with ML engineers and researchers while shaping the future of AI infrastructure at GM. The ideal candidate...SeniorRemote job
- Cerebras is seeking a Software Engineer to join our Inference Platform team in Sunnyvale, California. This role involves developing and leading projects that integrate cloud and ML components. You will contribute to shaping the technical direction and improve system performance...Senior
- Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position... ...and scale robust platforms for ML inference workflows supporting GM’s AI efforts... ...serving strategies and handle backend software components. The position demands 5+...SeniorRemote job
$126k - $248k
About the Role We’re looking for a Senior Engineer to help build the next‑generation inference platform that supports embedding models used for semantic search,... ...backend or infrastructure systems at scale Strong software engineering skills in languages such as Go, Rust,...SeniorLocal area- We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and... ...backend or infrastructure systems at scale Strong software engineering skills in languages such as Go, Rust,...SeniorLocal areaWorldwide
- ...limits of real‑time large language model inference? Join NVIDIA’s TensorRT Edge‑LLM team... ...automotive and robotics. We build the software stack that enables Large Language, Vision... ...Computer Science, Electrical/Computer Engineering, or a closely related field. 4+ years of...Senior
$152k - $241.5k
...Design, build, and optimize containerized inference execution for the latest 3D VLMs from... ...production-grade, highly optimized software (NIMs, NVIDIA Inference Microservices)*... ...Computer Science + 3 years, or Electrical Engineering, Bachelor of Science (or equivalent experience...Senior$128.7k - $261.3k
About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning... ...currently performed manually by engineers. Build the developer experience that ML... ...Experience designing clean, well‑tested software with clear interfaces and good abstractions...SeniorLocal areaRemote workFlexible hoursShift work$170.6k - $261.3k
Overview As a Senior Software Engineer on the SimCore team, you will build and deploy applied AI/ML solutions that directly support simulation... ...models, and excel at building robust, high‑performance inference pipelines. This role is not focused on training foundation...SeniorFlexible hours$181.1k - $272.1k
Sunnyvale, California, United States Software and Services At Apple, new ideas have... ...workers. Description We are looking for a senior Swift engineer to join a small, focused team building... ...tool calling, and streaming on-device inference. Familiarity with MLX or similar on-...SeniorRelocation$155.42k - $205.9k
Job Description Senior ML Infrastructure Engineer (ML Inference Platform). About the Team The ML Inference Platform is part of the AV ML Infrastructure organization... ...be doing Design and implement core platform backend software components. Collaborate with ML engineers and...SeniorLocal areaRemote workRelocationRelocation packageFlexible hours$135.8k - $237.05k
...learning and real‑world impact converge at scale. We’re hiring a Senior Backend Engineer to build and operate the infrastructure those models depend... ...focus on the performance, reliability, and scalability of inference systems. Join us and help influence how billions of gaming...SeniorWork at officeWorldwideRelocation package$224k - $356.5k
We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment solution. As part of the team, you will be instrumental in defining a scalable architecture for DL inference with emphasis on ease-of-use and compute...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer I, Inference. Be the first to apply!
- software engineer amazon Sunnyvale, CA
- experienced software developer Sunnyvale, CA
- federal - software developer Sunnyvale, CA
- software developer internship Sunnyvale, CA
- senior software engineer Sunnyvale, CA
- software developer fintech Sunnyvale, CA
- part time software developer remote Sunnyvale, CA
- software developer intern Sunnyvale, CA
- software data engineer Sunnyvale, CA
- software engineer Sunnyvale, CA
