Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer I, Inference

$139k - $204k

Dormont Manufacturing Co

What You’ll Do: Senior engineers are area owners who lead designs, raise engineering standards, and deliver measurable improvements to latency, throughput, and reliability across multiple services. You’ll partner with product, orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale. About the role: Lead design reviews and drive architecture within the team; decompose multi-service work into clear milestones. Define and own SLIs/SLOs; ensure post-incident actions land and reliability improves release-over-release. Implement advanced optimizations (e.g., micro-batch schedulers, speculative decoding, KV-cache reuse) and quantify impact. Strengthen incident posture: capacity planning, autoscaling policy, graceful degradation, rollback/traffic-shift strategies. Mentor IC1/IC2 engineers; review cross-team designs and elevate coding/testing standards. For IC4: own an area spanning multiple services and teams (e.g., request routing & adaptive scheduling, cost-per-token analytics, GPU resource isolation). Who You Are: IC3: ~3–5 years; IC4: ~5–8 years industry experience building distributed systems or cloud services. Strong coding in Python or Go (C++ a plus) and deep familiarity with networked systems and performance. Hands‑on experience with Kubernetes at production scale, CI/CD, and observability stacks (Prometheus, Grafana, OpenTelemetry). Practical knowledge of inference internals: batching, caching, mixed precision (BF16/FP8), streaming token delivery. Proven track record improving tail latency (P95/P99) and service reliability through metrics‑driven work. Preferred: Contributions to inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve, TorchServe). Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies. Leading multi‑team initiatives or partnering with customers on mission‑critical launches. Benefits and Compensation Base salary range for this role is $139,000 to $204,000. The starting salary will be determined based on job‑related knowledge, skills, experience, and market location. In addition to base salary, the total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility). Medical, dental, and vision insurance – 100% paid for by CoreWeave Company‑paid life insurance Voluntary supplemental life insurance Short and long‑term disability insurance Flexible Spending Account Health Savings Account Tuition Reimbursement Employee Stock Purchase Program (ESP) participation Mental wellness benefits via Spring Health Family‑Forming support via Carrot Paid parental leave Flexible, full‑service childcare support with Kinside 401(k) with generous employer match Flexible PTO Catered lunch each day in office and data center locations Casual work environment Work culture focused on innovative disruption Equal Opportunity Employer CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information. Americans with Disabilities Act (ADA) Compliance CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on click.appcast.io. Export Control Compliance This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, the applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process. California applicants: California Consumer Privacy Act – California applicants only #J-18808-Ljbffr Dormont Manufacturing Co

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer I, Inference in Sunnyvale, CA vacancy
  • $152k - $241.5k

    Senior Software Engineer - Deep Learning Inference What you’ll be doing: Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node...  ...You will collaborate across internal GPU software teams and engage with open-source...  ...software ecosystem. THE PERSON: Skilled engineer with strong technical and analyticalexpertisein... 
    Senior

    Advanced Micro Devices

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    Position Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • About the Role We are seeking a Senior Inference Engineer to accelerate the performance of Pika's AI-driven products. In this highly technical role, you will operate at the intersection of cutting‑edge inference acceleration, GPU parallelism, advanced model deployment,... 
    Senior
    Work at office
    3 days per week

    PIKA Inc

    Palo Alto, CA
    3 days ago
  • $152k - $204k

     ...Nasdaq: CRWV) in March 2025. Learn more at What You'll Do: Senior engineers are area owners who lead designs, raise engineering...  ...orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale. About the role... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    13 days ago
  • Cerebras Systems, Inc. is looking for a Senior Performance Engineer to enhance the performance benchmarking and competitive pricing models for their...  ...candidate will have extensive experience with open-source inference frameworks and an understanding of ML systems. This role... 
    Senior

    Cerebras Systems, Inc.

    Sunnyvale, CA
    2 days ago
  • A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves...  ...Required qualifications include a relevant degree and significant software development experience in Python or C++. A deep understanding... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...Inc. is looking for a Sr. Member of Technical Staff to design software features that enhance system resiliency and high availability...  ...distributed environments. The role includes developing scalable AI inference services and deploying cloud-based workflows. Ideal candidates... 
    Senior

    Cerebras Systems, Inc.

    Sunnyvale, CA
    2 days ago
  • $165k - $242k

    Dormont Manufacturing Co is seeking a Senior Engineer to lead designs and improve engineering standards. The role focuses on evolving our Kubernetes-native inference platform and ensuring reliability across multiple services. Qualified candidates should have 5-8 years experience... 
    Senior

    Dormont Manufacturing Co

    Sunnyvale, CA
    1 day ago
  • $230k - $250k

    Cerebras Systems is seeking a Sr. Member of Technical Staff in Sunnyvale, CA. This role involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a Master’s degree in Computer Science and experience... 
    Senior

    Cerebras Systems

    Sunnyvale, CA
    3 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • United States Digital Space LLC in Palo Alto is looking for a Senior Engineer to develop a next-generation inference platform integrated with Atlas. This role involves building scalable infrastructure and collaborating with teams to enhance AI capabilities. Ideal candidates... 
    Senior

    United States Digital Space LLC

    Palo Alto, CA
    4 days ago
  • $165k - $242k

    A cloud service provider is seeking a Senior Software Engineer II for their Inference team in Sunnyvale, California. In this role, you'll lead design reviews, implement optimizations, and improve service reliability. The ideal candidate has extensive experience with distributed... 
    Senior

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • Dormont Manufacturing Co is looking for a Senior ML Infrastructure Engineer to help build and scale robust platforms for ML inference workflows. You will collaborate with ML engineers and researchers while shaping the future of AI infrastructure at GM. The ideal candidate... 
    Senior
    Remote job

    Dormont Manufacturing Co

    Sunnyvale, CA
    1 day ago
  • Cerebras is seeking a Software Engineer to join our Inference Platform team in Sunnyvale, California. This role involves developing and leading projects that integrate cloud and ML components. You will contribute to shaping the technical direction and improve system performance... 
    Senior

    Cerebras

    Sunnyvale, CA
    4 days ago
  • Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position...  ...and scale robust platforms for ML inference workflows supporting GM’s AI efforts...  ...serving strategies and handle backend software components. The position demands 5+... 
    Senior
    Remote job

    Israelvcforum

    Mountain View, CA
    4 days ago
  • $126k - $248k

    About the Role We’re looking for a Senior Engineer to help build the next‑generation inference platform that supports embedding models used for semantic search,...  ...backend or infrastructure systems at scale Strong software engineering skills in languages such as Go, Rust,... 
    Senior
    Local area

    The Consulting Solutions

    Palo Alto, CA
    1 day ago
  • We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and...  ...backend or infrastructure systems at scale Strong software engineering skills in languages such as Go, Rust,... 
    Senior
    Local area
    Worldwide

    MongoDB

    Palo Alto, CA
    3 days ago
  •  ...limits of real‑time large language model inference? Join NVIDIA’s TensorRT Edge‑LLM team...  ...automotive and robotics. We build the software stack that enables Large Language, Vision...  ...Computer Science, Electrical/Computer Engineering, or a closely related field. 4+ years of... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...Design, build, and optimize containerized inference execution for the latest 3D VLMs from...  ...production-grade, highly optimized software (NIMs, NVIDIA Inference Microservices)*...  ...Computer Science + 3 years, or Electrical Engineering, Bachelor of Science (or equivalent experience... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $128.7k - $261.3k

    About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning...  ...currently performed manually by engineers. Build the developer experience that ML...  ...Experience designing clean, well‑tested software with clear interfaces and good abstractions... 
    Senior
    Local area
    Remote work
    Flexible hours
    Shift work

    General Motors

    Mountain View, CA
    3 days ago
  • $170.6k - $261.3k

    Overview As a Senior Software Engineer on the SimCore team, you will build and deploy applied AI/ML solutions that directly support simulation...  ...models, and excel at building robust, high‑performance inference pipelines. This role is not focused on training foundation... 
    Senior
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $181.1k - $272.1k

    Sunnyvale, California, United States Software and Services At Apple, new ideas have...  ...workers. Description We are looking for a senior Swift engineer to join a small, focused team building...  ...tool calling, and streaming on-device inference. Familiarity with MLX or similar on-... 
    Senior
    Relocation

    Apple

    Sunnyvale, CA
    4 days ago
  • $155.42k - $205.9k

    Job Description Senior ML Infrastructure Engineer (ML Inference Platform). About the Team The ML Inference Platform is part of the AV ML Infrastructure organization...  ...be doing Design and implement core platform backend software components. Collaborate with ML engineers and... 
    Senior
    Local area
    Remote work
    Relocation
    Relocation package
    Flexible hours

    Dormont Manufacturing Co

    Sunnyvale, CA
    1 day ago
  • $135.8k - $237.05k

     ...learning and real‑world impact converge at scale. We’re hiring a Senior Backend Engineer to build and operate the infrastructure those models depend...  ...focus on the performance, reliability, and scalability of inference systems. Join us and help influence how billions of gaming... 
    Senior
    Work at office
    Worldwide
    Relocation package

    Dormont Manufacturing Co

    Mountain View, CA
    1 day ago
  • $224k - $356.5k

    We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment solution. As part of the team, you will be instrumental in defining a scalable architecture for DL inference with emphasis on ease-of-use and compute... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer I, Inference. Be the first to apply!