Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Runtime Optimization Engineer

$159.05k - $199.3k

Decisive Point

About the role We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production‑grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, Triton). At Applied Intuition, you will: Drive ML performance optimization on multiple technologies for on‑road and off‑road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers Work on model pruning and quantization, and support deployment on memory constrained platforms Collaborate closely with ML engineers and software developers on technical efforts to find and optimize efficient model architecture solutions Set up methodologies to profile the model performance on target embedded compute platforms and identify performance bottlenecks as part of stack integration We're looking for someone who has: Bachelors in Electrical Engineering or Computer Science, OR B.Sc. in Computer Science, Mathematics, Physics or a related field 3+ years of experience with ML accelerators, GPU, CPU, SoC architecture and micro‑architecture Strong software development skills with the focus on embedded programming Experience profiling and optimizing model performance on embedded compute platforms Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX, etc.) Nice to have: M.Sc or PhD in a ML related area Built an ML optimization framework from scratch before Deployed ML solutions to embedded chips for real time robotics applications Compensation at Applied Intuition for eligible roles includes base salary, equity, and benefits. Base salary is a single component of the total compensation package, which may also include equity in the form of options and/or restricted stock units, comprehensive health, dental, vision, life and disability insurance coverage, 401k retirement benefits with employer match, learning and wellness stipends, and paid time off. Note that benefits are subject to change and may vary based on jurisdiction of employment. Applied Intuition pay ranges reflect the minimum and maximum intended target base salary for new hire salaries for the position. The actual base salary offered to a successful candidate will additionally be influenced by a variety of factors including experience, credentials & certifications, educational attainment, skill level requirements, interview performance, and the level and scope of the position. For pay transparency purposes, the base salary range for this full‑time position in the location listed is: $159,053 - $199,295 USD annually. Applied Intuition is an equal opportunity employer and federal contractor or subcontractor. Consequently, the parties agree that, as applicable, they will abide by the requirements of 41 CFR 60‑1.4(a), 41 CFR 60‑300.5(a) and 41 CFR 60‑741.5(a) and that these regulations prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities, and prohibit discrimination against all individuals based on race, color, religion, sex, sexual orientation, gender identity or national origin. These regulations require that covered prime contractors and subcontractors take affirmative action to employ and advance in employment individuals without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status or disability. The parties also agree that, as applicable, they will abide by the requirements of Executive Order 13496 (29 CFR Part 471, Appendix A to Subpart A), relating to the notice of employee rights under federal labor laws. #J-18808-Ljbffr Decisive Point

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the ML Runtime Optimization Engineer in Sunnyvale, CA vacancy
  • $170.5k - $315.49k

    ## Inference Optimization Engineer (local / edge runtime)Applylocations: US, California, Santa Clara: US, Oregon, Hillsboro: US, California, Folsom: US, Arizona, Phoenixtime type: Full timeposted on: Posted Yesterdayjob requisition id: JR0284871# **Job Details:**## Job... 
    Suggested
    Internship
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    23 hours ago
  • A technology company located in Santa Clara is seeking a CPU Physical Design Methodology and Optimization Engineer. In this role, you will work on enhancing CPU performance and efficiency by developing new design flows and collaborating with various teams. Candidates should... 
    Suggested

    Apple Inc.

    Santa Clara, CA
    4 days ago
  • $136k - $218.5k

     ...We’re looking for a Senior Power Architecture & Optimization Engineer to push the limits of energy efficiency using advanced analytics and AI,...  ...Develop and productionize power‑aware models and flows, including ML/RL‑based techniques for anomaly detection, dynamic power... 
    Suggested

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...Senior Software & Machine Learning Engineer to join our Energy Optimization team. This role focuses on building...  ...integration applications Build automated ML pipelines for model training,...  ...performance, memory utilization, and runtime efficiency. Develop monitoring, simulation... 
    Suggested

    Pentangle Tech Services | P5 Group

    Palo Alto, CA
    1 day ago
  • $166k - $244k

    Google is looking for a Senior Software Engineer in Sunnyvale, CA to lead GPU performance optimizations for cutting-edge AI and machine learning technologies. This role offers the opportunity to work on innovative projects that impact billions of users around the globe.... 
    Suggested

    Google

    Sunnyvale, CA
    1 day ago
  •  ...supercomputer — feel like one seamless engine. Developers can write once,...  ...the Role We're looking for a Runtime Engineer to design and build...  ...you'll take the output of our optimizing compiler and make it execute —...  ...the evolving needs of ML engineers and drive improvements... 

    Lemurian Labs

    Santa Clara, CA
    1 day ago
  • $181.1k - $318.4k

    Staff Data Science Engineer, Siri Runtime Systems and Interaction Cupertino, California, United States Software and Services Apple is where individual...  ...monitoring Drive technical decisions and architecture for ML/AI initiatives Identify high-impact opportunities where data... 
    Relocation

    Apple Inc.

    Cupertino, CA
    23 hours ago
  •  ...AI platform, from chip to model, optimized for enterprise and government organizations...  ...assets. The Opportunity The Runtime team at Sambanova is a seasoned engineering team with a proven track record of...  ...data-flow applications such as ML training and inference and HPC applications... 
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova

    Palo Alto, CA
    1 day ago
  • $120k - $200k

     ...to train and run the largest ML workloads for AGI. We primarily...  .../validation, compiler, and runtime teams Contribute to architectural...  ...and implement performance optimizations Requirements Bachelor of...  ...equivalent degree Excellent software engineering skills, with a focus on... 
    Full time
    Work at office
    3 days per week

    Acceler8 Talent

    Mountain View, CA
    2 days ago
  • Advanced Micro Devices in Santa Clara seeks a Senior ML Engineer focused on optimizing large language model inference runtimes. The role involves architecting distributed systems and enhancing performance across GPUs. Ideal candidates will have expertise in Python and C... 

    Advanced Micro Devices

    Santa Clara, CA
    4 days ago
  • $136k - $218.5k

    NVIDIA Gruppe is seeking a Senior Power Architecture & Optimization Engineer in Santa Clara, California. This role involves utilizing advanced analytics and AI to enhance energy efficiency of GPUs and SoCs. Candidates should have a solid background in power architecture... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • A leading AI software company in California is seeking a Software Engineer to develop and enhance runtime stacks for scalable ML applications. The role involves working on system software and collaborating with various teams to support next-generation high-performance... 

    SambaNova

    Palo Alto, CA
    1 day ago
  • MatX is seeking a skilled software engineer to build custom silicon for AI language models in Mountain View, California. The role requires strong programming skills, with a focus on memory management and API design. Candidates will engage in building libraries for memory... 
    Remote job

    MatX

    Mountain View, CA
    23 hours ago
  • $147.4k - $272.1k

    Apple Inc. is seeking a Software Development Engineer for Siri Runtime Systems and Interaction in Cupertino, California. This role involves...  ...focusing on low-latency interactions and system performance optimization. Ideal candidates will have strong programming skills in... 

    Apple Inc.

    Cupertino, CA
    4 days ago
  • d-Matrix inc. is seeking a Staff Runtime Systems Engineer to join our team in Santa Clara, CA. This hybrid role involves working onsite three days...  ...firmware and software for multiprocessor systems, ensuring optimal runtime performance. Ideal candidates have a Bachelor's in... 
    3 days per week

    d-Matrix inc.

    Santa Clara, CA
    23 hours ago
  •  ...users to effortlessly run large‑scale ML applications, without the hassle of managing...  ...seeking a versatile and experienced engineer to join our SOTA Training Platform team...  ...translation, graph lowering, compiler optimizations, runtime integration, and performance tuning. Debug... 
    Internship

    Cerebras

    Sunnyvale, CA
    3 days ago
  •  ...learning users to effortlessly run large-scale ML applications, without the hassle of...  ...computation. About The Role As a Kernel Engineer on our team, you will develop high-...  .... Your focus will be on implementing, optimizing, and scaling deep learning operations to... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  •  ...automation with Moveworks’ Reasoning Engine and natural language...  ...The Role We're building the runtime infrastructure that powers Moveworks...  ...in real time. This is not an ML role. This is a distributed...  .../per‑bot scoping, batch read optimization, and hot‑reload configuration... 
    Work at office
    Remote work
    Flexible hours

    Servicenow

    Mountain View, CA
    23 hours ago
  • Software Engineer, Agentic Systems - Moveworks Job Description We're building the runtime infrastructure that powers Moveworks' AI agents...  ...in real time. This is not an ML role. This is a distributed...  ...per-bot scoping, batch read optimization, and hot-reload configuration... 
    Work at office
    Remote work
    Flexible hours

    Moveworks

    Mountain View, CA
    1 day ago
  • $147.4k - $272.1k

    Software Development Engineer, Siri Runtime Systems and Interaction Cupertino, California, United States Software and Services Apple is where...  ...latency. You will focus on new features, system integration to optimize performance for low-latency interaction as well as... 
    Relocation

    Apple Inc.

    Cupertino, CA
    4 days ago
  •  ...responsible for building and optimizing production‑grade single-node and distributed inference runtimes for large language models on AMD...  ...the intersection of inference engines, distributed systems, and GPU...  ...PERSON You are a systems‑minded ML engineer who thinks in terms... 

    Advanced Micro Devices

    Santa Clara, CA
    4 days ago
  • $220.2k - $330.4k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering Group > Systems...  ...; instrument, profile, and optimize models and pipelines end‑to‑...  ...that combine application, runtime, and platform considerations...  ...contributor with 10+ years in the AI/ML fields. EEO Statement:... 
    Work experience placement
    Work at office

    Qualcomm

    Santa Clara, CA
    23 hours ago
  • jobr.pro is looking for a Sr Software Engineer Navigation to bridge digital interfaces and machine learning for robotic product lines. The role focuses on designing and optimizing C++ runtime systems, integrating computer vision and robotic kinematics to enhance surgical... 

    jobr.pro

    Sunnyvale, CA
    1 day ago
  • Intel Corporation is seeking a Senior Compiler Engineer to develop and optimize compiler software for next-generation GPU architectures. The role involves collaborating on cutting-edge compiler technologies that enhance AI and high-performance computing performance. The... 

    Intel Corporation

    Santa Clara, CA
    14 hours ago
  •  ...solving some of the most complex challenges in the energy industry—optimizing electricity markets, integrating renewables, and ensuring grid reliability in real time. As a Senior Optimization Engineer, you will work at the intersection of advanced mathematics, software... 

    Hitachi Vantara Corporation

    Santa Clara, CA
    1 day ago
  •  ...looking for a highly skilled High Performance Computing (HPC) Engineer in Sunnyvale, California. The ideal candidate will have deep expertise...  ...quantum computing technologies, focusing on solving complex optimization problems. This role involves designing algorithms for large-... 

    Onesubsea

    Sunnyvale, CA
    2 days ago
  • $133.5k - $183.5k

    A global leader in materials engineering solutions is seeking a DFx Engineer IV in Santa Clara, CA. This role involves leading initiatives to optimize product cost and quality in semiconductor manufacturing while mentoring junior engineers. Responsibilities include analyzing... 
    Full time

    Applied Materials, Inc.

    Santa Clara, CA
    4 days ago
  • A telecommunications firm is seeking an experienced Inbuilding Optimization Engineer for projects in Las Vegas, NV. The role involves RF testing, optimization, and performance improvement with a focus on Ericsson LTE/5G equipment. Candidates should have a minimum of 7 years... 

    Atriano

    Sunnyvale, CA
    4 days ago
  • NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate will have a PhD and 3+ years of experience in deep learning, specifically in inference. This role involves profiling, analyzing... 

    NVIDIA

    Santa Clara, CA
    23 hours ago
  • $120k - $250k

     ...compiler, and kernels so each layer benefits from the others. The runtime owns the host-side stack and the contracts that bind those...  ...and debuggers — perf counters, traces, and the Python surfaces ML engineers actually use — and hit measurable performance targets on... 
    Full time
    Contract work
    Work experience placement
    Local area
    Remote work
    Monday to Friday
    Flexible hours

    MatX

    Mountain View, CA
    23 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Runtime Optimization Engineer. Be the first to apply!