Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Runtime Engineer

Oho Group

Runtime Engineer – AI Runtime & Execution

About the Role

We're looking for a Runtime Engineer to help build and optimise the execution layer that powers next-generation AI workloads. Working at the intersection of systems software, compiler technology, and hardware acceleration, you'll play a key role in ensuring that compiled models execute with maximum performance, scalability, and reliability across a range of computing architectures.

This is an exciting opportunity to work on low-level runtime systems, execution engines, and hardware-aware optimisation, collaborating closely with compiler, hardware, and product teams to shape the future of AI infrastructure.

Key Responsibilities

  • You will help design, build, and evolve a high-performance execution engine capable of supporting multiple hardware platforms and accelerator architectures.
  • You'll get the chance to optimise workload execution through advanced scheduling, partitioning, and parallelisation strategies that maximise hardware utilisation.
  • You will work directly with compiled workloads and binaries, profiling execution behaviour and identifying opportunities for performance improvements.
  • This is an excellent opportunity for you to develop internal tooling, telemetry systems, and diagnostic frameworks that help uncover execution bottlenecks and system inefficiencies.
  • You'll be responsible for analysing runtime performance across physical hardware, ensuring models achieve optimal throughput, latency, and resource utilisation.
  • You will contribute to the development and evaluation of experimental runtime features, prototypes, and execution strategies that influence future platform capabilities.
  • You'll collaborate closely with compiler, hardware, and product teams to translate machine learning requirements into scalable runtime solutions.

Required Qualifications

  • You'll need strong experience developing runtime systems, execution engines, systems software, or hardware-facing infrastructure.
  • You should be highly proficient in modern C++ and comfortable working within large-scale performance-critical codebases.
  • You must have a strong understanding of concurrent programming, multi-threaded architectures, asynchronous execution, and workload scheduling.
  • You'll need a solid understanding of computer architecture, including memory hierarchies, cache behaviour, processor execution models, and low-level performance considerations.
  • Experience working close to operating system primitives, drivers, kernel-level functionality, or low-level systems programming is highly desirable.
  • You should be comfortable profiling, debugging, and optimising software running directly on physical hardware platforms.

Preferred Qualifications

  • Experience working with GPU computing technologies such as CUDA, ROCm, or other accelerator programming frameworks.
  • Exposure to machine learning frameworks and compiler technologies including Triton, PyTorch, JAX, MLIR, or similar ecosystems.
  • Understanding of distributed computing systems, HPC environments, or large-scale parallel processing architectures.
  • Experience building performance analysis, telemetry, or observability tooling for complex software systems.
  • Strong interest in compiler technology, hardware acceleration, and AI infrastructure.

Education

You should be educated to BS, MS, or PhD level in Computer Science, Computer Engineering, Electrical Engineering, or a related technical discipline, or possess equivalent industry experience.

Vacancy posted 12 hours ago
Similar jobs that could be interesting for youBased on the AI Runtime Engineer in San Mateo, CA vacancy
  • $110k - $270k

     ...and conventional C++ DSP and control code. Role The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM...  ...and speed up the deployment Make Improvement to SDK and runtime Provide technical support and documents to customers and... 
    Suggested
    Work at office
    Local area
    Immediate start
    Flexible hours
    2 days per week

    quadric.io

    Burlingame, CA
    3 days ago
  • $242k - $290k

     ...Model Optimization & Deployment Engineer The Perception team is pioneering the development...  ...latency and maximize memory bandwidth on AI accelerators. Write production-level,...  ..., Ray, DeepSpeed, Megatron-LM) and runtime efficiency optimization for GPU clusters.... 
    Suggested
    Temporary work
    Relocation package

    Zoox

    San Mateo, CA
    5 days ago
  •  ...Senior AI/ML Engineer — LLM & Agent Stack Every production AI system, whether it's powering customer support, writing code, analyzing...  ...like LangGraph, LangChain, vector stores, and specialized LLM runtimes. What You'll Do Architect and implement scalable agent... 
    Suggested

    TrueFoundry

    San Mateo, CA
    5 days ago
  • $160k - $250k

     ...Title: Founding AI Engineer (Research & Systems) Target: PhDs & Research Masters from Stanford, MIT, Berkeley, CMU focused on AI, ML, NLP, Agents. Location: San Francisco, CA | On-Site Compensation: $160K - $250K | 0.8% - 2.0% Equity Visa Sponsorship: Available... 
    Suggested
    H1b
    Immediate start
    Visa sponsorship

    Aimhire

    San Mateo, CA
    1 day ago
  • $172k - $263k

     ...and applications. We define and enforce the best practices for engineering across the company. Our approach involves using established programming...  ...solutions while establishing best practices for responsible AI integration in our development pipeline Qualifications... 
    Suggested
    Temporary work
    Relocation package

    Zoox

    Foster, CA
    5 days ago
  • $86.5k - $142.7k

     ...who designs, prototypes and builds modern, AI‑enabled applications and digital products...  ...building proofs‑of‑concept, and guiding engineering teams through complex technical decisions...  ..., data, AI components, integrations and runtime platforms. • Actively prototype and spike... 
    Summer holiday
    Flexible hours

    EY

    San Mateo, CA
    3 days ago
  • $110k - $270k

     ...code and conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI...  ...utilization • Make Improvement to Quadric toolchain, compiler and runtime • Provide technical support and documents to customers and... 
    Full time
    Temporary work
    Work from home

    quadric, Inc

    Burlingame, CA
    more than 2 months ago
  • $110k - $270k

     ...and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM...  ...and speed up the deployment Make Improvement to SDK and runtime Provide technical support and documents to customers and... 
    Full time
    Temporary work
    Work from home

    quadric, Inc

    Burlingame, CA
    more than 2 months ago
  • $122k - $175k

     ...and a $93 Billion market opportunity in UCaaS, Contact Center and AI-powered adjacencies. We invest more than $250 million annually...  ...come in. We're looking for an early-career AI Forward-Deployed Engineer to join our AIR Pro team and help build AI-powered customer experiences... 
    Full time
    Internship
    Work at office
    Local area
    Flexible hours

    RingCentral

    Belmont, CA
    4 days ago
  • $152.7k - $249.2k

     ...re now scaling manufacturing and preparing for the launch of our commercial service. Overview We're looking for a Senior AI Engineer to help bring pragmatic, production-grade AI capabilities into the systems that power Joby's manufacturing and enterprise workflows... 
    Temporary work

    Joby Aviation

    San Carlos, CA
    1 day ago
  •  ...As an AI Interfaces Engineer, you will design and build agent-mediated interfaces across our product and our internal tooling. You will help establish a new pattern of AI-native frontend development, where prototypes evolve against real platform data over MCP and move... 

    Confidential Company

    Belmont, CA
    1 day ago
  •  ...Vehicles, Scania and MAN. Here in the US, we are blending German engineering with American ingenuity. As ADMT, we develop and realize...  ...the main focus areas: to know and understand the future of AI and to derive requirements on sensor technologies and compute architectures... 
    Local area
    Worldwide

    Omega Solutions Inc

    Belmont, CA
    2 days ago
  •  ...At The ReWork Group, we partner with high-growth startups and forward-thinking companies to build the future. As the Head of AI Engineering, you'll own their AI research agenda, leading the design of foundation-model and reinforcement-learning systems that reason... 

    The ReWork Group

    San Mateo, CA
    12 hours ago
  •  ...ABOUT HEKA Heka is a stealth-stage startup building AI infrastructure for clinical referral workflows. We're early but real....  ...Stanford, Harvard, and Cornell alumni with backgrounds spanning AI/ML engineering, quantitative research, and healthcare. Because we're small and... 
    Full time
    Internship

    Heka Intelligence

    San Mateo, CA
    22 hours ago
  •  ...About Obvio AI Each year, more than 40,000 people in the U.S. leave home and never make it back due to traffic crashes. At Obvio...  ...promoted and rolled back without pipeline downtime. Set the engineering standard. This is an early hire. You'll write the playbooks—... 
    Local area

    Obvio

    San Carlos, CA
    1 day ago
  •  ...Strategic/Functional Define and drive the organization's AI strategy and roadmap aligned with business goals Educate leadership and cross-functional teams on AI capabilities, use cases, and best practices Identify high-impact AI/ML use cases across... 

    Omega Solutions Inc

    San Carlos, CA
    5 days ago
  •  ...Staff Software Engineer, Controls Company Genesis is a global physical AI lab and full-stack robotics company. We build generalist robots to unlock unlimited physical labor, allowing humans to focus on creativity, curiosity, and what they love. We recently raised... 

    OpenReq

    San Carlos, CA
    3 days ago
  • $120k - $135k

     ...next big thing" that nobody has thought of as yet, building it and selling it. This position should be held by a Computer Science Engineer, or equivalent, with experience in the development of Software and Software architectures. Possible Tasks within this Role... 
    Contract work

    Volkswagen

    Belmont, CA
    4 days ago
  • A leading AI development firm in California is seeking a talented developer to create LM/VLM-powered agents that generate physical data through advanced simulation and generative models. The role involves collaboration with simulation teams to design APIs and tools that... 

    GenesisAI

    San Carlos, CA
    3 days ago
  • $200k - $350k

     ...Research Engineer | San Francisco | Full-Time Brief Overview Applied AI lab building world models for 3D game environments. Early-stage, well-backed, 3 weeks from beta launch. What You'll Own You'll take large ownership of the core generation and simulation... 
    Full time
    Visa sponsorship
    Relocation package
    Flexible hours

    Harnham

    San Mateo, CA
    3 days ago
  •  ...Job Title: Senior AI Engineer (AI COE) Location: San Mateo, CA, USA (Onsite) Hire Type - Fulltime Only (NO C2C) Why should you choose us? Rakuten Symphony is a Rakuten Group company, that provides global B2B services for the mobile telco industry and enables... 
    Full time
    Worldwide

    Rakuten Symphony

    San Mateo, CA
    12 hours ago
  • $216k - $283k

     ...Full Stack AI Software Engineer Organizations everywhere struggle under the crushing costs and complexities of "solutions" that promise to simplify their lives. To create a better experience for their customers and employees. To help them grow. Software is a choice... 
    Work at office
    Flexible hours
    3 days per week

    Freshworks

    San Mateo, CA
    2 days ago
  • $110k - $270k

     ...machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role The AI Applications Engineer is the key bridge between development engineering and hands-on users in the field. The AI Application Engineer will [1]... 
    Work at office
    Local area
    Immediate start
    Worldwide
    Flexible hours

    quadric.io

    Burlingame, CA
    4 days ago
  • $169.1k - $270.8k

     ...to the world. Progress starts with you. Job Description AI Governance (AIG)Engineeringteam is part of the Data and AI...  ...experience with a PhD * MS or Ph.D. degree in Computer Science, Engineering, or related field. * 8+ years' work experience in the software... 
    Work experience placement
    Work at office
    Local area

    Visa

    San Mateo, CA
    4 days ago
  • $165k - $185k

     ...together to support the most exciting missions in the world! About the Role We're looking for an experienced engineer to design and deliver AI-powered applications, intelligent workflows, and automation solutions that drive measurable business impact. In this role... 

    Qualys

    Foster, CA
    5 days ago
  • $295.25k - $345.04k

     ...breadth of the Roblox stack. Availability of the platform is a key company goal. We are hiring our first Principal Machine Learning engineer within our team. As a Principal Machine Learning Engineer within Reliability, you will set the 3-5 year technical... 
    Full time
    Work experience placement
    Seasonal work
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    1 day ago
  •  ...Physics AI Leader Luminary helps engineering companies be more competitive by getting to market faster, creating new, better products, and reducing development risk. We do this with our Physics AI platform, the fastest and easiest way to build and deploy models to understand... 

    Luminary Cloud, Inc.

    San Mateo, CA
    1 day ago
  •  ...We are seeking a Tech Lead with deep experience in AI/ML to shape architecture and lead delivery in a rapidly evolving domain. This role is ideal for a senior engineer from a leading AI lab (Anthropic, OpenAI, Google DeepMind, and the like) or Big Tech (Google, Microsoft... 

    T3

    San Mateo, CA
    1 day ago
  • $96.8k - $251.6k

     ...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role responsible...  ...building or operating model serving, inference gateways, agent runtimes, workflow engines, developer platforms, or internal AI... 
    Temporary work
    Flexible hours

    Oracle

    Redwood City, CA
    2 days ago
  • $105.8k - $174.8k

     ...wherever you want it to go.  Join EY and help to build a better working world. Technology – Data and Decision Science – AI Native Engineering Physical AI Engineering Consultant, Senior Consultant The opportunity Our Artificial Intelligence and Data team helps... 
    Full time
    Work experience placement
    Summer holiday
    Flexible hours

    EY

    San Mateo, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Runtime Engineer. Be the first to apply!