AI Runtime Engineer

Oho Group

Runtime Engineer – AI Runtime & Execution

About the Role

We're looking for a Runtime Engineer to help build and optimise the execution layer that powers next-generation AI workloads. Working at the intersection of systems software, compiler technology, and hardware acceleration, you'll play a key role in ensuring that compiled models execute with maximum performance, scalability, and reliability across a range of computing architectures.

This is an exciting opportunity to work on low-level runtime systems, execution engines, and hardware-aware optimisation, collaborating closely with compiler, hardware, and product teams to shape the future of AI infrastructure.

Key Responsibilities

You will help design, build, and evolve a high-performance execution engine capable of supporting multiple hardware platforms and accelerator architectures.
You'll get the chance to optimise workload execution through advanced scheduling, partitioning, and parallelisation strategies that maximise hardware utilisation.
You will work directly with compiled workloads and binaries, profiling execution behaviour and identifying opportunities for performance improvements.
This is an excellent opportunity for you to develop internal tooling, telemetry systems, and diagnostic frameworks that help uncover execution bottlenecks and system inefficiencies.
You'll be responsible for analysing runtime performance across physical hardware, ensuring models achieve optimal throughput, latency, and resource utilisation.
You will contribute to the development and evaluation of experimental runtime features, prototypes, and execution strategies that influence future platform capabilities.
You'll collaborate closely with compiler, hardware, and product teams to translate machine learning requirements into scalable runtime solutions.

Required Qualifications

You'll need strong experience developing runtime systems, execution engines, systems software, or hardware-facing infrastructure.
You should be highly proficient in modern C++ and comfortable working within large-scale performance-critical codebases.
You must have a strong understanding of concurrent programming, multi-threaded architectures, asynchronous execution, and workload scheduling.
You'll need a solid understanding of computer architecture, including memory hierarchies, cache behaviour, processor execution models, and low-level performance considerations.
Experience working close to operating system primitives, drivers, kernel-level functionality, or low-level systems programming is highly desirable.
You should be comfortable profiling, debugging, and optimising software running directly on physical hardware platforms.

Preferred Qualifications

Experience working with GPU computing technologies such as CUDA, ROCm, or other accelerator programming frameworks.
Exposure to machine learning frameworks and compiler technologies including Triton, PyTorch, JAX, MLIR, or similar ecosystems.
Understanding of distributed computing systems, HPC environments, or large-scale parallel processing architectures.
Experience building performance analysis, telemetry, or observability tooling for complex software systems.
Strong interest in compiler technology, hardware acceleration, and AI infrastructure.

Education

You should be educated to BS, MS, or PhD level in Computer Science, Computer Engineering, Electrical Engineering, or a related technical discipline, or possess equivalent industry experience.

Apply

Vacancy posted 12 hours ago

Similar jobs that could be interesting for youBased on the AI Runtime Engineer in San Mateo, CA vacancy

AI Inference Engineer
$110k - $270k
...and conventional C++ DSP and control code. Role The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM... ...and speed up the deployment Make Improvement to SDK and runtime Provide technical support and documents to customers and...
Suggested
Work at office
Local area
Immediate start
Flexible hours
2 days per week
quadric.io
Burlingame, CA
3 days ago
Senior AI Inference Engineer - Model Optimization & Deployment
$242k - $290k
...Model Optimization & Deployment Engineer The Perception team is pioneering the development... ...latency and maximize memory bandwidth on AI accelerators. Write production-level,... ..., Ray, DeepSpeed, Megatron-LM) and runtime efficiency optimization for GPU clusters....
Suggested
Temporary work
Relocation package
Zoox
San Mateo, CA
5 days ago
Senior AI/ML Engineer LLM & Agent Stack
...Senior AI/ML Engineer — LLM & Agent Stack Every production AI system, whether it's powering customer support, writing code, analyzing... ...like LangGraph, LangChain, vector stores, and specialized LLM runtimes. What You'll Do Architect and implement scalable agent...
Suggested
TrueFoundry
San Mateo, CA
5 days ago
Founding AI Engineer
$160k - $250k
...Title: Founding AI Engineer (Research & Systems) Target: PhDs & Research Masters from Stanford, MIT, Berkeley, CMU focused on AI, ML, NLP, Agents. Location: San Francisco, CA | On-Site Compensation: $160K - $250K | 0.8% - 2.0% Equity Visa Sponsorship: Available...
Suggested
H1b
Immediate start
Visa sponsorship
Aimhire
San Mateo, CA
1 day ago
AI Developer Productivity Engineer
$172k - $263k
...and applications. We define and enforce the best practices for engineering across the company. Our approach involves using established programming... ...solutions while establishing best practices for responsible AI integration in our development pipeline Qualifications...
Suggested
Temporary work
Relocation package
Zoox
Foster, CA
5 days ago
Managed Services - AI-Native Software Engineer - Senior Consultant
$86.5k - $142.7k
...who designs, prototypes and builds modern, AI‑enabled applications and digital products... ...building proofs‑of‑concept, and guiding engineering teams through complex technical decisions... ..., data, AI components, integrations and runtime platforms. • Actively prototype and spike...
Summer holiday
Flexible hours
EY
San Mateo, CA
3 days ago
AI Kernel Engineer
$110k - $270k
...code and conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI... ...utilization • Make Improvement to Quadric toolchain, compiler and runtime • Provide technical support and documents to customers and...
Full time
Temporary work
Work from home
quadric, Inc
Burlingame, CA
more than 2 months ago
AI Inference Engineer
$110k - $270k
...and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM... ...and speed up the deployment Make Improvement to SDK and runtime Provide technical support and documents to customers and...
Full time
Temporary work
Work from home
quadric, Inc
Burlingame, CA
more than 2 months ago
AI Forward-Deployed Engineer (Early Career/Entry Level)
$122k - $175k
...and a $93 Billion market opportunity in UCaaS, Contact Center and AI-powered adjacencies. We invest more than $250 million annually... ...come in. We're looking for an early-career AI Forward-Deployed Engineer to join our AIR Pro team and help build AI-powered customer experiences...
Full time
Internship
Work at office
Local area
Flexible hours
RingCentral
Belmont, CA
4 days ago
Senior AI Engineer
$152.7k - $249.2k
...re now scaling manufacturing and preparing for the launch of our commercial service. Overview We're looking for a Senior AI Engineer to help bring pragmatic, production-grade AI capabilities into the systems that power Joby's manufacturing and enterprise workflows...
Temporary work
Joby Aviation
San Carlos, CA
1 day ago
Sr. AI Engineer
...As an AI Interfaces Engineer, you will design and build agent-mediated interfaces across our product and our internal tooling. You will help establish a new pattern of AI-native frontend development, where prototypes evolve against real platform data over MCP and move...
Confidential Company
Belmont, CA
1 day ago
AI Engineer Sr.
...Vehicles, Scania and MAN. Here in the US, we are blending German engineering with American ingenuity. As ADMT, we develop and realize... ...the main focus areas: to know and understand the future of AI and to derive requirements on sensor technologies and compute architectures...
Local area
Worldwide
Omega Solutions Inc
Belmont, CA
2 days ago
Head of AI Engineering
...At The ReWork Group, we partner with high-growth startups and forward-thinking companies to build the future. As the Head of AI Engineering, you'll own their AI research agenda, leading the design of foundation-model and reinforcement-learning systems that reason...
The ReWork Group
San Mateo, CA
12 hours ago
AI Engineering Intern (unpaid)
...ABOUT HEKA Heka is a stealth-stage startup building AI infrastructure for clinical referral workflows. We're early but real.... ...Stanford, Harvard, and Cornell alumni with backgrounds spanning AI/ML engineering, quantitative research, and healthcare. Because we're small and...
Full time
Internship
Heka Intelligence
San Mateo, CA
22 hours ago
Senior AI Infrastructure Engineer - Computer Vision
...About Obvio AI Each year, more than 40,000 people in the U.S. leave home and never make it back due to traffic crashes. At Obvio... ...promoted and rolled back without pipeline downtime. Set the engineering standard. This is an early hire. You'll write the playbooks—...
Local area
Obvio
San Carlos, CA
1 day ago
AI Architect / AI Engineer / ML Architect
...Strategic/Functional Define and drive the organization's AI strategy and roadmap aligned with business goals Educate leadership and cross-functional teams on AI capabilities, use cases, and best practices Identify high-impact AI/ML use cases across...
Omega Solutions Inc
San Carlos, CA
5 days ago
Genesis AI - Staff Software Engineer, Controls
...Staff Software Engineer, Controls Company Genesis is a global physical AI lab and full-stack robotics company. We build generalist robots to unlock unlimited physical labor, allowing humans to focus on creativity, curiosity, and what they love. We recently raised...
OpenReq
San Carlos, CA
3 days ago
AI Engineer Associate
$120k - $135k
...next big thing" that nobody has thought of as yet, building it and selling it. This position should be held by a Computer Science Engineer, or equivalent, with experience in the development of Software and Software architectures. Possible Tasks within this Role...
Contract work
Volkswagen
Belmont, CA
4 days ago
Senior AI Agent Engineer Physical Data & Simulation
A leading AI development firm in California is seeking a talented developer to create LM/VLM-powered agents that generate physical data through advanced simulation and generative models. The role involves collaboration with simulation teams to design APIs and tools that...
GenesisAI
San Carlos, CA
3 days ago
AI Research Engineer
$200k - $350k
...Research Engineer | San Francisco | Full-Time Brief Overview Applied AI lab building world models for 3D game environments. Early-stage, well-backed, 3 weeks from beta launch. What You'll Own You'll take large ownership of the core generation and simulation...
Full time
Visa sponsorship
Relocation package
Flexible hours
Harnham
San Mateo, CA
3 days ago
AI/ML Engineer
...Job Title: Senior AI Engineer (AI COE) Location: San Mateo, CA, USA (Onsite) Hire Type - Fulltime Only (NO C2C) Why should you choose us? Rakuten Symphony is a Rakuten Group company, that provides global B2B services for the mobile telco industry and enables...
Full time
Worldwide
Rakuten Symphony
San Mateo, CA
12 hours ago
Full Stack AI Software Engineer
$216k - $283k
...Full Stack AI Software Engineer Organizations everywhere struggle under the crushing costs and complexities of "solutions" that promise to simplify their lives. To create a better experience for their customers and employees. To help them grow. Software is a choice...
Work at office
Flexible hours
3 days per week
Freshworks
San Mateo, CA
2 days ago
AI Applications Engineer
$110k - $270k
...machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role The AI Applications Engineer is the key bridge between development engineering and hands-on users in the field. The AI Application Engineer will [1]...
Work at office
Local area
Immediate start
Worldwide
Flexible hours
quadric.io
Burlingame, CA
4 days ago
Staff Machine Learning Engineer, Responsible AI Engineering
$169.1k - $270.8k
...to the world. Progress starts with you. Job Description AI Governance (AIG)Engineeringteam is part of the Data and AI... ...experience with a PhD * MS or Ph.D. degree in Computer Science, Engineering, or related field. * 8+ years' work experience in the software...
Work experience placement
Work at office
Local area
Visa
San Mateo, CA
4 days ago
Lead Applied AI / GenAI Engineer
$165k - $185k
...together to support the most exciting missions in the world! About the Role We're looking for an experienced engineer to design and deliver AI-powered applications, intelligent workflows, and automation solutions that drive measurable business impact. In this role...
Qualys
Foster, CA
5 days ago
Principal AI/ML Engineer, Reliability
$295.25k - $345.04k
...breadth of the Roblox stack. Availability of the platform is a key company goal. We are hiring our first Principal Machine Learning engineer within our team. As a Principal Machine Learning Engineer within Reliability, you will set the 3-5 year technical...
Full time
Work experience placement
Seasonal work
H1b
Work at office
Local area
Visa sponsorship
Monday to Friday
Roblox
San Mateo, CA
1 day ago
AI/ML Scientist Lead Engineer
...Physics AI Leader Luminary helps engineering companies be more competitive by getting to market faster, creating new, better products, and reducing development risk. We do this with our Physics AI platform, the fastest and easiest way to build and deploy models to understand...
Luminary Cloud, Inc.
San Mateo, CA
1 day ago
Tech Lead - AI Engineering
...We are seeking a Tech Lead with deep experience in AI/ML to shape architecture and lead delivery in a rapidly evolving domain. This role is ideal for a senior engineer from a leading AI lab (Anthropic, OpenAI, Google DeepMind, and the like) or Big Tech (Google, Microsoft...
T3
San Mateo, CA
1 day ago
Senior Principal AI Agent / ML Software Engineer (OCI)
$96.8k - $251.6k
...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role responsible... ...building or operating model serving, inference gateways, agent runtimes, workflow engines, developer platforms, or internal AI...
Temporary work
Flexible hours
Oracle
Redwood City, CA
2 days ago
Physical AI Engineering Consultant - Senior - Consulting - Open Location
$105.8k - $174.8k
...wherever you want it to go. Join EY and help to build a better working world. Technology – Data and Decision Science – AI Native Engineering Physical AI Engineering Consultant, Senior Consultant The opportunity Our Artificial Intelligence and Data team helps...
Full time
Work experience placement
Summer holiday
Flexible hours
EY
San Mateo, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Runtime Engineer. Be the first to apply!