Runtime Engineer

$120k - $250k

MatX

Runtime Engineer

Mountain View, CA

What MatX Is Building

MatX is building custom silicon for large-language-model inference and training, with HW/SW co-design across ISA, RTL, simulator, compiler, and kernels so each layer benefits from the others. The runtime owns the host-side stack and the contracts that bind those teams together.

What You'll Do Here

Build the host-side interface library — device memory management, DMA, streams and events, sync primitives — that every compiler-emitted program runs on top of
Own and extend the executable format: the compiler→runtime contract, its versioning, the weight and quantization layouts that let compiler and runtime evolve independently
Design the custom-kernel ABI — calling convention, sync semantics, lifecycle — and the host-side marshaling layer (DLPack, the buffer protocol, numpy) that gets Python tensors to the device
Build Python bindings via PyO3, with a C-ABI shim as the alternative integration path for downstream consumers
Build the LLM inference serving stack — paged KV cache, continuous batching, request scheduling, token streaming — and the cluster orchestration primitives underneath it
Bring up interconnect topology from the host and own the failure-detection and clean-teardown path for stop-restructure-resume recovery across racks
Design what the chip exposes to host-side profilers and debuggers — perf counters, traces, and the Python surfaces ML engineers actually use — and hit measurable performance targets on runtime overhead and serving throughput

Who You Are

Strong experience in a systems programming language — Rust, C, C++, or Go — including memory management, allocator design, and FFI/ABI work
Have built Python interop layers in production (PyO3, ctypes, pybind11, or equivalent C-ABI bridging)
Have designed and maintained API or ABI contracts between teams — versioning, evolution, breaking-change discipline — not just consumed someone else's
Hands-on with at least one accelerator programming model (CUDA, ROCm, oneAPI Level Zero, TPU, or comparable) — enough to reason about device memory, async execution, and kernel launch
ML-systems literate — comfortable with the training and inference loop, what collectives do, what a tensor layout is. Research depth not required.

Bonus Points If You Have

LLM inference internals — vLLM, TensorRT-LLM, or SGLang (paged attention, scheduler design)
Rust at depth, including proc macros, unsafe with soundness reasoning, and complex lifetime/trait work
Custom allocator design (slab, paged, arena) or other low-level memory work
ML framework integration experience (PyTorch custom backends, JAX/XLA, ONNX runtime)
Profiler or tracing infrastructure work (perfetto, Nsight, or a custom stack)
Driver-adjacent or kernel-bypass work, or prior new-silicon bring-up

Compensation

The US base salary for this full-time position is determined based on a variety of factors including role, experience, location, job related skills, and relevant education and training. Career length is only a guideline for compensation.

Early Career - $120,000 - $250,000 + equity
Mid Career - $175,000 - $362,500 + equity
Senior Career - $250,000 - $475,000 + equity

What We Offer

A Stake In Our Success A flexible cash equity compensation mix that fits your needs
Health & Wellness Company subsidized Health, Dental, Vision, and Life insurance; Pre-tax Health Savings Accounts with generous company contribution (even if you don't)
Time To Recharge 4 weeks paid time off (accrued), 12 company holidays, and 3 weeks remote/flexible work per year
Support To Parents Up to 12 weeks of paid parental leave, regardless of your path to parenthood
Learning & Development $1,500 yearly towards your professional development e.g. conferences, courses, and other learning opportunities
Team Connection Team lunches, quarterly off-sites, and regular town halls
Financial Wellbeing 401K and/or Roth IRA, with 5% company contribution, even if you don't!
Flexible Spending Accounts Pre-tax spend accounts for medical, dental/vision, dependent care, parking, and transit expenses
Commute On Us For those commuting up to 1 hour, put your rideshare cost on our company card and reclaim the drive-time to get work done!
MatX E[x]tras $50 per month to use on the perks you care about most
Remote Perks We work remotely Monday & Friday, supported by home-tech setup, and remote wifi expense reimbursement

As part of our dedication to the diversity of our team and our focus on creating an inviting and inclusive work experience, MatX is committed to a policy of Equal Employment Opportunity and will not discriminate against an applicant or employee on the basis of race, color, religion, creed, national origin or ancestry, sex, gender, gender identity, gender expression, sexual orientation, age, physical or mental disability, medical condition, marital/domestic partner status, military and veteran status, genetic information or any other legally recognized protected basis under federal, state or local laws, regulations or ordinances.

This position requires access to information that is subject to U.S. export controls. This offer of employment is contingent upon the applicants capacity to perform job functions in compliance with U.S. export control laws without obtaining a license from U.S. export control authorities.

MatX does not accept unsolicited resumes from individual recruiters or third-party recruiting agencies in response to job postings. No fee will be paid to third parties who submit unsolicited candidates directly to our hiring managers or People team and any resumes submitted are deemed to be the property of MatX.

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Runtime Engineer in Mountain View, CA vacancy

Senior Sensor & Cloud Runtime Security Engineer
A leading cybersecurity firm in Sunnyvale is seeking a Sr. Software Engineer for its Cloud Runtime Protection team. You will design and implement high-performance features to secure cloud-native and AI workloads. The position demands over 10 years of experience with C/...
Suggested
2 days per week
3 days per week
CrowdStrike Holdings, Inc.
Sunnyvale, CA
1 day ago
Sr. Software Engineer - Sensor - Cloud Runtime Protection (Hybrid)
$140k - $215k
...mission that matters? The future of cybersecurity starts with you. About the Role: This is a Software Development Engineer role on the Cloud Runtime Protection team that builds the core of the CrowdStrike Falcon platform: the lightweight sensor. You will be at the...
Suggested
Work experience placement
Work at office
Local area
2 days per week
3 days per week
CrowdStrike Holdings, Inc.
Sunnyvale, CA
17 hours ago
Runtime Systems Engineer for High-Performance AI
A leading AI software company in California is seeking a Software Engineer to develop and enhance runtime stacks for scalable ML applications. The role involves working on system software and collaborating with various teams to support next-generation high-performance compute...
Suggested
SambaNova
Palo Alto, CA
4 days ago
Staff Runtime Systems Engineer
...Staff Runtime Systems Engineer At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is...
Suggested
3 days per week
d-Matrix
Santa Clara, CA
1 day ago
Staff Data Science Engineer, Siri Runtime Systems and Interaction
$181.1k - $318.4k
...Staff Data Science Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result...
Suggested
Relocation
Apple
Cupertino, CA
17 hours ago
Software Development Engineer, Siri Runtime Systems and Interaction
$147.4k - $272.1k
...Software Development Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the...
Relocation
Apple
Cupertino, CA
4 days ago
Siri Runtime Systems Engineer — On‑Device, Low-Latency
$147.4k - $272.1k
Apple Inc. is seeking a Software Development Engineer for Siri Runtime Systems and Interaction in Cupertino, California. This role involves designing and integrating next-generation Siri experiences, focusing on low-latency interactions and system performance optimization...
Apple Inc.
Cupertino, CA
2 days ago
Software Engineer, Siri Runtime Systems and Interaction
$147.4k - $272.1k
...Software Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us...
Relocation
Apple
Cupertino, CA
1 day ago
Runtime Engineer: High-Performance AI Compute
A technology company specializing in AI seeks a Runtime Engineer in Palo Alto, California. The role involves designing and implementing features for high-performance machine learning applications and supporting system software for next-generation silicon. Ideal candidates...
SambaNova
Palo Alto, CA
17 hours ago
Runtime Engineer
...ownership in perpetuity, so they can turn generative AI into one of their most valuable assets. The Opportunity The Runtime team at Sambanova is a seasoned engineering team with a proven track record of delivering cutting-edge system software solutions for AI and machine...
Full time
Temporary work
Local area
Flexible hours
SambaNova
Palo Alto, CA
4 days ago
Software Engineer, Metal Runtime (Core Systems)
$100k
...Software Engineer, Metal Runtime (Core Systems) Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations...
Tenstorrent
Santa Clara, CA
2 days ago
Staff Software Engineer , Anywhere Cloud - AI Systems & Runtimes
...Business Area: Engineering Seniority Level: Mid-Senior level Job Description: At Cloudera, we empower people to transform... ...). Deep understanding of LLM deployment challenges and runtimes (e.g., vLLM, ONNX, TorchServe, Triton). Familiarity with quantization...
Work from home
Flexible hours
Cloudera
Alviso, CA
17 hours ago
Runtime Engineer — Remote ML Inference & Systems
MatX is seeking a skilled software engineer to build custom silicon for AI language models in Mountain View, California. The role requires strong programming skills, with a focus on memory management and API design. Candidates will engage in building libraries for memory...
Remote job
MatX
Mountain View, CA
3 days ago
Senior AI Runtime & Systems Engineer (Embedded Linux)
d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with...
3 days per week
d-Matrix
Santa Clara, CA
1 day ago
Principal Runtime Systems Engineer
...possibilities of AI. Location: Hybrid, working onsite at our Santa Clara, CA headquarters 3 days per week. The role: Runtime Systems Engineer, Principal What You Will Do: d-Matrix is developing an AI compute platform focusing on in-memory compute for AI...
3 days per week
d-Matrix
Santa Clara, CA
more than 2 months ago
Systems Engineer, Kernel
$165k - $242k
...Systems Engineer, Kernel Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built... ...across: Virtualization (KubeVirt, QEMU, vFIO) Container runtimes (containerd, nydus, kubelet) HPC/AI workloads (CUDA,...
Permanent employment
Temporary work
Casual work
Work at office
Remote work
Flexible hours
CoreWeave
Sunnyvale, CA
2 days ago
Engineering Manager, Inference ML Runtime
...Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the...
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
4 days ago
Senior Distributed Systems Engineer
$200k - $400k
...Institute Of Foundation Models Engineer The Institute of Foundation Models (IFM) designs and operates ultra-scale GPU supercomputing... ...co-designed across model architecture, communication systems, runtime, and hardware topology. This role sits at the core of that effort...
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Software Engineer, Agentic Runtime
$170k - $265k
...employee, in every company. About the Role: The Agents Runtime team builds the low-latency, reliable, and secure foundation... ...impactful roadmap investments. You are: ~3+ years of software engineering experience building production distributed systems or cloud-...
Home office
Flexible hours
3 days per week
Glean.info
Mountain View, CA
4 days ago
Software Engineer, Agentic Runtime Glean - Mountain View, CA, US
$170k - $265k
...Software Engineer, Agentic Runtime Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and...
Home office
Flexible hours
3 days per week
Softbank Investment Advisers
Mountain View, CA
2 days ago
Runtime Engineer
...system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-... ...in the process, join us. About the Role We're looking for a Runtime Engineer to design and build the multi-target runtime that sits...
Lemurian Labs
Santa Clara, CA
3 days ago
Senior Software Engineer - Operating Systems
...Software Engineer The NextGen OS team is focused on building Applied Intuition's operating system (OS) stack for future vehicles and... ...implement core OS components including: kernel, system services, runtime, application framework, BSPs, and hardware abstraction layers....
For contractors
For subcontractor
Applied Intuition
Sunnyvale, CA
1 day ago
Software Systems Engineer
$141.3k - $226k
...Job Description: Broadcom is looking for a Software Systems Engineer (P5) to join VMware Cloud Foundation's (VCF) AI and Advanced Services... ...of Kubernetes based Platform Services: Model Gallery, Model Runtime and ML API Gateway, Data Indexing and Retrieval, and Agent...
Full time
Work at office
Local area
Broadcom Corporation
Palo Alto, CA
4 days ago
Software Engineer, Agentic Systems - Moveworks
...ServiceNow's leading workflow automation with Moveworks' Reasoning Engine and natural language capabilities, we deliver the AI platform... ...work better for everyone. The Role We're building the runtime infrastructure that powers Moveworks' AI agents - the systems that...
Work at office
Remote work
Flexible hours
ServiceNow
Mountain View, CA
1 day ago
Siri Runtime Systems Engineer - Next-Gen Voice Interaction
$147.4k - $272.1k
A leading technology company in Cupertino is seeking a Software Engineer to develop the next generation of Siri's natural interaction platforms. You will engage in hands-on work to improve Siri capabilities, requiring strong expertise in Swift or Objective-C and solid...
Apple Inc.
Cupertino, CA
2 days ago
Software Engineer Agentic Systems Moveworks
...Job Description The Role We're building the runtime infrastructure that powers Moveworks' AI agents — the systems that orchestrate... ...time. This is not an ML role. This is a distributed systems engineering role at the heart of the agentic AI wave. Our AI agents...
Work at office
Remote work
Flexible hours
ServiceNow
Mountain View, CA
2 days ago
Senior System Performance Engineer
$128.7k - $261.3k
...Job Description Role As a ? Senior System Performance Engineer ?on GM's AV System Performance Team, you will design, build, and... ...and prototype new tools, techniques, and technologies to improve runtime performance and developer workflows. Design, implement,...
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Principal Machine Learning Engineer, AI Systems
$200k - $340k
...Distinguished Machine Learning Engineer, AI Systems Palo Alto, CA HP IQ is HP's new AI innovation lab. Combining startup agility... ...in support of HP's AI roadmap, owning decisions across models, runtimes, inference engines, and optimization. Lead on device AI strategy...
Full time
Temporary work
Local area
Flexible hours
HP Development Company, L.P.
Palo Alto, CA
2 days ago
Staff Inference ML Runtime Engineer
...agentic computation. About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our fast generative... ...solution through simple APIs powered by a distributed runtime that runs on large clusters of our own hardware. Our mission is...
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
17 hours ago
High Energy Laser Beam Control Electro-Optical Engineer - V (Sr. Stf.)
$139.6k - $246.1k
...development of high energy laser beam control systems or electro-optical systems of related complexity. • Experience with developing engineering documents, e.g. specifications, vendor RFQs, proposal technical volumes, drawings, briefings, assembly work instructions,...
Full time
Contract work
Temporary work
Work experience placement
Work at office
Remote work
Relocation
Flexible hours
Shift work
Lockheed Martin Corporation
Sunnyvale, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Runtime Engineer. Be the first to apply!