Runtime Engineer
$120k - $250kMatX
Runtime Engineer
Mountain View, CA
What MatX Is Building
MatX is building custom silicon for large-language-model inference and training, with HW/SW co-design across ISA, RTL, simulator, compiler, and kernels so each layer benefits from the others. The runtime owns the host-side stack and the contracts that bind those teams together.
What You'll Do Here
- Build the host-side interface library — device memory management, DMA, streams and events, sync primitives — that every compiler-emitted program runs on top of
- Own and extend the executable format: the compiler→runtime contract, its versioning, the weight and quantization layouts that let compiler and runtime evolve independently
- Design the custom-kernel ABI — calling convention, sync semantics, lifecycle — and the host-side marshaling layer (DLPack, the buffer protocol, numpy) that gets Python tensors to the device
- Build Python bindings via PyO3, with a C-ABI shim as the alternative integration path for downstream consumers
- Build the LLM inference serving stack — paged KV cache, continuous batching, request scheduling, token streaming — and the cluster orchestration primitives underneath it
- Bring up interconnect topology from the host and own the failure-detection and clean-teardown path for stop-restructure-resume recovery across racks
- Design what the chip exposes to host-side profilers and debuggers — perf counters, traces, and the Python surfaces ML engineers actually use — and hit measurable performance targets on runtime overhead and serving throughput
Who You Are
- Strong experience in a systems programming language — Rust, C, C++, or Go — including memory management, allocator design, and FFI/ABI work
- Have built Python interop layers in production (PyO3, ctypes, pybind11, or equivalent C-ABI bridging)
- Have designed and maintained API or ABI contracts between teams — versioning, evolution, breaking-change discipline — not just consumed someone else's
- Hands-on with at least one accelerator programming model (CUDA, ROCm, oneAPI Level Zero, TPU, or comparable) — enough to reason about device memory, async execution, and kernel launch
- ML-systems literate — comfortable with the training and inference loop, what collectives do, what a tensor layout is. Research depth not required.
Bonus Points If You Have
- LLM inference internals — vLLM, TensorRT-LLM, or SGLang (paged attention, scheduler design)
- Rust at depth, including proc macros, unsafe with soundness reasoning, and complex lifetime/trait work
- Custom allocator design (slab, paged, arena) or other low-level memory work
- ML framework integration experience (PyTorch custom backends, JAX/XLA, ONNX runtime)
- Profiler or tracing infrastructure work (perfetto, Nsight, or a custom stack)
- Driver-adjacent or kernel-bypass work, or prior new-silicon bring-up
Compensation
The US base salary for this full-time position is determined based on a variety of factors including role, experience, location, job related skills, and relevant education and training. Career length is only a guideline for compensation.
- Early Career - $120,000 - $250,000 + equity
- Mid Career - $175,000 - $362,500 + equity
- Senior Career - $250,000 - $475,000 + equity
What We Offer
- A Stake In Our Success A flexible cash equity compensation mix that fits your needs
- Health & Wellness Company subsidized Health, Dental, Vision, and Life insurance; Pre-tax Health Savings Accounts with generous company contribution (even if you don't)
- Time To Recharge 4 weeks paid time off (accrued), 12 company holidays, and 3 weeks remote/flexible work per year
- Support To Parents Up to 12 weeks of paid parental leave, regardless of your path to parenthood
- Learning & Development $1,500 yearly towards your professional development e.g. conferences, courses, and other learning opportunities
- Team Connection Team lunches, quarterly off-sites, and regular town halls
- Financial Wellbeing 401K and/or Roth IRA, with 5% company contribution, even if you don't!
- Flexible Spending Accounts Pre-tax spend accounts for medical, dental/vision, dependent care, parking, and transit expenses
- Commute On Us For those commuting up to 1 hour, put your rideshare cost on our company card and reclaim the drive-time to get work done!
- MatX E[x]tras $50 per month to use on the perks you care about most
- Remote Perks We work remotely Monday & Friday, supported by home-tech setup, and remote wifi expense reimbursement
As part of our dedication to the diversity of our team and our focus on creating an inviting and inclusive work experience, MatX is committed to a policy of Equal Employment Opportunity and will not discriminate against an applicant or employee on the basis of race, color, religion, creed, national origin or ancestry, sex, gender, gender identity, gender expression, sexual orientation, age, physical or mental disability, medical condition, marital/domestic partner status, military and veteran status, genetic information or any other legally recognized protected basis under federal, state or local laws, regulations or ordinances.
This position requires access to information that is subject to U.S. export controls. This offer of employment is contingent upon the applicants capacity to perform job functions in compliance with U.S. export control laws without obtaining a license from U.S. export control authorities.
MatX does not accept unsolicited resumes from individual recruiters or third-party recruiting agencies in response to job postings. No fee will be paid to third parties who submit unsolicited candidates directly to our hiring managers or People team and any resumes submitted are deemed to be the property of MatX.
- A leading cybersecurity firm in Sunnyvale is seeking a Sr. Software Engineer for its Cloud Runtime Protection team. You will design and implement high-performance features to secure cloud-native and AI workloads. The position demands over 10 years of experience with C/...Suggested2 days per week3 days per week
$140k - $215k
...mission that matters? The future of cybersecurity starts with you. About the Role: This is a Software Development Engineer role on the Cloud Runtime Protection team that builds the core of the CrowdStrike Falcon platform: the lightweight sensor. You will be at the...SuggestedWork experience placementWork at officeLocal area2 days per week3 days per week- A leading AI software company in California is seeking a Software Engineer to develop and enhance runtime stacks for scalable ML applications. The role involves working on system software and collaborating with various teams to support next-generation high-performance compute...Suggested
- ...Staff Runtime Systems Engineer At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is...Suggested3 days per week
$181.1k - $318.4k
...Staff Data Science Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result...SuggestedRelocation$147.4k - $272.1k
...Software Development Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the...Relocation$147.4k - $272.1k
Apple Inc. is seeking a Software Development Engineer for Siri Runtime Systems and Interaction in Cupertino, California. This role involves designing and integrating next-generation Siri experiences, focusing on low-latency interactions and system performance optimization...$147.4k - $272.1k
...Software Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us...Relocation- A technology company specializing in AI seeks a Runtime Engineer in Palo Alto, California. The role involves designing and implementing features for high-performance machine learning applications and supporting system software for next-generation silicon. Ideal candidates...
- ...ownership in perpetuity, so they can turn generative AI into one of their most valuable assets. The Opportunity The Runtime team at Sambanova is a seasoned engineering team with a proven track record of delivering cutting-edge system software solutions for AI and machine...Full timeTemporary workLocal areaFlexible hours
$100k
...Software Engineer, Metal Runtime (Core Systems) Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations...- ...Business Area: Engineering Seniority Level: Mid-Senior level Job Description: At Cloudera, we empower people to transform... ...). Deep understanding of LLM deployment challenges and runtimes (e.g., vLLM, ONNX, TorchServe, Triton). Familiarity with quantization...Work from homeFlexible hours
- MatX is seeking a skilled software engineer to build custom silicon for AI language models in Mountain View, California. The role requires strong programming skills, with a focus on memory management and API design. Candidates will engage in building libraries for memory...Remote job
- d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with...3 days per week
- ...possibilities of AI. Location: Hybrid, working onsite at our Santa Clara, CA headquarters 3 days per week. The role: Runtime Systems Engineer, Principal What You Will Do: d-Matrix is developing an AI compute platform focusing on in-memory compute for AI...3 days per week
$165k - $242k
...Systems Engineer, Kernel Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built... ...across: Virtualization (KubeVirt, QEMU, vFIO) Container runtimes (containerd, nydus, kubelet) HPC/AI workloads (CUDA,...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours- ...Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the...
$200k - $400k
...Institute Of Foundation Models Engineer The Institute of Foundation Models (IFM) designs and operates ultra-scale GPU supercomputing... ...co-designed across model architecture, communication systems, runtime, and hardware topology. This role sits at the core of that effort...Visa sponsorship$170k - $265k
...employee, in every company. About the Role: The Agents Runtime team builds the low-latency, reliable, and secure foundation... ...impactful roadmap investments. You are: ~3+ years of software engineering experience building production distributed systems or cloud-...Home officeFlexible hours3 days per week$170k - $265k
...Software Engineer, Agentic Runtime Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and...Home officeFlexible hours3 days per week- ...system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-... ...in the process, join us. About the Role We're looking for a Runtime Engineer to design and build the multi-target runtime that sits...
- ...Software Engineer The NextGen OS team is focused on building Applied Intuition's operating system (OS) stack for future vehicles and... ...implement core OS components including: kernel, system services, runtime, application framework, BSPs, and hardware abstraction layers....For contractorsFor subcontractor
$141.3k - $226k
...Job Description: Broadcom is looking for a Software Systems Engineer (P5) to join VMware Cloud Foundation's (VCF) AI and Advanced Services... ...of Kubernetes based Platform Services: Model Gallery, Model Runtime and ML API Gateway, Data Indexing and Retrieval, and Agent...Full timeWork at officeLocal area- ...ServiceNow's leading workflow automation with Moveworks' Reasoning Engine and natural language capabilities, we deliver the AI platform... ...work better for everyone. The Role We're building the runtime infrastructure that powers Moveworks' AI agents - the systems that...Work at officeRemote workFlexible hours
$147.4k - $272.1k
A leading technology company in Cupertino is seeking a Software Engineer to develop the next generation of Siri's natural interaction platforms. You will engage in hands-on work to improve Siri capabilities, requiring strong expertise in Swift or Objective-C and solid...- ...Job Description The Role We're building the runtime infrastructure that powers Moveworks' AI agents — the systems that orchestrate... ...time. This is not an ML role. This is a distributed systems engineering role at the heart of the agentic AI wave. Our AI agents...Work at officeRemote workFlexible hours
$128.7k - $261.3k
...Job Description Role As a ? Senior System Performance Engineer ?on GM's AV System Performance Team, you will design, build, and... ...and prototype new tools, techniques, and technologies to improve runtime performance and developer workflows. Design, implement,...Local areaRemote workWork from homeRelocation packageFlexible hours$200k - $340k
...Distinguished Machine Learning Engineer, AI Systems Palo Alto, CA HP IQ is HP's new AI innovation lab. Combining startup agility... ...in support of HP's AI roadmap, owning decisions across models, runtimes, inference engines, and optimization. Lead on device AI strategy...Full timeTemporary workLocal areaFlexible hours- ...agentic computation. About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our fast generative... ...solution through simple APIs powered by a distributed runtime that runs on large clusters of our own hardware. Our mission is...
$139.6k - $246.1k
...development of high energy laser beam control systems or electro-optical systems of related complexity. • Experience with developing engineering documents, e.g. specifications, vendor RFQs, proposal technical volumes, drawings, briefings, assembly work instructions,...Full timeContract workTemporary workWork experience placementWork at officeRemote workRelocationFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Runtime Engineer. Be the first to apply!

