Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Runtime Engineer

$120k - $250k

MatX

Runtime Engineer

Mountain View, CA

What MatX Is Building

MatX is building custom silicon for large-language-model inference and training, with HW/SW co-design across ISA, RTL, simulator, compiler, and kernels so each layer benefits from the others. The runtime owns the host-side stack and the contracts that bind those teams together.

What You'll Do Here
  • Build the host-side interface library — device memory management, DMA, streams and events, sync primitives — that every compiler-emitted program runs on top of
  • Own and extend the executable format: the compiler→runtime contract, its versioning, the weight and quantization layouts that let compiler and runtime evolve independently
  • Design the custom-kernel ABI — calling convention, sync semantics, lifecycle — and the host-side marshaling layer (DLPack, the buffer protocol, numpy) that gets Python tensors to the device
  • Build Python bindings via PyO3, with a C-ABI shim as the alternative integration path for downstream consumers
  • Build the LLM inference serving stack — paged KV cache, continuous batching, request scheduling, token streaming — and the cluster orchestration primitives underneath it
  • Bring up interconnect topology from the host and own the failure-detection and clean-teardown path for stop-restructure-resume recovery across racks
  • Design what the chip exposes to host-side profilers and debuggers — perf counters, traces, and the Python surfaces ML engineers actually use — and hit measurable performance targets on runtime overhead and serving throughput
Who You Are
  • Strong experience in a systems programming language — Rust, C, C++, or Go — including memory management, allocator design, and FFI/ABI work
  • Have built Python interop layers in production (PyO3, ctypes, pybind11, or equivalent C-ABI bridging)
  • Have designed and maintained API or ABI contracts between teams — versioning, evolution, breaking-change discipline — not just consumed someone else's
  • Hands-on with at least one accelerator programming model (CUDA, ROCm, oneAPI Level Zero, TPU, or comparable) — enough to reason about device memory, async execution, and kernel launch
  • ML-systems literate — comfortable with the training and inference loop, what collectives do, what a tensor layout is. Research depth not required.
Bonus Points If You Have
  • LLM inference internals — vLLM, TensorRT-LLM, or SGLang (paged attention, scheduler design)
  • Rust at depth, including proc macros, unsafe with soundness reasoning, and complex lifetime/trait work
  • Custom allocator design (slab, paged, arena) or other low-level memory work
  • ML framework integration experience (PyTorch custom backends, JAX/XLA, ONNX runtime)
  • Profiler or tracing infrastructure work (perfetto, Nsight, or a custom stack)
  • Driver-adjacent or kernel-bypass work, or prior new-silicon bring-up
Compensation

The US base salary for this full-time position is determined based on a variety of factors including role, experience, location, job related skills, and relevant education and training. Career length is only a guideline for compensation.

  • Early Career - $120,000 - $250,000 + equity
  • Mid Career - $175,000 - $362,500 + equity
  • Senior Career - $250,000 - $475,000 + equity
What We Offer
  • A Stake In Our Success A flexible cash equity compensation mix that fits your needs
  • Health & Wellness Company subsidized Health, Dental, Vision, and Life insurance; Pre-tax Health Savings Accounts with generous company contribution (even if you don't)
  • Time To Recharge 4 weeks paid time off (accrued), 12 company holidays, and 3 weeks remote/flexible work per year
  • Support To Parents Up to 12 weeks of paid parental leave, regardless of your path to parenthood
  • Learning & Development $1,500 yearly towards your professional development e.g. conferences, courses, and other learning opportunities
  • Team Connection Team lunches, quarterly off-sites, and regular town halls
  • Financial Wellbeing 401K and/or Roth IRA, with 5% company contribution, even if you don't!
  • Flexible Spending Accounts Pre-tax spend accounts for medical, dental/vision, dependent care, parking, and transit expenses
  • Commute On Us For those commuting up to 1 hour, put your rideshare cost on our company card and reclaim the drive-time to get work done!
  • MatX E[x]tras $50 per month to use on the perks you care about most
  • Remote Perks We work remotely Monday & Friday, supported by home-tech setup, and remote wifi expense reimbursement

As part of our dedication to the diversity of our team and our focus on creating an inviting and inclusive work experience, MatX is committed to a policy of Equal Employment Opportunity and will not discriminate against an applicant or employee on the basis of race, color, religion, creed, national origin or ancestry, sex, gender, gender identity, gender expression, sexual orientation, age, physical or mental disability, medical condition, marital/domestic partner status, military and veteran status, genetic information or any other legally recognized protected basis under federal, state or local laws, regulations or ordinances.

This position requires access to information that is subject to U.S. export controls. This offer of employment is contingent upon the applicants capacity to perform job functions in compliance with U.S. export control laws without obtaining a license from U.S. export control authorities.

MatX does not accept unsolicited resumes from individual recruiters or third-party recruiting agencies in response to job postings. No fee will be paid to third parties who submit unsolicited candidates directly to our hiring managers or People team and any resumes submitted are deemed to be the property of MatX.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Runtime Engineer in Mountain View, CA vacancy
  • A leading cybersecurity firm in Sunnyvale is seeking a Sr. Software Engineer for its Cloud Runtime Protection team. You will design and implement high-performance features to secure cloud-native and AI workloads. The position demands over 10 years of experience with C/... 
    Suggested
    2 days per week
    3 days per week

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    1 day ago
  • $140k - $215k

     ...mission that matters? The future of cybersecurity starts with you. About the Role: This is a Software Development Engineer role on the Cloud Runtime Protection team that builds the core of the CrowdStrike Falcon platform: the lightweight sensor. You will be at the... 
    Suggested
    Work experience placement
    Work at office
    Local area
    2 days per week
    3 days per week

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    17 hours ago
  • A leading AI software company in California is seeking a Software Engineer to develop and enhance runtime stacks for scalable ML applications. The role involves working on system software and collaborating with various teams to support next-generation high-performance compute... 
    Suggested

    SambaNova

    Palo Alto, CA
    4 days ago
  •  ...Staff Runtime Systems Engineer At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is... 
    Suggested
    3 days per week

    d-Matrix

    Santa Clara, CA
    1 day ago
  • $181.1k - $318.4k

     ...Staff Data Science Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result... 
    Suggested
    Relocation

    Apple

    Cupertino, CA
    17 hours ago
  • $147.4k - $272.1k

     ...Software Development Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the... 
    Relocation

    Apple

    Cupertino, CA
    4 days ago
  • $147.4k - $272.1k

    Apple Inc. is seeking a Software Development Engineer for Siri Runtime Systems and Interaction in Cupertino, California. This role involves designing and integrating next-generation Siri experiences, focusing on low-latency interactions and system performance optimization... 

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $147.4k - $272.1k

     ...Software Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us... 
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  • A technology company specializing in AI seeks a Runtime Engineer in Palo Alto, California. The role involves designing and implementing features for high-performance machine learning applications and supporting system software for next-generation silicon. Ideal candidates... 

    SambaNova

    Palo Alto, CA
    17 hours ago
  •  ...ownership in perpetuity, so they can turn generative AI into one of their most valuable assets. The Opportunity The Runtime team at Sambanova is a seasoned engineering team with a proven track record of delivering cutting-edge system software solutions for AI and machine... 
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova

    Palo Alto, CA
    4 days ago
  • $100k

     ...Software Engineer, Metal Runtime (Core Systems) Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations... 

    Tenstorrent

    Santa Clara, CA
    2 days ago
  •  ...Business Area: Engineering Seniority Level: Mid-Senior level Job Description: At Cloudera, we empower people to transform...  ...). Deep understanding of LLM deployment challenges and runtimes (e.g., vLLM, ONNX, TorchServe, Triton). Familiarity with quantization... 
    Work from home
    Flexible hours

    Cloudera

    Alviso, CA
    17 hours ago
  • MatX is seeking a skilled software engineer to build custom silicon for AI language models in Mountain View, California. The role requires strong programming skills, with a focus on memory management and API design. Candidates will engage in building libraries for memory... 
    Remote job

    MatX

    Mountain View, CA
    3 days ago
  • d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with... 
    3 days per week

    d-Matrix

    Santa Clara, CA
    1 day ago
  •  ...possibilities of AI.  Location: Hybrid, working onsite at our Santa Clara, CA headquarters 3 days per week. The role: Runtime Systems Engineer, Principal What You Will Do: d-Matrix is developing an AI compute platform focusing on in-memory compute for AI... 
    3 days per week

    d-Matrix

    Santa Clara, CA
    more than 2 months ago
  • $165k - $242k

     ...Systems Engineer, Kernel Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built...  ...across: Virtualization (KubeVirt, QEMU, vFIO) Container runtimes (containerd, nydus, kubelet) HPC/AI workloads (CUDA,... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    2 days ago
  •  ...Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  • $200k - $400k

     ...Institute Of Foundation Models Engineer The Institute of Foundation Models (IFM) designs and operates ultra-scale GPU supercomputing...  ...co-designed across model architecture, communication systems, runtime, and hardware topology. This role sits at the core of that effort... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $170k - $265k

     ...employee, in every company. About the Role: The Agents Runtime team builds the low-latency, reliable, and secure foundation...  ...impactful roadmap investments. You are: ~3+ years of software engineering experience building production distributed systems or cloud-... 
    Home office
    Flexible hours
    3 days per week

    Glean.info

    Mountain View, CA
    4 days ago
  • $170k - $265k

     ...Software Engineer, Agentic Runtime Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and... 
    Home office
    Flexible hours
    3 days per week

    Softbank Investment Advisers

    Mountain View, CA
    2 days ago
  •  ...system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-...  ...in the process, join us. About the Role We're looking for a Runtime Engineer to design and build the multi-target runtime that sits... 

    Lemurian Labs

    Santa Clara, CA
    3 days ago
  •  ...Software Engineer The NextGen OS team is focused on building Applied Intuition's operating system (OS) stack for future vehicles and...  ...implement core OS components including: kernel, system services, runtime, application framework, BSPs, and hardware abstraction layers.... 
    For contractors
    For subcontractor

    Applied Intuition

    Sunnyvale, CA
    1 day ago
  • $141.3k - $226k

     ...Job Description: Broadcom is looking for a Software Systems Engineer (P5) to join VMware Cloud Foundation's (VCF) AI and Advanced Services...  ...of Kubernetes based Platform Services: Model Gallery, Model Runtime and ML API Gateway, Data Indexing and Retrieval, and Agent... 
    Full time
    Work at office
    Local area

    Broadcom Corporation

    Palo Alto, CA
    4 days ago
  •  ...ServiceNow's leading workflow automation with Moveworks' Reasoning Engine and natural language capabilities, we deliver the AI platform...  ...work better for everyone. The Role We're building the runtime infrastructure that powers Moveworks' AI agents - the systems that... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    1 day ago
  • $147.4k - $272.1k

    A leading technology company in Cupertino is seeking a Software Engineer to develop the next generation of Siri's natural interaction platforms. You will engage in hands-on work to improve Siri capabilities, requiring strong expertise in Swift or Objective-C and solid... 

    Apple Inc.

    Cupertino, CA
    2 days ago
  •  ...Job Description The Role We're building the runtime infrastructure that powers Moveworks' AI agents — the systems that orchestrate...  ...time. This is not an ML role. This is a distributed systems engineering role at the heart of the agentic AI wave. Our AI agents... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    2 days ago
  • $128.7k - $261.3k

     ...Job Description Role As a ? Senior System Performance Engineer ?on GM's AV System Performance Team, you will design, build, and...  ...and prototype new tools, techniques, and technologies to improve runtime performance and developer workflows. Design, implement,... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  • $200k - $340k

     ...Distinguished Machine Learning Engineer, AI Systems Palo Alto, CA HP IQ is HP's new AI innovation lab. Combining startup agility...  ...in support of HP's AI roadmap, owning decisions across models, runtimes, inference engines, and optimization. Lead on device AI strategy... 
    Full time
    Temporary work
    Local area
    Flexible hours

    HP Development Company, L.P.

    Palo Alto, CA
    2 days ago
  •  ...agentic computation. About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our fast generative...  ...solution through simple APIs powered by a distributed runtime that runs on large clusters of our own hardware. Our mission is... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    17 hours ago
  • $139.6k - $246.1k

     ...development of high energy laser beam control systems or electro-optical systems of related complexity. • Experience with developing engineering documents, e.g. specifications, vendor RFQs, proposal technical volumes, drawings, briefings, assembly work instructions,... 
    Full time
    Contract work
    Temporary work
    Work experience placement
    Work at office
    Remote work
    Relocation
    Flexible hours
    Shift work

    Lockheed Martin Corporation

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Runtime Engineer. Be the first to apply!