Runtime Engineer
Lemurian Labs
About Us At Lemurian Labs, we're reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity's hardest problems can move faster. We're building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It's a complete rethink of how software and hardware interact — designed for the era beyond Moore's Law. We're not looking for the comfortable or the conventional; we're looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what's possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us. About the Role We're looking for a Runtime Engineer to design and build the multi-target runtime that sits at the heart of our AI compiler stack. This is a systems-level role where you'll take the output of our optimizing compiler and make it execute — efficiently, correctly, and at scale — across a diverse landscape of hardware targets. You'll work on low-level parallelization, kernel scheduling, and performance analysis, and collaborate closely with our compiler and product teams to push the boundaries of what's possible on modern AI hardware. What You'll Do Design, develop, maintain, and improve our multi-target runtime. Apply the latest techniques in parallelization and partitioning to automate kernel generation and exploit highly optimized execution paths. Rapidly prototype and data-drive exploration of new runtime ideas. Benchmark and analyze the outputs produced by our optimizing compiler on target hardware. Build tools to collect and analyze performance bottlenecks. Work closely with our product team to understand the evolving needs of ML engineers and drive improvements in runtime architecture. Requirements Essential Skills and Experience BS degree in Computer Science, Computer Engineering, or equivalent practical experience. 4+ years of experience working with compilers or runtime systems. Deep understanding of asynchronous and concurrent programming. 4+ years of experience with C/C++ (C++14 or newer). Understanding of hardware architecture: vector vs. scalar registers and instructions, memory hierarchies. Knowledge of operating system kernel development or hypervisor development. Preferred Skills and Experience Master's or PhD in Computer Science, Computer Engineering, or equivalent. Experience developing or maintaining GPU compute libraries such as CUDA or ROCm. Experience with GPU programming and optimization. Background in high-performance computing (HPC). Knowledge of deep learning frameworks such as PyTorch, JAX, or Triton. Experience programming large compute clusters. Why Join Lemurian Labs Build the runtime that makes next-generation AI infrastructure actually go fast. Work across the full stack — from hardware intrinsics to compiler output to distributed execution. Join a team that approaches infrastructure as a canvas, not a constraint. Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits. Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background. Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits. #J-18808-Ljbffr
- ...Staff Runtime Systems Engineer At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is...Suggested3 days per week
- d-Matrix inc. is seeking a Staff Runtime Systems Engineer to join our team in Santa Clara, CA. This hybrid role involves working onsite three days a week and focuses on runtime software engineering for an AI compute platform. You will be responsible for the architecture...Suggested3 days per week
$140k - $215k
...their environments. About the Role This is a Software Development Engineer (SDE) role in the engineering team that delivers the sensor (... ...or networking. Experience designing and developing container runtime engines, software that monitors containers or optimized virtual...SuggestedWork experience placementWork at officeLocal areaRemote work- Staff Runtime Systems Engineer Hybrid, working onsite at our Santa Clara, CA, headquarters 3 days per week. What You Will Do d-Matrix is developing an AI compute platform focusing on in-memory compute for AI inference in datacenters. This position is for runtime software...Suggested3 days per week
- ...Senior Software Engineer II As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive... ...seeking a Senior Software Engineer II to join our Sensor Event Runtime (SER) team. This role is responsible for the design and...SuggestedWork at officeFlexible hours
$100k
...Software Engineer, Metal Runtime (Core Systems) Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations...$170.5k - $315.49k
## Inference Optimization Engineer (local / edge runtime)Applylocations: US, California, Santa Clara: US, Oregon, Hillsboro: US, California, Folsom: US, Arizona, Phoenixtime type: Full timeposted on: Posted Yesterdayjob requisition id: JR0284871# **Job Details:**## Job...InternshipLocal areaImmediate startShift work$181.1k - $318.4k
Staff Data Science Engineer, Siri Runtime Systems and Interaction Cupertino, California, United States Software and Services Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we...Relocation- Lemurian Labs in Santa Clara seeks a Runtime Engineer to design and develop a multi-target runtime for their AI compiler stack. This role involves low-level parallelization and collaboration with compiler and product teams to enhance performance across diverse hardware...
$147.4k - $272.1k
Apple Inc. is seeking a Software Development Engineer for Siri Runtime Systems and Interaction in Cupertino, California. This role involves designing and integrating next-generation Siri experiences, focusing on low-latency interactions and system performance optimization...$198.3k - $342.8k
...Software Engineering Manager, Siri Runtime Systems and Interaction Cupertino, California, United States Software and Services Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service...Relocation$147.4k - $272.1k
Software Development Engineer, Siri Runtime Systems and Interaction Cupertino, California, United States Software and Services Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service...Relocation$184k - $287.5k
NVIDIA Gruppe is looking for senior engineers to work on the CUDA driver and runtime, core components for accelerating computation on GPUs. Responsibilities include developing new features, overseeing multiple teams, and analyzing full stack performance. Candidates should...- A leading AI software company in California is seeking a Software Engineer to develop and enhance runtime stacks for scalable ML applications. The role involves working on system software and collaborating with various teams to support next-generation high-performance compute...
$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...$19 - $65 per hour
...paradigms. System Interoperability: Designing bridges and custom RMW (ROS Middleware) configurations for hybrid environments. Field Engineering: On‑vehicle deployment, hardware‑in‑the‑loop (HIL) testing, and performance optimization. Required Skills Advanced Proficiency in...Internship$180k - $225k
...performance, cloud-agnostic AI compute platform designed for next-generation training and inference workloads. As a Staff AI Runtime Engineer , you'll play a pivotal role in the design, development, and optimization of the core runtime infrastructure that powers distributed...Work at office$152k - $287.5k
...NVIDIA Gruppe in Santa Clara is seeking a skilled engineer to develop and optimize robotics software for their Robotics Platform. The role involves creating new features, performance optimization, and integrating simulation tools to enhance machine capabilities. The ideal...- ...Description We are seeking a highly skilled and experienced Electrical Engineer with data analysis and scripting skills to join our team. The... ...of system power characterization test plans for battery runtime validation Power Consumption measurement of various device use...Remote work
- ...ownership in perpetuity, so they can turn generative AI into one of their most valuable assets. The Opportunity The Runtime team at Sambanova is a seasoned engineering team with a proven track record of delivering cutting-edge system software solutions for AI and machine...Full timeTemporary workLocal areaFlexible hours
- ...Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the...
- ...driving models. You will join a motivated and talented team of engineers to deliver a reliable, stable and flexible software stack to support... ...impact Optimize critical code paths and algorithms to improve runtime efficiency, reduce latency, and enhance overall system...Full timeWork at officeWork from homeFlexible hours
$120k - $250k
...compiler, and kernels so each layer benefits from the others. The runtime owns the host-side stack and the contracts that bind those... ...debuggers — perf counters, traces, and the Python surfaces ML engineers actually use — and hit measurable performance targets on runtime...Full timeContract workWork experience placementLocal areaRemote workMonday to FridayFlexible hours$224k - $356.5k
NVIDIA Corporation is seeking a Senior Software Engineer in Santa Clara to define runtime intelligence and safety architecture for autonomous vehicles. Your role involves integrating AI with vehicle dynamics and safety systems, tackling complex problems in real-time robotics...$100k
...Software Engineer, Metal Runtime (API & Abstractions) Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify...- A technology company specializing in AI seeks a Runtime Engineer in Palo Alto, California. The role involves designing and implementing features for high-performance machine learning applications and supporting system software for next-generation silicon. Ideal candidates...
- ...general contractor specializing in healthcare and higher education construction across California and Arizona is seeking a Project Engineer with 3–5 years of experience. The company has a strong track record of delivering high-quality work and offers a supportive team...For contractorsWork at office
$147.4k - $272.1k
A leading technology company in Cupertino is seeking a Software Engineer to develop the next generation of Siri's natural interaction platforms. You will engage in hands-on work to improve Siri capabilities, requiring strong expertise in Swift or Objective-C and solid...$155k - $195k
...product and infrastructure ecosystem, the Cybersecurity Code Test Engineer will bridge the gap between secure development architectures... ...and unsafe memory operations. Design DAST frameworks to probe runtime applications, APIs, and microservices for structural, access control...Full time- ...Job Description Job Description Salary: 80,000 - 90,000 per Year Joseph J. Albanese, Inc. is seeking a full-time Project Engineer to join our Project Management team. As a member of the Project Management department, this individual will report directly to a Project...Full timeContract workFor subcontractorWork at officeLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Runtime Engineer. Be the first to apply!



