Staff Engineer, Frontier AI Inference

$350k

Mirendil

Mirendil in San Francisco is searching for an engineer to develop and optimize inference systems for cutting-edge AI models. You will handle the complete inference stack, enhancing performance and reliability. The role involves partnering with teams to deploy new architectures and implement optimizations such as quantization and caching strategies. With a focus on innovation, you will contribute to groundbreaking AI research. A competitive base salary of $350,000–$500,000 USD along with equity and benefits is offered. #J-18808-Ljbffr Mirendil

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Staff Engineer, Frontier AI Inference in San Francisco, CA vacancy

Staff ML Systems Engineer — Frontier AI Infra
...company is seeking a Member of Technical Staff to focus on cutting-edge AI research and development. The role... ...building and scaling training and inference infrastructure, designing ML kernels... ...an exciting opportunity in a frontier AI research environment with a diverse...
Suggested
Mirendil
San Francisco, CA
2 days ago
Staff Engineer, GPU AI Inference & RL Infrastructure
B Capital is seeking a skilled engineer for GPU infrastructure in San Francisco. This role... ...operating high-performance systems for model inference, synthetic data generation, and... ...and a passion for working in cutting-edge AI. Benefits include top-tier compensation,...
Suggested
B Capital
San Francisco, CA
3 days ago
Staff/Senior Inference Engineer for Scalable AI
$300k
United States Digital Space LLC is seeking a skilled software engineer to join the Inference team in San Francisco. You will be responsible for building and maintaining systems that serve Claude to millions of users. The role emphasizes maximizing compute efficiency and...
Suggested
Work at office
United States Digital Space LLC
San Francisco, CA
5 days ago
Staff Engineer, Scalable AI Inference Infrastructure
$200k - $400k
A leading AI technology company located in San Francisco is seeking an infrastructure engineer to build distributed systems for their AI inference engine. The role involves designing systems that ensure minimal latency and maximum reliability. Candidates should have a...
Suggested
Visa sponsorship
Inferact
San Francisco, CA
3 days ago
Staff ML Engineer, Frontier AI
$250k - $350k
...never set out to be just another scribe. We’re building the AI intelligence platform that restores humanity to healthcare... ...Perkins — and we’re just getting started. The Role: As a Staff ML Engineer on the Frontier AI team at Ambience, you'll own the hardest model quality...
Suggested
Work at office
Immediate start
Remote work
Flexible hours
3 days per week
Ambience Healthcare, Inc.
San Francisco, CA
3 days ago
Staff GenAI Inference Engineer: Optimize LLM Serving Latency
$190.9k - $232.8k
A leading data and AI company is seeking a Staff Software Engineer for GenAI inference to lead the architecture and optimization of the inference engine. The role requires expertise in CUDA, GPU programming, and distributed systems design. Ideal candidates will have a strong...
Menlo Ventures
San Francisco, CA
4 days ago
Staff Engineer, AI Inference & Distributed Systems
Sail Research in San Francisco is seeking a talented engineer to design and implement robust systems that ensure fast and cost-efficient AI inference at global scale. You will be responsible for building high-performance schedulers and optimizing global routing while focusing...
Sail Research
San Francisco, CA
2 days ago
Member of Technical Staff - Edge Inference Engineer
Overview About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems... ...us get there. The Opportunity Our Edge Inference team compiles Liquid Foundation Models into... ...Desired Experience Embedded software engineering experience or work on resource-constrained...
Liquid AI
San Francisco, CA
4 days ago
Staff Engineer, AI Inference Infrastructure & Orchestration
Acceler8 Talent is seeking an early engineer to join their team focused on developing compiler and runtime infrastructure for next-generation AI systems. This role emphasizes ownership, collaboration with deeply technical peers, and contribution to efficient workload execution...
Acceler8 Talent
San Francisco, CA
1 day ago
Staff Infrastructure Engineer, Cluster Infrastructure
$320k - $405k
...interpretable, and steerable AI systems. We want AI to... ...committed researchers, engineers, policy experts, and... ...infrastructure and frontier capabilities can go hand... ...response to failure. As a Staff engineer on this team,... ...and internal research, inference and product teams to...
Visa sponsorship
United States Digital Space LLC
San Francisco, CA
1 day ago
Staff ML Engineer, Frontier AI — Healthcare AI Innovator
A healthcare technology company in San Francisco is seeking a Staff ML Engineer to tackle complex model quality challenges in clinical AI products. The ideal candidate has over 5 years of experience in ML engineering, deep learning expertise, and a strong commitment to...
Ambience Healthcare, Inc.
San Francisco, CA
3 days ago
Staff GenAI Kernel & Performance Engineer
A leading data and AI company in San Francisco seeks a Staff Software Engineer to lead kernel-level performance engineering for GenAI workloads. The role... ...chance to work with a talented team focused on pushing the frontier of inference performance. #J-18808-Ljbffr Databricks
Databricks
San Francisco, CA
1 day ago
Staff Research Engineer, Model Efficiency
...training and deploying frontier models for developers and... ...who are building AI systems to power magical... ...a team of researchers, engineers, designers, and more, who... ...AI systems can do — but inference is still the bottleneck... ...preferred locations. As a Staff Research Engineer, you...
Full time
Work at office
Remote work
Flexible hours
Cohere
San Francisco, CA
5 days ago
Staff ML Engineer, Frontier AI — Healthcare AI Leader
A leading healthcare AI company is seeking a Staff ML Engineer to address complex model quality issues in clinical AI products. The role requires deep expertise in reinforcement learning and the ability to drive research from inception to production. This position is based...
Ambience Healthcare
San Francisco, CA
3 days ago
Staff ML Inference Engineer — Model Efficiency (Remote)
Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems... ...plus strong skills in C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and inclusive work...
Remote job
Jaide Health
San Francisco, CA
5 days ago
Senior/Staff Security Engineer
...justice gap using technology and AI. We empower personal injury... ...impact. Learn more at Life as an Engineer at EvenUp EvenUp’s security... ...claimed to date. As a Senior/Staff Security Engineer at EvenUp, you... ..., model poisoning, membership inference, adversarial perturbation,...
Temporary work
Work at office
Local area
Home office
Flexible hours
3 days per week
EvenUp Inc.
San Francisco, CA
3 days ago
Staff Engineer, vLLM Inference & DevRel
$200k - $400k
Inferact is looking for a Developer Relations Engineer in San Francisco, California, to help developers utilize vLLM for AI inference. This unique role involves teaching technical concepts, creating educational content, and engaging with the AI infrastructure community....
Remote work
Inferact
San Francisco, CA
1 day ago
Staff Engineer, Engineering Productivity & AI Quality
$253k - $308k
Staff Engineer, Engineering Productivity & AI Quality Harper is an AI-native commercial insurance company, based in San Francisco and built from scratch... ...insurance. They join because they want to be on the frontier of the AI transition, doing the most consequential work...
Part time
Work at office
Relocation
Harper Group
San Francisco, CA
4 days ago
Staff Engineer, Mid-Training Infra for Large-Scale AI
A cutting-edge AI research firm in San Francisco is seeking talent to build and optimize GPU infrastructure for large-scale model inference and training workloads. The ideal candidate will have hands-on experience with GPU systems and optimization techniques, actively...
Reflection
San Francisco, CA
5 days ago
Staff Engineer - LLM Inference & Serving at Scale
$150k - $300k
Prime Intellect is looking for a skilled ML Systems Engineer to build and optimize LLM serving infrastructure and inference systems. This hybrid role involves contributing... ...platforms, and a desire to work on cutting-edge AI infrastructure. They offer a cash compensation...
Relocation package
Prime Intellect
San Francisco, CA
1 day ago
Senior Staff Engineer Inference Runtime — Flexible Hours
jobr.pro is seeking a Staff Engineer to lead technical direction for Inference Runtime. This senior IC role encompasses broad ownership of the runtime’s architecture and validation systems while collaborating across teams to drive performance and scalability. The ideal...
Flexible hours
jobr.pro
San Francisco, CA
1 day ago
Staff Infra Engineer - Global GPU ML Inference
...Token Company in San Francisco is seeking a Member of Technical Staff for their infrastructure team. In this role, you will own the... ...compression API and build global low-latency, high-throughput GPU ML inference infrastructure. The ideal candidate will have solid experience...
Visa sponsorship
The Token Company
San Francisco, CA
1 day ago
Member of Technical Staff, Hardware, Compiler Engineer
$200k - $420k
...mission is to create personal AI owned and shaped by each individual... ...: personal hardware for local inference, custom training infrastructure, next‑generation UIs, and frontier deep learning research. Who we are We are scientists, engineers, and builders from the industry...
Local area
Visa sponsorship
Work visa
Relocation package
Flexible hours
River AI
San Francisco, CA
2 days ago
AI Inference Engineer (Member of Technical Staff)
...laid out for you 3+ years of professional software engineering experience with meaningful work on ML inference or high-performance systems Familiarity with at least... ...before users do. Respond to and learn from production incidents #J-18808-Ljbffr Perplexity AI
Perplexity AI
San Francisco, CA
5 days ago
Staff Machine Learning Engineer
$273k - $345k
...re changing that. Atoms builds Physical AI— real-world robots for the industries that... ...they work at scale. We are roboticists, engineers, operators, and builders. We believe the... ...vehicle edge hardware. Profile real-time inference pipelines to identify and eliminate CPU,...
Full time
Internship
Work at office
Flexible hours
ATOMS Careers page
San Francisco, CA
1 day ago
Staff Database Engineer
$190.72k - $290k
...how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise... ...we're just getting started. Role Overview As a Staff Database Engineer on the Engineering team at Harvey, you will define how...
Relocation package
Harvey
San Francisco, CA
2 days ago
Member of Technical Staff, DX & Data Tooling Engineer
$200k
...alone. Our approach combines frontier-scale pre‑training, domain‑specific... ...RL, ultra‑long context, and inference‑time compute to achieve this... ...organization, packaging, and engineering best practices What we’re... ...Not Required Deep ML/AI expertise (this is a tooling...
Work at office
Relocation
Visa sponsorship
Magic Inc
San Francisco, CA
3 days ago
Staff Engineer - ML Inference & Model Efficiency
A leading AI research firm in San Francisco is seeking a Member of Technical Staff specialized in Model Efficiency. In this role, you will enhance LLM inference systems by tackling performance issues and collaborating with cross-functional teams. Ideal candidates have...
Remote work
Cohere
San Francisco, CA
2 days ago
Staff Full Stack Engineer
...About David AI David AI is the first audio data research company... ...by a team of former Scale AI engineers and operators. In less than a... ...us on our mission to push the frontier of audio AI. About our Engineering... ...models. About this role As a Staff Full Stack Engineer at David...
Work at office
David AI
San Francisco, CA
18 hours ago
Staff IT Security Engineer
$150k - $226k
Amplitude is seeking an experienced Staff IT Security Engineer to design and build controls that define how Amplitude leverages frontier AI tooling at scale. This is a high‑scope, hands‑on position focused entirely on corporate and enterprise security, specifically tackling...
Work at office
Home office
Flexible hours
Amplitude
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Engineer, Frontier AI Inference. Be the first to apply!