Staff Engineer, Frontier AI Inference
$350kMirendil
Mirendil in San Francisco is searching for an engineer to develop and optimize inference systems for cutting-edge AI models. You will handle the complete inference stack, enhancing performance and reliability. The role involves partnering with teams to deploy new architectures and implement optimizations such as quantization and caching strategies. With a focus on innovation, you will contribute to groundbreaking AI research. A competitive base salary of $350,000–$500,000 USD along with equity and benefits is offered. #J-18808-Ljbffr Mirendil
- ...company is seeking a Member of Technical Staff to focus on cutting-edge AI research and development. The role... ...building and scaling training and inference infrastructure, designing ML kernels... ...an exciting opportunity in a frontier AI research environment with a diverse...Suggested
- B Capital is seeking a skilled engineer for GPU infrastructure in San Francisco. This role... ...operating high-performance systems for model inference, synthetic data generation, and... ...and a passion for working in cutting-edge AI. Benefits include top-tier compensation,...Suggested
$300k
United States Digital Space LLC is seeking a skilled software engineer to join the Inference team in San Francisco. You will be responsible for building and maintaining systems that serve Claude to millions of users. The role emphasizes maximizing compute efficiency and...SuggestedWork at office$200k - $400k
A leading AI technology company located in San Francisco is seeking an infrastructure engineer to build distributed systems for their AI inference engine. The role involves designing systems that ensure minimal latency and maximum reliability. Candidates should have a...SuggestedVisa sponsorship$250k - $350k
...never set out to be just another scribe. We’re building the AI intelligence platform that restores humanity to healthcare... ...Perkins — and we’re just getting started. The Role: As a Staff ML Engineer on the Frontier AI team at Ambience, you'll own the hardest model quality...SuggestedWork at officeImmediate startRemote workFlexible hours3 days per week$190.9k - $232.8k
A leading data and AI company is seeking a Staff Software Engineer for GenAI inference to lead the architecture and optimization of the inference engine. The role requires expertise in CUDA, GPU programming, and distributed systems design. Ideal candidates will have a strong...- Sail Research in San Francisco is seeking a talented engineer to design and implement robust systems that ensure fast and cost-efficient AI inference at global scale. You will be responsible for building high-performance schedulers and optimizing global routing while focusing...
- Overview About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems... ...us get there. The Opportunity Our Edge Inference team compiles Liquid Foundation Models into... ...Desired Experience Embedded software engineering experience or work on resource-constrained...
- Acceler8 Talent is seeking an early engineer to join their team focused on developing compiler and runtime infrastructure for next-generation AI systems. This role emphasizes ownership, collaboration with deeply technical peers, and contribution to efficient workload execution...
$320k - $405k
...interpretable, and steerable AI systems. We want AI to... ...committed researchers, engineers, policy experts, and... ...infrastructure and frontier capabilities can go hand... ...response to failure. As a Staff engineer on this team,... ...and internal research, inference and product teams to...Visa sponsorship- A healthcare technology company in San Francisco is seeking a Staff ML Engineer to tackle complex model quality challenges in clinical AI products. The ideal candidate has over 5 years of experience in ML engineering, deep learning expertise, and a strong commitment to...
- A leading data and AI company in San Francisco seeks a Staff Software Engineer to lead kernel-level performance engineering for GenAI workloads. The role... ...chance to work with a talented team focused on pushing the frontier of inference performance. #J-18808-Ljbffr Databricks
- ...training and deploying frontier models for developers and... ...who are building AI systems to power magical... ...a team of researchers, engineers, designers, and more, who... ...AI systems can do — but inference is still the bottleneck... ...preferred locations. As a Staff Research Engineer, you...Full timeWork at officeRemote workFlexible hours
- A leading healthcare AI company is seeking a Staff ML Engineer to address complex model quality issues in clinical AI products. The role requires deep expertise in reinforcement learning and the ability to drive research from inception to production. This position is based...
- Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems... ...plus strong skills in C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and inclusive work...Remote job
- ...justice gap using technology and AI. We empower personal injury... ...impact. Learn more at Life as an Engineer at EvenUp EvenUp’s security... ...claimed to date. As a Senior/Staff Security Engineer at EvenUp, you... ..., model poisoning, membership inference, adversarial perturbation,...Temporary workWork at officeLocal areaHome officeFlexible hours3 days per week
$200k - $400k
Inferact is looking for a Developer Relations Engineer in San Francisco, California, to help developers utilize vLLM for AI inference. This unique role involves teaching technical concepts, creating educational content, and engaging with the AI infrastructure community....Remote work$253k - $308k
Staff Engineer, Engineering Productivity & AI Quality Harper is an AI-native commercial insurance company, based in San Francisco and built from scratch... ...insurance. They join because they want to be on the frontier of the AI transition, doing the most consequential work...Part timeWork at officeRelocation- A cutting-edge AI research firm in San Francisco is seeking talent to build and optimize GPU infrastructure for large-scale model inference and training workloads. The ideal candidate will have hands-on experience with GPU systems and optimization techniques, actively...
$150k - $300k
Prime Intellect is looking for a skilled ML Systems Engineer to build and optimize LLM serving infrastructure and inference systems. This hybrid role involves contributing... ...platforms, and a desire to work on cutting-edge AI infrastructure. They offer a cash compensation...Relocation package- jobr.pro is seeking a Staff Engineer to lead technical direction for Inference Runtime. This senior IC role encompasses broad ownership of the runtime’s architecture and validation systems while collaborating across teams to drive performance and scalability. The ideal...Flexible hours
- ...Token Company in San Francisco is seeking a Member of Technical Staff for their infrastructure team. In this role, you will own the... ...compression API and build global low-latency, high-throughput GPU ML inference infrastructure. The ideal candidate will have solid experience...Visa sponsorship
$200k - $420k
...mission is to create personal AI owned and shaped by each individual... ...: personal hardware for local inference, custom training infrastructure, next‑generation UIs, and frontier deep learning research. Who we are We are scientists, engineers, and builders from the industry...Local areaVisa sponsorshipWork visaRelocation packageFlexible hours- ...laid out for you 3+ years of professional software engineering experience with meaningful work on ML inference or high-performance systems Familiarity with at least... ...before users do. Respond to and learn from production incidents #J-18808-Ljbffr Perplexity AI
$273k - $345k
...re changing that. Atoms builds Physical AI— real-world robots for the industries that... ...they work at scale. We are roboticists, engineers, operators, and builders. We believe the... ...vehicle edge hardware. Profile real-time inference pipelines to identify and eliminate CPU,...Full timeInternshipWork at officeFlexible hours$190.72k - $290k
...how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise... ...we're just getting started. Role Overview As a Staff Database Engineer on the Engineering team at Harvey, you will define how...Relocation package$200k
...alone. Our approach combines frontier-scale pre‑training, domain‑specific... ...RL, ultra‑long context, and inference‑time compute to achieve this... ...organization, packaging, and engineering best practices What we’re... ...Not Required Deep ML/AI expertise (this is a tooling...Work at officeRelocationVisa sponsorship- A leading AI research firm in San Francisco is seeking a Member of Technical Staff specialized in Model Efficiency. In this role, you will enhance LLM inference systems by tackling performance issues and collaborating with cross-functional teams. Ideal candidates have...Remote work
- ...About David AI David AI is the first audio data research company... ...by a team of former Scale AI engineers and operators. In less than a... ...us on our mission to push the frontier of audio AI. About our Engineering... ...models. About this role As a Staff Full Stack Engineer at David...Work at office
$150k - $226k
Amplitude is seeking an experienced Staff IT Security Engineer to design and build controls that define how Amplitude leverages frontier AI tooling at scale. This is a high‑scope, hands‑on position focused entirely on corporate and enterprise security, specifically tackling...Work at officeHome officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Engineer, Frontier AI Inference. Be the first to apply!
- assistant civil engineer San Francisco, CA
- engineering aide San Francisco, CA
- assistant mechanical engineer San Francisco, CA
- assistant engineering manager San Francisco, CA
- project engineer assistant project manager San Francisco, CA
- senior staff systems engineer San Francisco, CA
- staff automation engineer San Francisco, CA
- staff design engineer San Francisco, CA
- staff security engineer San Francisco, CA
- staff engineer San Francisco, CA

