Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Engineer, Frontier AI Inference

$350k

Mirendil

Mirendil in San Francisco is searching for an engineer to develop and optimize inference systems for cutting-edge AI models. You will handle the complete inference stack, enhancing performance and reliability. The role involves partnering with teams to deploy new architectures and implement optimizations such as quantization and caching strategies. With a focus on innovation, you will contribute to groundbreaking AI research. A competitive base salary of $350,000–$500,000 USD along with equity and benefits is offered. #J-18808-Ljbffr Mirendil

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Staff Engineer, Frontier AI Inference in San Francisco, CA vacancy
  •  ...company is seeking a Member of Technical Staff to focus on cutting-edge AI research and development. The role...  ...building and scaling training and inference infrastructure, designing ML kernels...  ...an exciting opportunity in a frontier AI research environment with a diverse... 
    Suggested

    Mirendil

    San Francisco, CA
    2 days ago
  • B Capital is seeking a skilled engineer for GPU infrastructure in San Francisco. This role...  ...operating high-performance systems for model inference, synthetic data generation, and...  ...and a passion for working in cutting-edge AI. Benefits include top-tier compensation,... 
    Suggested

    B Capital

    San Francisco, CA
    3 days ago
  • $300k

    United States Digital Space LLC is seeking a skilled software engineer to join the Inference team in San Francisco. You will be responsible for building and maintaining systems that serve Claude to millions of users. The role emphasizes maximizing compute efficiency and... 
    Suggested
    Work at office

    United States Digital Space LLC

    San Francisco, CA
    5 days ago
  • $200k - $400k

    A leading AI technology company located in San Francisco is seeking an infrastructure engineer to build distributed systems for their AI inference engine. The role involves designing systems that ensure minimal latency and maximum reliability. Candidates should have a... 
    Suggested
    Visa sponsorship

    Inferact

    San Francisco, CA
    3 days ago
  • $250k - $350k

     ...never set out to be just another scribe. We’re building the AI intelligence platform that restores humanity to healthcare...  ...Perkins — and we’re just getting started. The Role: As a Staff ML Engineer on the Frontier AI team at Ambience, you'll own the hardest model quality... 
    Suggested
    Work at office
    Immediate start
    Remote work
    Flexible hours
    3 days per week

    Ambience Healthcare, Inc.

    San Francisco, CA
    3 days ago
  • $190.9k - $232.8k

    A leading data and AI company is seeking a Staff Software Engineer for GenAI inference to lead the architecture and optimization of the inference engine. The role requires expertise in CUDA, GPU programming, and distributed systems design. Ideal candidates will have a strong... 

    Menlo Ventures

    San Francisco, CA
    4 days ago
  • Sail Research in San Francisco is seeking a talented engineer to design and implement robust systems that ensure fast and cost-efficient AI inference at global scale. You will be responsible for building high-performance schedulers and optimizing global routing while focusing... 

    Sail Research

    San Francisco, CA
    2 days ago
  • Overview About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems...  ...us get there. The Opportunity Our Edge Inference team compiles Liquid Foundation Models into...  ...Desired Experience Embedded software engineering experience or work on resource-constrained... 

    Liquid AI

    San Francisco, CA
    4 days ago
  • Acceler8 Talent is seeking an early engineer to join their team focused on developing compiler and runtime infrastructure for next-generation AI systems. This role emphasizes ownership, collaboration with deeply technical peers, and contribution to efficient workload execution... 

    Acceler8 Talent

    San Francisco, CA
    1 day ago
  • $320k - $405k

     ...interpretable, and steerable AI systems. We want AI to...  ...committed researchers, engineers, policy experts, and...  ...infrastructure and frontier capabilities can go hand...  ...response to failure. As a Staff engineer on this team,...  ...and internal research, inference and product teams to... 
    Visa sponsorship

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  • A healthcare technology company in San Francisco is seeking a Staff ML Engineer to tackle complex model quality challenges in clinical AI products. The ideal candidate has over 5 years of experience in ML engineering, deep learning expertise, and a strong commitment to... 

    Ambience Healthcare, Inc.

    San Francisco, CA
    3 days ago
  • A leading data and AI company in San Francisco seeks a Staff Software Engineer to lead kernel-level performance engineering for GenAI workloads. The role...  ...chance to work with a talented team focused on pushing the frontier of inference performance. #J-18808-Ljbffr Databricks

    Databricks

    San Francisco, CA
    1 day ago
  •  ...training and deploying frontier models for developers and...  ...who are building AI systems to power magical...  ...a team of researchers, engineers, designers, and more, who...  ...AI systems can do — but inference is still the bottleneck...  ...preferred locations. As a Staff Research Engineer, you... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    5 days ago
  • A leading healthcare AI company is seeking a Staff ML Engineer to address complex model quality issues in clinical AI products. The role requires deep expertise in reinforcement learning and the ability to drive research from inception to production. This position is based... 

    Ambience Healthcare

    San Francisco, CA
    3 days ago
  • Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems...  ...plus strong skills in C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and inclusive work... 
    Remote job

    Jaide Health

    San Francisco, CA
    5 days ago
  •  ...justice gap using technology and AI. We empower personal injury...  ...impact. Learn more at Life as an Engineer at EvenUp EvenUp’s security...  ...claimed to date. As a Senior/Staff Security Engineer at EvenUp, you...  ..., model poisoning, membership inference, adversarial perturbation,... 
    Temporary work
    Work at office
    Local area
    Home office
    Flexible hours
    3 days per week

    EvenUp Inc.

    San Francisco, CA
    3 days ago
  • $200k - $400k

    Inferact is looking for a Developer Relations Engineer in San Francisco, California, to help developers utilize vLLM for AI inference. This unique role involves teaching technical concepts, creating educational content, and engaging with the AI infrastructure community.... 
    Remote work

    Inferact

    San Francisco, CA
    1 day ago
  • $253k - $308k

    Staff Engineer, Engineering Productivity & AI Quality Harper is an AI-native commercial insurance company, based in San Francisco and built from scratch...  ...insurance. They join because they want to be on the frontier of the AI transition, doing the most consequential work... 
    Part time
    Work at office
    Relocation

    Harper Group

    San Francisco, CA
    4 days ago
  • A cutting-edge AI research firm in San Francisco is seeking talent to build and optimize GPU infrastructure for large-scale model inference and training workloads. The ideal candidate will have hands-on experience with GPU systems and optimization techniques, actively... 

    Reflection

    San Francisco, CA
    5 days ago
  • $150k - $300k

    Prime Intellect is looking for a skilled ML Systems Engineer to build and optimize LLM serving infrastructure and inference systems. This hybrid role involves contributing...  ...platforms, and a desire to work on cutting-edge AI infrastructure. They offer a cash compensation... 
    Relocation package

    Prime Intellect

    San Francisco, CA
    1 day ago
  • jobr.pro is seeking a Staff Engineer to lead technical direction for Inference Runtime. This senior IC role encompasses broad ownership of the runtime’s architecture and validation systems while collaborating across teams to drive performance and scalability. The ideal... 
    Flexible hours

    jobr.pro

    San Francisco, CA
    1 day ago
  •  ...Token Company in San Francisco is seeking a Member of Technical Staff for their infrastructure team. In this role, you will own the...  ...compression API and build global low-latency, high-throughput GPU ML inference infrastructure. The ideal candidate will have solid experience... 
    Visa sponsorship

    The Token Company

    San Francisco, CA
    1 day ago
  • $200k - $420k

     ...mission is to create personal AI owned and shaped by each individual...  ...: personal hardware for local inference, custom training infrastructure, next‑generation UIs, and frontier deep learning research. Who we are We are scientists, engineers, and builders from the industry... 
    Local area
    Visa sponsorship
    Work visa
    Relocation package
    Flexible hours

    River AI

    San Francisco, CA
    2 days ago
  •  ...laid out for you 3+ years of professional software engineering experience with meaningful work on ML inference or high-performance systems Familiarity with at least...  ...before users do. Respond to and learn from production incidents #J-18808-Ljbffr Perplexity AI

    Perplexity AI

    San Francisco, CA
    5 days ago
  • $273k - $345k

     ...re changing that. Atoms builds Physical AI— real-world robots for the industries that...  ...they work at scale. We are roboticists, engineers, operators, and builders. We believe the...  ...vehicle edge hardware. Profile real-time inference pipelines to identify and eliminate CPU,... 
    Full time
    Internship
    Work at office
    Flexible hours

    ATOMS Careers page

    San Francisco, CA
    1 day ago
  • $190.72k - $290k

     ...how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise...  ...we're just getting started. Role Overview As a Staff Database Engineer on the Engineering team at Harvey, you will define how... 
    Relocation package

    Harvey

    San Francisco, CA
    2 days ago
  • $200k

     ...alone. Our approach combines frontier-scale pre‑training, domain‑specific...  ...RL, ultra‑long context, and inference‑time compute to achieve this...  ...organization, packaging, and engineering best practices What we’re...  ...Not Required Deep ML/AI expertise (this is a tooling... 
    Work at office
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    3 days ago
  • A leading AI research firm in San Francisco is seeking a Member of Technical Staff specialized in Model Efficiency. In this role, you will enhance LLM inference systems by tackling performance issues and collaborating with cross-functional teams. Ideal candidates have... 
    Remote work

    Cohere

    San Francisco, CA
    2 days ago
  •  ...About David AI David AI is the first audio data research company...  ...by a team of former Scale AI engineers and operators. In less than a...  ...us on our mission to push the frontier of audio AI. About our Engineering...  ...models. About this role As a Staff Full Stack Engineer at David... 
    Work at office

    David AI

    San Francisco, CA
    18 hours ago
  • $150k - $226k

    Amplitude is seeking an experienced Staff IT Security Engineer to design and build controls that define how Amplitude leverages frontier AI tooling at scale. This is a high‑scope, hands‑on position focused entirely on corporate and enterprise security, specifically tackling... 
    Work at office
    Home office
    Flexible hours

    Amplitude

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Engineer, Frontier AI Inference. Be the first to apply!