Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Engineer, AI Inference & Distributed Systems

Sail Research

Sail Research in San Francisco is seeking a talented engineer to design and implement robust systems that ensure fast and cost-efficient AI inference at global scale. You will be responsible for building high-performance schedulers and optimizing global routing while focusing on deep observability of our systems. The ideal candidate has a strong background in distributed systems and is eager to engage in complex challenges. Enjoy a vibrant work environment with excellent meals and a collaborative team atmosphere. #J-18808-Ljbffr Sail Research

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Staff Engineer, AI Inference & Distributed Systems in San Francisco, CA vacancy
  •  ...in San Francisco is seeking a Member of Technical Staff to design and build distributed systems for AI workloads. The role involves developing scheduling...  ...APIs. Ideal candidates should have strong software engineering skills and experience with distributed systems. This... 
    Suggested

    Gimlet Labs

    San Francisco, CA
    1 day ago
  • Inferact Inc. is seeking exceptional generalist engineers to work on the vLLM stack from low-level GPU kernels to high-level distributed systems. This fully remote position emphasizes autonomy and effective asynchronous communication across time zones. Engineers will take... 
    Suggested
    Remote job

    Inferact Inc.

    San Francisco, CA
    2 days ago
  • $150k - $350k

     ...Inc. is seeking a Member of Technical Staff to focus on distributed systems in San Francisco, California. This...  ...and building the core platform for AI workloads, developing resource management...  ...should have strong software engineering fundamentals and experience with distributed... 
    Suggested

    Gimlet Labs, Inc.

    San Francisco, CA
    1 day ago
  • $200k - $400k

    A leading AI technology company located in San Francisco is seeking an infrastructure engineer to build distributed systems for their AI inference engine. The role involves designing systems that ensure minimal latency and maximum reliability. Candidates should have a... 
    Suggested
    Visa sponsorship

    Inferact

    San Francisco, CA
    4 days ago
  • $150k - $250k

    Asari AI in San Francisco is looking for a skilled individual to build the supercomputing infrastructure that runs AI agents,...  ...performance workloads. Your role will involve designing cloud compute, distributed systems, and sandboxed tooling to ensure efficiency and scalability.... 
    Suggested

    Asari AI

    San Francisco, CA
    2 days ago
  • deCircle is seeking an engineer to design and implement core systems for its agentic AI platform. This role involves building production systems, ensuring reliable...  ...has over 3 years of experience in backend or distributed systems engineering, strong skills in languages... 

    deCircle

    San Francisco, CA
    2 days ago
  • $190.9k - $232.8k

    A leading data and AI company is seeking a Staff Software Engineer for GenAI inference to lead the architecture and optimization of the inference engine. The...  ...requires expertise in CUDA, GPU programming, and distributed systems design. Ideal candidates will have a strong... 

    Menlo Ventures

    San Francisco, CA
    5 days ago
  •  ...leading streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating...  ...over 8 years of experience in building distributed systems, strong skills in AWS, and knowledge... 

    Tubi Tv

    San Francisco, CA
    1 day ago
  • B Capital is seeking a skilled engineer for GPU infrastructure in San Francisco. This...  ...designing and operating high-performance systems for model inference, synthetic data generation, and...  ...a passion for working in cutting-edge AI. Benefits include top-tier compensation... 

    B Capital

    San Francisco, CA
    4 days ago
  • $220k

    We build and run the inference engine behind every Perplexity query and deploy dozens of model...  ...CUTLASS, or similar). Any other deep systems programming experience is a plus. You...  ...You've built and operated production distributed systems under real load - ideally performance... 

    Perplexity

    San Francisco, CA
    3 days ago
  • Overview About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets...  .... The Opportunity Our Edge Inference team compiles Liquid Foundation...  ...Experience Embedded software engineering experience or work on resource-... 

    Liquid AI

    San Francisco, CA
    5 days ago
  • About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from...  ...team is building the distributed systems that power our next-generation...  ...priorities and delivers: Our best engineers align with team priorities... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    2 days ago
  • $180k - $250k

     ...innovative model serving architectures while working with the Applied ML team and customers. The ideal candidate has expertise in systems programming and deep understanding of cutting-edge ML infrastructure. Compensation ranges from $180,000 to $250,000, along with equity... 

    fal

    San Francisco, CA
    1 day ago
  • $279.2k - $390.9k

     ...recommendations. We design and maintain systems for ML data ingestion, low-...  ...ML Indexing & Retrieval engine, integrating capabilities...  ...operational excellence in large-scale distributed systems. Mentor and guide...  ...summarized by artificial intelligence (AI). You will have the... 
    For contractors
    Work experience placement
    Flexible hours

    Tensec

    San Francisco, CA
    3 days ago
  • $200k - $240k

     ...blockchain analytics and AI solutions to help law...  ...world for all. The AI Engineering Team is chartered with...  ...(LLMs) and agentic systems. Our mission is to build...  .... As a Senior or Staff ML Systems Engineer -...  ...Francisco, TRM operates as a distributed-first company with... 
    Remote work
    Worldwide

    TRM Labs

    San Francisco, CA
    5 days ago
  • $140.6k - $173.1k

     ...Team/Role We are seeking a seasoned Staff Software Engineer in the North America Mobility organization...  ...team that focuses on building AI Platform to support the feature development...  ...at the vanguard of generative AI, distributed systems, and fintech, you will empower WEX to... 
    Remote work
    Flexible hours

    WEX

    San Francisco, CA
    4 days ago
  •  ...biomedical scientist. We believe AI agents will fundamentally...  ...world class researchers and engineers across AI and biology. Backed...  ...building the next generation of AI systems for the life sciences. About...  ...’ll design and implement the distributed systems, compute environments... 

    deCircle

    San Francisco, CA
    2 days ago
  • jobr.pro is seeking a Staff Engineer to lead technical direction for Inference Runtime. This senior IC role encompasses broad...  ...’s architecture and validation systems while collaborating across teams...  ...engineering, particularly with distributed systems, and demonstrates... 
    Flexible hours

    jobr.pro

    San Francisco, CA
    2 days ago
  • Acceler8 Talent is seeking an early engineer to join their team focused on developing compiler and runtime infrastructure for next-generation AI systems. This role emphasizes ownership, collaboration with deeply technical peers, and contribution to efficient workload execution... 

    Acceler8 Talent

    San Francisco, CA
    1 day ago
  • $250k - $285k

     ...Staff Product Security Engineer Crusoe is on a mission to accelerate the abundance...  ...vertically integrated AI infrastructure company built...  ..., infrastructure, and distributed AI systems. This is a highly technical...  ...stack, including MLOps, inference architectures, vector... 
    Temporary work

    Crusoe

    San Francisco, CA
    9 days ago
  •  ...field 7+ years of backend engineering experience with 3+ years in...  ...Experience with large‑scale distributed systems Proficiency in Knowledge Graph...  ...for an exceptional Search/AI Engineer with experience in...  ...language, resolve ambiguity, and infer user intent Design and... 

    Workato

    San Francisco, CA
    3 days ago
  • A fintech company is looking for engineers to join their team in San Francisco. The ideal candidates will work on AI solutions for banking compliance, focusing on building effective end-to-end systems. You will contribute to developing agents that automate crucial compliance... 

    Sphinx

    San Francisco, CA
    4 days ago
  • $208k - $282k

     ...Staff Data Engineer At Komodo Health, our mission is to reduce...  ...the U.S. healthcare system — by combining de-...  ...Rust, C++, and emerging AI-enabled engineering...  ...technical depth across SQL, distributed data processing,...  ...AI/ML training, inference, experimentation, and... 
    Work experience placement
    Local area
    Flexible hours

    Komodo Health

    San Francisco, CA
    5 days ago
  •  ...The Difference You Will Make: As a staff software engineer, you will lead two areas that are critical...  ...-scale, widely adopted conversational AI platform at Airbnb, You will also...  ...data science teams to develop backend systems and enhance AI prompt effectiveness.... 
    Work experience placement
    Flexible hours

    airbnb, Inc.

    San Francisco, CA
    1 day ago
  • $160k - $300k

     ...pioneering foundational AI company for physical...  ...to revolutionize how engineering decisions are made, turning...  ...Role As a Senior / Staff Infrastructure...  ...You’ll ensure our systems meet strict requirements...  ...experience (Python, APIs, distributed systems) Exposure to... 
    Work at office
    Visa sponsorship
    Flexible hours

    Apiphany

    San Francisco, CA
    1 day ago
  • Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model...  ...or Python and insights into the LLM inference ecosystem. A commitment to diversity... 
    Remote job

    Jaide Health

    San Francisco, CA
    1 day ago
  •  ...enterprises who are building AI systems to power magical...  ...team of researchers, engineers, designers, and more,...  ...systems can do — but inference is still the bottleneck...  ...approach, we strategically distribute teams based on...  ...preferred locations. As a Staff Research Engineer, you... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    1 day ago
  • Blok is seeking a Member of Technical Staff to join our San Francisco team. This...  ...designing and implementing the core systems that power our AI initiatives. You'll build the infrastructure...  ...experience with complex software engineering and a strong background in behavior... 

    Blok

    San Francisco, CA
    2 days ago
  •  ...Connectly are building an AI-powered sales...  ...and San Francisco, and distributed teams across Latin America...  ...and visionary Staff AI Engineer to lead Connectly's...  ...technical strategy for AI systems and infrastructure....  ...distributed systems, inference optimization, caching... 
    Temporary work
    Flexible hours

    Connectly

    San Francisco, CA
    16 hours ago
  •  ...in San Francisco is seeking a skilled Security Engineer to lead the design and implementation of complex security systems. The ideal candidate will possess at least 8 years...  ...opportunities within a collaborative team committed to AI safety. #J-18808-Ljbffr Menlo Ventures

    Menlo Ventures

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Engineer, AI Inference & Distributed Systems. Be the first to apply!