Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Robotics GPU Inference Engineer — Hybrid (Relocation)

OpenAI

OpenAI is seeking a GPU Inference Engineer based in San Francisco, CA. In this high-impact role, you'll optimize inference performance and scalability for Robotics research, driving engineering efforts to enhance model serving and system efficiency. The ideal candidate will have expertise in model performance optimization, kernel-level systems, and low-level performance tuning. The position offers a hybrid work model of 3 days in the office per week and relocation assistance for new employees. #J-18808-Ljbffr OpenAI

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Robotics GPU Inference Engineer — Hybrid (Relocation) in San Francisco, CA vacancy
  • Our Robotics team is focused on unlocking general‑purpose robotics...  ...Role We’re looking for a GPU Inference Engineer to contribute to...  ...San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees... 
    Suggested
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  • $300k

    GPU Optimisation Engineer — Real-Time Inference Want to push GPU performance to its limits — not in theory, but in production systems handling real-time...  ...Meaningful stock Location: San Francisco preferred (relocation and visa sponsorship can be provided) If you care... 
    Relocation
    Visa sponsorship
    Free visa

    Techire Ai

    San Francisco, CA
    3 days ago
  • $300k

     ...technology firm in San Francisco seeks a GPU Optimisation Engineer to maximize GPU performance in real-...  ..., and a knack for optimizing inference latency for large generative models....  ...rather than backfilling previous roles. Relocation and visa support is available. #J-188... 
    Suggested
    Visa sponsorship
    Relocation package

    Trades Workforce Solutions

    San Francisco, CA
    4 days ago
  • FriendliAI is seeking a GPU Kernel Engineer in San Francisco to design and optimize GPU kernels for AI inference. This role requires expertise in CUDA, C++, and performance-critical systems. You will work on cutting-edge GPU technology and contribute to a highly collaborative... 
    Suggested

    FriendliAI

    San Francisco, CA
    1 day ago
  • $220k - $320k

    A tech startup specializing in AI inference seeks a skilled professional to optimize their inference stack. Candidates should have over...  ...inference. Join an innovative team in downtown San Francisco, hybrid options available for local candidates. #J-18808-Ljbffr Inference
    Suggested
    Local area

    Inference

    San Francisco, CA
    18 hours ago
  • $100k - $120k

    Coda Robotics is scaling the compute infrastructure that powers next...  ...models. As training and inference workloads grow, we need kernel...  ...a team of kernel and system engineers focused on performance-critical...  ...kernels for CPU (AVX/ARM NEON), GPU (CUDA/ROCm), and hardware... 

    Coda Robotics

    San Francisco, CA
    4 days ago
  • $220k - $320k

    inference.net, a growing company in San Francisco, seeks an experienced engineer to optimize AI inference performance. The ideal candidate will have over 2 years of experience in ML systems and GPU programming. Key responsibilities include implementing optimization techniques... 

    inference.net

    San Francisco, CA
    1 day ago
  • $225k

    OpenAI is seeking a skilled RTL Engineer to design components for their custom AI accelerator in San Francisco, CA. This...  ...and strong problem-solving abilities. OpenAI offers a hybrid work model, supporting relocation for new employees, and competitive compensation between... 
    Relocation

    OpenAI

    San Francisco, CA
    18 hours ago
  •  ...unicorn founders and senior engineers with deep expertise in 3D, generative...  ...for a Founding Engineer, ML Inference with deep expertise in high-...  ...Working knowledge of GPU hardware (NVIDIA) and the ability...  ...committed to helping you relocate to the US throughout this process... 
    Relocation
    Visa sponsorship
    Relocation package

    Reactor

    San Francisco, CA
    2 days ago
  • $280k

     ...group of committed researchers, engineers, policy experts, and business...  ...breakthrough innovations in GPU performance and systems...  ...capabilities and dramatically improve inference efficiency. Working at the...  ...position Location-based hybrid policy: Currently, we expect... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    18 hours ago
  • A leading AI acceleration company in San Francisco is seeking a GPU Kernel Engineer to optimize performance for machine learning models. You will be responsible for designing high-performance GPU kernels and using advanced techniques to boost computation efficiency. Ideal... 

    Baseten

    San Francisco, CA
    1 day ago
  • A technology infrastructure company in San Francisco is seeking an experienced engineer to manage and operate GPU clusters. The role requires over 5 years of hands-on experience, a deep understanding of hardware systems, and a passion for automating fleet operations. You... 

    The San Francisco Compute Company

    San Francisco, CA
    18 hours ago
  • $350k

     ...group of committed researchers, engineers, policy experts, and business...  ...the Role Anthropic's inference fleet serves Claude to millions...  ...plus Familiarity with GPU/TPU/accelerator performance concepts...  ...position Location-based hybrid policy: Currently, we expect... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    4 days ago
  • $150k - $250k

     ...About the job User Interface Developer (Hybrid UX Engineer / Interactive Systems Developer)...  ...to OPT) Stage: Seed-stage surgical robotics company | $14M raised Who Are We?...  ...base salary plus meaningful equity. Relocation assistance ($5K-$10K). Opportunity... 
    Work at office
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Jenn Nguyen and Friends

    San Francisco, CA
    3 days ago
  • $315k

     ...group of committed researchers, engineers, policy experts, and business...  ...research, training, and inference. A significant portion of this...  ...experience. Location-based hybrid policy: Currently, we expect...  ...be aware of? Are you open to relocation for this role? * Select... What... 
    Relocation
    Contract work
    For contractors
    For subcontractor
    Work at office
    Visa sponsorship
    Work visa
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    18 hours ago
  • Neier Inc. is seeking a Robotics Engineer to enhance smart automation solutions. You will diagnose and repair Fanuc robots, commission new...  ...flexible vacation. This full-time position is available immediately in San Francisco and requires relocation. #J-18808-Ljbffr Neier Inc.
    Relocation
    Full time
    Immediate start
    Flexible hours

    Neier Inc.

    San Francisco, CA
    1 day ago
  •  ...Job Description Machine Learning Engineer, Inference Want to solve realtime inference problems where milliseconds genuinely matter?...  ...latency constraints. Think streaming inference, scheduler design, GPU utilisation, concurrency optimisation, dynamic batching, and... 
    Remote work
    Flexible hours

    techire ai

    San Francisco, CA
    1 day ago
  •  ...Sciforium Gpu Kernel Engineer Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary...  ...high-level ML frameworks used for large-scale training and inference. This role is ideal for someone who thrives at the intersection... 
    Flexible hours

    Sciforium

    San Francisco, CA
    3 days ago
  • $160k - $230k

     ...LLM Inference Frameworks and Optimization Engineer San Francisco, Singapore, Amsterdam About the Role At Together.ai, we are building state-of-the...  ...will focus on low-latency, high-throughput inference, GPU/accelerator optimizations, and software-hardware co-design... 
    Full time

    Together AI

    San Francisco, CA
    23 days ago
  •  ...expertise in model innovation and systems engineering paired with a design-minded product...  ...About the Role We're hiring an Inference Engineer to advance our mission of building...  ...foundation models using Transformers, SSMs and hybrid models. Work closely with our... 
    Work at office
    Visa sponsorship
    Flexible hours

    Cartesia, Inc.

    San Francisco, CA
    1 day ago
  •  ...Member of Technical Staff focused on building and optimizing ML inference systems in San Francisco. The role involves designing end-to-...  ...real-world workloads. Candidates should have strong software engineering skills, experience with ML inference systems, and proficiency... 

    Acceler8 Talent

    San Francisco, CA
    18 hours ago
  •  ...re hiring a Machine Learning Engineer to lead our progression from...  ...learning to make our teleoperated robots increasingly autonomous over...  ...devices with real‑time inference constraints. Collaborate with...  ...to be located or willing to relocate to San Francisco, CA. Opportunity... 
    Relocation
    Remote work
    Worldwide
    Long distance
    Flexible hours

    Avatar Robotics

    San Francisco, CA
    1 day ago
  •  ...we’d love to speak with you. ABOUT THE ROLE As a Robot Perception Engineer on the Smart Robotics team at Bright Machines, you will...  ...and optimize imaging configurations Support model inference optimization for GPU deployment using CUDA, TensorRT, and related... 

    Bright Machines

    San Francisco, CA
    17 days ago
  •  ...ABOUT THE ROLE You build and operate the inference systems that serve our models in...  ...with running real workloads. This is an engineering role, not a research role. You'll measure...  ...measured before you change Experience with GPU‑accelerated inference at scale (multi‑GPU... 

    MakerMaker.AI

    San Francisco, CA
    3 days ago
  • $167.2k - $209k

     ...generation of AI-driven applications. We are seeking a Senior Engineer 2 to join our AI Inference Data Plane team. In this role, you will be a key...  ...or Ray Serve. Hardware & Interconnects: Understanding of GPU‑level optimisation and experience with interconnect technologies... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    5 days ago
  • $167.2k - $209k

     ...builders in the world. DigitalOcean is seeking a Senior Engineer 2 to play a key technical role in our AI Inference Optimization team. DigitalOcean aims to be the...  ...optimizations at the inference engine and GPU kernel layers, ensuring our infrastructure extracts... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    4 days ago
  • $160k - $230k

    Together AI is seeking an Inference Frameworks and Optimization Engineer in San Francisco, California. The role focuses on designing and optimizing distributed...  ...in deep learning inference frameworks, proficiency in GPU programming, and strong collaboration skills.... 

    Together AI

    San Francisco, CA
    18 hours ago
  • ABOUT BASETEN Baseten powers mission‑critical inference for the world's most dynamic AI companies, like...  ...Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at the cutting... 
    Flexible hours

    Baseten

    San Francisco, CA
    3 days ago
  • MakerMaker, based in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves deep, low-level work to close the significant performance gap that exists in modern... 

    MakerMaker

    San Francisco, CA
    4 days ago
  •  ...data projects, and establishing a data-driven culture. Ideal candidates should have over 10 years of experience in data roles, be autonomous, and have a strong SQL background. The position supports a hybrid work model with relocation assistance. #J-18808-Ljbffr OpenAI
    Relocation
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Robotics GPU Inference Engineer — Hybrid (Relocation). Be the first to apply!