Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Edge Transformer Inference Tech Lead

OpenAI

A leading AI research firm in San Francisco is seeking a Technical Lead to join its Future of Computing Research team. This role involves evaluating silicon platforms and optimizing model architectures while working in a hybrid model. Ideal candidates have expertise in evaluating workloads on accelerators, understanding transformer models, and leading teams focused on performance-critical software. The position offers relocation assistance and is centered on deploying cutting-edge AI technology responsibly and effectively. #J-18808-Ljbffr OpenAI

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Edge Transformer Inference Tech Lead in San Francisco, CA vacancy
  •  ...About the Role As a Technical Lead on the Future of Computing...  ...) for on-device and edge deployment of OpenAI models....  ...ensure efficient execution of transformer workloads. Build and lead...  ...for implementing the low-level inference stack, including kernel development... 
    Transformer
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    5 days ago
  •  ...autoregressive and diffusion transformers and familiarity with custom‑kernels...  ...this critical role, you will lead the development and enable...  ...You will work with cutting‑edge ML models that may consist of...  ...highly distributed training/inference setups, apply roofline analysis... 
    Transformer

    Waymo

    San Francisco, CA
    5 days ago
  •  ...We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering...  ..., and shaping Reactor's competitive edge in ultra-low-latency, high-throughput...  ...hardware (NVIDIA) Strong understanding of transformer architectures and modern ML optimization... 
    Transformer
    Full time
    Visa sponsorship
    Relocation package

    Reactor.am

    San Francisco, CA
    4 days ago
  •  ...About the Job We are seeking a highly technical Inference Engine Engineer to optimize the performance and efficiency of our core...  ...Design and optimize custom GPU kernels for AI (e.g., transformer and diffusion) workloads Contribute to the development of FriendliAI... 
    Transformer
    Worldwide
    Flexible hours

    FriendliAI Corp

    San Francisco, CA
    3 days ago
  • Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure company is building a scalable cloud platform...  ...serving infrastructure Strong understanding of transformer architectures and attention mechanisms Experience with... 
    Transformer

    Acceler8 Talent

    San Francisco, CA
    4 days ago
  •  ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures...  ...engineer to join us. What You Will Work On Support transformer-based retrieval, text-generation, and multimodal models in our... 
    Transformer

    Perplexity AI

    San Francisco, CA
    2 days ago
  • $264.8k - $331k

     ...enable our next generation LLM training, inference and data curation. If you are...  ...frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc. Strong written...  ...technologies that power the world's leading models, and help enterprises and governments... 
    Transformer
    Full time

    Scale AI

    San Francisco, CA
    14 days ago
  • $58 - $63 per hour

     ...Research Intern, Inference (Fall 2026) San Francisco About The...  ...critical intersection of cutting-edge model architectures, high-...  ...Python Familiarity with Transformer architectures and recent developments...  ...systems. Publications at leading conferences in machine... 
    Transformer
    Hourly pay
    Internship

    Together AI

    San Francisco, CA
    5 days ago
  •  ...team to build and ship cutting edge models and experiences. We're funded by leading investors at Index Ventures and...  ...the Role We're hiring an Inference Engineer to advance our mission...  ...cutting edge foundation models using Transformers, SSMs and hybrid models.... 
    Transformer
    Work at office
    Visa sponsorship
    Flexible hours

    Cartesia, Inc.

    San Francisco, CA
    5 days ago
  •  ...Staff Technical Lead for Inference & ML Performance San Francisco fal is the generative media ecosystem powering the next generation of...  ...work directly impacts our ability to rapidly deliver cutting-edge creative solutions to users, from individual creators to global... 

    Fal

    San Francisco, CA
    4 days ago
  •  ...'re looking for a Founding Engineer, ML Inference with deep expertise in high-performance...  ...performance, and shaping the competitive edge in ultra-low-latency, high-throughput environments...  ...as needed Strong understanding of transformer architectures and modern ML model... 
    Transformer
    Relocation
    Visa sponsorship
    Relocation package

    Reactor

    San Francisco, CA
    1 day ago
  •  ...foundational data infrastructure for an edge-first world — a world where intelligence...  ...intelligence. Why This Role Matters As Lead Edge AI Engineer , you will own Source's...  ...— from federated learning and on-device inference to adaptive compute pipelines running on... 
    Local area

    Source, Inc.

    San Francisco, CA
    2 days ago
  •  ...Tech Lead, Data & Inference Engineer Massachusetts, Massachusetts, United States About the Job Tech Lead, Data & Inference Engineer...  ...They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers... 
    Full time

    Catalyst Labs, LLC

    San Francisco, CA
    1 day ago
  •  ...exceptional people to help us get there. The Opportunity Our Edge Inference team compiles Liquid Foundation Models into optimized machine...  ...device AI possible. You will work directly with the technical lead on problems that require deep understanding of both ML architectures... 

    Liquid AI

    San Francisco, CA
    2 days ago
  •  ...to carry out our mission from industry‑leading investors. We are obsessed with rapid...  ...modalities Deep debug failure modes in transformer and diffusion policy field deployments...  ...policies for real‑time (~10hz) inference on edge hardware What you bring Experience deploying... 
    Transformer
    Temporary work

    Kovari

    San Francisco, CA
    4 days ago
  •  ...Staff+ Software Engineer, Inference Runtime Remote-Friendly (Travel...  ...Staff Engineer to be a technical lead for Inference Runtime: the...  ...their own specialization, and edge cases stitch back into the core...  ...scheduling environments ~ Prior tech lead experience on a developer... 
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    5 days ago
  •  ...including GPU orchestration, large-scale inference systems, performance optimization, and developer...  ...and brand at the forefront of fashion-tech innovation. Your design work will...  ...love of design, luxury fashion, and cutting-edge tech, you'll have the freedom to do it here... 
    Internship
    Immediate start

    SpreeAI

    San Francisco, CA
    5 days ago
  •  ...cloud deployment — ensuring our cutting-edge computer vision and multi-modal AI systems...  ...optimize model serving for low-latency inference at scale. You\'ll work closely with our...  ...learning models such as auto-regressive transformers and familiarity with inference optimization... 
    Transformer
    Work at office
    3 days per week

    Claryo

    San Francisco, CA
    5 days ago
  •  ...and low-level systems engineering. Beyond inference, you'll profile and optimize the entire...  ...with ML model architectures (transformers, CNNs) and the ability to reason about computational...  ...autonomous vehicles, robotics, or IoT/edge devices Deep knowledge of CUDA, TensorRT... 
    Transformer
    Local area

    Humble Robotics

    San Francisco, CA
    5 days ago
  • $190k - $250k

     ...Staff Software Engineer / Tech Lead, ML Infrastructure Heartflow is a medical technology...  ...cause of death worldwide, using cutting-edge technology. The flagship product—an AI-...  ...core ML environment for both training and inference. We design our infrastructure to not... 
    Full time
    Work at office
    Local area
    Worldwide
    Relocation

    HeartFlow

    San Francisco, CA
    1 day ago
  •  ...machine learning systems that power real-time perception and inference across our edge-cloud platform. This role owns the training, deployment,...  ...architectures and their deployment tradeoffs (YOLO, transformers, CNNs, real-time detection/tracking). Hands-on experience... 
    Transformer

    Specter Services LLC

    San Francisco, CA
    5 days ago
  •  ...Vision encoders, etc.) onto edge devices, especially mobile NPUs...  ..., memory, power/thermal), lead model-side optimization strategy...  ...with at least one of: LLM inference optimization (quantization, attention...  ...understanding across transformers / conformers / diffusion-vocoders... 
    Transformer
    Full time

    CAPSA

    San Francisco, CA
    1 day ago
  • $160k - $230k

     ...LLM Inference Frameworks and Optimization Engineer San Francisco, Singapore, Amsterdam...  ...Techniques: Deep understanding of Transformer architectures and LLM/VLM/Diffusion model...  ...algorithms, and models. We have contributed to leading open-source research, models, and... 
    Transformer
    Full time

    Together AI

    San Francisco, CA
    22 days ago
  • $238k - $302k

     ...Engineering Manager. You will: Lead a top-tier applied ML team focused on building...  ...AI. Lead the development of cutting edge Deep Learning and machine learning models...  ...as TensorFlow, PyTorch, Hugging Face's transformers, along with expertise in deep learning... 
    Transformer
    Full time
    Remote work

    Waymo

    San Francisco, CA
    5 days ago
  • $175k - $225k

     ...with participation from other leading venture capital firms. The...  ...We're looking for an AI Inference Engineer who lives at the boundary...  ...sophisticated models and transforming them into lightning-fast, production...  ...-ready engines running on edge devices in homes across the... 
    Local area
    Remote work

    Sauron

    San Francisco, CA
    1 day ago
  •  ...Montreal Employment Type Full time Location Type Hybrid Department Inference Model Serving Who are we? Our mission is to scale intelligence...  ...and work environment Work closely with a team on the cutting edge of AI research Weekly lunch stipend, in-office lunches & snacks... 
    Full time
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    4 days ago
  • $248.8k - $311k

     ...Technical Lead Manager, Physical AI San Francisco, CA Scale AI is the data engine...  ...you will bridge the gap between cutting-edge Machine Learning research and physical robot...  ...in PyTorch, with deep knowledge of Transformer architectures, Attention mechanisms, and... 
    Transformer
    Full time

    Scale AI

    San Francisco, CA
    5 days ago
  • $250k - $350k

     ...society. Role Overview We’re looking for a Tech Lead for 3D Modeling & Reconstruction to set...  ...product partners to translate cutting‑edge ideas into reliable, scalable...  ...representations, training pipelines, and inference systems, ensuring they integrate cleanly... 

    WORLD LABS

    San Francisco, CA
    5 days ago
  • $342k

     ...infrastructure execution-translating cutting‑edge compute roadmaps into scalable,...  ...We are seeking a CPU & Storage Technical Lead to define and drive the server compute and...  ...storage systems are optimized for training, inference, and supporting services. You will work... 
    Local area

    OpenAI

    San Francisco, CA
    4 days ago
  •  ...product engineering team to build and ship cutting edge models and experiences. We're funded by leading investors at Index Ventures and Lightspeed Venture...  ...for training. Partner closely with research and inference teams so data systems are co-designed with training... 
    Work at office
    Visa sponsorship
    Flexible hours

    Cartesia, Inc.

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Edge Transformer Inference Tech Lead. Be the first to apply!