Edge Transformer Inference Tech Lead
OpenAI
A leading AI research firm in San Francisco is seeking a Technical Lead to join its Future of Computing Research team. This role involves evaluating silicon platforms and optimizing model architectures while working in a hybrid model. Ideal candidates have expertise in evaluating workloads on accelerators, understanding transformer models, and leading teams focused on performance-critical software. The position offers relocation assistance and is centered on deploying cutting-edge AI technology responsibly and effectively. #J-18808-Ljbffr OpenAI
- ...About the Role As a Technical Lead on the Future of Computing... ...) for on-device and edge deployment of OpenAI models.... ...ensure efficient execution of transformer workloads. Build and lead... ...for implementing the low-level inference stack, including kernel development...TransformerWork at officeRelocation package
- ...autoregressive and diffusion transformers and familiarity with custom‑kernels... ...this critical role, you will lead the development and enable... ...You will work with cutting‑edge ML models that may consist of... ...highly distributed training/inference setups, apply roofline analysis...Transformer
- ...We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering... ..., and shaping Reactor's competitive edge in ultra-low-latency, high-throughput... ...hardware (NVIDIA) Strong understanding of transformer architectures and modern ML optimization...TransformerFull timeVisa sponsorshipRelocation package
- ...About the Job We are seeking a highly technical Inference Engine Engineer to optimize the performance and efficiency of our core... ...Design and optimize custom GPU kernels for AI (e.g., transformer and diffusion) workloads Contribute to the development of FriendliAI...TransformerWorldwideFlexible hours
- Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure company is building a scalable cloud platform... ...serving infrastructure Strong understanding of transformer architectures and attention mechanisms Experience with...Transformer
- ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures... ...engineer to join us. What You Will Work On Support transformer-based retrieval, text-generation, and multimodal models in our...Transformer
$264.8k - $331k
...enable our next generation LLM training, inference and data curation. If you are... ...frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc. Strong written... ...technologies that power the world's leading models, and help enterprises and governments...TransformerFull time$58 - $63 per hour
...Research Intern, Inference (Fall 2026) San Francisco About The... ...critical intersection of cutting-edge model architectures, high-... ...Python Familiarity with Transformer architectures and recent developments... ...systems. Publications at leading conferences in machine...TransformerHourly payInternship- ...team to build and ship cutting edge models and experiences. We're funded by leading investors at Index Ventures and... ...the Role We're hiring an Inference Engineer to advance our mission... ...cutting edge foundation models using Transformers, SSMs and hybrid models....TransformerWork at officeVisa sponsorshipFlexible hours
- ...Staff Technical Lead for Inference & ML Performance San Francisco fal is the generative media ecosystem powering the next generation of... ...work directly impacts our ability to rapidly deliver cutting-edge creative solutions to users, from individual creators to global...
- ...'re looking for a Founding Engineer, ML Inference with deep expertise in high-performance... ...performance, and shaping the competitive edge in ultra-low-latency, high-throughput environments... ...as needed Strong understanding of transformer architectures and modern ML model...TransformerRelocationVisa sponsorshipRelocation package
- ...foundational data infrastructure for an edge-first world — a world where intelligence... ...intelligence. Why This Role Matters As Lead Edge AI Engineer , you will own Source's... ...— from federated learning and on-device inference to adaptive compute pipelines running on...Local area
- ...Tech Lead, Data & Inference Engineer Massachusetts, Massachusetts, United States About the Job Tech Lead, Data & Inference Engineer... ...They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers...Full time
- ...exceptional people to help us get there. The Opportunity Our Edge Inference team compiles Liquid Foundation Models into optimized machine... ...device AI possible. You will work directly with the technical lead on problems that require deep understanding of both ML architectures...
- ...to carry out our mission from industry‑leading investors. We are obsessed with rapid... ...modalities Deep debug failure modes in transformer and diffusion policy field deployments... ...policies for real‑time (~10hz) inference on edge hardware What you bring Experience deploying...TransformerTemporary work
- ...Staff+ Software Engineer, Inference Runtime Remote-Friendly (Travel... ...Staff Engineer to be a technical lead for Inference Runtime: the... ...their own specialization, and edge cases stitch back into the core... ...scheduling environments ~ Prior tech lead experience on a developer...Work at officeRemote workVisa sponsorshipFlexible hours
- ...including GPU orchestration, large-scale inference systems, performance optimization, and developer... ...and brand at the forefront of fashion-tech innovation. Your design work will... ...love of design, luxury fashion, and cutting-edge tech, you'll have the freedom to do it here...InternshipImmediate start
- ...cloud deployment — ensuring our cutting-edge computer vision and multi-modal AI systems... ...optimize model serving for low-latency inference at scale. You\'ll work closely with our... ...learning models such as auto-regressive transformers and familiarity with inference optimization...TransformerWork at office3 days per week
- ...and low-level systems engineering. Beyond inference, you'll profile and optimize the entire... ...with ML model architectures (transformers, CNNs) and the ability to reason about computational... ...autonomous vehicles, robotics, or IoT/edge devices Deep knowledge of CUDA, TensorRT...TransformerLocal area
$190k - $250k
...Staff Software Engineer / Tech Lead, ML Infrastructure Heartflow is a medical technology... ...cause of death worldwide, using cutting-edge technology. The flagship product—an AI-... ...core ML environment for both training and inference. We design our infrastructure to not...Full timeWork at officeLocal areaWorldwideRelocation- ...machine learning systems that power real-time perception and inference across our edge-cloud platform. This role owns the training, deployment,... ...architectures and their deployment tradeoffs (YOLO, transformers, CNNs, real-time detection/tracking). Hands-on experience...Transformer
- ...Vision encoders, etc.) onto edge devices, especially mobile NPUs... ..., memory, power/thermal), lead model-side optimization strategy... ...with at least one of: LLM inference optimization (quantization, attention... ...understanding across transformers / conformers / diffusion-vocoders...TransformerFull time
$160k - $230k
...LLM Inference Frameworks and Optimization Engineer San Francisco, Singapore, Amsterdam... ...Techniques: Deep understanding of Transformer architectures and LLM/VLM/Diffusion model... ...algorithms, and models. We have contributed to leading open-source research, models, and...TransformerFull time$238k - $302k
...Engineering Manager. You will: Lead a top-tier applied ML team focused on building... ...AI. Lead the development of cutting edge Deep Learning and machine learning models... ...as TensorFlow, PyTorch, Hugging Face's transformers, along with expertise in deep learning...TransformerFull timeRemote work$175k - $225k
...with participation from other leading venture capital firms. The... ...We're looking for an AI Inference Engineer who lives at the boundary... ...sophisticated models and transforming them into lightning-fast, production... ...-ready engines running on edge devices in homes across the...Local areaRemote work- ...Montreal Employment Type Full time Location Type Hybrid Department Inference Model Serving Who are we? Our mission is to scale intelligence... ...and work environment Work closely with a team on the cutting edge of AI research Weekly lunch stipend, in-office lunches & snacks...Full timeWork experience placementWork at officeRemote workFlexible hours
$248.8k - $311k
...Technical Lead Manager, Physical AI San Francisco, CA Scale AI is the data engine... ...you will bridge the gap between cutting-edge Machine Learning research and physical robot... ...in PyTorch, with deep knowledge of Transformer architectures, Attention mechanisms, and...TransformerFull time$250k - $350k
...society. Role Overview We’re looking for a Tech Lead for 3D Modeling & Reconstruction to set... ...product partners to translate cutting‑edge ideas into reliable, scalable... ...representations, training pipelines, and inference systems, ensuring they integrate cleanly...$342k
...infrastructure execution-translating cutting‑edge compute roadmaps into scalable,... ...We are seeking a CPU & Storage Technical Lead to define and drive the server compute and... ...storage systems are optimized for training, inference, and supporting services. You will work...Local area- ...product engineering team to build and ship cutting edge models and experiences. We're funded by leading investors at Index Ventures and Lightspeed Venture... ...for training. Partner closely with research and inference teams so data systems are co-designed with training...Work at officeVisa sponsorshipFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Edge Transformer Inference Tech Lead. Be the first to apply!
- technical lead manager San Francisco, CA
- technical leader San Francisco, CA
- technical lead San Francisco, CA
- transformer San Francisco, CA
- technology summer internship San Francisco, CA
- tax technology analyst San Francisco, CA
- computer tech San Francisco, CA
- ep tech San Francisco, CA
- high tech San Francisco, CA
- sterile processing tech no experience San Francisco, CA

