Engineering Manager (AI Inference)
$300k - $385kPantera Capital
Location San Francisco Employment Type Full time Department AI Compensation $300K – $385K • Offers Equity U.S. Benefits Full-time U.S. employees enjoy a comprehensive benefits program including equity, health, dental, vision, retirement, fitness, commuter and dependent care accounts, and more. International Benefits Full-time employees outside the U.S. enjoy a comprehensive benefits program tailored to their region of residence. USD salary ranges apply only to U.S.-based positions. International salaries are set based on the local market. Final offer amounts are determined by multiple factors, including experience and expertise, and may vary from the amounts listed above. About the Role We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, serving millions of users with state-of-the-art AI capabilities. You will own the technical direction and execution of our inference systems while building and leading a world-class team of inference engineers. Our current stack includes Python, PyTorch, Rust, C++, and Kubernetes. You will help architect and scale the large-scale deployment of machine learning models behind Perplexity's Comet, Sonar, Search, Deep Research products. Why Perplexity? Build SOTA systems that are the fastest in the industry with cutting-edge technology High-impact work on a smaller team with significant ownership and autonomy Opportunity to build 0-to-1 infrastructure from scratch rather than maintaining legacy systems Work on the full spectrum: reducing cost, scaling traffic, and pushing the boundaries of inference Direct influence on technical roadmap and team culture at a rapidly growing company Responsibilities Lead and grow a high-performing team of AI inference engineers Develop APIs for AI inference used by both internal and external customers Architect and scale our inference infrastructure for reliability and efficiency Benchmark and eliminate bottlenecks throughout our inference stack Drive large sparse/MoE model inference at rack scale, including sharding strategies for massive models Push the frontier with building inference systems to support sparse attention, disaggregated pre-fill/decoding serving, etc. Improve the reliability and observability of our systems and lead incident response Own technical decisions around batching, throughput, latency, and GPU utilization Partner with ML research teams on model optimization and deployment Recruit, mentor, and develop engineering talent Establish team processes, engineering standards, and operational excellence Qualifications 5+ years of engineering experience with 2+ years in a technical leadership or management role Deep experience with ML systems and inference frameworks (PyTorch, TensorFlow, ONNX, TensorRT, vLLM) Strong understanding of LLM architecture: Multi-Head Attention, Multi/Grouped-Query Attention, and common layers Experience with inference optimizations: batching, quantization, kernel fusion, FlashAttention Familiarity with GPU characteristics, roofline models, and performance analysis Experience deploying reliable, distributed, real-time systems at scale Track record of building and leading high-performing engineering teams Experience with parallelism strategies: tensor parallelism, pipeline parallelism, expert parallelism Strong technical communication and cross-functional collaboration skills Nice to Have Experience with CUDA, Triton, or custom kernel development Background in training infrastructure and RL workloads Experience with Kubernetes and container orchestration at scale Published work or contributions to inference optimization research Compensation Range: $300K - $385K #J-18808-Ljbffr
- ...AI Chopping Block, Inc. is seeking an Engineering Manager to lead and grow its Model Inference team in San Francisco. This pivotal role involves architecting high-performance inference systems and collaborating with various teams to impact healthcare delivery. Ideal candidates...Suggested
- ...A leading investment firm in San Francisco seeks an Inference Engineering Manager to lead its AI inference team. The ideal candidate will have over 5 years of engineering experience, including 2+ years in a leadership capacity, and deep expertise in ML systems, particularly...Suggested
$425k
...reliable, interpretable, and steerable AI systems. We want AI to be safe and... ...group of committed researchers, engineers, policy experts, and business... ...use of our compute resources, be it inference or training. As an Engineering Manager on these teams you will be responsible...SuggestedContract workFor contractorsFor subcontractorWork at officeRelocationVisa sponsorshipWork visaFlexible hours- ...The Role Our generative AI-powered products are transforming the practice of medicine—and the inference systems that power them need to be fast, reliable, and world-class. We’re looking for an Engineering Manager to lead and grow our Model Inference team. The Inference...SuggestedHourly payFull timeFlexible hours
$405k
...interpretable, and steerable AI systems. We want AI to be safe... ...of committed researchers, engineers, policy experts, and business... ...shouldn't have been shed. The Inference Routing team owns this layer.... ...Have 5+ years of engineering management experience, ideally with at...SuggestedWork at officeVisa sponsorshipFlexible hoursShift work- ...inventive research, design, and engineering. Our organization is very... ...will lead the Model Routing & Inference team at Cursor, owning the inference... ...platform that powers every AI interaction in the product.... ...direction for cluster management, inference optimization, and...
$405k
...Engineering Manager - Privacy Infrastructure San Francisco, CA | Seattle, WA About Anthropic... ...reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial... ...architectures for AI training and inference, foundational data governance and...Work at officeVisa sponsorshipFlexible hours$405k
...interpretable, and steerable AI systems. We want AI to be safe... ...of committed researchers, engineers, policy experts, and business... ...that sits in front of every inference call Anthropic serves. As Claude... ...request multiplexing, connection management), rate limiting and...Temporary workWork at officeVisa sponsorshipFlexible hours$148.5k - $266.2k
...Machine Learning Engineering Manager, Model Delivery page is loaded## Machine Learning Engineering... ...include 2D/3D generative models and other AI capabilities used across Autodesk... ...performance, and cost improvements for inference and serving, including capacity planning...Remote work- ...Weekend (formerly Volley) is the leading developer of voice AI games for smart TVs. Our games attract millions of users every... ...San Francisco. Role Summary We’re looking for an experienced Engineering Manager to lead our AI Game Engine team. You will lead a high performing...Work at officeWork from homeRelocationVisa sponsorshipFlexible hours
$206.02k - $257.52k
...Flexport, a leader in global trade solutions, is building a new engineering team in San Francisco. This team will own the client's rates... ...years of engineering experience, and expertise in leading teams in AI-driven automation. The role offers a competitive salary ranging...Shift work- ...Rad AI is seeking a Senior Engineering Manager to lead core product engineering teams focused on advancing healthcare through AI. This leadership role involves owning the execution of product roadmaps, guiding technical architecture, and driving cross-functional collaboration...
$230k - $270k
...Assembled is seeking an Engineering Manager for their Forecasting and Scheduling team in San Francisco. This role involves setting the technical... ...PMs and engineers to reimagine workforce management for the AI era. The ideal candidate will have a strong leadership...- ...Mercor is defining the future of work. We partner with leading AI labs and enterprises to provide the human intelligence... ...Francisco, NYC, or London offices. About the Role We’re hiring Engineering Managers to lead teams within our Applied AI organization. Applied AI...Relocation package
- ...A leading AI research and deployment company in San Francisco seeks an experienced engineering manager to lead the development of software systems that prevent harmful misuse of AI models. You will guide a team in building detection pipelines and mitigation solutions...
- ...close the justice gap using technology and AI. We empower personal injury lawyers and... ...lasting impact. Learn more at Life as an Engineer at EvenUp Location & Work Model... ...in personal injury law. As Engineering Manager for Document Generation, you will lead a...Full timeTemporary workWork at officeLocal areaHome officeFlexible hours3 days per week
- ...A leading AI cloud provider in San Francisco seeks an experienced engineering manager to lead a team focused on cloud platform development. The successful candidate will possess over 10 years of experience in software engineering, including managerial roles, and will be...
- ...deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical... ..., PhDs, creatives, technologists, and engineers working together to empower people and... ...launches. We are looking for an engineering manager to drive improvements in developer...Hourly payFull timeLocal areaFlexible hours
$240k - $280k
...Runway Financial is seeking a leader for our engineering team during a critical growth phase. In this role, you'll build a high-performing team, establish engineering excellence, and facilitate product delivery. The ideal candidate has a track record in high-growth environments...- ...Crusoe Energy Systems LLC in San Francisco is seeking a Senior Engineering Manager to lead their SDN Management Plane team. This role involves... ...compensation and benefits, alongside a unique opportunity to be part of a pioneering AI infrastructure company. #J-18808-Ljbffr...
- ...A leading analytics platform based in San Francisco is searching for an Engineering Manager to own the Guides & Surveys product. This role involves leading a dynamic team and driving product development to meet customer needs. Candidates should have over 5 years of engineering...
$208.45k - $364.8k
...jobr.pro is seeking an Engineering Manager to lead a cross-functional team focused on user-facings. This role involves strong leadership, data-driven decision making, and collaboration with Product, Design, and Research teams. The ideal candidate has over 8 years of software...- ...Sentry is seeking an Engineering Manager for Dev Infra to lead a talented team dedicated to enhancing developer productivity through innovative tooling. As an Engineering Manager, you will drive the evolution of the platform and nurture talent while collaborating across...
- ...EvenUp in San Francisco is seeking an Engineering Manager for Document Generation. You will lead a team to develop AI-native workflows that enhance legal document creation for personal injury law. The role requires strong technical expertise, strategic leadership, and...Full timeFlexible hours
- ...Networks in San Francisco is looking for an experienced Software Engineering Manager to lead a team focused on next-generation storage... ...management. A competitive salary and a nurturing work environment await those eager to shape the future of AI. #J-18808-Ljbffr...
$260.1k - $360k
...highest standards of security and governance. AI is redefining what it means to build... ...intersection of AI, product, and platform engineering. While some of the problems involve LLMs... ...BRING 3+ years of experience leading and managing engineering teams Experience designing or...- ...Menlo Ventures is hiring an Engineering Manager for our Client Foundation team in San Francisco. The role involves leading a small team focused on enhancing frontend velocity through AI-driven development and closely partnering with tech leads. The position offers a competitive...
- A leading technology firm in San Francisco is seeking an Engineering Manager for the Brex Assistant, a consumer-facing conversational AI product. This role involves leading a team to optimize customer interactions around spend approval and financial decision-making. Candidates...Work at officeRemote work
- A leading data and AI company seeks a Senior Engineering Manager for Customer Experience Intelligence in San Francisco. You’ll lead a team to enhance AI-driven customer interactions across the platform. Ideal candidates have over 10 years of experience, with a focus on...
$250k - $350k
...Superhuman is looking for an engineering manager to join the Land & Expand team in San Francisco. This role involves leading a team of 6 engineers and driving growth through collaboration with sales and marketing. The ideal candidate has strong ownership, technical proficiency...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Engineering Manager (AI Inference). Be the first to apply!

