Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Engineering Manager (AI Inference)

$300k - $385k

Pantera Capital

Location San Francisco Employment Type Full time Department AI Compensation $300K – $385K • Offers Equity U.S. Benefits Full-time U.S. employees enjoy a comprehensive benefits program including equity, health, dental, vision, retirement, fitness, commuter and dependent care accounts, and more. International Benefits Full-time employees outside the U.S. enjoy a comprehensive benefits program tailored to their region of residence. USD salary ranges apply only to U.S.-based positions. International salaries are set based on the local market. Final offer amounts are determined by multiple factors, including experience and expertise, and may vary from the amounts listed above. About the Role We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, serving millions of users with state-of-the-art AI capabilities. You will own the technical direction and execution of our inference systems while building and leading a world-class team of inference engineers. Our current stack includes Python, PyTorch, Rust, C++, and Kubernetes. You will help architect and scale the large-scale deployment of machine learning models behind Perplexity's Comet, Sonar, Search, Deep Research products. Why Perplexity? Build SOTA systems that are the fastest in the industry with cutting-edge technology High-impact work on a smaller team with significant ownership and autonomy Opportunity to build 0-to-1 infrastructure from scratch rather than maintaining legacy systems Work on the full spectrum: reducing cost, scaling traffic, and pushing the boundaries of inference Direct influence on technical roadmap and team culture at a rapidly growing company Responsibilities Lead and grow a high-performing team of AI inference engineers Develop APIs for AI inference used by both internal and external customers Architect and scale our inference infrastructure for reliability and efficiency Benchmark and eliminate bottlenecks throughout our inference stack Drive large sparse/MoE model inference at rack scale, including sharding strategies for massive models Push the frontier with building inference systems to support sparse attention, disaggregated pre-fill/decoding serving, etc. Improve the reliability and observability of our systems and lead incident response Own technical decisions around batching, throughput, latency, and GPU utilization Partner with ML research teams on model optimization and deployment Recruit, mentor, and develop engineering talent Establish team processes, engineering standards, and operational excellence Qualifications 5+ years of engineering experience with 2+ years in a technical leadership or management role Deep experience with ML systems and inference frameworks (PyTorch, TensorFlow, ONNX, TensorRT, vLLM) Strong understanding of LLM architecture: Multi-Head Attention, Multi/Grouped-Query Attention, and common layers Experience with inference optimizations: batching, quantization, kernel fusion, FlashAttention Familiarity with GPU characteristics, roofline models, and performance analysis Experience deploying reliable, distributed, real-time systems at scale Track record of building and leading high-performing engineering teams Experience with parallelism strategies: tensor parallelism, pipeline parallelism, expert parallelism Strong technical communication and cross-functional collaboration skills Nice to Have Experience with CUDA, Triton, or custom kernel development Background in training infrastructure and RL workloads Experience with Kubernetes and container orchestration at scale Published work or contributions to inference optimization research Compensation Range: $300K - $385K #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Engineering Manager (AI Inference) in San Francisco, CA vacancy
  •  ...AI Chopping Block, Inc. is seeking an Engineering Manager to lead and grow its Model Inference team in San Francisco. This pivotal role involves architecting high-performance inference systems and collaborating with various teams to impact healthcare delivery. Ideal candidates... 
    Suggested

    AI Chopping Block, Inc.

    San Francisco, CA
    6 hours ago
  •  ...A leading investment firm in San Francisco seeks an Inference Engineering Manager to lead its AI inference team. The ideal candidate will have over 5 years of engineering experience, including 2+ years in a leadership capacity, and deep expertise in ML systems, particularly... 
    Suggested

    Pantera Capital

    San Francisco, CA
    7 hours ago
  • $425k

     ...reliable, interpretable, and steerable AI systems. We want AI to be safe and...  ...group of committed researchers, engineers, policy experts, and business...  ...use of our compute resources, be it inference or training. As an Engineering Manager on these teams you will be responsible... 
    Suggested
    Contract work
    For contractors
    For subcontractor
    Work at office
    Relocation
    Visa sponsorship
    Work visa
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    6 hours ago
  •  ...The Role Our generative AI-powered products are transforming the practice of medicine—and the inference systems that power them need to be fast, reliable, and world-class. We’re looking for an Engineering Manager to lead and grow our Model Inference team. The Inference... 
    Suggested
    Hourly pay
    Full time
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    1 day ago
  • $405k

     ...interpretable, and steerable AI systems. We want AI to be safe...  ...of committed researchers, engineers, policy experts, and business...  ...shouldn't have been shed. The Inference Routing team owns this layer....  ...Have 5+ years of engineering management experience, ideally with at... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    Anthropic

    San Francisco, CA
    4 days ago
  •  ...inventive research, design, and engineering. Our organization is very...  ...will lead the Model Routing & Inference team at Cursor, owning the inference...  ...platform that powers every AI interaction in the product....  ...direction for cluster management, inference optimization, and... 

    Anysphere

    San Francisco, CA
    7 hours ago
  • $405k

     ...Engineering Manager - Privacy Infrastructure San Francisco, CA | Seattle, WA About Anthropic...  ...reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial...  ...architectures for AI training and inference, foundational data governance and... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    2 days ago
  • $405k

     ...interpretable, and steerable AI systems. We want AI to be safe...  ...of committed researchers, engineers, policy experts, and business...  ...that sits in front of every inference call Anthropic serves. As Claude...  ...request multiplexing, connection management), rate limiting and... 
    Temporary work
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  • $148.5k - $266.2k

     ...Machine Learning Engineering Manager, Model Delivery page is loaded## Machine Learning Engineering...  ...include 2D/3D generative models and other AI capabilities used across Autodesk...  ...performance, and cost improvements for inference and serving, including capacity planning... 
    Remote work

    Autodesk

    San Francisco, CA
    6 hours ago
  •  ...Weekend (formerly Volley) is the leading developer of voice AI games for smart TVs. Our games attract millions of users every...  ...San Francisco. Role Summary We’re looking for an experienced Engineering Manager to lead our AI Game Engine team. You will lead a high performing... 
    Work at office
    Work from home
    Relocation
    Visa sponsorship
    Flexible hours

    Volley (volley.com)

    San Francisco, CA
    7 hours ago
  • $206.02k - $257.52k

     ...Flexport, a leader in global trade solutions, is building a new engineering team in San Francisco. This team will own the client's rates...  ...years of engineering experience, and expertise in leading teams in AI-driven automation. The role offers a competitive salary ranging... 
    Shift work

    Flexport

    San Francisco, CA
    6 hours ago
  •  ...Rad AI is seeking a Senior Engineering Manager to lead core product engineering teams focused on advancing healthcare through AI. This leadership role involves owning the execution of product roadmaps, guiding technical architecture, and driving cross-functional collaboration... 

    Rad AI

    San Francisco, CA
    6 hours ago
  • $230k - $270k

     ...Assembled is seeking an Engineering Manager for their Forecasting and Scheduling team in San Francisco. This role involves setting the technical...  ...PMs and engineers to reimagine workforce management for the AI era. The ideal candidate will have a strong leadership... 

    A S S E M B L E D

    San Francisco, CA
    6 hours ago
  •  ...Mercor is defining the future of work. We partner with leading AI labs and enterprises to provide the human intelligence...  ...Francisco, NYC, or London offices. About the Role We’re hiring Engineering Managers to lead teams within our Applied AI organization. Applied AI... 
    Relocation package

    Mercor Inc

    San Francisco, CA
    6 hours ago
  •  ...A leading AI research and deployment company in San Francisco seeks an experienced engineering manager to lead the development of software systems that prevent harmful misuse of AI models. You will guide a team in building detection pipelines and mitigation solutions... 

    OpenAI

    San Francisco, CA
    6 hours ago
  •  ...close the justice gap using technology and AI. We empower personal injury lawyers and...  ...lasting impact. Learn more at Life as an Engineer at EvenUp Location & Work Model...  ...in personal injury law. As Engineering Manager for Document Generation, you will lead a... 
    Full time
    Temporary work
    Work at office
    Local area
    Home office
    Flexible hours
    3 days per week

    EvenUp Inc.

    San Francisco, CA
    3 days ago
  •  ...A leading AI cloud provider in San Francisco seeks an experienced engineering manager to lead a team focused on cloud platform development. The successful candidate will possess over 10 years of experience in software engineering, including managerial roles, and will be... 

    Lambda

    San Francisco, CA
    7 hours ago
  •  ...deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical...  ..., PhDs, creatives, technologists, and engineers working together to empower people and...  ...launches. We are looking for an engineering manager to drive improvements in developer... 
    Hourly pay
    Full time
    Local area
    Flexible hours

    Abridge AI

    San Francisco, CA
    6 hours ago
  • $240k - $280k

     ...Runway Financial is seeking a leader for our engineering team during a critical growth phase. In this role, you'll build a high-performing team, establish engineering excellence, and facilitate product delivery. The ideal candidate has a track record in high-growth environments... 

    Runway Financial

    San Francisco, CA
    6 hours ago
  •  ...Crusoe Energy Systems LLC in San Francisco is seeking a Senior Engineering Manager to lead their SDN Management Plane team. This role involves...  ...compensation and benefits, alongside a unique opportunity to be part of a pioneering AI infrastructure company. #J-18808-Ljbffr... 

    Crusoe Energy Systems LLC

    San Francisco, CA
    1 day ago
  •  ...A leading analytics platform based in San Francisco is searching for an Engineering Manager to own the Guides & Surveys product. This role involves leading a dynamic team and driving product development to meet customer needs. Candidates should have over 5 years of engineering... 

    Amplitude

    San Francisco, CA
    6 hours ago
  • $208.45k - $364.8k

     ...jobr.pro is seeking an Engineering Manager to lead a cross-functional team focused on user-facings. This role involves strong leadership, data-driven decision making, and collaboration with Product, Design, and Research teams. The ideal candidate has over 8 years of software... 

    Jobr

    San Francisco, CA
    6 hours ago
  •  ...Sentry is seeking an Engineering Manager for Dev Infra to lead a talented team dedicated to enhancing developer productivity through innovative tooling. As an Engineering Manager, you will drive the evolution of the platform and nurture talent while collaborating across... 

    Sentry

    San Francisco, CA
    6 hours ago
  •  ...EvenUp in San Francisco is seeking an Engineering Manager for Document Generation. You will lead a team to develop AI-native workflows that enhance legal document creation for personal injury law. The role requires strong technical expertise, strategic leadership, and... 
    Full time
    Flexible hours

    EvenUp Inc.

    San Francisco, CA
    6 hours ago
  •  ...Networks in San Francisco is looking for an experienced Software Engineering Manager to lead a team focused on next-generation storage...  ...management. A competitive salary and a nurturing work environment await those eager to shape the future of AI. #J-18808-Ljbffr... 

    DataDirect Networks Inc

    San Francisco, CA
    6 hours ago
  • $260.1k - $360k

     ...highest standards of security and governance. AI is redefining what it means to build...  ...intersection of AI, product, and platform engineering. While some of the problems involve LLMs...  ...BRING 3+ years of experience leading and managing engineering teams Experience designing or... 

    Retool

    San Francisco, CA
    7 hours ago
  •  ...Menlo Ventures is hiring an Engineering Manager for our Client Foundation team in San Francisco. The role involves leading a small team focused on enhancing frontend velocity through AI-driven development and closely partnering with tech leads. The position offers a competitive... 

    Menlo Ventures

    San Francisco, CA
    7 hours ago
  • A leading technology firm in San Francisco is seeking an Engineering Manager for the Brex Assistant, a consumer-facing conversational AI product. This role involves leading a team to optimize customer interactions around spend approval and financial decision-making. Candidates... 
    Work at office
    Remote work

    Brex

    San Francisco, CA
    1 day ago
  • A leading data and AI company seeks a Senior Engineering Manager for Customer Experience Intelligence in San Francisco. You’ll lead a team to enhance AI-driven customer interactions across the platform. Ideal candidates have over 10 years of experience, with a focus on... 

    Menlo Ventures

    San Francisco, CA
    4 days ago
  • $250k - $350k

     ...Superhuman is looking for an engineering manager to join the Land & Expand team in San Francisco. This role involves leading a team of 6 engineers and driving growth through collaboration with sales and marketing. The ideal candidate has strong ownership, technical proficiency... 

    I did my part and supported the Regular Toilet

    San Francisco, CA
    7 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Engineering Manager (AI Inference). Be the first to apply!