Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Product Engineer - Training Platform

Baseten

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

We’re looking for a customer-obsessed software engineer to come ship with us. You’ll own features like multi-node training and products like serverless reinforcement learning (RL) from conception to MVP (and from MVP to GA!). You’ll work through the stack, architecting solutions from API and UI down to our infrastructure layer. You’ll fine tune models yourself to develop an understanding of user workflows. You’ll work closely with research engineers leveraging state-of-the-art training techniques to build experiences that accelerate model development and solve for real pain points. If you’re excited to dive deep into the training, let’s talk!

THE PRODUCT

Take a look at what we’ve built so far: Overview of the product so far Training docs overview Story of the Training product Research we've done

EXAMPLE INITIATIVES

Checkpointing Pipeline: Our checkpointing pipeline starts with automated checkpointing, a feature that ensures that versions of models created during training are automatically backed up to the cloud. Users are able to then deploy checkpoints seamlessly into inference servers, providing point-and-click integrations into inference frameworks like vLLM and Baseten’s Inference Stack. This enables customers to quickly evaluate the performance of their checkpoints with real traffic. Multinode training: Multinode training enables customers to easily run training jobs across multiple compute nodes, enabling users to train large models like GLM 4.7 and DeepSeek. We’ve built deeply at the Kubernetes layer to ensure that scheduling, startup, inter-node communication, and shutdown happen seamlessly under the hood and as the user expects. Training DX: Customers come to train on Baseten because it helps them get to value fast. To do this, we ensure that the features we ship aren’t just fast, but are easy to iterate with. We enhanced Baseten’s metrics from pod-level GPU summaries to per-GPU and per-Node. We’ve built a CLI experience that caters to terminal users, and UI experiences that enable user to seamlessly manage their training jobs.

RESPONSIBILITIES

Iterate like crazy Design ergonomic APIs and abstractions to model complex resources and lifecycles Work throughout the stack (API layer, backend and database implementation, infra layer; frontend is a plus) to implement features. Fine-tune and deploy models to develop intuition around training workflows. Partner closely with model developers and world-class research engineers to understand the requirements and pain points of post-training workflows. Drive long-term improvements to improve reliability of systems and velocity of development Fix bugs & resolve customer issues with urgency

REQUIREMENTS

5+ years experience building software applications Deep knowledge of the web stack, databases, and distributed systems Experience developing developer tooling or infrastructure products for external or internal users. Good taste in product, particularly developer-oriented tools Interest in ML/AI infrastructure and willingness to learn Driven by high agency and ownership Strong communication skills with the ability to bridge technical depth and business needs

NICE TO HAVE

Experience launching features and products through different release cycles (MVP, Beta, GA, etc.) Experience with model development methods and paradigms, like Supervised Fine-Tuning, Reinforcement Learning, Synthetic Data Generation, LoRA, Full Finetunes, etc. Familiarity or experience with the open source training stack and frameworks (NCCL, PyTorch, Megatron, NemoRL, VeRL, Axolotl, HF Trainer) and distributed training techniques (FSDP, DeepSpeed). Experience developing AI products, tooling, or agents Frontend fluency

BENEFITS

Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year\'s Day!) Paid parental leave Company-facilitated 401(k) Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. #J-18808-Ljbffr Baseten

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Product Engineer - Training Platform in San Francisco, CA vacancy
  •  ...Security Product Engineer We believe that software is the foundation of modern civilization...  ...critical software vulnerabilities. We are training and scaling security AI agents to...  ...custom integrations on top of depthfirst's platform, and partner closely with our product team... 
    Training
    Work at office
    Relocation

    depthfirst

    San Francisco, CA
    2 days ago
  • $180k - $258k

     ...founders. Role Overview We are looking for a Product Security Engineer to join our team and act as a champion for security within...  .... Secure Coding Standards: Develop and deliver training, coding patterns, and security guardrails to help engineering... 
    Training
    Shift work

    Candid Health

    San Francisco, CA
    1 day ago
  • We’re looking for founding engineers with strong product instincts and an obsession with agents. Bonus...  ..., email, and other tools. But these platforms weren't designed for human-AI collaboration...  ...to bleeding-edge memory and post-training research Observe users and... 
    Training

    Ando

    San Francisco, CA
    3 days ago
  • $117.2k - $176.7k

     ...efforts. Job Category Product Job Details About Salesforce...  ...for a Product Security Engineer to join our Salesforce Product...  ...security posture of our core platforms, ensuring the resilience and...  ...compensation, promotion, benefits, training, assessment of job... 
    Training

    Salesforce.Com Inc

    San Francisco, CA
    4 days ago
  • $162k - $260k

     ...us on LinkedIn. Aurora's Product Security team's mission is to...  ...contributing and documenting security engineering processes and the resulting...  ...across the Aurora Driver Platform and prioritize high value...  ...qualifications, relevant education or training, and market conditions. These... 
    Training
    Work experience placement
    Work at office
    Local area
    3 days per week

    Aurora Innovation

    San Francisco, CA
    3 days ago
  •  ...unified payments and financial platform for global businesses....  ...turn zerotoone ideas into real products, and you "get stuff done" end...  ...role As a Senior Security Engineer at Airwallex, you will be a trusted...  ...or similar Recognised training or cybersecurity... 
    Training
    Worldwide

    Airwallex

    San Francisco, CA
    3 days ago
  • $120k - $175k

     ...Product Support Engineer Cooley is seeking a Product Support Engineer to join the Product team,...  ...direction of the Manager of Support and Training, and with guidance from the Associate...  ...Familiarity with cloud platforms (Azure preferred) Experience writing... 
    Training
    Full time
    Temporary work
    Work at office
    Remote work
    Work from home
    Worldwide
    Flexible hours
    Weekend work

    Cooley

    San Francisco, CA
    22 days ago
  •  ...invented State Space Models or SSMs, a new primitive for training efficient, large-scale foundation models. Our team...  ...deep expertise in model innovation and systems engineering paired with a design-minded product engineering team to build and ship cutting edge models... 
    Training
    Work at office

    Cartesia

    San Francisco, CA
    3 days ago
  • $200k - $300k

    Product Engineer Adapt, San Francisco, California, United States About this position About Adapt...  ...scale, you’ll own whole pillars of the platform. Concretely, you’re responsible for:...  ...After 90 days and beyond, any requisite training wheels are off and you’re feeling empowered... 
    Training
    Work at office
    Relocation

    Adapt

    San Francisco, CA
    4 days ago
  •  ...remediate critical software vulnerabilities. We are training and scaling security AI agents to discover...  ...re seeking an experienced front‑end focused Product Engineer to lead the charge in building the world’s first AI‑native platform that autonomously finds and fixes software... 
    Training
    Work at office

    DepthFirst

    San Francisco, CA
    5 days ago
  •  ...invented State Space Models or SSMs, a new primitive for training efficient, large-scale foundation models. Our team...  ...deep expertise in model innovation and systems engineering paired with a design-minded product engineering team to build and ship cutting edge models... 
    Training
    Internship
    Work at office

    Cartesia

    San Francisco, CA
    5 days ago
  •  ...the growth of these applications: high‑quality training data. We’ve partnered with top AI labs and did...  ...Combinator, and AI Grant. About the Role As a Product Engineer at Sieve, you’ll build our video collection platform: a system that pays contributors for submitting... 
    Training
    Night shift
    Weekend work

    Sieve

    San Francisco, CA
    2 days ago
  •  ...for frontier AI development: Enterprise Platform & Tools : Advanced annotation tools,...  ...that enable teams to produce high-quality training data at scale Frontier Data Labeling...  ...apply! Role Overview We’re looking for a Product Engineer to join our Perception team, where you’... 
    Training
    Flexible hours
    Shift work

    HRB

    San Francisco, CA
    2 days ago
  • $180k - $290k

    Product Engineer — Search You’ll own the developer-facing search experience at Firecrawl — taking the retrieval and ranking improvements coming...  ...Experience: 3+ years in applied RL, ML engineering, or model training — with production systems Visa: US Citizenship/Visa required... 
    Training
    Full time
    Temporary work
    Live in
    Remote work
    Shift work

    AI Chopping Block, Inc.

    San Francisco, CA
    5 days ago
  •  ...technology inspired a brand-new product category, later named "SASE"...  ...network and secure cloud platform, and is on a fast track to becoming...  ...creative Product Enablement Engineer to design and deliver...  ...such as: Product demos Training videos Walkthroughs or tutorials... 
    Training
    Worldwide
    Flexible hours

    Cato Networks

    San Francisco, CA
    a month ago
  • A healthcare technology company in San Francisco is seeking a full-stack product engineer to design and maintain applications for dentists. You will be involved in shaping product strategy and engineering culture in a startup environment. Ideal candidates have experience... 

    Daydream Services LLC

    San Francisco, CA
    1 day ago
  • $200k - $300k

    Adapt is seeking a Product Engineer in San Francisco to drive the development of scalable systems in an early-stage startup environment. You'll engage in end-to-end feature ownership and collaborate directly with customers to enhance their experience. The role offers competitive... 

    Adapt

    San Francisco, CA
    4 days ago
  • Amplitude is seeking a Senior Product Engineer to enhance their product experimentation strategy. In this role, you will take ownership of critical product surfaces and lead the development of systems using React, Node.js, and Python. The ideal candidate will have at least... 

    Amplitude

    San Francisco, CA
    3 days ago
  • Elysia, located in San Francisco, is seeking a Senior Product Software Engineer to work on their innovative battery intelligence software. The role involves designing high-performance, real-time systems using C++. Ideal candidates will have 5-10 years of experience in C++... 
    Remote job

    Elysia

    San Francisco, CA
    2 days ago
  •  ...healthcare technology company in San Francisco is seeking a Senior Product Engineer to tackle complex technical challenges in modern healthcare. This role involves designing a cutting-edge cloud platform with technologies like Elixir and Phoenix. Candidates should have... 
    Flexible hours

    Lunar

    San Francisco, CA
    3 days ago
  • A cutting-edge AI recruiting firm in San Francisco is seeking a Senior Product Engineer to lead the development of their innovative platform. You will work on both frontend and backend systems, ensuring a seamless user experience in a high-autonomy environment. The ideal... 

    Jack & Jill/External ATS

    San Francisco, CA
    3 days ago
  • $168k - $213k

    A leading AI firm in San Francisco is looking for a Product Engineer to develop innovative solutions for financial compliance. Ideal candidates will have strong full-stack development skills and experience in shipping impactful products. The role involves direct customer... 

    Greenlite Inc

    San Francisco, CA
    3 days ago
  • A technology company in San Francisco is seeking a full-stack engineer to help build their core product. The role involves using AI coding tools to deliver reliable and efficient solutions, collaborating cross-functionally, and engaging directly with customers. Ideal candidates... 

    Withorb

    San Francisco, CA
    4 days ago
  •  ...performance. Candidates should have a strong understanding of frontend principles and backend systems. A proven track record of shipping production-level code through internships or projects is preferred. Join us to help automate knowledge work through innovative technology in... 
    Internship

    Julius AI

    San Francisco, CA
    4 days ago
  • Voiceflow is looking for a Founding Product Engineer to lead the development of our simulation and evaluation platform for voice AI in San Francisco. The role involves defining product direction, building features end to end, and staying ahead of trends in voice AI. Ideal... 

    Voiceflow

    San Francisco, CA
    1 day ago
  •  ...use cases emerge, high-quality training data is the bottleneck. This...  ...by a team of former Scale AI engineers and operators. In less than a...  ...best research, engineering, product, and operations minds to join...  ...engineers build the pipelines, platforms, and models that transform... 
    Training
    Work at office

    David AI

    San Francisco, CA
    5 days ago
  •  ...belong at HappyRobot. Role Overview: We are looking for a Product Operations Engineer to act as the operational backbone of our product organization...  ...labelers and external agencies to ensure high‑quality training data throughput. Who You Are: A Technical Operator: You possess... 
    Training
    Shift work

    Happyrobot Inc.

    San Francisco, CA
    5 days ago
  • Arena Intelligence, Inc. is seeking a Founding Product Security Engineer in San Francisco, CA, to lead the security strategy and implementation...  ...will be responsible for designing systems to protect the platform and influencing how AI labs and users experience Arena. The... 

    Arena Intelligence, Inc.

    San Francisco, CA
    4 days ago
  • A cutting-edge technology company in San Francisco is seeking a Product Engineer with over 2 years of experience. This role focuses on building high-quality products for self-learning agents, contributing to both front-end and back-end development. The ideal candidate... 

    Judgment Labs Inc.

    San Francisco, CA
    5 days ago
  • A tech startup specializing in voice technologies is looking for a Product Engineer to manage voice agent projects. You will ramp up on the technology, handle large projects end-to-end, and engage with customers to create valuable APIs. This role offers a competitive salary... 
    Flexible hours

    Vapi Inc.

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Product Engineer - Training Platform. Be the first to apply!