Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Product Manager, Training Infrastructure - Weights & Biases

$188k - $275k

Weights & Biases

About The Role Training a frontier model is as much an infrastructure problem as it is a modeling problem. When a run fails at hour 47 of a 72-hour job, the question is almost never what’s wrong with the code — it’s what happened on the cluster. Node failures, network timeouts, storage bottlenecks, and scheduling decisions determine whether training runs succeed or fail, how much they cost, and how fast teams can iterate. The CoreWeave acquisition changes that. W&B can now build products that connect application-level experiment tracking with cluster health, hardware telemetry, and compute orchestration — a surface area that’s hard to assemble without owning both the developer tools and the infrastructure. Reporting to the Director of Product Management, this Staff Product Manager will own the strategy and execution for the products at this intersection: W&B Launch (job submission and compute orchestration), infrastructure insights that leverage CoreWeave’s unique telemetry and platform capabilities, and new product concepts — including agentic and automated research workflows — that only exist because W&B and CoreWeave are now one company. Sandboxes (managed development environments) are also part of this surface, but the core opportunity is building infrastructure-native developer tools that take advantage of the combined platform. You’ll be working directly with frontier model builders — teams training on thousands of GPUs — to understand what they need and ship it. If you’re energized by solving infrastructure challenges that directly accelerate deep learning workflows, collaborating with the world’s best AI teams, and delivering tools that move the industry frontier forward, this role is for you. What You’ll Do Define and ship infrastructure-native products that only W&B + CoreWeave can build. This is the core of the role. The integration between W&B’s experiment tracking and CoreWeave’s infrastructure platform opens a new product surface — giving researchers visibility and control that spans their code, their runs, and the machines those runs execute on. You’ll identify the highest-value opportunities in this design space, build conviction around them, and drive the roadmap from concept through launch and iteration. Own W&B Launch and evolve it for the CoreWeave era. Launch is W&B’s job submission and compute orchestration product. Today it helps teams submit training jobs to Kubernetes clusters and cloud providers. As part of a vertically integrated AI platform, Launch can go further — with deeper infrastructure awareness and the ability to serve as the orchestration layer for agentic and automated research workflows. You’ll set the vision for what that looks like and ship it. Spend real time with frontier AI teams and learn how they actually work. Foundation model builders are your primary customers. You’ll spend significant time in their workflows — understanding how they manage infrastructure, debug failures, orchestrate computation, and increasingly automate their research processes. These conversations will directly shape the product roadmap. Build the bridge between W&B product and CoreWeave platform engineering. This role sits at the seam between two organizations that are becoming one. You’ll partner with CoreWeave’s infrastructure and platform teams to identify integration opportunities, and with W&B’s engineering teams to build them into products researchers actually want to use. Your ability to translate between infrastructure engineers and ML researchers will be critical. Find and resolve friction points in the job submission, compute management, and infrastructure observability workflows, maintaining W&B’s high bar for craft and quality. Explore new product concepts at the hardware-software boundary. The design space opened by the CoreWeave integration is large — spanning profiling, environments, automation, agents, and more. You’ll evaluate which new concepts have the highest impact for frontier AI teams and drive the most promising ones from zero to one. Who You Are Experience: You have 7+ years as a product manager working on developer tools, deep learning infrastructure, or ML engineer platforms. End-to-end ownership: You’ve taken a technical product—whether a model, orchestration feature, or developer tool—all the way to production, including defining success criteria, measuring effectiveness, and iterating to improve impact and user happiness. Technical fluency: You’re comfortable discussing Kubernetes and SLURM, job scheduling, hyperparameter optimization algorithms, and API design with engineers and AI researchers. Cross-functional influence: You’re skilled at managing and influencing AI engineers, software engineers, and stakeholders—with or without official authority—to align teams and drive outcomes. Clear communication: A major part of this role is seeking feedback from many directions and channeling it into a single, executable stream. You bring clarity and organization to complex, fast-moving environments. Care and taste: You have strong opinions about the products you use and a high bar for developer experience. Preferred Experience with deep learning frameworks (PyTorch, JAX), distributed training, or LLM development workflows. Hands-on experience with SLURM or Kubernetes for GPU-intensive workloads. Prior exposure to W&B or other experiment-tracking and model management platforms. Understanding of hyperparameter optimization methods and evaluation workflows for large models. Wondering if You’re a Good Fit? We Believe In Investing In Our People And Value Candidates Who Can Bring Their Own Diversified Experiences To Our Teams—even If You Aren’t a 100% Skill Or Experience Match. Here Are a Few Qualities We’ve Found Compatible With Our Team Outgoing and user-focused. You’ll love this role if you enjoy connecting with real users day to day, helping them solve issues, and sharing patterns for making the most of our products Self-directed and willing to take risks. At our speed and scale, product managers need to proactively find solutions, improve processes, and collaborate with team members and engaged users. Your initiative will really shine here. Hands-on with AI. You experiment with generative AI to improve your own workflow and build user-facing prototypes. You keep up to date every day with what’s new in the AI / LLM world. Passionate about developer experience. You care deeply about building tools that help AI practitioners accelerate their iteration and discovery. You know small friction points compound, and smoothing them out is what makes a delightful and winning developer tool. Why Us? We work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values: Be Curious at Your Core Act Like an Owner Empower Employees Deliver Best-in-Class Client Experiences Achieve More Together We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for takeoff, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us! The base salary range for this role is $188,000 to $275,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility). What We Offer The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location. In addition to a competitive salary, we offer a variety of benefits to support your needs, including: Medical, dental, and vision insurance - 100% paid for by CoreWeave Company-paid Life Insurance Voluntary supplemental life insurance Short and long-term disability insurance Flexible Spending Account Health Savings Account Tuition Reimbursement Ability to Participate in Employee Stock Purchase Program (ESPP) Mental Wellness Benefits through Spring Health Family-Forming support provided by Carrot Paid Parental Leave Flexible, full-service childcare support with Kinside 401(k) with a generous employer match Flexible PTO Catered lunch each day in our office and data center locations A casual work environment A work culture focused on innovative disruption Our Workplace While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration California Consumer Privacy Act - California applicants only CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information. As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on click.appcast.io. Export Control Compliance This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a U.S. citizen or national, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process. #J-18808-Ljbffr Weights & Biases

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Staff Product Manager, Training Infrastructure - Weights & Biases in Sunnyvale, CA vacancy
  •  ...on our team! Role overview We are seeking a Senior/Staff Product Manager to lead instrument and software product strategy, development...  ...messaging. Own and support field adoption by developing training, technical content, and customer-facing resources (e.g... 
    Training

    Countable Labs

    Palo Alto, CA
    2 days ago
  • $192k - $278k

    Senior Product Manager, Infrastructure Reliability Apply info_outline info_outline X In accordance with Washington state law, we are highlighting...  ...job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary... 
    Training
    Full time
    Temporary work

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $166.5k - $291.4k

     ...us as we pursue our purpose to make the world work better for everyone. The Cloud Infrastructure Systems Engineering team at ServiceNow is currently seeking a Staff Product Manager to help build the products and infrastructure that’s the foundation of the Now Platform... 
    Suggested
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Santa Clara, CA
    3 days ago
  • $180k - $200k

     ...Product Manager Product Manager is responsible for working with Selector customers across...  ...operational issues across complex network, infrastructure, and cloud deployments. The ideal...  ...teams with product documentation, training material, competitive narratives, and... 
    Training
    Night shift

    Selector

    Santa Clara, CA
    1 day ago
  • $157.8k - $236.8k

     ...We are looking for an experienced Product Manager to work in our Unity Ads Product team...  ...reporting, and workflows. Ensure data infrastructure and insights are accurate, comprehensive...  ...Global Employee Assistance Program | Training and development programs | Volunteering... 
    Training
    Work at office
    Worldwide
    Relocation package
    Shift work

    Unity

    Mountain View, CA
    4 days ago
  • $144.33k - $240.55k

     ...centers. Job Description As a Staff Product Manager on the SSD marketing team, you will...  ...center operators) to translate evolving infrastructure needs into differentiated product...  ...career growth and personal development training Open-minded management Empowering... 
    Training
    Local area
    Worldwide
    Flexible hours

    Kioxia

    Alviso, CA
    1 day ago
  •  ...Staff Product Manager, Endoscopes It started with a simple idea: what if surgery could be less invasive and recovery less painful? Nearly...  ...messaging with global portfolio marketing, regional marketing, training, and commercial teams. Ensure regional nuances and global... 
    Training
    Local area
    Worldwide
    Flexible hours

    Intuitive

    Sunnyvale, CA
    4 days ago
  •  ...Staff Product Manager – Compute Architecture It started with a simple idea: what if surgery could be less invasive and recovery less painful...  ...job duties noted above Required Education and Training Minimum Bachelors degree Minimum 4 years in a medical... 
    Training
    Temporary work
    Local area
    Worldwide
    Flexible hours

    Intuitive

    Sunnyvale, CA
    4 days ago
  • $204k - $247k

     ...As the only vertically integrated AI infrastructure company built from the ground up, we...  ...Crusoe Cloud is seeking a visionary Staff Product Manager to spearhead the development of our next...  ...-to-end model lifecycle, including training, orchestration, and deployment. Market... 
    Training
    Full time
    Temporary work

    Crusoe

    Sunnyvale, CA
    1 day ago
  • $208.73k - $253k

     ...Product Manager, Compute Crusoe is on a mission to accelerate the abundance of energy and...  ...As the only vertically integrated AI infrastructure company built from the ground up, we own...  ...orchestration layers that power AI training and inference workloads. This role owns... 
    Training
    Temporary work

    Crusoe

    Sunnyvale, CA
    3 days ago
  • $240k - $334k

    Group Product Manager, ACI Infrastructure Apply X In accordance with Washington state law, we are highlighting our comprehensive benefits package,...  ...job‑related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary... 
    Training
    Full time
    Temporary work
    Worldwide

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $187k - $262k

     ...You will be responsible for the end-to-end product lifecycle of tools that empower our...  ...workflows Prioritize for Impact : Rigorously manage the product roadmap, balancing the...  ...experience, qualifications, relevant education or training, and market conditions. These ranges may... 
    Training
    Work at office
    Local area
    Immediate start
    3 days per week

    Aurora Innovation

    Mountain View, CA
    5 days ago
  • $177k - $239.5k

    Staff Product Manager - Intuit UI Platform job at Intuit. Mountain View, CA. Overview The Development Platform team at Intuit builds foundational...  ...Platform Adoption: Engage cross‑functional teams through training, documentation, and feedback loops to maximize platform... 
    Training
    Local area
    Shift work

    Itlearn360

    Mountain View, CA
    2 days ago
  •  ...the Role We are seeking a Staff Technical Program Manager (TPM) to lead AV ML Infrastructure programs for our autonomous driving...  ...ML infrastructure - including training pipelines, model lifecycle...  ...are scalable, efficient, and production-ready to support end-to-end model... 
    Training
    Local area
    Work from home
    Relocation
    Relocation package

    General Motors

    Sunnyvale, CA
    2 days ago
  • Staff Program Manager- Product Management Community job at Intuit. Mountain View, CA. Overview The Program...  ...in driving organizational change Bias for action and track record of driving...  ...cohorts engaged and informed, onboarding training, ongoing coaching of RPM and managers... 
    Training
    Rotational program
    Internship
    Local area

    Payfuture Technologies

    Mountain View, CA
    2 days ago
  • Weights & Biases in Sunnyvale is seeking a Staff Product Manager to lead the integration of developer tools and infrastructure following the CoreWeave acquisition. You will define and ship infrastructure-native products, owning the vision for W&B Launch. The ideal candidate... 

    Weights & Biases

    Sunnyvale, CA
    4 days ago
  • $181k - $262k

     ...prioritize development with the greatest product impact. Assess requirements, facilitate...  ...risks and communicate status to upper management and project stakeholders, and energize project...  ..., qualifications, relevant education or training, and market conditions. The successful... 
    Training
    Local area

    Aurora

    Mountain View, CA
    4 days ago
  • $187k - $262k

    Product Mountain View, California Staff Product Program Manager Who we are Aurora’s mission is to deliver the benefits of self-driving technology safely, quickly...  ...experience, qualifications, relevant education or training, and market conditions. These ranges may be... 
    Training
    Work at office
    Local area
    Immediate start
    3 days per week

    Australian Competition and Consumer Commission

    Mountain View, CA
    2 days ago
  • $214k - $305k

     ...experience. 10 years of experience in product management or a related technical role. 5 years...  ...of experience with cloud data center infrastructure including power, network, cooling, zonal...  ..., and relevant education or training. Your recruiter can share more about... 
    Training
    Full time

    Google

    Sunnyvale, CA
    2 days ago
  • $230k - $292k

     ...Staff Technical Program Manager, Simulation Infrastructure Resource Management Waymo is an autonomous driving technology...  ...range of vehicle platforms and product use cases. The Waymo Driver has...  ...location, experience, relevant training and education, and skill level.... 
    Training
    Full time
    Temporary work
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $181k - $262k

     ...accessible for all. We're searching for a Technical Program Manager to join Aurora's Product & Program Management team, focusing on two primary...  ...skills, experience, qualifications, relevant education or training, and market conditions. These ranges may be modified in the... 
    Training
    Work at office
    Local area
    3 days per week

    Aurora Innovation

    Mountain View, CA
    3 days ago
  •  ...ambition in Vehicle Autonomy Product Management is to build a world-class...  ...senior engineering managers and Staff+ ICs across multiple AI...  ...MCP servers, and have built infrastructure that enables AI agents to...  ...developer adoption through training, newsletters, cross-teamprojects... 
    Training
    Work at office
    Local area
    Remote work
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  •  ...You will become a member of the da Vinci® Multiport Global Product Management team helping to lead and support multiple generations of platform...  ...job duties noted above Required Education and Training Minimum Bachelors degree Minimum 4 years in a medical... 
    Training
    Local area
    Worldwide
    Flexible hours

    Intuitive

    Sunnyvale, CA
    1 day ago
  • $109.2k - $205.9k

     ...Business Unit What the Role Entails Product Strategy & Roadmap: Own the full lifecycle management of AI infrastructure platforms (including large-scale clusters,...  ...native architectures, and large-scale model training/inference techniques) and industry trends to... 
    Training
    Relocation package

    Tencent

    Palo Alto, CA
    4 days ago
  • $211k - $254k

     ...Staff Product Manager, Software Mountain View, CA About the Role Muon Space is hiring a Staff Product Manager to own two of the...  ...decisions, the technical fluency to engage with platform and infrastructure leadership as a peer, and the judgment to operate across... 
    Permanent employment
    Full time
    Temporary work
    Remote work
    Flexible hours
    Shift work

    Muon Space

    Mountain View, CA
    5 days ago
  • A leading AI infrastructure company in Palo Alto is seeking a Product Manager to define and drive the product roadmap for its inference and training infrastructure. You will engage with early adopters, support Go-to-Market initiatives, and ensure alignment with engineering... 
    Training

    RadixArk

    Palo Alto, CA
    4 days ago
  •  ...Senior Product Manager - Downstream Software Product Managers play a critical role in Ion's success by empowering teams across technical...  ..., Product introduction timing, product phase in/phase out, Training Plan Communicate, coordinate, and track project status at... 
    Training
    Local area
    Flexible hours

    Intuitive

    Sunnyvale, CA
    1 day ago
  • $166.5k - $291.4k

     ...Staff Inbound Product Manager- SAM (Software Asset Management) It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Santa Clara, CA
    3 days ago
  • $161k - $221k

     ...and exceed customer expectations. Position Summary The Product Manager in the Lab D&A team is responsible for the strategy, roadmap,...  ...notes. Lead readiness with GIS, AIx, and BU teams, including training plans and rollout communications for lab applications.... 
    Training
    Full time
    Relocation

    Applied Materials

    Santa Clara, CA
    2 days ago
  •  ...here. Job Description Essential Job Duties Product Strategy and Roadmap Development -...  ...Required Skills and Experience • Ability to manage cross-functional projects, stakeholders,...  ...travel up to 30% Required Education and Training • A Bachelors (BS/BSE) in computer science... 
    Training
    Local area
    Worldwide
    Flexible hours
    Shift work

    Intuitive

    Sunnyvale, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Product Manager, Training Infrastructure - Weights & Biases. Be the first to apply!