Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Models PM: Inference & Open-Source Innovation

Cerebras

Cerebras is seeking an experienced Product Manager to join our team in Sunnyvale, California. You will oversee a portfolio of models, collaborating with top customers and model labs to ensure high-quality implementations and drive product marketing. Ideal candidates will have over 5 years of product management experience, with a strong knowledge of generative AI and open-source models. This role offers a hybrid work arrangement and the chance to work on groundbreaking AI advancements. #J-18808-Ljbffr Cerebras

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI Models PM: Inference & Open-Source Innovation in Sunnyvale, CA vacancy
  • $212.7k - $287.7k

     ...Computing (UC) provides product innovations, from foundational services...  ...Trainium. As the SDM for the LLM Inference Model Enablement team, you will lead a team of expert AI/ML engineers to onboard and optimize state-of-the-art open-source and customer LLMs, both dense... 
    Suggested
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    2 days ago
  • $165.2k - $223.6k

     ...enabling unparalleled ML inference and training...  ...running a wide range of models and supporting novel architecture...  ...infrastructure, innovate new methods and create...  ...of what's possible in AI acceleration. As part...  ...collaborates with open source ecosystems to provide... 
    Suggested
    Work experience placement
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    1 day ago
  • $224k - $356.5k

     ...’s a unique legacy of innovation that’s fueled by great...  ...unlimited potential of AI to define the next era...  ...Learning Engineer — Model Evaluation & AI Systems...  ...NeMo Evaluator as an open-source platform, focusing on...  ...alongside model training, inference, and product divisions... 
    Suggested

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $155.42k - $395.9k

     ...About the Team: The ML Inference Platform is part of...  ...platform that powers GM’s AI efforts. We’re proud...  ...customers. We enable rapid innovation and feature...  ...SOTA) machine learning models for experimental, online...  ...practices. Contribute to open source projects; represent GM... 
    Suggested
    Local area
    Remote work
    Relocation
    Relocation package
    Flexible hours

    Israelvcforum

    Mountain View, CA
    1 day ago
  • $272k - $431.25k

    NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...Engineering Services Pvt Ltd. is seeking a Product Manager for AI Models in Sunnyvale, California. The role involves owning the...  ...of product management experience and a strong understanding of open-source models and generative AI. The position offers a hybrid work environment... 

    Gravity Engineering Services Pvt Ltd.

    Sunnyvale, CA
    2 days ago
  • $197k - $291k

     ...Research, Foundation User Models corporate_fare Google...  ...or contributing to open-source projects related to RecSys...  ...to accelerate product innovations through ML for...  ...teams and products. The AI and Infrastructure team...  ...quality output with strict inference latency requirements... 
    Full time
    Immediate start
    Worldwide

    Google Inc.

    Mountain View, CA
    10 hours ago
  • $188k - $300k

     ...Staff AI Engineer, Data Analytics & Modeling - Office of the CTO Sunnyvale, CA At...  ...in accelerating software innovations that solve complex, real-...  ...modeling on large-scale, multi-source data to uncover insights...  ....g., A/B testing, causal inference). Preferred:... 
    Work at office
    Worldwide
    Flexible hours
    Shift work

    Sonatus

    Sunnyvale, CA
    2 days ago
  • $224k - $356.5k

     ...It’s an outstanding legacy of innovation that’s fueled by great...  ...into the unlimited potential of AI to define the next era of computing...  ...on the world. NVIDIA’s open-source benchmarking platform, AIPerf...  ...serving performance across various inference frameworks. Hyperscalers,... 
    Local area
    Worldwide

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $246.5k

     ...Reinforcement Learning, AI, Control and...  ...Machine Learning and Inference Platform that powers the...  ...hardware, software, and models. We're looking for a strong...  ...to mentor engineers, innovate at scale, and shape the...  ...~ Contributions to open-source ML or systems projects... 
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    1 day ago
  • $224k - $356.5k

     ...It’s an outstanding legacy of innovation that’s fueled by great...  ...into the unlimited potential of AI to define the next era of computing...  ...on the world. NVIDIA’s open‑source benchmarking platform, AIPerf...  ...serving performance across various inference frameworks. Hyperscalers,... 
    Local area
    Worldwide

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...builds the world's largest AI chip, 56 times larger...  ...-leading training and inference speeds and empowers...  ...customers include top model labs, global enterprises...  ...decide which frontier and open-source models we support based...  ...above the level of Senior PM. #5+ years of total... 
    Work experience placement
    Work at office
    Remote work
    Shift work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  • $254k - $349.25k

     ...how people, data, and AI agents connect across email...  ...in how we dream and innovate Responsive to feedback...  ...deep expertise in model architecture, training,...  ...environments Optimize inference systems for low latency...  ...to AI/ML research, open-source, or security tooling... 
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    4 days ago
  •  ...builds the world's largest AI chip, 56 times larger...  ...‑leading training and inference speeds and empowers...  ...launch the world’s top AI models on the world’s fastest...  ...Collaborate with model labs and open‑source model builders, to...  ...the level of Senior PM. 5+ years of total technical... 
    Work experience placement
    Work at office
    Remote work

    Cerebras

    Sunnyvale, CA
    10 hours ago
  • $254k - $349.25k

     ...how people, data, and AI agents connect across email...  ...in how we dream and innovate Responsive to feedback...  ...requires deep expertise in model architecture, training,...  ...environments Optimize inference systems for low latency...  ...to AI/ML research, open‑source, or security tooling Background... 
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    1 day ago
  • $212.7k - $287.7k

     ...the forefront of AWS innovation. The Inferentia chip delivers...  ...best-in-class ML inference performance at the...  ...market. As deep learning models become more versatile,...  ...Neuron, TPU or other AI acceleration hardware...  ...- Interactions with open-source communities, in either... 
    Local area
    Work from home
    Relocation
    Flexible hours

    Amazon

    Cupertino, CA
    4 days ago
  • $148k - $235.75k

     ...business and pivotal in our inference marketing. You will be focused...  ...networking, CUDA libraries, model architectures and deployment...  ...showcase our leadership position in AI inference. Want to join a...  ...workflows using NVIDIA or open-source serving frameworks running on... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement...  ...and production observability. Contributions to open-source projects and/or publications; please include links... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...computing experiences-from AI and data centers, to...  ...in a culture of innovation and collaboration, we...  ...senior member of the LLM inference framework team, you will...  ...for large language models on AMD GPUs. You will...  ...will be upstreamed into open-source inference frameworks such... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...deep learning ignited modern AI — the next era of computing —...  ...AI Compiler Engineers to drive innovation within our world-class compiler...  ...problems for AI workloads (both inference and training) and successfully...  ...of Large Language Model (LLM) inference and its profound... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $184k - $230k

     ...Powered by the relentless innovation of the open source community, Cloudera advances...  ...Engineer to join the Enterprise AI Platform team and help...  ...applications using foundation models with enterprise data at...  ...scalable, enterprise-quality AI inference services powered by machine... 
    Work from home
    Relocation
    Flexible hours

    Cloudera

    Alviso, CA
    2 days ago
  •  ...company at the forefront of innovation, integrating advanced AI and autonomous driving...  ...Vision‑Language‑Action (VLA) models and foundation models are...  ...deployment, and edge inference for real‑world autonomous...  ...Contributions to research projects, open‑source repositories, or relevant... 
    Internship

    XPENG & Volkswagen Group

    Santa Clara, CA
    3 days ago
  • $157.2k - $254.1k

     ...the intersection of innovation and impact, solving...  .... We weave AI into the fabric of...  ...it's needed. This model supports real-time...  ...deployment and real-time inference systems. System...  ...experience. We are open to both a Staff/Sr...  ...Experience with open-source AI projects. Prior... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    2 days ago
  •  ...Senior Staff AI/ML System Software Engineer At d-Matrix...  ...of software and hardware innovation, pushing the boundaries of...  ...processors. Experience with open-source ML compiler frameworks such...  ...,...). Experience with inference servers/model serving frameworks (such as... 
    Work experience placement
    3 days per week

    D-Matrix

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some...  ...the industry-leading deep learning inference software for NVIDIA AI accelerators...  ...-of-the-art LLMs and Generative AI models. Collaborate with deep learning... 

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $95 per hour

     ...Engineer - Data & ML Innovation Location:...  ...Join a cutting-edge AI/ML organization...  ...improving Foundation Models through large-...  ...from a variety of sources including licensed...  ...Support large-scale inference workflows using fine...  ...Leverage open-source and internal... 
    Long term contract

    Systems Integration Solutions

    Cupertino, CA
    2 days ago
  •  ...Powered by the relentless innovation of the open source community, Cloudera advances...  ...experience - enabling data and AI workloads to run anywhere,...  ...run and manage opensource models (Llama, Qwen, etc.) using K...  ...Lead the deployment of inference servers (vLLM, Triton) using... 
    Work from home
    Flexible hours

    Cloudera

    Alviso, CA
    2 days ago
  •  ...Systems builds the world's largest AI chip, 56 times larger than...  ...-leading training and inference speeds and empowers machine learning...  ...customers include top model labs, global enterprises, and...  ...of the GPU. # Publish and open source their cutting-edge AI research... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • $184k - $287.5k

     ...Develop state‑of‑the‑art model optimization...  ...optimization strategies for inference, such as automated model...  ...breakthrough innovations into shipping product...  ...robotic control, embodied AI, and autonomous decision...  ...Active contributions to open‑source inference and optimization... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $197k - $291k

    Staff AI Research Engineer, Large User Models Google Mountain View, CA, USA Advanced Experience owning outcomes...  ..., ICML, RecSys) or significant open-source contributions in RecSys, NLP, or...  ...force behind Google’s groundbreaking innovations, empowering the development of our... 
    Full time
    Worldwide

    NLP PEOPLE

    Mountain View, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Models PM: Inference & Open-Source Innovation. Be the first to apply!