Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Applied ML Systems Scientist for Large-Scale LLMs

Dormont Manufacturing Co

Cerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras’ current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As an Applied Machine LearningResearch ScientistatCerebras, you will play a key role in turning modern machine learning techniques into scalable, high-performance systems. This role sits at the intersection of modeling and systemsfocused not on publishing new algorithms, but on understanding how they work and making them run effectively at scale. Your work will directlyimpacthow large language models (LLMs) are trained,optimized, and deployed on one of the most advanced AI platforms in the world. You will work closely with researchers and senior engineers to implement and improve workflows for LLM pretraining, fine-tuning, and reinforcement learning-based post-training. This includes building training pipelines, debugging complex system behaviors, improving model quality, anditerating ondata and evaluation strategies. Your contributions will help translatecutting-edgeML ideas into reliable, production-ready systems that solve real-world problems. This role is ideal for candidates who enjoy hands-on engineering, want to build deep intuition for ML systems, and are excited about working on LLMs and reinforcement learning in practice,not just in theory. Responsibilities Apply post-training techniques (e.g. RLVR, RLHF, GRPO etc.) techniques to improve model performance. Build and maintain evaluation pipelines to measure model performance across tasks and domains. Debug issues across the ML stack, including data pipelines, training jobs, model outputs and mixed or lower precision computation. Collaborate with researchers to translate ML ideas into efficient, scalable implementation. Design, implement, and scale ML pipelines across all stages of LLM development (pretraining, fine-tuning, alignment). Work with large datasets, including dataset generation, filtering, and synthetic data approaches. Optimize training and inference workflows for performance, efficiency, and reliability. Contribute high-quality, maintainable code to shared ML infrastructure. Skills & Qualifications Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field. 0 - 5 years of experience (including internships, research, or industry experience) working with machine learning systems; we are hiring multiple positions for various levels. Strong programming skills in Python. Experience with ML frameworks such as PyTorch. Solid understanding of machine learning fundamentals. Familiarity with deep learning architectures, particularly transformers. Ability to read and understand modern ML papers and implement key ideas. Preferred Skills & Qualifications Experience working with large language models (training, fine-tuning, and evaluation). Familiarity with reinforcement learning concepts. Experience with distributed training frameworks (e.g., FSDP, Megatron). Experience working with large-scale datasets and data pipelines. Experience debugging or optimizing ML systems for performance. • Contributions to meaningful codebases, projects, or open-source systems Why Join Cerebras People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras: Build a breakthrough AI platform beyond the constraints of the GPU. Publish and open source their cutting-edge AI research. Work on one of the fastest AI supercomputers in the world. Enjoy job stability with startup vitality. Our simple, non-corporate work culture that respects individual beliefs. Read our blog: Five Reasons to Join Cerebras in 2026. Apply today and become part of the forefront of groundbreaking advancements in AI! _Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer._ We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them. This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice. #J-18808-Ljbffr Dormont Manufacturing Co

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Applied ML Systems Scientist for Large-Scale LLMs in Sunnyvale, CA vacancy
  • $139.9k - $274.8k

     ...Azure Machine Learning (ML), Cognitive Services...  ...Microsoft ships AI systems that are safe,...  ...research and planet-scale production systems,...  ...looking for a Principal Applied Scientist to join our team!...  ...and familiarity with large language models (LLMs). ~ Experience with... 
    Suggested
    Ongoing contract
    Work at office
    Local area
    Shift work

    Microsoft Corporation

    Mountain View, CA
    5 days ago
  • $176k - $253.5k

     ...'s Adaptive Behavioral Systems Department within our Human...  ...for an AI Research Scientist, or Senior Machine Learning...  ...with expertise in large-scale foundational model training...  ...or large-scale ML. Industry experience is...  ...who inquire about and/or apply to work for Toyota Research... 
    Suggested
    Temporary work
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    4 days ago
  •  ...opportunity for individuals looking to apply their academic knowledge of...  ..., and relationships within large datasets. Assist in...  ...concise manner. Work with large-scale data sets using SQL, Python for...  ...AI and Large Language Models (LLMs) is a plus. Expected Base... 
    Suggested
    Hourly pay
    Permanent employment
    Internship

    Marvell

    Santa Clara, CA
    4 days ago
  • $120.7k - $238.6k

    Adobe Applied Science & Machine Learning (ASML) group is looking for applied research scientists and engineers to help us build the next...  ...that can work with large-scale data in production systems Develop and fine-tune...  ...on top of the latest ML research (e.g., diffusion... 
    Suggested
    Temporary work
    Local area
    Worldwide

    Dormont Manufacturing Co

    San Jose, CA
    5 days ago
  • $164.35k - $260k

     ...our company. As an Applied Machine Learning Scientist within our dynamic...  ...production-grade advanced ML solutions across...  ...recognition, recommendation systems, information...  ...Fine-tune and adapt LLMs/SLMs using PEFT (LoRA...  ...~ Experience with scaling LLM systems (caching,... 
    Suggested

    JPMorgan Chase Bank, N.A.

    Palo Alto, CA
    4 days ago
  • $192k - $304.75k

    Senior Quantum AI Research Scientist, Applied Research page is loaded## Senior...  ...of fault-tolerant quantum systems powered by machine learning....  ...*** Design and architect AI/ML models - including deep...  ...* Help create high-quality, large-scale datasets for quantum error correction... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $286.4k - $326.8k

    Applied Researcher II, AI Foundations...  ...and reliable AI systems, changing banking...  ...of AI & ML are bringing humanity...  ...team of data scientists, software engineers...  ...building large deep learning models...  ...models at scale both in terms of...  ...of pre-trained LLMs, SSL techniques... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    San Jose, CA
    18 hours ago
  •  ...discipline to work on Agentic AI systems for mobility. What you'll...  ...solutions that advance AI/ML systems for mobility services Conducting applied research in Agentic AI,...  ...experience with Foundational (LLMs, VLMs, and other large-scale AI architectures), including... 

    Vantage Point Consulting Inc.

    Mountain View, CA
    3 days ago
  • $170k - $216k

     ...autonomous ride-hail service and can also be applied to a range of vehicle platforms and...  ...states. The Perception team builds the system which learns the spatial-temporal...  ...efficiently and continuously learning from large scale real-world data, to (2) develop models and... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • $204k - $259k

     ...-hail service and can also be applied to a range of vehicle platforms...  ...Perception team builds the system which learns the spatial-temporal...  ...continuously learning from large scale real-world data, to (2) develop...  ...will: Own tasks in the ML Driver, take responsibility for... 
    Full time
    Temporary work
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • $170k - $216k

     ...-hail service and can also be applied to a range of vehicle platforms...  ...Perception team builds the system which learns the spatial-temporal...  ...continuously learning from large scale real-world data, to (2) develop...  ...with Python ~ Experience with ML frameworks like PyTorch or JAX... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • $192.2k - $260k

     ...motivated, passionate and resourceful Applied Scientist to bring diverse perspectives, ideas,...  ...solutions for various Prime Video Search systems using Deep learning, GenAI,...  ..., numpy, scipy etc. - Experience with large scale distributed systems such as Hadoop, Spark... 
    Local area
    Worldwide
    Flexible hours

    Amazon.com Inc

    Sunnyvale, CA
    2 days ago
  •  ...Machine Learning Scientist Recommender systems (e.g., image and content recommendations...  ...notifications. Conduct applied research to improve...  ...systems using traditional ML techniques, deep learning...  ...deep learning applied to large-scale recommendation problems.... 
    Work at office
    Remote work
    2 days per week

    Wayfair

    Mountain View, CA
    1 day ago
  • $60 per hour

     ...boundaries of recommendation systems? Do you dream of...  ...limits of deep learning and large-scale system design, we're...  ...the power of LLMs to understand and process...  ...Conduct original research on applying RL (e.g., bandit models...  ...Python and familiarity with ML frameworks such as... 
    Hourly pay
    Internship
    Local area

    Tik Tok

    San Jose, CA
    5 days ago
  • $60 per hour

     ...Machine Learning Scientist Intern (TikTok-Recommendation...  ...recommendation systems that elevate user...  ...understanding, LLMs, robustness, and...  ...the full-stack ML pipeline—from algorithm...  ...following fields: applied machine learning,...  ...infrastructure, large-scale recommendation system... 
    Hourly pay
    Internship
    Local area

    Tik Tok

    San Jose, CA
    3 days ago
  • $142.8k - $274.8k

     ...understanding and recommendation systems-spanning text, images,...  .... As a Principal Applied Scientist , you'll lead the...  ...stack, combining LLMs, multimodal models, and...  ...contentquality understanding at scale. Design and deploy...  ...principles into the ML system; create redteaming... 
    Ongoing contract
    Work at office
    Local area

    Microsoft Corporation

    Mountain View, CA
    1 day ago
  • $269.4k - $412.6k

     ...breakthrough hardware and battery systems to intuitive design,...  ...on a global scale. Role: As a Principal...  ...and deploying advanced ML models to reliably and safely...  ...to GM's mission. Apply techniques such as RL, guidance...  ...experience working with large-scale Foundation Models... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    2 days ago
  • $140k - $195k

     ...generation of intelligent systems-integrating...  ...innovation at global scale. The Role :...  ...development of end-to-end AI/ML systems that solve...  ...-class team of scientists and engineers, and...  ...training large scale end-to-end multimodal...  ...for each role and apply for any positions that... 
    Work at office
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    1 day ago
  • $173k

     ...Machine Learning Scientist The Senior...  ...Owns end-to-end ML and GenAI projects...  ...evaluation. Applies deep expertise in...  ...customer experience at scale? Would you like...  ...for medium-to-large projects: from...  ..., scalable ML systems (batch and/or streaming...  ...to integrate LLMs into production... 
    Local area
    Worldwide
    Flexible hours

    Expedia Group

    San Jose, CA
    1 day ago
  • $207k - $300k

    Staff AI Research Scientist, Applied AI, Google Cloud corporate_fare Google...  ...algorithms and tools, or Applied ML (e.g., LLM's, Generative AI,...  ...AI solutions to solve large-scale problems. Experience with the...  ...AI quality for production systems by defining technical metrics... 
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  •  ...McLean, Virginia Applied Researcher I...  ...and reliable AI systems, changing banking...  ...of AI & ML are bringing humanity...  ...team of data scientists, software engineers...  ...building large deep learning models...  ...models at scale both in terms of...  ...of pre-trained LLMs, SSL techniques... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One National Association

    San Jose, CA
    16 hours ago
  •  ...future of banking through Agentic AI.The Applied Artificial Intelligence and Machine...  ...andfrontier models. As an Applied AI ML Researcher Director in the Applied AI...  ...edge theory and enterprise-grade,large-scale,deployable systems. Job Responsibilities: ~ Architect... 

    JPMorgan Chase Bank, N.A.

    Palo Alto, CA
    2 days ago
  •  ...Principal Research Scientist to define how we build, evaluate, and scale domain‑specific models...  ...agentic AI systems to compress strategic...  ...Experience: 10+ years in AI/ML research with an...  ...adapting models on large GPU clusters using...  ...approaches) applied to real alignment problems... 
    Shift work

    Articul8

    Palo Alto, CA
    5 days ago
  •  ...-hail service and can also be applied to a range of vehicle platforms...  ...Perception team builds the system which learns the spatial-temporal...  ...continuously learning from large scale real-world data, to (2) develop...  ...You will: Own tasks in the ML Driver, take responsibility for... 
    Full time
    Temporary work
    Remote work

    Somi AI

    Mountain View, CA
    2 days ago
  • $300k

     ...researchers, data scientists, and engineers, tackling...  ...Overview Build and scale distributed pre-...  ...stability. Apply kernel fusion, communication...  ...the future of large language models. Why...  ...breakthrough papers into real systems and publish your...  ...skills on large ML codebases.... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $183.83k - $275.98k

     ...clear path to AVs at commercial scale—empowering a safer, richer,...  ...advancements in the field of ML and large-scale learning to the AV...  ...end-to-end autonomous driving system. This role requires working...  ...inference speeds. You will use your applied research skills to think... 

    Icehouseventures

    Mountain View, CA
    2 days ago
  • $168k - $264.5k

    We are now looking for a Research Scientist New Graduate with a focus on Machine Learning Systems (MLSys). NVIDIA Research seeks exceptional systems researchers...  ...software, and infrastructure technology for ML systems of all scales. Advances in AI/ML rely on efficient,... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $168k - $264.5k

     ..., HCI, and HRI to create systems that perceive, learn, and...  ...working in a team of research scientists and engineers from varied...  ..., including working with large-scale, multi-modal foundation models (such as recent LLMs, VLMs). A track record of applying human behavior models to... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...Applied AI ML Researcher Director Our goal is to build the next generation of AI: autonomous agents that can reason, plan,...  ...the gap between leading edge theory and enterprise-grade, large-scale, deployable systems. Job Responsibilities: Architect and develop... 

    Chase

    Palo Alto, CA
    3 days ago
  • $193.93k - $291.15k

     ...ML Research Scientist, Prediction & Smart Agents Mountain View...  ...to AVs at commercial scale, empowering a safer, richer...  ...for our autonomous system, as they will be deployed...  ..., we encourage you to apply! About the Work...  ...decoder architectures. Large generative models and... 

    Nuro

    Mountain View, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Applied ML Systems Scientist for Large-Scale LLMs. Be the first to apply!