Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist - Data

$150k

Institute of Foundation Models

About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk‑managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge‑driven economy. As part of our team, you’ll have the opportunity to work on the core of cutting‑edge foundation model training, alongside world‑class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem‑solving skills will be instrumental in establishing MBZUAI as a global hub for high‑performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers. The Role As a Research Scientist in the Data team, your primary responsibility is to curate high quality data at the web‑scale to fuel the development of next generation foundation models. You will work on exploring and consolidating data sources and collaborate with cross‑functional teams to conduct in‑depth data research, contributing to MBZUAI’s mission of driving impactful AI discoveries and positioning the institution as a leader in the global AI research community. Your expertise will be key in enhancing the performance of large‑scale machine learning models, while supporting the development of transformative AI tools that can influence industries worldwide. Key Responsibilities Pioneer web‑scale data collection and curation methodologies for LLMs and multi‑modal foundation models. Design and implement novel data synthesis pipelines for code, mathematics, and agentic reasoning datasets. Trace the impact of data from pre‑training to final model capabilities and create automated quality assessment frameworks for massive datasets. Design data recipes that maximize model capabilities across diverse domains. Optimize data‑model co‑design for improved training dynamics. Contribute to research papers and represent MBZUAI at industry conferences and events, showcasing the institution’s AI research and innovation. Academic Qualifications Minimum: Master’s in Computer Science, Data Science, or a related technical field, or equivalent practical experience required. Preferred: PhD or equivalent research experience in Machine Learning, NLP, or Data Science with a focus on LLMs and data is preferred. Professional Experience Experience working with large language models, including evaluation, fine‑tuning, and prompt engineering. Strong Python development skills with a focus on research‑grade code and scalable data pipelines. Familiarity with collecting and processing large‑scale datasets from open‑source and web resources. Demonstrated ability to work with ML infrastructure (e.g., model evaluation, optimization, debugging). Proactive mindset with the ability to identify impactful research questions and execute on them with minimal supervision. Effective communication and collaboration skills for working in cross‑functional teams. Preferred Prior research experience in areas such as web data curation and mixing, synthetizing complex datasets for training, LLM evaluation, post‑training data, efficient inference, LLM‑as‑a‑judge, tokenization. Strong publication record in leading AI conferences (e.g., NeurIPS, ICLR, ICML, EMNLP) and/or prior contributions to open‑source AI research or data tools. Hands‑on experience training language/mulitli-modal models from scratch. $150,000 - $450,000 a year Visa Sponsorship This position is eligible for visa sponsorship. Benefits Include Comprehensive medical, dental, and vision benefits Bonus 401K Plan Generous paid time off, sick leave and holidays Paid Parental Leave Employee Assistance Program Life insurance and disability #J-18808-Ljbffr Institute of Foundation Models

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Research Scientist - Data in Sunnyvale, CA vacancy
  •  ...step of our exciting journey. The mission of the Waymo Research team is to develop machine learning solutions addressing open problems...  ...learning, etc) to these problems; scale them to Google-sized data pipelines; and streamline them to run in real-time on the cars.... 
    Data
    Internship
    Summer internship
    Local area

    Waymo

    Mountain View, CA
    23 hours ago
  •  ...computation. About The Role As an Applied Machine Learning Research Scientist at Cerebras, you will play a key role in turning modern...  ...system behaviors, improving model quality, and iterating on data and evaluation strategies. Your contributions will help translate... 
    Data
    Internship

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  • $158k - $304k

    Decisive Point in Mountain View, CA is seeking a passionate Research Scientist to join its Research Team. You will engage in world-class research for autonomous systems, leveraging large-scale data and industry-leading tools to develop advanced models. The role requires... 
    Data

    Decisive Point

    Mountain View, CA
    1 day ago
  • $158k - $304k

    Decisive Point in Mountain View, CA is seeking a passionate Research Scientist to drive research for autonomous systems. You will analyze large-scale data and utilize industry-leading tools to innovate in autonomous driving and robotics. The ideal candidate will possess... 
    Data
    Full time

    Decisive Point

    Mountain View, CA
    23 hours ago
  • $184k - $299k

    Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning...  ...‑on experience with large‑scale model training including data preparation and model parallelization (tensor and pipeline) is... 
    Data

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $184k - $299k

    Senior Research Scientist, Security and Privacy page is loaded## Senior Research Scientist, Security and Privacylocations: US, MA, Westford:...  ...safety concerns are increasingly limiting the access and use of data in AI as well as the use of AI in critical use cases. In... 
    Data
    Work experience placement

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $168k - $264.5k

     ...across training pipelines and serving systems* Collaborating with researchers to translate cutting-edge ideas into production-ready...  ...Solid background in computer science fundamentals: algorithms, data structures, parallel/distributed computing, and systems programming... 
    Data

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  •  ...optimization and integration into the Waymo Driver. We conduct our own research to address real-world problems and collaborate with research teams at Alphabet. We have access to millions of miles of driving data from a diverse set of sensors, enabling engineers like you to (1... 
    Data
    Full time
    Temporary work
    Remote work

    Somi AI

    Mountain View, CA
    23 hours ago
  • $204k - $259k

     ...initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas...  ...role, you will report to a Principal Scientist. You will: Participate in Waymo’s Foundation...  ...and performant manner such as Data parallel, FSDP and other sharding approaches... 
    Data
    Temporary work
    Remote work

    Neura Market

    Mountain View, CA
    1 day ago
  • $183.83k - $275.98k

     ...Behavior team, leveraging the cutting edge of machine learning research to solve challenging real-world robotics problems. This role is...  ...quickly and efficiently, collaborating with other teams to determine data and infrastructure support needs, and working to improve model... 
    Data

    Icehouseventures

    Mountain View, CA
    23 hours ago
  • $150k

    About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation...  ...model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges... 
    Data
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  • $147k - $211k

     ...Understanding of machine learning algorithms and research. About the job As an organization,...  ...specific types of work. As a Research Scientist, you will set up large‑scale tests and deploy...  ..., such as machine (and deep) learning, data mining, natural language processing,... 
    Data
    Full time

    Google Inc.

    Mountain View, CA
    23 hours ago
  • $147k - $211k

     ...Gemini Robotics On-Device (our Gemini model that runs without a data network). You will also develop reasoning and agentic systems for...  ...to unlock new robot capabilities. Write software to implement research ideas and iterate. Participate in research, including learning... 
    Data
    Full time

    Google Inc.

    Mountain View, CA
    23 hours ago
  • $150k

    About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation...  ...model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges... 
    Data
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  • $174k - $252k

    Senior Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Required qualifications: PhD degree in Computer Science, a related...  ...of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software... 
    Data
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $150k

    About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation...  ...model training, alongside world‑class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges... 
    Data
    Visa sponsorship
    Shift work

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  • $207k - $300k

    Research Scientist, Evaluations, Security and Privacy, DeepMind DeepMind Mountain View, CA, USA ; San Francisco, CA, USA Apply X Applicants...  ...breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software... 
    Data
    Full time

    Google Inc.

    Mountain View, CA
    23 hours ago
  • $158k - $304k

    About the role We are looking for a passionate Research Scientist to join the Research Team at Applied Intuition. This team conducts world‑class...  .... You will have access to millions of miles of large‑scale data and industry‑leading simulators and tools to develop cutting‑... 
    Data
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Decisive Point

    Mountain View, CA
    1 day ago
  • $158k - $304k

     ...exception.) About the role We are looking for a passionate Research Scientist to join the Research Team at Applied Intuition. This team conducts...  .... You will have access to millions of miles of large‑scale data and industry‑leading simulators and tools to develop cutting‑... 
    Data
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Decisive Point

    Mountain View, CA
    1 day ago
  •  ...energy are the foundation of what we do. We ingest large-scale data—weather, prices, load, and grid conditions—to build...  ...we work. The Role We are looking for a Power Systems Research Scientist to develop physics-based models of large-scale transmission systems... 
    Data

    Gridmatic

    Cupertino, CA
    15 days ago
  • $126k - $423k

     ...About the role and team We are looking for multiple passionate Research Scientists to join the Research Group at Applied Intuition. The mission...  ...tools and infra, researchers can access millions of miles of data from large fleets, and deploy methods they develop into... 
    Data
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Decisive Point

    Sunnyvale, CA
    23 hours ago
  • $300k

     ...About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing...  ...foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges... 
    Data
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    15 days ago
  •  ...complexity, and continuously evolve through curiosity, data, and craftsmanship. We’re seeking technologists...  ....   Position Overview: We are expanding our research team and looking for an experienced Research Scientist specializing in Conversational AI & Machine Learning... 
    Data
    Remote work

    ASAPP

    Mountain View, CA
    12 days ago
  • $150k - $300k

     ...deployed at scale. The Silicon Valley Research Lab focuses on developing novel...  ...learning (RL) , etc.   As a Research Scientist in the team, you will conduct research within...  ...you would like more information about how your data is processed, please contact us.... 
    Data
    Full time
    H1b
    Work at office
    3 days per week

    Horizon Robotics

    Cupertino, CA
    17 days ago
  • $147k - $211k

     ...coding experience. 1 year of experience owning and initiating research agendas. About the job Artificial intelligence will be one of humanity...  ...trends and best practices within the community. Define the data structure, framework, design, and evaluation metrics for... 
    Data
    Full time

    Google Inc.

    Mountain View, CA
    23 hours ago
  • $192k - $304.75k

    Senior Quantum AI Research Scientist, Applied Research page is loaded## Senior Quantum AI Research Scientist, Applied Researchlocations: US,...  ...characterization, including simulated and hardware-derived syndrome data, enabling the community to train and evaluate AI models at... 
    Data

    NVIDIA Corporation

    Santa Clara, CA
    23 hours ago
  • $150k

    About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk‑managing foundation...  ...model training, alongside world‑class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges... 
    Data
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $165k - $195k

    Senior AI Research Scientist- Time-Series Foundational Models Full-time The Bosch Research and Technology Center North America with offices...  ...Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data Visual Analytics, Explainable AI (XAI), Data Science, AI... 
    Data
    Full time
    Work experience placement
    Local area
    Worldwide

    Robert Bosch Group

    Sunnyvale, CA
    23 hours ago
  • $235.03k - $352.29k

     ...Staff ML Research Scientist Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to make autonomy accessible...  ...system Build and leverage effective and efficient ML data pipelines Provide technical and people leadership to a... 
    Data
    Work experience placement

    Nuro

    Mountain View, CA
    23 hours ago
  • $190k - $250k

     ...realistic, physically consistent futures from real-world sensor data. This capability serves as the foundation for scalable closed...  ...that drive our autonomous trucks. We are looking for a research scientist to lead the design and development of world models capable of... 
    Data
    Temporary work
    Work at office
    Visa sponsorship
    Flexible hours

    Kodiak

    Mountain View, CA
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist - Data. Be the first to apply!