Research Scientist - Data
$150kInstitute of Foundation Models
About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk‑managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge‑driven economy. As part of our team, you’ll have the opportunity to work on the core of cutting‑edge foundation model training, alongside world‑class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem‑solving skills will be instrumental in establishing MBZUAI as a global hub for high‑performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers. The Role As a Research Scientist in the Data team, your primary responsibility is to curate high quality data at the web‑scale to fuel the development of next generation foundation models. You will work on exploring and consolidating data sources and collaborate with cross‑functional teams to conduct in‑depth data research, contributing to MBZUAI’s mission of driving impactful AI discoveries and positioning the institution as a leader in the global AI research community. Your expertise will be key in enhancing the performance of large‑scale machine learning models, while supporting the development of transformative AI tools that can influence industries worldwide. Key Responsibilities Pioneer web‑scale data collection and curation methodologies for LLMs and multi‑modal foundation models. Design and implement novel data synthesis pipelines for code, mathematics, and agentic reasoning datasets. Trace the impact of data from pre‑training to final model capabilities and create automated quality assessment frameworks for massive datasets. Design data recipes that maximize model capabilities across diverse domains. Optimize data‑model co‑design for improved training dynamics. Contribute to research papers and represent MBZUAI at industry conferences and events, showcasing the institution’s AI research and innovation. Academic Qualifications Minimum: Master’s in Computer Science, Data Science, or a related technical field, or equivalent practical experience required. Preferred: PhD or equivalent research experience in Machine Learning, NLP, or Data Science with a focus on LLMs and data is preferred. Professional Experience Experience working with large language models, including evaluation, fine‑tuning, and prompt engineering. Strong Python development skills with a focus on research‑grade code and scalable data pipelines. Familiarity with collecting and processing large‑scale datasets from open‑source and web resources. Demonstrated ability to work with ML infrastructure (e.g., model evaluation, optimization, debugging). Proactive mindset with the ability to identify impactful research questions and execute on them with minimal supervision. Effective communication and collaboration skills for working in cross‑functional teams. Preferred Prior research experience in areas such as web data curation and mixing, synthetizing complex datasets for training, LLM evaluation, post‑training data, efficient inference, LLM‑as‑a‑judge, tokenization. Strong publication record in leading AI conferences (e.g., NeurIPS, ICLR, ICML, EMNLP) and/or prior contributions to open‑source AI research or data tools. Hands‑on experience training language/mulitli-modal models from scratch. $150,000 - $450,000 a year Visa Sponsorship This position is eligible for visa sponsorship. Benefits Include Comprehensive medical, dental, and vision benefits Bonus 401K Plan Generous paid time off, sick leave and holidays Paid Parental Leave Employee Assistance Program Life insurance and disability #J-18808-Ljbffr Institute of Foundation Models
- ...step of our exciting journey. The mission of the Waymo Research team is to develop machine learning solutions addressing open problems... ...learning, etc) to these problems; scale them to Google-sized data pipelines; and streamline them to run in real-time on the cars....DataInternshipSummer internshipLocal area
- ...computation. About The Role As an Applied Machine Learning Research Scientist at Cerebras, you will play a key role in turning modern... ...system behaviors, improving model quality, and iterating on data and evaluation strategies. Your contributions will help translate...DataInternship
$158k - $304k
Decisive Point in Mountain View, CA is seeking a passionate Research Scientist to join its Research Team. You will engage in world-class research for autonomous systems, leveraging large-scale data and industry-leading tools to develop advanced models. The role requires...Data$158k - $304k
Decisive Point in Mountain View, CA is seeking a passionate Research Scientist to drive research for autonomous systems. You will analyze large-scale data and utilize industry-leading tools to innovate in autonomous driving and robotics. The ideal candidate will possess...DataFull time$184k - $299k
Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning... ...‑on experience with large‑scale model training including data preparation and model parallelization (tensor and pipeline) is...Data$184k - $299k
Senior Research Scientist, Security and Privacy page is loaded## Senior Research Scientist, Security and Privacylocations: US, MA, Westford:... ...safety concerns are increasingly limiting the access and use of data in AI as well as the use of AI in critical use cases. In...DataWork experience placement$168k - $264.5k
...across training pipelines and serving systems* Collaborating with researchers to translate cutting-edge ideas into production-ready... ...Solid background in computer science fundamentals: algorithms, data structures, parallel/distributed computing, and systems programming...Data- ...optimization and integration into the Waymo Driver. We conduct our own research to address real-world problems and collaborate with research teams at Alphabet. We have access to millions of miles of driving data from a diverse set of sensors, enabling engineers like you to (1...DataFull timeTemporary workRemote work
$204k - $259k
...initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas... ...role, you will report to a Principal Scientist. You will: Participate in Waymo’s Foundation... ...and performant manner such as Data parallel, FSDP and other sharding approaches...DataTemporary workRemote work$183.83k - $275.98k
...Behavior team, leveraging the cutting edge of machine learning research to solve challenging real-world robotics problems. This role is... ...quickly and efficiently, collaborating with other teams to determine data and infrastructure support needs, and working to improve model...Data$150k
About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation... ...model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges...DataVisa sponsorship$147k - $211k
...Understanding of machine learning algorithms and research. About the job As an organization,... ...specific types of work. As a Research Scientist, you will set up large‑scale tests and deploy... ..., such as machine (and deep) learning, data mining, natural language processing,...DataFull time$147k - $211k
...Gemini Robotics On-Device (our Gemini model that runs without a data network). You will also develop reasoning and agentic systems for... ...to unlock new robot capabilities. Write software to implement research ideas and iterate. Participate in research, including learning...DataFull time$150k
About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation... ...model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges...DataVisa sponsorship$174k - $252k
Senior Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Required qualifications: PhD degree in Computer Science, a related... ...of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software...DataFull time$150k
About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation... ...model training, alongside world‑class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges...DataVisa sponsorshipShift work$207k - $300k
Research Scientist, Evaluations, Security and Privacy, DeepMind DeepMind Mountain View, CA, USA ; San Francisco, CA, USA Apply X Applicants... ...breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software...DataFull time$158k - $304k
About the role We are looking for a passionate Research Scientist to join the Research Team at Applied Intuition. This team conducts world‑class... .... You will have access to millions of miles of large‑scale data and industry‑leading simulators and tools to develop cutting‑...DataFull timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workFlexible hours$158k - $304k
...exception.) About the role We are looking for a passionate Research Scientist to join the Research Team at Applied Intuition. This team conducts... .... You will have access to millions of miles of large‑scale data and industry‑leading simulators and tools to develop cutting‑...DataFull timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift- ...energy are the foundation of what we do. We ingest large-scale data—weather, prices, load, and grid conditions—to build... ...we work. The Role We are looking for a Power Systems Research Scientist to develop physics-based models of large-scale transmission systems...Data
$126k - $423k
...About the role and team We are looking for multiple passionate Research Scientists to join the Research Group at Applied Intuition. The mission... ...tools and infra, researchers can access millions of miles of data from large fleets, and deploy methods they develop into...DataFull timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift$300k
...About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing... ...foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges...DataVisa sponsorship- ...complexity, and continuously evolve through curiosity, data, and craftsmanship. We’re seeking technologists... .... Position Overview: We are expanding our research team and looking for an experienced Research Scientist specializing in Conversational AI & Machine Learning...DataRemote work
$150k - $300k
...deployed at scale. The Silicon Valley Research Lab focuses on developing novel... ...learning (RL) , etc. As a Research Scientist in the team, you will conduct research within... ...you would like more information about how your data is processed, please contact us....DataFull timeH1bWork at office3 days per week$147k - $211k
...coding experience. 1 year of experience owning and initiating research agendas. About the job Artificial intelligence will be one of humanity... ...trends and best practices within the community. Define the data structure, framework, design, and evaluation metrics for...DataFull time$192k - $304.75k
Senior Quantum AI Research Scientist, Applied Research page is loaded## Senior Quantum AI Research Scientist, Applied Researchlocations: US,... ...characterization, including simulated and hardware-derived syndrome data, enabling the community to train and evaluate AI models at...Data$150k
About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk‑managing foundation... ...model training, alongside world‑class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges...DataVisa sponsorship$165k - $195k
Senior AI Research Scientist- Time-Series Foundational Models Full-time The Bosch Research and Technology Center North America with offices... ...Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data Visual Analytics, Explainable AI (XAI), Data Science, AI...DataFull timeWork experience placementLocal areaWorldwide$235.03k - $352.29k
...Staff ML Research Scientist Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to make autonomy accessible... ...system Build and leverage effective and efficient ML data pipelines Provide technical and people leadership to a...DataWork experience placement$190k - $250k
...realistic, physically consistent futures from real-world sensor data. This capability serves as the foundation for scalable closed... ...that drive our autonomous trucks. We are looking for a research scientist to lead the design and development of world models capable of...DataTemporary workWork at officeVisa sponsorshipFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist - Data. Be the first to apply!
- machine learning research scientist Sunnyvale, CA
- drug safety scientist Sunnyvale, CA
- senior scientist Sunnyvale, CA
- r&d scientist Sunnyvale, CA
- applied scientist Sunnyvale, CA
- water quality scientist Sunnyvale, CA
- analytical scientist Sunnyvale, CA
- qc scientist Sunnyvale, CA
- senior research scientist Sunnyvale, CA
- scientist immunology Sunnyvale, CA


