Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist, Data

Full-time

Pika

About the Role At Pika, we are pioneering the next generation of creative infrastructure built around real-time, multimodal generation and intelligent agentic platforms. We are looking for a staff or lead-level Research Engineer, Data to architect and scale data engineering systems supporting model training for our advanced multimodal foundation models. This pivotal role will strengthen our research teams by building, optimizing, and owning large-scale data pipelines and robust ML data curation, ensuring our foundation models have access to the highest quality and most diverse datasets. If you are passionate about powerful data infrastructure and innovative research-engineering, join us to make an impact for millions of creators. What You’ll Do Take ownership of large-scale data pipeline architecture and implementation to support model training and research workflows for text, image, audio, and video datasets Partner with research and engineering teams to curate, clean, and manage diverse, sensory-rich datasets for pre-training and mid-training of multimodal models Develop strategies and tools for scalable data ingestion, labeling, filtering, augmentation, and storage Ensure data quality, reliability, and compliance, including managing privacy and ethical considerations throughout the data lifecycle Optimize data processing, transformation, and delivery for large-scale distributed training pipelines Prototype and productionize new methods for dataset creation, management, and continuous improvement in response to researcher needs Contribute to the integration of research-driven data advancements into production-ready systems Stay informed on emerging data engineering and ML data management developments, bringing best practices to our systems What We’re Looking For 5+ years of experience building and scaling data pipelines for machine learning applications at staff or lead engineer level, ideally in research or model training environments Strong background in data engineering and ML data curation for LLMs, VLMs, or other large-scale multimodal models Expertise in distributed data systems (e.g., Spark, Hadoop, Ray, or similar) and efficient large dataset processing/ETL workflows Proven ability to build robust, scalable, and production-grade data infrastructure for ML pipelines Experience developing tools for data labeling, filtering, deduplication, quality assurance, and dataset management Strong programming skills (Python, SQL, PySpark, or similar) and familiarity with cloud data platforms (AWS, GCP, Azure) Knowledge of privacy, compliance, ethics, and best practices in data collection and management Excellent cross-functional collaboration, problem-solving, and communication skills Passion for enabling cutting-edge generative AI and creative technology through data excellence What We Offer Competitive salary and substantial equity in a high-growth startup Full health benefits, 401k matching, and more Collaborative, mission-driven team environment with major growth opportunities Flexible on-site/remote hybrid (HQ in Palo Alto, CA) About Pika Pika empowers creators by building state-of-the-art agentic and multimedia platforms. Our vision is to break down technical barriers to creativity, making real-time generative and intelligent orchestration accessible to all. Join us and help shape the next evolution of creative technology! If you are a data-driven research engineer excited to lead and scale the data infrastructure powering real-time multimodal foundation models, we want to hear from you.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Research Scientist, Data in Palo Alto, CA vacancy
  •  ...every step of our exciting journey. The mission of the Waymo Research team is to develop machine learning solutions addressing open problems...  ...learning, etc) to these problems; scale them to Google-sized data pipelines; and streamline them to run in real-time on the cars.... 
    Data
    Internship
    Summer internship
    Local area

    DiversityJobs Inc

    Mountain View, CA
    2 days ago
  •  ...product development and improving conversational technologies. The ideal candidate has at least 10 years of experience in large scale data processing, a Master's or Ph.D. in relevant fields, and hands-on experience with transformer models. Join us in shaping the future... 
    Data

    Otter.ai

    Mountain View, CA
    6 days ago
  •  ...determined according to the order of listing. What you’ll do As a Research Scientist at Simular, you will: Shape the future of agentic AI by...  ...AI safety). Design and execute experiments end-to-end: from data collection and benchmarking, to model training and evaluation... 
    Data

    Simular Inc.

    Palo Alto, CA
    5 days ago
  •  ...model (LLM) for the healthcare industry. Our team comprised of ex-researchers from Microsoft, Meta, Nvidia, Apple, Stanford, John Hopkins and...  .... Responsibilities: Design, Develop, Evaluate and update data-driven models for Speech First applications. Participate in Research... 
    Data
    Work at office

    Dormont Manufacturing Co

    Palo Alto, CA
    2 days ago
  • $115k - $140k

     ...Alto, Subsense brings together leading scientists and engineers to redefine the future of...  ...interaction. The Opportunity We’re seeking a Research Scientist with strong expertise in...  ...Develop experimental protocols, maintain data integrity, and contribute to publications... 
    Data

    Subsense, Inc.

    Palo Alto, CA
    2 days ago
  •  ...optimization and integration into the Waymo Driver. We conduct our own research to address real-world problems and collaborate with research teams at Alphabet. We have access to millions of miles of driving data from a diverse set of sensors, enabling engineers like you to (1... 
    Data
    Full time
    Temporary work
    Remote work

    Somi AI

    Mountain View, CA
    2 days ago
  • $147k - $211k

     ...organization, Google maintains a portfolio of research projects driven by fundamental research,...  ...specific types of work. As a Research Scientist, you'll set up large-scale tests and...  ...science, such as machine (and deep) learning, data mining, natural language processing,... 
    Data
    Full time

    Google Inc.

    Mountain View, CA
    2 days ago
  • $204k - $259k

     ...initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas...  ...role, you will report to a Principal Scientist. You will: Participate in Waymo’s Foundation...  ...and performant manner such as Data parallel, FSDP and other sharding approaches... 
    Data
    Temporary work
    Remote work

    Neura Market

    Mountain View, CA
    3 days ago
  • $174k - $252k

    Senior Research Scientist, Google Research Mountain View, CA, USA; New York, NY, USA; +2 more Apply X Applicants in San Francisco: Qualified...  ...of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software performance... 
    Data
    Full time

    Google Inc.

    Mountain View, CA
    4 days ago
  • $252k - $400k

     ...of models will look fundamentally different. We’re assembling a research team dedicated to shaping that future. The Opportunity We’re creating...  ...community. Benefits of Research in Industry Rich real‑time data : access to large‑scale, diverse, and dynamic user interactions.... 
    Data
    Immediate start
    Worldwide

    AppLovin

    Palo Alto, CA
    3 days ago
  • $147k - $211k

     ...Gemini Robotics On-Device (our Gemini model that runs without a data network). You will also develop reasoning and agentic systems for...  ...to unlock new robot capabilities. Write software to implement research ideas and iterate. Participate in research, including learning... 
    Data
    Full time

    Google Inc.

    Mountain View, CA
    2 days ago
  • $174k - $253k

     ...organization, Google maintains a portfolio of research projects driven by fundamental research,...  ...specific types of work. As a Research Scientist, you will set up large‑scale tests and...  ...computer science, such as machine learning, data mining, natural language processing,... 
    Data
    Worldwide

    Google Inc.

    Mountain View, CA
    3 days ago
  • $176k - $253k

    At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life...  ...Opportunity We are looking for a Research Scientist to join us in building intelligent...  ...evaluate a wide range of architectural, data, and algorithmic choices, and help shape... 
    Data
    Work experience placement
    Internship
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    6 days ago
  •  ...translation fluency under real-world disfluency. We’re looking for a Research Scientist who can define what "better" actually means across all of...  ...conditions optimize for. Feed evaluation insights back into data acquisition and model training priorities — identifying which... 
    Data

    Sanas

    Palo Alto, CA
    1 day ago
  • About Us GenMD is unlocking healthcare data at scale. Today, roughly 97% of healthcare data goes unused because of patient privacy...  ...that data—safely and ethically—for AI labs, pharma companies, and researchers. This isn’t a chatbot, or an AI agent replacing clinicians or... 
    Data
    Internship
    Night shift

    GenMD

    Palo Alto, CA
    3 days ago
  •  ...not months -automating the loop of evaluation, data synthesis, training, and repeat. Oumi also develops an open research stack and models in collaboration with...  ...experimentation, and adoption. Role Overview The Research Scientist will be an integral part of Oumi's research... 
    Data
    Worldwide
    Flexible hours

    Oumi

    Palo Alto, CA
    4 days ago
  •  ...models that leverage our large-scale, high-quality, real-world data collection system. At the same time, we’re building a new kind of...  ...more time on the things they value most. As a Machine Learning Research Engineer, you will work on the software and algorithms that enable... 
    Data

    Sunday Robotics

    Mountain View, CA
    2 days ago
  • $184k - $299k

     ...Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning...  ...‑on experience with large‑scale model training including data preparation and model parallelization (tensor and pipeline) is... 
    Data

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $192k - $304.75k

     ...We are now looking for a Research Scientist with a focus in System Software and I/O! NVIDIA is seeking Research Scientists with a focus in System...  ...workloads such as recommender systems, graph analytics, and data frames. Your base salary will be determined based on... 
    Data
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $126k - $248k

     ...enable accurate, efficient unstructured data search and retrieval for RAG,...  ...more. It is backed by a strong team of AI researchers from Stanford, MIT, Berkeley, Princeton,...  ...OVERVIEW We are seeking a Senior Research Scientist to join our team and contribute to the... 
    Data
    Full time
    Local area
    Worldwide
    Flexible hours

    MongoDB

    Palo Alto, CA
    2 days ago
  •  ...Fortune 500 enterprises, we bring together research, engineering, product, and domain...  ...Articul8 AI is seeking a Principal Research Scientist to define how we build, evaluate, and scale...  ...full model development lifecycle: domain data strategy, continued pre‑training, supervised... 
    Data
    Shift work

    Articul8

    Palo Alto, CA
    5 days ago
  • ## Senior Staff Research Scientist, Agentic AI & RLApplylocations: East Palo Alto, CAtime type: Full timeposted on: Posted Todayjob requisition id: JR107333**About Centific**Centific is a frontier AI data foundry that curates diverse, high-quality data, using our purpose... 
    Data

    Centific Global Solutions, Inc.

    Palo Alto, CA
    2 days ago
  • $202.35k - $303.05k

     ...challenging problems in autonomous driving. You will be focusing on researching and developing state of the art generative models, with an...  ...Apply the model to various tasks such as planning, prediction, data generation, simulation, and so on. Research SoTA algorithms to... 
    Data

    Icehouseventures

    Mountain View, CA
    2 days ago
  • $151k - $297k

     ...enable accurate, efficient unstructured data search and retrieval for RAG, recommendation...  .... It is backed by a strong team of AI researchers from Stanford, MIT, Berkeley, Princeton,...  ...We are seeking a Staff Research Scientist to join our team and contribute to the development... 
    Data
    Work at office
    Local area
    Remote work
    Flexible hours

    United States Digital Space LLC

    Palo Alto, CA
    2 days ago
  • $197.8k - $296.6k

     ...Summary The Robot Intelligence Lab at Samsung Research America is a new facility dedicated to...  ...is looking for a Senior Staff Research Scientist with solid technical skills and rich academic...  ...(vision, tactile, audio, semantic) data fusion. Work with the team to design and... 
    Data
    Work at office
    Local area

    Dormont Manufacturing Co

    Mountain View, CA
    2 days ago
  • $225k - $275k

     ...an experienced consultant for a Principal Scientist position in our Polymers & Chemistry...  ...solving skills, with the ability to interpret data, identify trends, and make informed...  ...derived from government funding for academic research projects). Benefits you will enjoy Access... 
    Data
    Work at office

    Exponent

    Menlo Park, CA
    4 days ago
  •  ...healthcare industry. Our team comprised of ex‑researchers from Microsoft, Meta, Nvidia, Apple,...  ...‑AI interactions. Overview Applied Scientists at Hippocratic provide a dynamic opportunity...  ...TensorFlow. Experience with large‑scale data processing and distributed computing.... 
    Data
    Work at office

    Dormont Manufacturing Company

    Palo Alto, CA
    3 days ago
  • $190k - $250k

     ...realistic, physically consistent futures from real-world sensor data. This capability serves as the foundation for scalable...  ...models that drive our autonomous trucks. We are looking for a research scientist to lead the design and development of world models capable of... 
    Data
    Full time
    Temporary work
    Work at office
    Visa sponsorship
    Flexible hours

    Kodiak

    Mountain View, CA
    22 hours ago
  •  ...A biotechnology company is seeking a Clinical Scientist to design and support clinical trials from its Palo Alto office or remotely. The role involves analyzing clinical data, collaborating with cross-functional teams, and ensuring the integrity of clinical studies. Ideal... 
    Data
    Work at office
    Remote work

    Fortvita Biologics

    Palo Alto, CA
    1 day ago
  • $185k - $215k

     ...Company Description The Bosch Research and Technology Center North America with offices...  ...Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural...  ...Job Description As a Senior Research Scientist- Vision-Language-Action (VLA) Models, you... 
    Data
    Part time
    Work experience placement
    Local area
    Immediate start
    Worldwide

    Bosch Group

    Sunnyvale, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist, Data. Be the first to apply!