Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer, Multimodal Data

Eventual

About Eventual Every breakthrough Physical AI system — humanoid robots, autonomous vehicles, video generation models — is trained on petabytes of video, lidar, radar, and sensor data. But today's data platforms (Databricks, Snowflake) were built for spreadsheet‑like analytics, not the multimodal corpora that power AI. As a result, robotics and video‑AI teams iterate on model improvement about once a week. Most of that week isn't training — it's finding the right data: writing CV heuristics over raw footage, paying annotators for edge cases, hand‑curating clips before a cluster ever spins up. GPU bandwidth has grown 2‑3× per generation. Storage and pipelines haven't. The gap widens every year. Eventual was founded in 2022 to close it. Our open‑source engine, Daft, is the distributed data engine purpose‑built for multimodal AI — already running 2 PB/day at Amazon, 60‑100 PB at another FAANG company, and in production at Mobileye, TogetherAI, and CloudKitchens. We are building a video‑native index on top of our engine for Physical AI that collapses the data iteration loop. Describe the dataset you want, get a curated table in minutes, feed it to your GPUs at line rate. One iteration per day becomes the norm. We're building this in partnership with the top PhysicalAI labs and public AI infrastructure companies today. We have raised $30M from Felicis, CRV, Microsoft M12, Citi, Essence, Y Combinator, Caffeinated Capital, Array.vc, and angels from the co‑founders of Databricks and Perplexity. We've assembled a world‑class team from AWS, Render, Pinecone and Tesla. We have spent our careers powering the last generation of PhysicalAI in self‑driving, and are excited to now do this for the next. Join our small (but powerful!) team working together 4 days/week in our SF Mission district office. Your Role As a Research Engineer on the Visual Understanding team, you'll own the layer that makes petabytes of video queryable by content. Physical AI teams have raw footage, lidar, radar, and sim outputs scattered across object stores with no way to find what they need without weeks of human annotation. We change that economics: we run vision‑language models over every clip in a corpus along axes the customer cares about (gripper type, failure mode, object class, scene, motion density), so a researcher can ask "left-arm grasp failures on deformable objects" and get a curated dataset in minutes. You'll define the roadmap for our visual understanding capabilities, train and select the models that make corpus‑scale annotation tractable at single‑digit cents per hour of video, and build the rich datasets that go on to train customer models. This is a research engineering role — meaning you'll read papers and run experiments, but you ship to production and your work is judged by what it does for customer training runs. Key Responsibilities Own the visual understanding roadmap end‑to‑end: from picking the model family for a customer's taxonomy to landing it in production inference at corpus scale. Train, fine‑tune, and evaluate VLMs, VQA models, embedding models, and convolutional perception models against customer datasets and benchmarks. Drive down per‑clip annotation cost — model selection, distillation, batching, decode pipelining — so "annotate every clip in a 10K‑hour corpus" stays economical. Build the rich, queryable datasets that customers train on: design taxonomies with researchers, instrument quality, version the outputs. Partner with the dataloading and storage teams so visual understanding outputs flow into the index and on to the GPU without re‑engineering. Work directly with researchers at our partner labs — your shortest feedback loop is their next training iteration. What we look for Strong familiarity with modern vision and multimodal models — convolution nets, VLMs, VQA, embeddings — and a sense for the SOTA that's actually deployable today vs. on a leaderboard. Experience running these models at scale on real video and sensor data, ideally for perception tasks (detection, tracking, segmentation, retrieval, captioning). Background from a perception team at a self‑driving, robotics, or visual‑data company — or equivalent depth from a research lab. Comfortable with cloud infrastructure and large‑scale data processing — you don't need to be a distributed‑systems engineer, but you've shipped jobs that ran on thousands of GPU‑hours of video. Bias toward data and infrastructure: you reach for "annotate the whole corpus" before "fine‑tune another model." Nice to have Experience training vision or multimodal models from scratch (not just calling APIs). ML/AI research background — papers, citations, or a research org on your resume. Hands‑on time with big‑data frameworks like Spark, Ray, or Daft. Worked on embeddings, retrieval, or content‑aware search at scale. Experience designing labeling taxonomies or running annotation programs. Perks & Benefits In‑person, tight‑knit team — 4 days/week in our SF Mission office. Competitive comp and meaningful startup equity. Catered lunches and dinners for SF employees. Commuter benefit. Team‑building events and poker nights. Health, vision, and dental coverage. Flexible PTO. Latest Apple equipment. 401(k) plan with match. If you're excited about being on the team that turns petabytes of raw video into the training data for the next generation of Physical AI, we'd love to talk. #J-18808-Ljbffr Eventual

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Research Engineer, Multimodal Data in San Francisco, CA vacancy
  •  ...model innovation and systems engineering paired with a design‑minded product...  ...models must be trained on data that reflects the world’s...  ...generative models (speech, text, or multimodal). Ability to help guide...  ...scalable systems that bridge research and production. What We... 
    Suggested
    Work at office
    Relocation package

    Cartesia

    San Francisco, CA
    3 days ago
  • A leading AI research organization is looking for a Research Engineer/Scientist to join their Future of Computing Research team in San Francisco. The role focuses...  ...on developing methodologies for personalized multimodal AI systems and includes responsibilities such as building... 
    Suggested
    Relocation package

    OpenAI

    San Francisco, CA
    16 hours ago
  •  ...A pioneering AI firm based in San Francisco is seeking a Research Engineer, Distributed Data Systems. In this role, you will design and maintain infrastructure for large-scale multimodal training, ensuring scalability and reliability of data systems. Candidates should... 
    Suggested
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  • Human Archive is looking for a Research Engineer in San Francisco to work on multimodal sensing systems. You will evaluate sensor technologies and design experiments impacting robotics performance. If you hold a Master's/PhD in relevant fields and have research experience... 
    Suggested

    Human Archive

    San Francisco, CA
    2 days ago
  • talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities... 
    Suggested

    talentpluto

    San Francisco, CA
    3 days ago
  • A leading AI Customer Support startup located in San Francisco is seeking a Machine Learning Research Engineer. The role involves developing cutting-edge AI models for customer support, focusing on enhancing performance beyond existing solutions. Ideal candidates will have... 

    DRH Search

    San Francisco, CA
    1 day ago
  •  ...Cartesia is looking for a Software Engineer to build the data infrastructure for its AI models in San Francisco. In this hands-on role, you will design and implement scalable data pipelines for multimodal data, particularly audio. Candidates should have experience with... 
    Work at office

    Cartesia, Inc.

    San Francisco, CA
    4 days ago
  • A leading AI startup is seeking senior ML research engineers for their office in San Francisco. The role involves developing models for customer support, experimenting to improve latency, and translating research ideas into actionable plans. Ideal candidates will have... 
    Work at office

    DRH Search

    San Francisco, CA
    1 day ago
  • $180.6k - $315k

     ...of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including...  ...agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working... 
    Full time

    Scale AI

    San Francisco, CA
    2 days ago
  •  ...great technology. The Liquid team is a community of world-class engineers, researchers, and builders creating the next generation of AI. Whether...  ...consolidating, gathering, and generating high-quality text data for pretraining, midtraining, SFT, and preference optimization... 

    Liquid AI

    San Francisco, CA
    5 days ago
  •  ...company based in San Francisco, California. The Role: Data Engineer - Multimodal Systems As a Data Engineer - Multimodal Systems, you will...  ...The ability to work well with others in a high-paced research setting Can rapidly learn new fields and are excited... 
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    2 days ago
  •  ...Chronicle Bio is developing a data-driven healthcare platform...  ...conditions globally. We integrate multimodal data (clinical records,...  ...our next-generation discovery engine. This role is central to building...  ..., and clinical researchers to ensure data quality and biological... 

    ChronicleBio

    San Francisco, CA
    5 days ago
  •  ...Washington D.C., London and Amsterdam. The Data Foundation and AI team within Plaid’s...  .... Role and Responsibilities As a Senior Research Scientist on the Data Foundation and AI...  ..., model serving infrastructure, feature engineering, and monitoring. In addition, you will develop... 

    Plaid

    San Francisco, CA
    5 days ago
  •  ...ML Engineer - Data Scientist (Enterprise) Hilbert is building the ML systems that power demand intelligence for the world's largest consumer companies - recommendation engines, demand forecasting, customer lifecycle models, and activation systems that must work across... 
    Live in
    Flexible hours
    Shift work

    Hilbert\'s AI

    San Francisco, CA
    3 days ago
  • $185k - $235k

     ...Location Type On-site Department Science & Engineering Compensation The annual base salary...  ...internal equity, and alignment with market data. Why Join Stand At Stand, you’ll help...  ...labeling, self-supervised or HITL systems Multimodal ML or agentic workflows (LLMs + CV) Experience... 
    Full time
    Temporary work
    H1b
    Work at office
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Slope

    San Francisco, CA
    4 days ago
  • Inception is seeking experienced engineers and scientists to develop data pipelines that power our AI models. You will utilize your engineering expertise...  ...data for machine learning and contribute to innovative research in AI technologies. Join a team of world-class talent... 

    Inception

    San Francisco, CA
    4 days ago
  •  ...platform will ultimately become the perception engine for a company’s physical footprint,...  ...engineer responsible for turning sensor data pipelines into actionable insights for our...  ...training platform Driving the design behind a multimodal software user interface Qualifications:... 

    Specter

    San Francisco, CA
    3 days ago
  •  ...Hilbert's AI is seeking an ML Engineer in San Francisco to create ML systems that enhance demand intelligence for top B2C companies. In this role, you'll bridge the gap between complex data and actionable business strategies, working directly with clients to translate... 

    Hilbert's AI

    San Francisco, CA
    1 day ago
  •  ...exploration in the real world. We're looking for research engineers with strong foundations in reinforcement learning, multimodal representation learning, or large-scale model...  ...with Kubernetes You’ve done large-scale data processing with tools like Apache Spark You value... 

    Pantograph

    San Francisco, CA
    4 days ago
  • A generative modeling company in San Francisco is seeking a Research Engineer to lead pre-training and post-training of action-conditioned...  ...physical systems. The ideal candidate has experience with video, multimodal models, and a strong background in machine learning. The... 
    Work at office

    Hedra

    San Francisco, CA
    16 hours ago
  • About Human Archive Human Archive is a research lab backed by Y Combinator focused...  ..., and publish research. Today, our data is used for robotics and world modeling...  ...us. The Opportunity As a Research Engineer, you’ll work on multimodal sensing systems and sensor fusion... 
    Shift work

    Human Archive

    San Francisco, CA
    2 days ago
  •  ...Role As a SLAM / Multi‑View Geometry Engineer on the Robotics team, you will develop...  ...world in 3D from multi‑camera and multimodal sensor data. You will work on real‑time and offline...  ..., and enjoys working closely with AI researchers and engineers. This role is based in... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    1 day ago
  •  ...breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that are...  ...As an Applied Research Engineer, you will be at the forefront...  ...as large language models and multimodal models—and the human data strategies... 
    Flexible hours

    HRB

    San Francisco, CA
    2 days ago
  • $315k

     ...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build...  ..., including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI... 
    Contract work
    For contractors
    For subcontractor
    Work at office
    Remote work
    Relocation
    Visa sponsorship
    Work visa
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    16 hours ago
  • $200k - $250k

    Research Engineer Location San Francisco (On-site) Compensation $200,000 - $250,000 + variable...  ...power Lotus. You will turn messy health data into accurate, cited, and actionable guidance...  ...use Experience building speech or multimodal pipelines for medical settings... 

    Lotus Health AI

    San Francisco, CA
    2 days ago
  •  ...collects large-scale neuroscientific data sets to train machine learning based...  ...diverse team, including product managers, researchers, and engineering departments, your role involves...  ...image and video transformer models and multimodal systems. Contribute to the creation... 

    Kubelt

    San Francisco, CA
    16 hours ago
  •  ...world industry and economy use cases. As a Research Engineer on our Physical AI team, you will lead...  ...learning, and large-scale multimodal learning Design and generate training...  ...FSDP, and DeepSpeed Work with multimodal data pipelines involving video, sensory inputs... 
    Work at office

    Hedra, Inc

    San Francisco, CA
    2 days ago
  • $320k

     ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...models to interact with private user data. In this role, you\u2019ll design and...  ...GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete... 
    Full time
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    4 days ago
  • $320k - $405k

     ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...and deployments scale, the volume of data we need to monitor and understand is exploding...  ...-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute,... 
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    16 hours ago
  • $350k

     ...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build...  ..., including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI... 
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Multimodal Data. Be the first to apply!