Research Engineer, Multimodal Data
Eventual
About Eventual Every breakthrough Physical AI system — humanoid robots, autonomous vehicles, video generation models — is trained on petabytes of video, lidar, radar, and sensor data. But today's data platforms (Databricks, Snowflake) were built for spreadsheet‑like analytics, not the multimodal corpora that power AI. As a result, robotics and video‑AI teams iterate on model improvement about once a week. Most of that week isn't training — it's finding the right data: writing CV heuristics over raw footage, paying annotators for edge cases, hand‑curating clips before a cluster ever spins up. GPU bandwidth has grown 2‑3× per generation. Storage and pipelines haven't. The gap widens every year. Eventual was founded in 2022 to close it. Our open‑source engine, Daft, is the distributed data engine purpose‑built for multimodal AI — already running 2 PB/day at Amazon, 60‑100 PB at another FAANG company, and in production at Mobileye, TogetherAI, and CloudKitchens. We are building a video‑native index on top of our engine for Physical AI that collapses the data iteration loop. Describe the dataset you want, get a curated table in minutes, feed it to your GPUs at line rate. One iteration per day becomes the norm. We're building this in partnership with the top PhysicalAI labs and public AI infrastructure companies today. We have raised $30M from Felicis, CRV, Microsoft M12, Citi, Essence, Y Combinator, Caffeinated Capital, Array.vc, and angels from the co‑founders of Databricks and Perplexity. We've assembled a world‑class team from AWS, Render, Pinecone and Tesla. We have spent our careers powering the last generation of PhysicalAI in self‑driving, and are excited to now do this for the next. Join our small (but powerful!) team working together 4 days/week in our SF Mission district office. Your Role As a Research Engineer on the Visual Understanding team, you'll own the layer that makes petabytes of video queryable by content. Physical AI teams have raw footage, lidar, radar, and sim outputs scattered across object stores with no way to find what they need without weeks of human annotation. We change that economics: we run vision‑language models over every clip in a corpus along axes the customer cares about (gripper type, failure mode, object class, scene, motion density), so a researcher can ask "left-arm grasp failures on deformable objects" and get a curated dataset in minutes. You'll define the roadmap for our visual understanding capabilities, train and select the models that make corpus‑scale annotation tractable at single‑digit cents per hour of video, and build the rich datasets that go on to train customer models. This is a research engineering role — meaning you'll read papers and run experiments, but you ship to production and your work is judged by what it does for customer training runs. Key Responsibilities Own the visual understanding roadmap end‑to‑end: from picking the model family for a customer's taxonomy to landing it in production inference at corpus scale. Train, fine‑tune, and evaluate VLMs, VQA models, embedding models, and convolutional perception models against customer datasets and benchmarks. Drive down per‑clip annotation cost — model selection, distillation, batching, decode pipelining — so "annotate every clip in a 10K‑hour corpus" stays economical. Build the rich, queryable datasets that customers train on: design taxonomies with researchers, instrument quality, version the outputs. Partner with the dataloading and storage teams so visual understanding outputs flow into the index and on to the GPU without re‑engineering. Work directly with researchers at our partner labs — your shortest feedback loop is their next training iteration. What we look for Strong familiarity with modern vision and multimodal models — convolution nets, VLMs, VQA, embeddings — and a sense for the SOTA that's actually deployable today vs. on a leaderboard. Experience running these models at scale on real video and sensor data, ideally for perception tasks (detection, tracking, segmentation, retrieval, captioning). Background from a perception team at a self‑driving, robotics, or visual‑data company — or equivalent depth from a research lab. Comfortable with cloud infrastructure and large‑scale data processing — you don't need to be a distributed‑systems engineer, but you've shipped jobs that ran on thousands of GPU‑hours of video. Bias toward data and infrastructure: you reach for "annotate the whole corpus" before "fine‑tune another model." Nice to have Experience training vision or multimodal models from scratch (not just calling APIs). ML/AI research background — papers, citations, or a research org on your resume. Hands‑on time with big‑data frameworks like Spark, Ray, or Daft. Worked on embeddings, retrieval, or content‑aware search at scale. Experience designing labeling taxonomies or running annotation programs. Perks & Benefits In‑person, tight‑knit team — 4 days/week in our SF Mission office. Competitive comp and meaningful startup equity. Catered lunches and dinners for SF employees. Commuter benefit. Team‑building events and poker nights. Health, vision, and dental coverage. Flexible PTO. Latest Apple equipment. 401(k) plan with match. If you're excited about being on the team that turns petabytes of raw video into the training data for the next generation of Physical AI, we'd love to talk. #J-18808-Ljbffr Eventual
- ...model innovation and systems engineering paired with a design‑minded product... ...models must be trained on data that reflects the world’s... ...generative models (speech, text, or multimodal). Ability to help guide... ...scalable systems that bridge research and production. What We...SuggestedWork at officeRelocation package
- A leading AI research organization is looking for a Research Engineer/Scientist to join their Future of Computing Research team in San Francisco. The role focuses... ...on developing methodologies for personalized multimodal AI systems and includes responsibilities such as building...SuggestedRelocation package
- ...A pioneering AI firm based in San Francisco is seeking a Research Engineer, Distributed Data Systems. In this role, you will design and maintain infrastructure for large-scale multimodal training, ensuring scalability and reliability of data systems. Candidates should...SuggestedWork at officeRelocation package
- Human Archive is looking for a Research Engineer in San Francisco to work on multimodal sensing systems. You will evaluate sensor technologies and design experiments impacting robotics performance. If you hold a Master's/PhD in relevant fields and have research experience...Suggested
- talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities...Suggested
- A leading AI Customer Support startup located in San Francisco is seeking a Machine Learning Research Engineer. The role involves developing cutting-edge AI models for customer support, focusing on enhancing performance beyond existing solutions. Ideal candidates will have...
- ...Cartesia is looking for a Software Engineer to build the data infrastructure for its AI models in San Francisco. In this hands-on role, you will design and implement scalable data pipelines for multimodal data, particularly audio. Candidates should have experience with...Work at office
- A leading AI startup is seeking senior ML research engineers for their office in San Francisco. The role involves developing models for customer support, experimenting to improve latency, and translating research ideas into actionable plans. Ideal candidates will have...Work at office
$180.6k - $315k
...of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including... ...agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working...Full time- ...great technology. The Liquid team is a community of world-class engineers, researchers, and builders creating the next generation of AI. Whether... ...consolidating, gathering, and generating high-quality text data for pretraining, midtraining, SFT, and preference optimization...
- ...company based in San Francisco, California. The Role: Data Engineer - Multimodal Systems As a Data Engineer - Multimodal Systems, you will... ...The ability to work well with others in a high-paced research setting Can rapidly learn new fields and are excited...Work at officeRelocation package
- ...Chronicle Bio is developing a data-driven healthcare platform... ...conditions globally. We integrate multimodal data (clinical records,... ...our next-generation discovery engine. This role is central to building... ..., and clinical researchers to ensure data quality and biological...
- ...Washington D.C., London and Amsterdam. The Data Foundation and AI team within Plaid’s... .... Role and Responsibilities As a Senior Research Scientist on the Data Foundation and AI... ..., model serving infrastructure, feature engineering, and monitoring. In addition, you will develop...
- ...ML Engineer - Data Scientist (Enterprise) Hilbert is building the ML systems that power demand intelligence for the world's largest consumer companies - recommendation engines, demand forecasting, customer lifecycle models, and activation systems that must work across...Live inFlexible hoursShift work
$185k - $235k
...Location Type On-site Department Science & Engineering Compensation The annual base salary... ...internal equity, and alignment with market data. Why Join Stand At Stand, you’ll help... ...labeling, self-supervised or HITL systems Multimodal ML or agentic workflows (LLMs + CV) Experience...Full timeTemporary workH1bWork at officeRemote workVisa sponsorshipWork visaFlexible hours- Inception is seeking experienced engineers and scientists to develop data pipelines that power our AI models. You will utilize your engineering expertise... ...data for machine learning and contribute to innovative research in AI technologies. Join a team of world-class talent...
- ...platform will ultimately become the perception engine for a company’s physical footprint,... ...engineer responsible for turning sensor data pipelines into actionable insights for our... ...training platform Driving the design behind a multimodal software user interface Qualifications:...
- ...Hilbert's AI is seeking an ML Engineer in San Francisco to create ML systems that enhance demand intelligence for top B2C companies. In this role, you'll bridge the gap between complex data and actionable business strategies, working directly with clients to translate...
- ...exploration in the real world. We're looking for research engineers with strong foundations in reinforcement learning, multimodal representation learning, or large-scale model... ...with Kubernetes You’ve done large-scale data processing with tools like Apache Spark You value...
- A generative modeling company in San Francisco is seeking a Research Engineer to lead pre-training and post-training of action-conditioned... ...physical systems. The ideal candidate has experience with video, multimodal models, and a strong background in machine learning. The...Work at office
- About Human Archive Human Archive is a research lab backed by Y Combinator focused... ..., and publish research. Today, our data is used for robotics and world modeling... ...us. The Opportunity As a Research Engineer, you’ll work on multimodal sensing systems and sensor fusion...Shift work
- ...Role As a SLAM / Multi‑View Geometry Engineer on the Robotics team, you will develop... ...world in 3D from multi‑camera and multimodal sensor data. You will work on real‑time and offline... ..., and enjoys working closely with AI researchers and engineers. This role is based in...Work at officeRelocation package
- ...breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that are... ...As an Applied Research Engineer, you will be at the forefront... ...as large language models and multimodal models—and the human data strategies...Flexible hours
$315k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ..., including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI...Contract workFor contractorsFor subcontractorWork at officeRemote workRelocationVisa sponsorshipWork visaFlexible hours$200k - $250k
Research Engineer Location San Francisco (On-site) Compensation $200,000 - $250,000 + variable... ...power Lotus. You will turn messy health data into accurate, cited, and actionable guidance... ...use Experience building speech or multimodal pipelines for medical settings...- ...collects large-scale neuroscientific data sets to train machine learning based... ...diverse team, including product managers, researchers, and engineering departments, your role involves... ...image and video transformer models and multimodal systems. Contribute to the creation...
- ...world industry and economy use cases. As a Research Engineer on our Physical AI team, you will lead... ...learning, and large-scale multimodal learning Design and generate training... ...FSDP, and DeepSpeed Work with multimodal data pipelines involving video, sensory inputs...Work at office
$320k
...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...models to interact with private user data. In this role, you\u2019ll design and... ...GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete...Full timeWork at officeVisa sponsorshipFlexible hours$320k - $405k
...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...and deployments scale, the volume of data we need to monitor and understand is exploding... ...-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute,...Work at officeVisa sponsorshipFlexible hours$350k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ..., including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI...Work at officeVisa sponsorshipFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Multimodal Data. Be the first to apply!
- research software engineer San Francisco, CA
- research assistant engineering San Francisco, CA
- deep learning research engineer San Francisco, CA
- senior research engineer San Francisco, CA
- research programmer San Francisco, CA
- ai research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research engineer San Francisco, CA
- staff data engineer San Francisco, CA
- data engineering intern summer San Francisco, CA


