Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Robotics Data Pipeline Engineer - Multimodal Data

Persona AI

Data Pipeline Engineer

Persona AI is developing and commercializing rugged, multi-purpose humanoid robots that perform real work. Persona's founding team has a decades-long history in humanoid robotics, bionics, and product development delivering robust hardware that has touched the stars, worked miles below the surface of the ocean, and even roamed Disney Parks. Our mission is focused squarely on shipping beautiful, reliable products at massive scale, while building a customer-focused team to achieve these aims.

At Persona we require an unprecedented volume of high-quality, multimodal data. We are moving beyond basic teleoperation to leverage massive datasets of in-the-wild egocentric video combined with dense sensor streams (IMU, haptics, kinematics, and high-fidelity force profiles). We are seeking a highly skilled Data Pipeline Engineer to architect the systems that turn this raw, unstructured multimodal data—including critical force-aware data collections—into high-fidelity training assets for our robots.

The Role

As a Data Pipeline Engineer, you will architect and scale the data infrastructure that feeds our foundation models. Your primary mission is to extract, augment, and align human dexterous manipulation data from massive complex, multi-sensor and egocentric video datasets. Crucially, you will build advanced post-processing algorithms to perform deep force analysis and infer hidden states from raw data—such as processing direct force-torque outputs to quantify grasp dynamics, estimating contact forces from visual cues, extrapolating heavily occluded hand positions, or deriving 3D geometry from 2D frames. You will use spatial, temporal, and cross-modal data augmentation to multiply the value of every minute of data our teleoperation team collects.

Key Responsibilities

  • Multimodal Data Pipelines: Architect highly efficient, scalable pipelines to ingest, decode, and synchronously process thousands of hours of high-resolution egocentric video alongside rich sensor streams (IMUs, force-torque sensors, tactile pads, and joint proprioception).
  • Force Analysis & Hidden State Inference: Develop sophisticated post-processing algorithms to analyze force interactions and infer unobservable or missing states from raw data. This includes calibrating and cleaning direct force-aware data collections, estimating contact forces from object deformation, tracking occluded objects during complex manipulation, or applying inverse kinematics to fill in missing joint trajectories.
  • Kinematic Retargeting & Alignment: Develop algorithms to translate 3D human hand tracking, wrist motion, and pose estimation into the specific 6DoF/joint-space coordinates of our humanoid's end-effectors, relying on sensor fusion to ensure absolute precision.
  • Advanced Data Augmentation: Implement robust data augmentation strategies (spatial transformations, temporal scaling, synthetic viewpoints, and sensor noise injection) to expand expert trajectories and improve the robustness of our learning models.
  • Teleoperation Synchronization: Work closely with the Hardware Teleoperation Team (UMI & Console operators) to perfectly align human-robot play-data (haptics, force profiles, video, audio, telemetry) with large-scale pre-training datasets.

Required Qualifications

  • Education: B.S., M.S., or Ph.D. in Computer Science, Data Engineering, Machine Learning, Robotics, or a related field.
  • Programming & ML Frameworks: Deep expertise in Python and extensive experience with PyTorch, specifically in handling custom dataloaders for multimodal datasets.
  • Force & Time-Series Data Processing: Experience analyzing and processing complex time-series data from force-torque (F/T) sensors, load cells, or tactile arrays, ensuring pristine alignment with visual frames.
  • Video Processing Expertise: Mastery of video processing pipelines and libraries (OpenCV, FFmpeg, Decord) and managing the I/O bottlenecks of terabyte-scale video datasets.
  • Computer Vision / Pose Estimation: Hands-on experience with 3D hand tracking, human pose estimation (e.g., MediaPipe), and spatial geometry calculations.
  • Embodied AI Familiarity: Strong understanding of modern imitation learning paradigms, VLA architectures, and frameworks focused on human-to-robot transfer (e.g., EgoScale, EgoMimic, or OpenVLA).
  • Data Augmentation: Proven ability to implement programmatic and generative data augmentation techniques for computer vision and time-series data.

Bonus Skills

  • Experience with NVIDIA's robotic software stack (Isaac, Cosmos, or components of the GR00T framework).
  • Familiarity with distributed data processing systems (Ray, Apache Spark) for cluster computing.
  • Background in generating or utilizing synthetic robotic data via simulation (Omniverse, MuJoCo).
  • Experience integrating spatial awareness or tactile data representations (e.g., Fourier encoding) into visual pipelines.

Why join Persona AI?

  • You'll shape technology that's redefining the possibilities of robotics and human interaction.
  • Work alongside passionate teammates who value diversity, creativity, and continuous learning.
  • Enjoy full access to advanced prototyping tools, labs, and the freedom to experiment and innovate.
  • We offer competitive compensation, excellent benefits, flexible work environment, and equity opportunities.

Persona AI embraces diversity and equal opportunity in a serious way. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. We believe the more inclusive we are, the better our work will be.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Robotics Data Pipeline Engineer - Multimodal Data in Houston, TX vacancy
  • $90k - $135k

    System One is seeking a Mechanical Equipment / Piping Engineer in Houston, Texas, to work on innovative data center infrastructure projects. This role requires collaboration on the design, manufacturing, and deployment of mechanical systems, ensuring compliance with industry... 
    Suggested
    Work at office
    Remote work

    System One

    Houston, TX
    5 days ago
  • Onesubsea is seeking a Mechanical Equipment / Piping Engineer in Houston, Texas. This role involves ensuring the accuracy and efficiency of mechanical and piping systems in large-scale data centers. The ideal candidate will have experience in mechanical engineering with... 
    Suggested

    Onesubsea

    Houston, TX
    2 days ago
  • A midstream energy company is seeking a Pipeline Safety Engineer Intern for the Owensboro, KY office. The intern will support the Pipeline Safety team by assisting with data analysis, documentation, and compliance tasks related to pipeline integrity. Candidates should be... 
    Suggested
    Internship
    Work at office

    Bwpipelines

    Houston, TX
    1 day ago
  •  ...journey to help build the world using applied robotics and AI! Novarc Technologies is a full-...  ...is looking for a Senior Welding Engineer to join our growing team and support the...  ...demonstration tasks, such as cleanup and data collection. Maintain the Experience Center... 
    Suggested

    Novarc Technologies Inc.

    Houston, TX
    2 days ago
  •  ...developing and commercializing rugged, multi-purpose humanoid robots that perform real work. Persona’s founding team has a decades-long...  ...team to achieve these aims. We are looking for a Welding Engineer to join our Applications team who has a strong understanding of... 
    Suggested
    Flexible hours

    Persona AI, Inc.

    Houston, TX
    3 days ago
  • A full-stack robotics company is seeking a Senior Welding Engineer to join their Houston team. This role involves supporting the production of cobots, conducting demonstrations, and maintaining equipment at the Customer Experience Center. The ideal candidate will have a... 

    Novarc Technologies Inc.

    Houston, TX
    2 days ago
  • $100k - $130k

     ...Job Description Job Description Senior Pipeline Engineer – Transmission & Distribution Location: Houston, TX Employment Type: Full-...  ...technical and regulatory compliance. Prepare specifications, data sheets, RFQs, bid summaries, and procurement recommendations.... 
    Permanent employment
    Full time

    MSR Technology Group

    Houston, TX
    4 days ago
  •  ...Description Position Summary We are seeking highly motivated Pipeline Engineers with expertise in transmission and distribution oil & gas...  ...and Responsibilities - Perform MAOP validation by gathering data, conducting calculations, and verifying compliance with regulations... 
    Full time

    Distro

    Houston, TX
    16 days ago
  • $136k - $142k

     ...discipline leads and deploying complex data strategies activities across Kiewit’s business...  ...and will collaborate closely with data engineers, solution architects, project leadership...  ...Manage workspaces, deployment pipelines, and versioning strategies Design and... 
    Full time
    Remote work
    Weekend work

    Kiewit Corporation

    Houston, TX
    1 day ago
  •  ...Alliance Technical Group is seeking an experienced Senior Data Engineer to design, build, and optimize the databases and data systems...  ...platforms. In this role, you will lead the creation of scalable pipelines, data models, and cloud-based architecture that ensures reliable... 
    Daily paid
    Work at office
    Remote work
    Flexible hours

    Alliance Technical Group

    Houston, TX
    19 days ago
  •  ...Title: DB2 Data Engineer Location: Houston, TX (3 Days onsite a week) Duration: 12 Months with a high possibility of extension ROLE AND RESPONSIBILITIES: we are replacing our Hyperion financial system, so we need someone who is a strong db2 database... 
    3 days per week

    EOG ResourcesAvance Consulting

    Houston, TX
    4 days ago
  •  ...Top Skills' Details AWS Data Engineer experience working on AWS services building data environment and warehouse- most important...  ...services such as ECS, EKS, Dockers Knowledge of CI/CD deployment pipelines and using services such as AWS CDK... 

    3B Staffing LLC

    Houston, TX
    1 day ago
  •  ...Data Engineer Location: Houston, TX (Onsite) Duration: 12 Months Rate: DOE US Citizens and Green cards are Preferred....  ...Key Responsibilities: Design, develop, and maintain data pipelines using SQL, Python, and GCP tools. Create and manage Power... 

    Georgia IT Inc

    Houston, TX
    3 days ago
  •  ...Job Title: Data Engineer (Level 3) Location: Houston, TX Address: 1100 Louisiana St, Houston, TX 77002 Schedule...  ..., build, and optimize large-scale, high-reliability data pipelines and lakehouse architectures. The ideal candidate combines deep... 
    Contract work
    Local area
    Immediate start

    Saxon Global

    Houston, TX
    4 days ago
  •  ...Data Engineer Job Location: Houston, Texas Job Type: Contract Client: DISYS Rate: Depend on Experience Job Authorization...  ...Responsibilities: Create and maintain optimal data pipeline architecture, assemble large, complex data sets that meet functional... 
    Contract work

    Georgia IT Inc

    Houston, TX
    4 days ago
  •  ...role- Data Engineer with Azure + Wisdom.ai Houston, Texas Travel up to 50% regional travel expected. Need local candidates. Rate...  ...services and Wisdom.ai. This role bridges data engineering (pipelines, modeling, architecture), analytics (insights, dashboards, semantic... 
    Work at office
    Local area

    3B Staffing LLC

    Houston, TX
    1 day ago
  •  ...Jr. Data Engineer Experience working with general industry models Implementing universal modeling patterns, selective levels of abstraction...  ...Delta Lake Azure Data Factory (nice to have) CI/CD Pipelines (nice to have) Synapse (nice to have) Python (nice to... 
    Relocation

    Software Technology Inc

    Houston, TX
    1 day ago
  • $150k - $200k

     ...Senior Data Engineer - HealthTech $150,000 - $200,000 Hybrid - Houston, TX Full time / Permanent A fast-growing healthcare...  ...is a hands-on role. You will be building and owning data pipelines that feed real AI systems used by clinicians every day.... 
    Permanent employment
    Full time
    Temporary work
    Flexible hours

    DEEPREC.AI

    Houston, TX
    1 day ago
  •  ...Job Description The client is seeking a Data Engineer to join their fast‑growing data team and play a critical role in building, maintaining...  ...polygons, routes, and raster datasets into production‑grade pipelines and databases rather than working solely within desktop GIS... 
    Contract work

    Insight Global

    Houston, TX
    4 days ago
  •  ...Data Engineer Develop and deploy production-grade services, and data infrastructure emphasizing performance, scalability, and self-service. Assume a leadership role in developing solutions with experience in continuous delivery, immutable deployments, containerization... 

    Samprasoft

    Houston, TX
    2 days ago
  •  ...Snowflake Data Modelling SQL and DBT Engineer Location Spring TX Day 1 onsite Detailed JD Strong Technical Competencies o Snowflake o Curation of Snowflake SQL stored procedure Managing data o JavaScript Java NET Python... 

    Futran Tech Solutions Pvt. Ltd.

    Houston, TX
    1 day ago
  •  ...Java and SQL to design, implement and support full life-cycle data engineering projects for current and future Summary of Operations...  ...an event driven distributed architecture. Maintain dataflow pipeline between Planning database and other data stores such as Primo... 

    EOG Resources

    Houston, TX
    3 days ago
  •  ...Data Engineer Location: HYBRID- Houston, TX (3 days onsite in downtown Houston) Duration: 1 year contract (potential to extend)...  ...requirements from SORs, Data Lakes or Data Warehouses Configure data pipelines using Azure Synapse Analytics or Data Bricks Configure... 
    Contract work

    Software Technology Inc

    Houston, TX
    1 day ago
  •  ...Senior Data Engineer (Contractor – Hybrid, Houston, TX) Core Requirements: Python: Advanced proficiency for scripting, ETL/ELT, and pipeline automation. Snowflake: Hands-on experience with query tuning, security (RBAC, dynamic masking), data sharing, and cost... 
    For contractors

    Scout 1 Solutions

    Houston, TX
    1 day ago
  •  ...Job Title: Sr. Data Engineer Location: Houston, TX - 4-5 days onsite with occasional travel Length: 6 months with a long...  ...to have advanced development scenarios in Data engineering pipelines. Experience in oil and gas industry is preferable... 

    3B Staffing LLC

    Houston, TX
    1 day ago
  •  ...Position: Data Engineer-W2 Location: Jersey City, NJ (Onsite)/Houston, TX (On-site) Duration: 12 Months ( C ontract with possible...  ...engineering. ~ Strong expertise in Spark, SQL, and building data pipelines (preferably in Big Data technologies). ~ Basic knowledge... 

    Kaav Inc.

    Houston, TX
    6 hours ago
  •  ...solar and battery storage projects. Main purpose The Data Engineer will be responsible for building and maintaining the data integration...  ...Design, build, and maintain time-critical data integration pipelines (ETL/ELT) across a range of internal and external data... 

    Trafigura Group Pte. Ltd.

    Houston, TX
    6 hours ago
  •  ...Lead Data Engineer Databricks Location: Houston, TX Onsite Day 1 Experience Required: 10–12 years Required Skills & Qualifications: Lead Data Engineer Relevant experience to be more than 8-9 years, Strong and proficient in Databricks, DLT (Delta Live Tables) framework... 

    InterSources

    Houston, TX
    20 days ago
  • $106.9k - $176.5k

     ...help to build a better working world. Technology – Data and Decision Science – Data Engineering – Senior We are seeking a highly skilled Senior Consultant...  ...their business objectives. Lead end-to-end data pipeline development, including data ingestion, transformation,... 
    Summer holiday
    Flexible hours

    EY

    Houston, TX
    5 days ago
  •  ...Role - Data Engineer (ETL) Location - Houston, TX (Onsite) Exp need - 10+ years Job Description • ETL experience...  ...• Analyze data sources, design and evaluate feasible data pipeline solutions. The solutions might include database modeling and... 

    Diverse Lynx

    Houston, TX
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Robotics Data Pipeline Engineer - Multimodal Data. Be the first to apply!