Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Ingestion Engineer for Scalable AI Training Pipelines

Reflection

Reflection, located in New York, is searching for a Data Engineer to build robust data ingestion systems essential for AI training. The ideal candidate will be skilled in web crawling and data acquisition, comfortable working with large datasets, and have excellent communication abilities. The role emphasizes collaboration with researchers and iterative processes based on measurable impact. Benefits include top-tier compensation, health insurance, paid parental leave, and opportunities for team engagement. #J-18808-Ljbffr Reflection

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Data Ingestion Engineer for Scalable AI Training Pipelines in New York, NY vacancy
  • $120k - $180k

     ...most advanced AI-native platform...  ...our new Software Engineer III you'll be...  ...the GDI (Getting Data In) group under...  ...focus on data ingestion at petabyte-scale...  ...ingest pipeline, data enrichment...  ...architecture, high scalability and availability...  ...recruitment, selection, training, compensation,... 
    Pipeline
    Training
    Permanent employment
    Work experience placement
    Work at office
    Local area

    CrowdStrike Holdings, Inc.

    New York, NY
    2 days ago
  •  .... About The Role Data plays a crucial role...  ...the frontier of AI innovation. Many advances...  ...and operate the ingestion systems that turn...  ...corpora for training frontier models....  ...our pre‑training pipelines, working directly...  ...role is ideal for engineers who love building... 
    Pipeline
    Training
    Relocation package

    Reflection

    New York, NY
    4 days ago
  •  ...and Responsibility - Design, build, and maintain scalable and reliable data pipelines for dataset creation, transformation, and benchmarking...  ...processing and analysis Partner closely with ML engineers to enable model training, evaluation, and benchmarking pipelines Improve... 
    Pipeline
    Training

    New Groyp Talentoj

    New York, NY
    20 hours ago
  •  ...deep expertise in Data Science, Machine Learning and AI. We are the trusted...  ...experienced Data Engineer to join our data team...  ..., and maintaining scalable data pipelines, data integration...  ...preparation and ingestion for AI/ML and Generative...  ...for model training, inference, and GenAI... 
    Pipeline
    Training
    Local area

    Tiger Analytics

    Jersey City, NJ
    1 day ago
  •  ...Data Engineer, Gen AI New York, New York, United States About the Job...  ...data infrastructure and pipelines necessary to enable large-...  ...Responsibilities: Design and build scalable data pipelines to ingest, process, and store large volumes of training data for generative AI... 
    Pipeline
    Training

    Inizio Partners

    New York, NY
    4 days ago
  •  ...connect top LATAM engineering talent with...  ...manage leases through AI‑driven intelligence...  ...AI, structured data pipelines, and user‑centered...  ...accuracy and scalability. Key Responsibilities...  ...‑end — from raw ingestion to validated structured...  ...Work‑from‑home & training reimbursement... 
    Pipeline
    Training
    Remote work
    Work from home
    Worldwide

    South Geeks

    New York, NY
    4 days ago
  • $150k - $180k

     ...DriveWealth develops data products to...  ...a Senior Data Engineer to take...  ...automation of data ingestion, transformation...  ...requirements are met with scalable technical...  ...data pipelines (Databricks and...  ...We Think About AI We leverage...  ...expertise, education, training, and experience... 
    Pipeline
    Training
    Full time
    Work at office
    Worldwide

    DriveWealth

    New York, NY
    2 days ago
  • $65 - $68.26 per hour

     ...Data Engineer Location: Jersey City, New Jersey (hybrid...  ...and optimizing data ingestion, transformation, and...  ..., and optimize scalable ETL/ELT pipelines across AWS, GCP, and...  ...quality career resources, training, certifications,...  ...to receive calls, AI-generated calls, text... 
    Pipeline
    Training
    Hourly pay
    Contract work

    Apex Systems

    Jersey City, NJ
    3 days ago
  • $120k - $130k

    Data Engineer page is loaded## Data Engineerlocations: New...  ...build, and optimize scalable data pipelines and lake house...  ...developing end-to-end data ingestion, transformation, and...  ...analytics using Databricks AI/BI features and Genie...  ...recruitment, hiring, training, promotion,... 
    Pipeline
    Training
    Local area
    Remote work

    PPL

    New York, NY
    2 days ago
  • $150k

     ...a Microsoft Fabric Data Engineer with Agentic AI experience to design...  ...build, and maintain scalable data solutions within...  ...developing robust data pipelines, implementing...  ...governance, and usability Ingest, transform, and...  ...aligned with AI model training requirements.... 
    Pipeline
    Training

    Garan, Incorporated

    New York, NY
    20 hours ago
  •  ...company providing a data-driven...  ...required for trainings, meetings, and...  ...optimize data pipelines for analytics...  ...reliability, scalability, and performance...  ...Implement data ingestion from internal...  ...with product and engineering teams to translate...  ...intelligence (AI) tools to... 
    Pipeline
    Training
    Work at office
    Remote work
    Home office
    Flexible hours

    Traackr

    New York, NY
    20 hours ago
  • $110k - $190k

     ...Senior Data Management Professional - Data Engineering - Commodities Data...  ...directly into pipelines, systems and architecture...  ...high-impact, scalable solutions....  ...technologies including AI and machine...  ...improve data ingestion and enrichment...  ...conditions, education/training and skill level... 
    Pipeline
    Training
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    5 days ago
  •  ...Senior Consultant, Data Engineer Work...  ...Sector: Data, AI & Analytics Position...  ...data models, and scalable analytics...  ...and maintain data ingestion, transformation,...  ...Exposure to ML pipelines, MLOps, or AI-adjacent...  ...Guilds, regular training, and peer learning... 
    Pipeline
    Training
    Full time
    Work at office

    Biz First

    New York, NY
    1 day ago
  • $135k - $145k

     ...SUMMARY The Data Engineer will be...  ...enterprise-wide AI and automation initiatives...  ...Informatica data pipelines, API integrations...  ...teams to deliver scalable, high-quality...  ...and reliable data ingestion and...  ...reporting, and AI model training, including ingestion... 
    Pipeline
    Training
    Work experience placement
    Summer work
    Work at office
    Remote work
    Flexible hours

    Empire State Realty Trust

    New York, NY
    4 days ago
  • $155k - $184k

     ...states. The Principal Data Engineer will lead the...  ...efforts to build scalable, reliable, and...  ...frameworks, and pipelines that support large...  ...and complex data ingestion, processing, transformations...  .../privacy, and AI assistant tools....  ..., education, training, merit, location,... 
    Pipeline
    Training
    Full time
    Local area
    Night shift

    Change.org, PBC

    New York, NY
    20 hours ago
  •  ...to help bring their data, AI, cloud and digital solutions...  ...: Paid certs, weekly training, guilds, hackathons,...  ...Consultant (Data Engineer) You’re the builder...  ...Design and build pipelines to ingest, transform and load data...  ...lakes and warehouses for scalable access Ensure... 
    Pipeline
    Training
    Weekly pay
    Work from home

    Vivanti Consulting

    New York, NY
    2 days ago
  • $180k - $250k

     ...Staff Analytics Engineer to join a centralized...  ...maintaining a scalable and robust...  ...implementation of complex data transformations...  ...strategies, and training for analytics...  ...to enhance data ingestion, processing pipelines, and integration...  ...utilizing AI Codegen to improve... 
    Pipeline
    Training
    Flexible hours

    Zocdoc

    New York, NY
    1 day ago
  •  ...): 09 Job Title: Data Engineer The Team: As a member...  ...Platforms & AI - Foundational Data...  ...scale data processing pipelines that power our...  ...maintain robust, scalable, and reliable data...  .../ELT processes to ingest, transform, and load...  ..., “pre‑employment training” or for equipment/... 
    Pipeline
    Training
    Live in
    Worldwide
    Flexible hours

    S&P Global

    New York, NY
    3 days ago
  • $225k - $275k

     ...AI Data Engineer New York, New York, United States Schonfeld...  ...robust data pipelines that power SchonAI, our...  ...Design and build scalable, reliable data pipelines to ingest, transform, and deliver...  ...and observability for AI training and inference pipelines... 
    Pipeline
    Training

    Schonfeld

    New York, NY
    2 days ago
  • $160k - $180k

     ...Data & AI Engineer The Data & AI Engineer sits within Carlyle...  ...and operating the pipelines, semantic layers, retrieval...  .... Implement ingestion, modeling, and consumption...  ...standards for scalability, performance, security...  ...prior experience and training; and licenses and/or... 
    Pipeline
    Training
    Work at office

    Carlyle Group

    New York, NY
    2 days ago
  • $120k - $195k

    Lead Data Engineering & Governance (Pharma) New York...  ...join its Pharma AI practice and serve...  ...- from pipeline modernization and...  ...and delivery of scalable data pipelines and...  ...governance, and ingestion patterns appropriate...  ...; experience and training; licensure and certifications... 
    Pipeline
    Training
    Hourly pay
    Full time
    Work at office
    Local area
    Day shift

    Fractal

    New York, NY
    20 hours ago
  •  ...provide a versatile AI Platform...  ...exciting environment. Data is at the core...  ..., and the Data Engineering team is a...  ...We leverage and ingest data from multiple...  ...in analytical pipelines. We develop and...  ...high-quality, scalable, and reliable data...  ...you Continuous training and access to... 
    Pipeline
    Training

    Optasia Group

    Brooklyn, NY
    20 hours ago
  •  ...nonprofit applied AI research...  ...About the Role Data Engineers on the Platform team...  ...trustworthy data pipelines with comprehensive...  ...documented datasets for training and evaluation,...  ...solutions from ingestion through...  ...performance, and scalability. Implement data... 
    Pipeline
    Training
    Full time
    Work at office

    Basis Research

    New York, NY
    2 days ago
  • $110k - $190k

    Senior Data Management Professional - Automation Engineer - Funds Location: New...  ...providers and ingesting and normalising...  ...(including AI/ML where appropriate...  ...and scalability Perform quarterly...  ...maintain data pipelines and tools to improve...  ..., education/training and skill... 
    Pipeline
    Training
    Temporary work
    For contractors
    Work experience placement

    Bloomberg L.P.

    New York, NY
    2 days ago
  • $176k - $207k

     ...Breathe Life Into Data At Komodo...  ...The Senior Data Engineer will be responsible...  ...systems and pipelines that power Komodo...  ...lifecycle—from ingestion and modeling to...  ...foundation for AI/ML and agentic...  ...pipeline performance, scalability, and system...  ...layers for training, inference, and... 
    Pipeline
    Training
    For contractors
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours

    Komodo Health

    New York, NY
    3 days ago
  •  ...A leading network technology company is seeking a Data Engineer to design, build, and maintain scalable data pipelines. The role involves collaboration with data scientists, ensuring the data infrastructure is reliable and cost-effective. Ideal candidates should have... 
    Pipeline
    Remote work

    Versa Networks

    New York, NY
    20 hours ago
  •  ...A technology solutions provider is seeking an experienced Data Engineer to design and build automated ingestion pipelines. This role involves processing large-scale datasets and ensuring data integrity. Candidates should have 6+ years of experience in Data Engineering,... 
    Pipeline

    New Combin S.R.L.

    New York, NY
    20 hours ago
  •  ...Vericence is a digital engineering and technology consulting...  ...enterprises build AI-driven platforms, modernize...  ...through cloud, data, and intelligent engineering...  ...performance tuning, and scalable data processing Experience...  ...understanding of data pipeline design, development,... 
    Pipeline

    Vericence

    New York, NY
    20 hours ago
  • About Mecka AI Mecka AI is building the data infrastructure layer for robotics...  ...datasets used to train and evaluate modern...  ...a Senior Data Engineer to own and evolve Mecka...  ...data infrastructure, pipelines, and internal analytics...  ...data pipelines ingesting structured and unstructured... 
    Pipeline
    Training

    Mecka AI

    New York, NY
    4 days ago
  • $180k - $280k

     ...Data Engineer Sunset is building the data layer for real-world AI training. We work with frontier labs to turn messy, multi-modal enterprise...  ...at Sunset, you'll own the pipeline that turns raw, chaotic...  ...canonical entity Hardening how we ingest, store, and process sensitive... 
    Pipeline
    Training
    Work at office
    Remote work

    Sunset

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Ingestion Engineer for Scalable AI Training Pipelines. Be the first to apply!