Data Ingestion Engineer for Scalable AI Training Pipelines
Reflection
Reflection, located in New York, is searching for a Data Engineer to build robust data ingestion systems essential for AI training. The ideal candidate will be skilled in web crawling and data acquisition, comfortable working with large datasets, and have excellent communication abilities. The role emphasizes collaboration with researchers and iterative processes based on measurable impact. Benefits include top-tier compensation, health insurance, paid parental leave, and opportunities for team engagement. #J-18808-Ljbffr Reflection
$120k - $180k
...most advanced AI-native platform... ...our new Software Engineer III you'll be... ...the GDI (Getting Data In) group under... ...focus on data ingestion at petabyte-scale... ...ingest pipeline, data enrichment... ...architecture, high scalability and availability... ...recruitment, selection, training, compensation,...PipelineTrainingPermanent employmentWork experience placementWork at officeLocal area- .... About The Role Data plays a crucial role... ...the frontier of AI innovation. Many advances... ...and operate the ingestion systems that turn... ...corpora for training frontier models.... ...our pre‑training pipelines, working directly... ...role is ideal for engineers who love building...PipelineTrainingRelocation package
- ...and Responsibility - Design, build, and maintain scalable and reliable data pipelines for dataset creation, transformation, and benchmarking... ...processing and analysis Partner closely with ML engineers to enable model training, evaluation, and benchmarking pipelines Improve...PipelineTraining
- ...deep expertise in Data Science, Machine Learning and AI. We are the trusted... ...experienced Data Engineer to join our data team... ..., and maintaining scalable data pipelines, data integration... ...preparation and ingestion for AI/ML and Generative... ...for model training, inference, and GenAI...PipelineTrainingLocal area
- ...Data Engineer, Gen AI New York, New York, United States About the Job... ...data infrastructure and pipelines necessary to enable large-... ...Responsibilities: Design and build scalable data pipelines to ingest, process, and store large volumes of training data for generative AI...PipelineTraining
- ...connect top LATAM engineering talent with... ...manage leases through AI‑driven intelligence... ...AI, structured data pipelines, and user‑centered... ...accuracy and scalability. Key Responsibilities... ...‑end — from raw ingestion to validated structured... ...Work‑from‑home & training reimbursement...PipelineTrainingRemote workWork from homeWorldwide
$150k - $180k
...DriveWealth develops data products to... ...a Senior Data Engineer to take... ...automation of data ingestion, transformation... ...requirements are met with scalable technical... ...data pipelines (Databricks and... ...We Think About AI We leverage... ...expertise, education, training, and experience...PipelineTrainingFull timeWork at officeWorldwide$65 - $68.26 per hour
...Data Engineer Location: Jersey City, New Jersey (hybrid... ...and optimizing data ingestion, transformation, and... ..., and optimize scalable ETL/ELT pipelines across AWS, GCP, and... ...quality career resources, training, certifications,... ...to receive calls, AI-generated calls, text...PipelineTrainingHourly payContract work$120k - $130k
Data Engineer page is loaded## Data Engineerlocations: New... ...build, and optimize scalable data pipelines and lake house... ...developing end-to-end data ingestion, transformation, and... ...analytics using Databricks AI/BI features and Genie... ...recruitment, hiring, training, promotion,...PipelineTrainingLocal areaRemote work$150k
...a Microsoft Fabric Data Engineer with Agentic AI experience to design... ...build, and maintain scalable data solutions within... ...developing robust data pipelines, implementing... ...governance, and usability Ingest, transform, and... ...aligned with AI model training requirements....PipelineTraining- ...company providing a data-driven... ...required for trainings, meetings, and... ...optimize data pipelines for analytics... ...reliability, scalability, and performance... ...Implement data ingestion from internal... ...with product and engineering teams to translate... ...intelligence (AI) tools to...PipelineTrainingWork at officeRemote workHome officeFlexible hours
$110k - $190k
...Senior Data Management Professional - Data Engineering - Commodities Data... ...directly into pipelines, systems and architecture... ...high-impact, scalable solutions.... ...technologies including AI and machine... ...improve data ingestion and enrichment... ...conditions, education/training and skill level...PipelineTrainingTemporary workFor contractorsWork experience placement- ...Senior Consultant, Data Engineer Work... ...Sector: Data, AI & Analytics Position... ...data models, and scalable analytics... ...and maintain data ingestion, transformation,... ...Exposure to ML pipelines, MLOps, or AI-adjacent... ...Guilds, regular training, and peer learning...PipelineTrainingFull timeWork at office
$135k - $145k
...SUMMARY The Data Engineer will be... ...enterprise-wide AI and automation initiatives... ...Informatica data pipelines, API integrations... ...teams to deliver scalable, high-quality... ...and reliable data ingestion and... ...reporting, and AI model training, including ingestion...PipelineTrainingWork experience placementSummer workWork at officeRemote workFlexible hours$155k - $184k
...states. The Principal Data Engineer will lead the... ...efforts to build scalable, reliable, and... ...frameworks, and pipelines that support large... ...and complex data ingestion, processing, transformations... .../privacy, and AI assistant tools.... ..., education, training, merit, location,...PipelineTrainingFull timeLocal areaNight shift- ...to help bring their data, AI, cloud and digital solutions... ...: Paid certs, weekly training, guilds, hackathons,... ...Consultant (Data Engineer) You’re the builder... ...Design and build pipelines to ingest, transform and load data... ...lakes and warehouses for scalable access Ensure...PipelineTrainingWeekly payWork from home
$180k - $250k
...Staff Analytics Engineer to join a centralized... ...maintaining a scalable and robust... ...implementation of complex data transformations... ...strategies, and training for analytics... ...to enhance data ingestion, processing pipelines, and integration... ...utilizing AI Codegen to improve...PipelineTrainingFlexible hours- ...): 09 Job Title: Data Engineer The Team: As a member... ...Platforms & AI - Foundational Data... ...scale data processing pipelines that power our... ...maintain robust, scalable, and reliable data... .../ELT processes to ingest, transform, and load... ..., “pre‑employment training” or for equipment/...PipelineTrainingLive inWorldwideFlexible hours
$225k - $275k
...AI Data Engineer New York, New York, United States Schonfeld... ...robust data pipelines that power SchonAI, our... ...Design and build scalable, reliable data pipelines to ingest, transform, and deliver... ...and observability for AI training and inference pipelines...PipelineTraining$160k - $180k
...Data & AI Engineer The Data & AI Engineer sits within Carlyle... ...and operating the pipelines, semantic layers, retrieval... .... Implement ingestion, modeling, and consumption... ...standards for scalability, performance, security... ...prior experience and training; and licenses and/or...PipelineTrainingWork at office$120k - $195k
Lead Data Engineering & Governance (Pharma) New York... ...join its Pharma AI practice and serve... ...- from pipeline modernization and... ...and delivery of scalable data pipelines and... ...governance, and ingestion patterns appropriate... ...; experience and training; licensure and certifications...PipelineTrainingHourly payFull timeWork at officeLocal areaDay shift- ...provide a versatile AI Platform... ...exciting environment. Data is at the core... ..., and the Data Engineering team is a... ...We leverage and ingest data from multiple... ...in analytical pipelines. We develop and... ...high-quality, scalable, and reliable data... ...you Continuous training and access to...PipelineTraining
- ...nonprofit applied AI research... ...About the Role Data Engineers on the Platform team... ...trustworthy data pipelines with comprehensive... ...documented datasets for training and evaluation,... ...solutions from ingestion through... ...performance, and scalability. Implement data...PipelineTrainingFull timeWork at office
$110k - $190k
Senior Data Management Professional - Automation Engineer - Funds Location: New... ...providers and ingesting and normalising... ...(including AI/ML where appropriate... ...and scalability Perform quarterly... ...maintain data pipelines and tools to improve... ..., education/training and skill...PipelineTrainingTemporary workFor contractorsWork experience placement$176k - $207k
...Breathe Life Into Data At Komodo... ...The Senior Data Engineer will be responsible... ...systems and pipelines that power Komodo... ...lifecycle—from ingestion and modeling to... ...foundation for AI/ML and agentic... ...pipeline performance, scalability, and system... ...layers for training, inference, and...PipelineTrainingFor contractorsWork experience placementWork at officeLocal areaRemote workFlexible hours- ...A leading network technology company is seeking a Data Engineer to design, build, and maintain scalable data pipelines. The role involves collaboration with data scientists, ensuring the data infrastructure is reliable and cost-effective. Ideal candidates should have...PipelineRemote work
- ...A technology solutions provider is seeking an experienced Data Engineer to design and build automated ingestion pipelines. This role involves processing large-scale datasets and ensuring data integrity. Candidates should have 6+ years of experience in Data Engineering,...Pipeline
- ...Vericence is a digital engineering and technology consulting... ...enterprises build AI-driven platforms, modernize... ...through cloud, data, and intelligent engineering... ...performance tuning, and scalable data processing Experience... ...understanding of data pipeline design, development,...Pipeline
- About Mecka AI Mecka AI is building the data infrastructure layer for robotics... ...datasets used to train and evaluate modern... ...a Senior Data Engineer to own and evolve Mecka... ...data infrastructure, pipelines, and internal analytics... ...data pipelines ingesting structured and unstructured...PipelineTraining
$180k - $280k
...Data Engineer Sunset is building the data layer for real-world AI training. We work with frontier labs to turn messy, multi-modal enterprise... ...at Sunset, you'll own the pipeline that turns raw, chaotic... ...canonical entity Hardening how we ingest, store, and process sensitive...PipelineTrainingWork at officeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Ingestion Engineer for Scalable AI Training Pipelines. Be the first to apply!
- junior data developer New York, NY
- director data engineering New York, NY
- junior big data engineer New York, NY
- data engineer graduate New York, NY
- senior data engineer New York, NY
- data platform engineer New York, NY
- sr information security engineer New York, NY
- senior data integration developer New York, NY
- data developer New York, NY
- data engineer New York, NY

