Data Ingestion Engineer for Scalable AI Training Pipelines
Reflection
Reflection, located in New York, is searching for a Data Engineer to build robust data ingestion systems essential for AI training. The ideal candidate will be skilled in web crawling and data acquisition, comfortable working with large datasets, and have excellent communication abilities. The role emphasizes collaboration with researchers and iterative processes based on measurable impact. Benefits include top-tier compensation, health insurance, paid parental leave, and opportunities for team engagement. #J-18808-Ljbffr
- .... About The Role Data plays a crucial role... ...the frontier of AI innovation. Many advances... ...and operate the ingestion systems that turn... ...corpora for training frontier models.... ...our pre‑training pipelines, working directly... ...role is ideal for engineers who love building...PipelineTrainingRelocation package
- ...deep expertise in Data Science, Machine Learning and AI. We are the trusted... ...experienced Data Engineer to join our data team... ..., and maintaining scalable data pipelines, data integration... ...preparation and ingestion for AI/ML and Generative... ...for model training, inference, and GenAI...PipelineTrainingLocal area
- ...Senior Data Engineer We're seeking a Senior Data... ...stack of Anaplan AI applications. You... ...direction for how we ingest, transform, store,... ...and deployment of scalable Generative AI and... ...search, and embedding pipelines. Help design the... ...including experience training and deploying ML...PipelineTraining
- ...are seeking a skilled Data Engineer to support our... ...organization’s generative AI initiatives. In this... ...infrastructure and pipelines necessary to enable... ...Responsibilities Design and build scalable data pipelines to ingest, process, and store large volumes of training data for generative...PipelineTraining
- ...collection of executives, engineers, data scientists, and... ...is augmented by AI and machine learning... ...maintain the data pipelines that power our deep... ...You’ll work across ingestion, transformation, and... ...discoverable, and scalable for use by model training, analytics, and AI-...PipelineTrainingRemote workFlexible hours
- ...Intelligence (AI) and other emerging... ...founding engineering team of Cloudseed... ...an experienced Data Scientist to quantify... ...collect, ingest, and stage complex... ...ETL/ELT pipelines in Python and SQL... ...Production grade scalable data pipelines, ML model training at scale, and analytics...PipelineTrainingFull timeContract workLocal areaRemote work
$142.6k - $153.1k
...looking for a Sr. Data Engineer with strong data... ...of our emerging AI and ML platform.... ...bring high‑quality, scalable, and ethical AI... ...supports reliable data pipelines, scalable... ...independently for ingestion, transformation,... ...preparation, (2) training and tuning, (3) experimentation...PipelineTrainingHourly pay- ...We're building AI employees. Not chatbots... ...can do. The engineering problems are hard... ...'ll be the first Data Engineer on the Artisan... ..., and maintain scalable data pipelines that process and... ...data Manage ingestion from third-party... ...embeddings, or ML training pipelines ~ Bonus...PipelineTrainingRemote work
$110k - $190k
...Senior Data Management Professional - Data Engineering - Commodities Data... ...directly into pipelines, systems and architecture... ...high-impact, scalable solutions.... ...technologies including AI and machine... ...improve data ingestion and enrichment... ...conditions, education/training and skill level...PipelineTrainingTemporary workFor contractorsWork experience placement$150k - $180k
...DriveWealth develops data products to... ...a Senior Data Engineer to take... ...automation of data ingestion, transformation... ...requirements are met with scalable technical... ...data pipelines (Databricks and... ...We Think About AI We leverage... ...expertise, education, training, and experience...PipelineTrainingFull timeWork at officeWorldwide- ...Senior Consultant, Data Engineer Work... ...Sector: Data, AI & Analytics Position... ...data models, and scalable analytics... ...and maintain data ingestion, transformation,... ...Exposure to ML pipelines, MLOps, or AI-adjacent... ...Guilds, regular training, and peer learning...PipelineTrainingFull timeWork at office
$135k - $145k
...POSITION SUMMARY The Data Engineer will be... ...enterprise-wide AI and automation initiatives... ...Informatica data pipelines, API integrations... ...teams to deliver scalable, high‑quality... ...and reliable data ingestion and transformation... ...reporting, and AI model training, including...PipelineTrainingWork experience placementSummer workWork at officeRemote workFlexible hours$150k
...Microsoft Fabric Data Engineer We are seeking... ...Engineer with Agentic AI experience to... ...build, and maintain scalable data solutions within... ...robust data pipelines, implementing Medallion... ...and usability Ingest, transform, and integrate... ...with AI model training requirements....PipelineTraining- ...provide a versatile AI Platform... ...exciting environment. Data is at the core... ..., and the Data Engineering team is a... ...We leverage and ingest data from multiple... ...in analytical pipelines. We develop and... ...high-quality, scalable, and reliable data... ...you Continuous training and access to...PipelineTraining
- ...technologies like AI and blockchain... ...exceptional engineers to help us do it... ...hiring a Staff Data Engineer to be... ...architecture—from ingestion and scraping... ...to enrichment pipelines, data warehousing... ...with long‑term scalability, quality, and governance... ...clean training and inference datasets...PipelineTrainingImmediate startRemote work
$220k - $240k
...Principal Data Engineer New York, New York, United... ...: Data Ingestion - building reliable pipelines that bring data from... ...ecosystem, ensuring scalability, reliability, and security... ...of ML and AI data pipelines and... ...expertise, education, training, and experience. If...PipelineTrainingFull timeWork at officeWorldwide$155k - $184k
...states. The Principal Data Engineer will lead the... ...efforts to build scalable, reliable, and... ...frameworks, and pipelines that support large... ...and complex data ingestion, processing, transformations... .../privacy, and AI assistant tools.... ..., education, training, merit, location,...PipelineTrainingFull timeLocal areaNight shift- ...company providing a data-driven... ...required for trainings, meetings, and... ...optimize data pipelines for analytics... ...reliability, scalability, and performance... ...Implement data ingestion from internal... ...with product and engineering teams to translate... ...intelligence (AI) tools to...PipelineTrainingWork at officeRemote workHome officeFlexible hours
$180k - $250k
...Staff Analytics Engineer to join a centralized... ...maintaining a scalable and robust... ...implementation of complex data transformations... ...strategies, and training for analytics... ...to enhance data ingestion, processing pipelines, and integration... ...utilizing AI Codegen to improve...PipelineTrainingFlexible hours$110k - $190k
...Senior Data Management Professional – Automation Engineer – Funds Location: New... ...providers and ingesting and normalising... ...(including AI/ML where appropriate... ...and scalability Perform quarterly... ...maintain data pipelines and tools to improve... ..., education/training and skill...PipelineTrainingTemporary workFor contractorsWork experience placement$180k - $220k
...Senior Backend Engineer, Data Modeling and Ingestion Platform New York About the... ...research by building robust, scalable systems for linking,... ...workflows using JAX or multihost training is a plus, as the... ...Understanding of JAX-based ML pipelines, multihost training...PipelineTrainingWork experience placementWork at officeFlexible hours$160k - $180k
...The Data & AI Engineer sits within Carlyle’s Enterprise Technology... ...and operating the pipelines, semantic layers,... ...Snowflake. Implement ingestion, modeling, and consumption... ...standards for scalability, performance, security... ...prior experience and training; and licenses and/or...PipelineTrainingWork at office$225k - $275k
...seeking an experienced AI Data Engineer to join our Data Engineering... ...robust data pipelines that power SchonAI, our... ...Design and build scalable, reliable data pipelines to ingest, transform, and deliver... ...and observability for AI training and inference pipelines...PipelineTraining- ...nonprofit applied AI research... ...About the Role Data Engineers on the Platform team... ...trustworthy data pipelines with comprehensive... ...documented datasets for training and evaluation,... ...solutions from ingestion through... ...performance, and scalability. Implement data...PipelineTrainingFull timeWork at office
- ...TryApplyNow is seeking a Data & AI/ML Engineer in New York, NY. The role focuses on designing and maintaining scalable data pipelines and infrastructure for deploying ML models. Ideal candidates will have a Bachelor's or Master's degree and over 4 years of experience...Pipeline
$176.72k - $265.08k
...level role for a data architect or lead data engineer within a Data... ...scale Generative AI and Machine Learning... ...copy the data. Scalability and Performance:... ...for pre‑training large language models... ...Advanced AI Ops & Data Pipelines This is the... ...bus to ingest real‑time data from...PipelineTraining- ...Development team is seeking a data engineer to function as the... .... Responsibilities Pipeline Architecture: Design,... ...on rapid data ingestion and lightning-fast query... ...feature engineering and training to real-time model... ...Acquisition: Develop scalable frameworks to ingest...PipelineTraining
- A leading network technology company is seeking a Data Engineer to design, build, and maintain scalable data pipelines. The role involves collaboration with data scientists, ensuring the data infrastructure is reliable and cost-effective. Ideal candidates should have strong...PipelineRemote job
- ...experienced software engineer to lead the design... ...on building scalable, resilient, and high... ...trading and market data. • Develop Python-based AI and quantitative models... ...engineering, model training, evaluation). • Build... ...-time and batch pipelines. • Optimize analytics...PipelineTraining
$65 - $70 per hour
...Data Engineer -601/602 job at Pinnacle Group. Jersey City... ...are secure, stable, and scalable. You will develop,... ...maintain essential data pipelines and architectures across... ...Qualifications Formal training or certification in... ...proficiency in leveraging Gen AI models using APIs/SDKs...PipelineTrainingFull timeContract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Ingestion Engineer for Scalable AI Training Pipelines. Be the first to apply!
- staff data engineer New York, NY
- data engineering intern summer New York, NY
- senior data integration developer New York, NY
- data engineer graduate New York, NY
- data engineer contract New York, NY
- data science developer New York, NY
- senior data center engineer New York, NY
- software data engineer New York, NY
- hadoop big data developer New York, NY
- data developer New York, NY

