Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Ingestion Engineer for Scalable AI Training Pipelines

Reflection

Reflection, located in New York, is searching for a Data Engineer to build robust data ingestion systems essential for AI training. The ideal candidate will be skilled in web crawling and data acquisition, comfortable working with large datasets, and have excellent communication abilities. The role emphasizes collaboration with researchers and iterative processes based on measurable impact. Benefits include top-tier compensation, health insurance, paid parental leave, and opportunities for team engagement. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Data Ingestion Engineer for Scalable AI Training Pipelines in New York, NY vacancy
  •  .... About The Role Data plays a crucial role...  ...the frontier of AI innovation. Many advances...  ...and operate the ingestion systems that turn...  ...corpora for training frontier models....  ...our pre‑training pipelines, working directly...  ...role is ideal for engineers who love building... 
    Pipeline
    Training
    Relocation package

    Reflection

    New York, NY
    1 day ago
  •  ...deep expertise in Data Science, Machine Learning and AI. We are the trusted...  ...experienced Data Engineer to join our data team...  ..., and maintaining scalable data pipelines, data integration...  ...preparation and ingestion for AI/ML and Generative...  ...for model training, inference, and GenAI... 
    Pipeline
    Training
    Local area

    Tiger Analytics

    Jersey City, NJ
    4 days ago
  •  ...Senior Data Engineer We're seeking a Senior Data...  ...stack of Anaplan AI applications. You...  ...direction for how we ingest, transform, store,...  ...and deployment of scalable Generative AI and...  ...search, and embedding pipelines. Help design the...  ...including experience training and deploying ML... 
    Pipeline
    Training

    Anaplan

    New York, NY
    3 days ago
  •  ...are seeking a skilled Data Engineer to support our...  ...organization’s generative AI initiatives. In this...  ...infrastructure and pipelines necessary to enable...  ...Responsibilities Design and build scalable data pipelines to ingest, process, and store large volumes of training data for generative... 
    Pipeline
    Training

    Inizio Partners Corp

    New York, NY
    4 days ago
  •  ...collection of executives, engineers, data scientists, and...  ...is augmented by AI and machine learning...  ...maintain the data pipelines that power our deep...  ...You’ll work across ingestion, transformation, and...  ...discoverable, and scalable for use by model training, analytics, and AI-... 
    Pipeline
    Training
    Remote work
    Flexible hours

    SumerSports LLC

    New York, NY
    2 days ago
  •  ...Intelligence (AI) and other emerging...  ...founding engineering team of Cloudseed...  ...an experienced Data Scientist to quantify...  ...collect, ingest, and stage complex...  ...ETL/ELT pipelines in Python and SQL...  ...Production grade scalable data pipelines, ML model training at scale, and analytics... 
    Pipeline
    Training
    Full time
    Contract work
    Local area
    Remote work

    CloudseedAi

    New York, NY
    2 days ago
  • $142.6k - $153.1k

     ...looking for a Sr. Data Engineer with strong data...  ...of our emerging AI and ML platform....  ...bring high‑quality, scalable, and ethical AI...  ...supports reliable data pipelines, scalable...  ...independently for ingestion, transformation,...  ...preparation, (2) training and tuning, (3) experimentation... 
    Pipeline
    Training
    Hourly pay

    Octave

    New York, NY
    12 hours ago
  •  ...We're building AI employees. Not chatbots...  ...can do. The engineering problems are hard...  ...'ll be the first Data Engineer on the Artisan...  ..., and maintain scalable data pipelines that process and...  ...data Manage ingestion from third-party...  ...embeddings, or ML training pipelines ~ Bonus... 
    Pipeline
    Training
    Remote work

    Artisan

    New York, NY
    3 days ago
  • $110k - $190k

     ...Senior Data Management Professional - Data Engineering - Commodities Data...  ...directly into pipelines, systems and architecture...  ...high-impact, scalable solutions....  ...technologies including AI and machine...  ...improve data ingestion and enrichment...  ...conditions, education/training and skill level... 
    Pipeline
    Training
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    2 days ago
  • $150k - $180k

     ...DriveWealth develops data products to...  ...a Senior Data Engineer to take...  ...automation of data ingestion, transformation...  ...requirements are met with scalable technical...  ...data pipelines (Databricks and...  ...We Think About AI We leverage...  ...expertise, education, training, and experience... 
    Pipeline
    Training
    Full time
    Work at office
    Worldwide

    DriveWealth

    New York, NY
    4 days ago
  •  ...Senior Consultant, Data Engineer Work...  ...Sector: Data, AI & Analytics Position...  ...data models, and scalable analytics...  ...and maintain data ingestion, transformation,...  ...Exposure to ML pipelines, MLOps, or AI-adjacent...  ...Guilds, regular training, and peer learning... 
    Pipeline
    Training
    Full time
    Work at office

    Biz First

    New York, NY
    3 days ago
  • $135k - $145k

     ...POSITION SUMMARY The Data Engineer will be...  ...enterprise-wide AI and automation initiatives...  ...Informatica data pipelines, API integrations...  ...teams to deliver scalable, high‑quality...  ...and reliable data ingestion and transformation...  ...reporting, and AI model training, including... 
    Pipeline
    Training
    Work experience placement
    Summer work
    Work at office
    Remote work
    Flexible hours

    Empire State Realty Trust

    New York, NY
    4 days ago
  • $150k

     ...Microsoft Fabric Data Engineer We are seeking...  ...Engineer with Agentic AI experience to...  ...build, and maintain scalable data solutions within...  ...robust data pipelines, implementing Medallion...  ...and usability Ingest, transform, and integrate...  ...with AI model training requirements.... 
    Pipeline
    Training

    Garan, Incorporated

    New York, NY
    2 days ago
  •  ...provide a versatile AI Platform...  ...exciting environment. Data is at the core...  ..., and the Data Engineering team is a...  ...We leverage and ingest data from multiple...  ...in analytical pipelines. We develop and...  ...high-quality, scalable, and reliable data...  ...you Continuous training and access to... 
    Pipeline
    Training

    Optasia Group

    Brooklyn, NY
    2 days ago
  •  ...technologies like AI and blockchain...  ...exceptional engineers to help us do it...  ...hiring a Staff Data Engineer to be...  ...architecture—from ingestion and scraping...  ...to enrichment pipelines, data warehousing...  ...with long‑term scalability, quality, and governance...  ...clean training and inference datasets... 
    Pipeline
    Training
    Immediate start
    Remote work

    Wag Art

    New York, NY
    3 days ago
  • $220k - $240k

     ...Principal Data Engineer New York, New York, United...  ...: Data Ingestion - building reliable pipelines that bring data from...  ...ecosystem, ensuring scalability, reliability, and security...  ...of ML and AI data pipelines and...  ...expertise, education, training, and experience. If... 
    Pipeline
    Training
    Full time
    Work at office
    Worldwide

    DriveWealth

    New York, NY
    12 hours ago
  • $155k - $184k

     ...states. The Principal Data Engineer will lead the...  ...efforts to build scalable, reliable, and...  ...frameworks, and pipelines that support large...  ...and complex data ingestion, processing, transformations...  .../privacy, and AI assistant tools....  ..., education, training, merit, location,... 
    Pipeline
    Training
    Full time
    Local area
    Night shift

    Change.org, PBC

    New York, NY
    2 days ago
  •  ...company providing a data-driven...  ...required for trainings, meetings, and...  ...optimize data pipelines for analytics...  ...reliability, scalability, and performance...  ...Implement data ingestion from internal...  ...with product and engineering teams to translate...  ...intelligence (AI) tools to... 
    Pipeline
    Training
    Work at office
    Remote work
    Home office
    Flexible hours

    Traackr

    New York, NY
    2 days ago
  • $180k - $250k

     ...Staff Analytics Engineer to join a centralized...  ...maintaining a scalable and robust...  ...implementation of complex data transformations...  ...strategies, and training for analytics...  ...to enhance data ingestion, processing pipelines, and integration...  ...utilizing AI Codegen to improve... 
    Pipeline
    Training
    Flexible hours

    Zocdoc

    New York, NY
    3 days ago
  • $110k - $190k

     ...Senior Data Management Professional – Automation Engineer – Funds Location: New...  ...providers and ingesting and normalising...  ...(including AI/ML where appropriate...  ...and scalability Perform quarterly...  ...maintain data pipelines and tools to improve...  ..., education/training and skill... 
    Pipeline
    Training
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    3 days ago
  • $180k - $220k

     ...Senior Backend Engineer, Data Modeling and Ingestion Platform New York About the...  ...research by building robust, scalable systems for linking,...  ...workflows using JAX or multihost training is a plus, as the...  ...Understanding of JAX-based ML pipelines, multihost training... 
    Pipeline
    Training
    Work experience placement
    Work at office
    Flexible hours

    Udio

    New York, NY
    1 day ago
  • $160k - $180k

     ...The Data & AI Engineer sits within Carlyle’s Enterprise Technology...  ...and operating the pipelines, semantic layers,...  ...Snowflake. Implement ingestion, modeling, and consumption...  ...standards for scalability, performance, security...  ...prior experience and training; and licenses and/or... 
    Pipeline
    Training
    Work at office

    The Carlyle Group

    New York, NY
    18 hours ago
  • $225k - $275k

     ...seeking an experienced AI Data Engineer to join our Data Engineering...  ...robust data pipelines that power SchonAI, our...  ...Design and build scalable, reliable data pipelines to ingest, transform, and deliver...  ...and observability for AI training and inference pipelines... 
    Pipeline
    Training

    Schonfeld

    New York, NY
    4 days ago
  •  ...nonprofit applied AI research...  ...About the Role Data Engineers on the Platform team...  ...trustworthy data pipelines with comprehensive...  ...documented datasets for training and evaluation,...  ...solutions from ingestion through...  ...performance, and scalability. Implement data... 
    Pipeline
    Training
    Full time
    Work at office

    Basis Research

    New York, NY
    4 days ago
  •  ...TryApplyNow is seeking a Data & AI/ML Engineer in New York, NY. The role focuses on designing and maintaining scalable data pipelines and infrastructure for deploying ML models. Ideal candidates will have a Bachelor's or Master's degree and over 4 years of experience... 
    Pipeline

    TryApplyNow

    New York, NY
    3 days ago
  • $176.72k - $265.08k

     ...level role for a data architect or lead data engineer within a Data...  ...scale Generative AI and Machine Learning...  ...copy the data. Scalability and Performance:...  ...for pre‑training large language models...  ...Advanced AI Ops & Data Pipelines This is the...  ...bus to ingest real‑time data from... 
    Pipeline
    Training

    Information Technology Senior Management Forum

    Jersey City, NJ
    12 hours ago
  •  ...Development team is seeking a data engineer to function as the...  .... Responsibilities Pipeline Architecture: Design,...  ...on rapid data ingestion and lightning-fast query...  ...feature engineering and training to real-time model...  ...Acquisition: Develop scalable frameworks to ingest... 
    Pipeline
    Training

    Med Review Inc

    New York, NY
    4 days ago
  • A leading network technology company is seeking a Data Engineer to design, build, and maintain scalable data pipelines. The role involves collaboration with data scientists, ensuring the data infrastructure is reliable and cost-effective. Ideal candidates should have strong... 
    Pipeline
    Remote job

    Versa Networks

    New York, NY
    2 days ago
  •  ...experienced software engineer to lead the design...  ...on building scalable, resilient, and high...  ...trading and market data. • Develop Python-based AI and quantitative models...  ...engineering, model training, evaluation). • Build...  ...-time and batch pipelines. • Optimize analytics... 
    Pipeline
    Training

    Compunnel

    Jersey City, NJ
    2 days ago
  • $65 - $70 per hour

     ...Data Engineer -601/602 job at Pinnacle Group. Jersey City...  ...are secure, stable, and scalable. You will develop,...  ...maintain essential data pipelines and architectures across...  ...Qualifications Formal training or certification in...  ...proficiency in leveraging Gen AI models using APIs/SDKs... 
    Pipeline
    Training
    Full time
    Contract work

    kozmetickesluzby.vecnakraska.sk - Jobboard

    Jersey City, NJ
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Ingestion Engineer for Scalable AI Training Pipelines. Be the first to apply!