Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Ingestion Engineer for Scalable AI Training Pipelines

Reflection

Reflection, located in New York, is searching for a Data Engineer to build robust data ingestion systems essential for AI training. The ideal candidate will be skilled in web crawling and data acquisition, comfortable working with large datasets, and have excellent communication abilities. The role emphasizes collaboration with researchers and iterative processes based on measurable impact. Benefits include top-tier compensation, health insurance, paid parental leave, and opportunities for team engagement. #J-18808-Ljbffr Reflection

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Data Ingestion Engineer for Scalable AI Training Pipelines in New York, NY vacancy
  •  .... About The Role Data plays a crucial role...  ...the frontier of AI innovation. Many advances...  ...and operate the ingestion systems that turn...  ...corpora for training frontier models....  ...our pre‑training pipelines, working directly...  ...role is ideal for engineers who love building... 
    Pipeline
    Training
    Relocation package

    Reflection

    New York, NY
    5 days ago
  •  ...deep expertise in Data Science, Machine Learning and AI. We are the trusted...  ...experienced Data Engineer to join our data team...  ..., and maintaining scalable data pipelines, data integration...  ...preparation and ingestion for AI/ML and Generative...  ...for model training, inference, and GenAI... 
    Pipeline
    Training
    Local area

    Tiger Analytics

    Jersey City, NJ
    7 days ago
  •  ...Data Engineer, Gen AI New York, New York, United States About the Job...  ...data infrastructure and pipelines necessary to enable large-...  ...Responsibilities: Design and build scalable data pipelines to ingest, process, and store large volumes of training data for generative AI... 
    Pipeline
    Training

    Inizio Partners

    New York, NY
    5 days ago
  • $150k - $180k

     ...DriveWealth develops data products to...  ...a Senior Data Engineer to take...  ...automation of data ingestion, transformation...  ...requirements are met with scalable technical...  ...data pipelines (Databricks and...  ...We Think About AI We leverage...  ...expertise, education, training, and experience... 
    Pipeline
    Training
    Full time
    Work at office
    Worldwide

    DriveWealth

    New York, NY
    3 days ago
  •  ...We're building AI employees. Not chatbots...  ...can do. The engineering problems are hard...  ...'ll be the first Data Engineer on the Artisan...  ..., and maintain scalable data pipelines that process and...  ...data Manage ingestion from third-party...  ...embeddings, or ML training pipelines ~ Bonus... 
    Pipeline
    Training
    Remote work

    Artisan

    New York, NY
    2 days ago
  • $150k

     ...Microsoft Fabric Data Engineer We are seeking...  ...Engineer with Agentic AI experience to...  ...build, and maintain scalable data solutions within...  ...robust data pipelines, implementing Medallion...  ...and usability Ingest, transform, and integrate...  ...with AI model training requirements.... 
    Pipeline
    Training

    Garan, Incorporated

    New York, NY
    1 day ago
  • $110k - $190k

     ...Senior Data Management Professional - Data Engineering - Commodities Data...  ...directly into pipelines, systems and architecture...  ...high-impact, scalable solutions....  ...technologies including AI and machine...  ...improve data ingestion and enrichment...  ...conditions, education/training and skill level... 
    Pipeline
    Training
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    6 days ago
  • Senior Data Engineer We're seeking a Senior Data Engineer...  ...stack of Anaplan AI applications. You...  ...for how we ingest, transform, store,...  ...and deployment of scalable Generative AI and...  ...search, and embedding pipelines. Help design the...  ...including experience training and deploying ML... 
    Pipeline
    Training

    Anaplan Inc

    New York, NY
    2 days ago
  •  ...Senior Consultant, Data Engineer Work...  ...Sector: Data, AI & Analytics Position...  ...data models, and scalable analytics...  ...and maintain data ingestion, transformation,...  ...Exposure to ML pipelines, MLOps, or AI-adjacent...  ...Guilds, regular training, and peer learning... 
    Pipeline
    Training
    Full time
    Work at office

    Biz First

    New York, NY
    2 days ago
  •  ...collection of executives, engineers, data scientists, and...  ...is augmented by AI and machine learning...  ...maintain the data pipelines that power our deep...  ...You’ll work across ingestion, transformation, and...  ...discoverable, and scalable for use by model training, analytics, and AI-... 
    Pipeline
    Training
    Remote work
    Flexible hours

    SumerSports LLC

    New York, NY
    1 day ago
  •  ...provide a versatile AI Platform...  ...exciting environment. Data is at the core...  ..., and the Data Engineering team is a...  ...We leverage and ingest data from multiple...  ...in analytical pipelines. We develop and...  ...high-quality, scalable, and reliable data...  ...you Continuous training and access to... 
    Pipeline
    Training

    Optasia Group

    Brooklyn, NY
    1 day ago
  • $142.6k - $153.1k

     ...looking for a Sr. Data Engineer with strong data...  ...of our emerging AI and ML platform....  ...bring high‑quality, scalable, and ethical AI...  ...supports reliable data pipelines, scalable...  ...independently for ingestion, transformation,...  ...preparation, (2) training and tuning, (3) experimentation... 
    Pipeline
    Training
    Hourly pay

    Octave

    New York, NY
    4 days ago
  • $220k - $240k

     ...Principal Data Engineer New York, New York, United...  ...: Data Ingestion - building reliable pipelines that bring data from...  ...ecosystem, ensuring scalability, reliability, and security...  ...of ML and AI data pipelines and...  ...expertise, education, training, and experience. If... 
    Pipeline
    Training
    Full time
    Work at office
    Worldwide

    DriveWealth

    New York, NY
    3 days ago
  •  ...company providing a data-driven...  ...required for trainings, meetings, and...  ...optimize data pipelines for analytics...  ...reliability, scalability, and performance...  ...Implement data ingestion from internal...  ...with product and engineering teams to translate...  ...intelligence (AI) tools to... 
    Pipeline
    Training
    Work at office
    Remote work
    Home office
    Flexible hours

    Traackr

    New York, NY
    1 day ago
  • $135k - $145k

     ...POSITION SUMMARY The Data Engineer will be...  ...enterprise-wide AI and automation initiatives...  ...Informatica data pipelines, API integrations...  ...teams to deliver scalable, high‑quality...  ...and reliable data ingestion and transformation...  ...reporting, and AI model training, including... 
    Pipeline
    Training
    Work experience placement
    Summer work
    Work at office
    Remote work
    Flexible hours

    Empire State Realty Trust

    New York, NY
    2 days ago
  • $180k - $250k

     ...Staff Analytics Engineer to join a centralized...  ...maintaining a scalable and robust...  ...implementation of complex data transformations...  ...strategies, and training for analytics...  ...to enhance data ingestion, processing pipelines, and integration...  ...utilizing AI Codegen to improve... 
    Pipeline
    Training
    Flexible hours

    Zocdoc

    New York, NY
    2 days ago
  • $180k - $220k

     ...Senior Backend Engineer, Data Modeling and Ingestion Platform New York About the...  ...research by building robust, scalable systems for linking,...  ...workflows using JAX or multihost training is a plus, as the...  ...Understanding of JAX-based ML pipelines, multihost training... 
    Pipeline
    Training
    Work experience placement
    Work at office
    Flexible hours

    Udio

    New York, NY
    5 days ago
  • $110k - $190k

     ...Senior Data Management Professional - Automation Engineer - Funds Location...  ...providers and ingesting and normalising...  ...technologies (including AI/ML where...  ...and scalability Perform quarterly...  ...maintain data pipelines and tools to improve...  ..., education/training and skill... 
    Pipeline
    Training
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    6 days ago
  • $225k - $275k

     ...seeking an experienced AI Data Engineer to join our Data Engineering...  ...robust data pipelines that power SchonAI, our...  ...Design and build scalable, reliable data pipelines to ingest, transform, and deliver...  ...and observability for AI training and inference pipelines... 
    Pipeline
    Training

    Schonfeld

    New York, NY
    3 days ago
  • $160k - $180k

     ...Job Name: Data & AI Engineer Location:...  ...building and operating the pipelines, semantic layers,...  ...Snowflake. Implement ingestion, modeling, and consumption...  ...standards for scalability, performance, security...  ...prior experience and training; and licenses and/or... 
    Pipeline
    Training
    Work at office

    Carlyle

    New York, NY
    1 day ago
  •  ...nonprofit applied AI research...  ...About the Role Data Engineers on the Platform team...  ...trustworthy data pipelines with comprehensive...  ...documented datasets for training and evaluation,...  ...solutions from ingestion through...  ...performance, and scalability. Implement data... 
    Pipeline
    Training
    Full time
    Work at office

    Basis Research

    New York, NY
    3 days ago
  •  ...experienced software engineer to lead the design...  ...on building scalable, resilient, and high...  ...trading and market data. • Develop Python-based AI and quantitative models...  ...engineering, model training, evaluation). • Build...  ...-time and batch pipelines. • Optimize analytics... 
    Pipeline
    Training

    Compunnel

    Jersey City, NJ
    1 day ago
  • A leading AI-native revenue platform is seeking its first Data Engineer to design and implement a scalable data infrastructure. This role involves building reliable data pipelines, maintaining data quality, and translating business metrics into actionable models. Ideal... 
    Pipeline

    Tabs

    New York, NY
    2 days ago
  • TryApplyNow is seeking a Data & AI/ML Engineer in New York, NY. The role focuses on designing and maintaining scalable data pipelines and infrastructure for deploying ML models. Ideal candidates will have a Bachelor's or Master's degree and over 4 years of experience in... 
    Pipeline

    TryApplyNow

    New York, NY
    2 days ago
  • $100k - $131k

     ...the business. Fractal is a strategic AI partner to Fortune 500 companies with a...  ...Design, develop, enhance, and maintain scalable data pipelines across heterogeneous datasets in...  ...limited to skill sets; experience and training; licensure and certifications; and other... 
    Pipeline
    Training
    Hourly pay
    Full time
    Local area

    Fractal, Inc.

    New York, NY
    1 day ago
  •  ...looking for a Senior Data Engineer to help design, build...  ...data lifecycle, from ingestion and transformation to reliability, scalability, and ML enablement....  ...scalable, reliable data pipelines and datasets that power...  ...stores that support model training, validation, and... 
    Pipeline
    Training
    Work at office
    Relocation package

    Nelo Mobile

    New York, NY
    1 day ago
  •  ...Data Engineer Do you want to be a part of the team that...  ...space of life science AI. At Tellic, we value...  ...maintain optimal data pipeline architecture Assist...  ...code Architect scalable solutions that can handle...  ...will support ongoing GCP training and certifications)... 
    Pipeline
    Training
    Visa sponsorship

    tellic

    New York, NY
    3 days ago
  •  ...seeking a highly skilled Data Engineer with strong AI/ML and MLOps experience to...  ...and cloud-native deployment pipelines. The ideal candidate will...  ...‑on experience building scalable data platforms, developing...  ...including data preparation, model training, evaluation, deployment,... 
    Pipeline
    Training

    Intellectt Inc

    New York, NY
    1 day ago
  • $99k - $149k

     ...about companies and AI-driven...  ...responsibility is to integrate data from a variety of...  ...and building scalable, efficient processes...  ...and efficient data pipelines. In this role, you...  ...provide documentation, training, and consultation...  ...in software engineering fundamentals and coding... 
    Pipeline
    Training
    Work experience placement
    Local area

    Indeed Inc.

    New York, NY
    14 days ago
  •  ...Data Engineer As a Data Engineer you will get to play a key...  ...storage, streaming and pipeline architectures. Works...  ...to develop and automate scalable ETL/ELT processes for ingesting, transforming, and loading...  ...documentation. Provides training sessions, tutorials, and... 
    Pipeline
    Training
    Flexible hours

    Building Service 32BJ Benefit Funds

    New York, NY
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Ingestion Engineer for Scalable AI Training Pipelines. Be the first to apply!