Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Data Engineer (AI Ingestion Platform)

SoftwareMind Americas

We are Software Mind, an awesome team of engineers who are ready to ramp up any top‑notch company’s projects! Our aim? To always be one step ahead. Become part of a multicultural company in constant growth with an excellent work environment certified by Great Place To Work. Job Description About the Project Software Mind is building a private, tenant‑isolated AI assistant for the real estate title and settlement industry. The platform is a retrieval‑first (RAG) system that ingests historical email, documents, and structured metadata into a per‑tenant vector index, and serves grounded, cited, expert‑weighted answers through a chat‑style Q&A interface with single sign‑on and full audit logging. The platform is AWS‑native with a Python/FastAPI backend, Vue.js frontend, OpenSearch/Pinecone vector store, and OpenAI/Anthropic/Bedrock as LLM provider. You will join a senior, cross‑functional LATAM‑based team where hands‑on AI delivery experience, not just familiarity, is the baseline expectation. You own the ingestion and processing backbone of the platform: the pipelines that transform raw email and document corpora into clean, PII‑minimised, chunked, and indexed data in the per‑tenant vector store. This is the foundational layer the AI extraction gateway depends on; quality here directly determines system accuracy. Your Responsibilities Build and own the historical email ingestion pipeline via Microsoft Graph API. Implement SharePoint / OneDrive document ingestion pipeline with scoped folder access. Design and implement the PII minimisation pre‑processing layer. Build the vector store indexing workflow (OpenSearch/Pinecone) with per‑tenant data isolation. Define and implement the data processing schema; produce and maintain schema documentation. Build the OCR routing orchestrator and integrate OCR service for scanned documents. Implement the raw text / content extraction layer for all supported document types. Define and prototype push vs. pull ingestion strategy, from one‑time PoC through to incremental nightly pipeline. Ensure data lineage and audit traceability are built into pipeline outputs from the outset. Tech Stack: Python, Microsoft Graph API, AWS (S3, DynamoDB, Lambda), OpenSearch, Pinecone, OCR Tooling, PII Libraries, NER Libraries, Docker, Jira, Confluence. Qualifications Must‑Have Skills & Experience 6+ years in data engineering; strong pipeline and ETL/ELT experience required. Proficiency in Python for data pipeline development. Experience with Microsoft Graph API or similar enterprise email/document APIs (M365, Exchange Online). AWS data services: S3, DynamoDB, Glue, and/or Lambda‑based event‑driven processing. Familiarity with PII detection and data minimisation techniques (regex‑based, NER‑based, or purpose‑built libraries). Experience with vector store indexing or semantic search pipeline construction. Nice‑to‑Have Prior experience building ingestion pipelines specifically for AI/ML, NLP, or LLM‑based platforms. OCR tooling experience: AWS Textract, Tesseract, or commercial OCR services. Understanding of per‑tenant data isolation patterns, tenant‑scoped encryption, and row‑level security. Familiarity with LangChain document loaders, embedding pipelines, or vector index management. We are accepting applications from LATAM countries. #J-18808-Ljbffr

Vacancy posted 10 hours ago
Similar jobs that could be interesting for youBased on the Senior Data Engineer (AI Ingestion Platform) in New York, NY vacancy
  •  ...Software Mind Americas is seeking a Senior Data Engineer to enhance our data processing capabilities. You will design and build ingestion pipelines for AI projects, focusing on document and email data using Microsoft Graph API and AWS services. This role demands a strong... 
    Platform
    Senior

    SoftwareMind Americas

    New York, NY
    1 day ago
  •  ...New York, NY, is seeking a Senior Data Acquisition Engineer to lead the development of a...  ...scalable web data acquisition platform. This hybrid role focuses on enhancing data ingestion processes and system...  ...opportunity to contribute to AI-driven technologies and their... 
    Platform
    Senior

    Rachel Paul Recruiting

    New York, NY
    3 days ago
  • $160k - $180k

     ...company building critical data infrastructure in the...  ...space. We are seeking a Lead/Senior Data Engineer to own the data platform from the ground up. You’...  ...architecture on AWS, including ingestion, transformation, and...  .... Active user of AI‑assisted coding tools (Claude... 
    Platform
    Senior

    Metrics Recruitment

    New York, NY
    4 days ago
  •  ...Snowflake is looking for a Principal Data Engineering Solutions Specialist to drive sales execution for ingestion workloads. Responsibilities...  ...optimize usage of Snowflake's platform. Candidates should possess 10+...  ...and making an impact in the AI-driven enterprise landscape.... 
    Platform
    Senior

    Snowflake Computing

    New York, NY
    2 days ago
  •  ...years of experience in data engineering, backend engineering,...  ...about ambiguous external platform behavior and turning...  ...making We use AI-augmented development...  ...the job involves As a Senior Data Engineer, you will...  ...pipelines and services that ingest, transform, and reconcile... 
    Platform
    Senior

    DoubleVerify

    New York, NY
    4 days ago
  •  ...bring deep expertise in Data Science, Machine Learning and AI. We are the trusted analytics...  ...an experienced Data Engineer to join our data team. In...  ...Support data preparation and ingestion for AI/ML and Generative...  ...for data workflows and platform automation Collaborate with... 
    Platform
    Senior
    Local area

    Tiger Analytics

    Jersey City, NJ
    1 day ago
  •  ...Senior Data Engineer We're seeking a Senior Data Engineer to work across the full stack of Anaplan AI applications. You will build transformative AI capabilities...  ...direction for how we ingest, transform, store, serve,...  ...native ML infrastructure platforms. Knowledge of vector... 
    Platform
    Senior

    Anaplan

    New York, NY
    3 days ago
  •  ...Quantiphi is an award-winning, AI-First digital engineering and consulting company...  ...expertise, disciplined cloud and data engineering practices, and...  ...to leading cloud and AI platforms such as NVIDIA, Google...  ...based solutions at scale by ingesting data from sources like DB2.... 
    Platform
    Senior
    Full time

    Quantiphi

    Jersey City, NJ
    4 days ago
  • $140k - $180k

     ...PTO + Healthcare Insurance Are you a Senior Data Engineer with experience designing and building reliable data platforms, looking to join a high-growth AI start-up operating in the E-commerce...  ...and warehouse systems that ingest, transform, and serve critical business... 
    Platform
    Senior
    Remote work

    Rise Technical

    New York, NY
    2 days ago
  •  ...| Contract We are seeking a Senior Data Engineer to join our team, focusing on...  ...the overall Customer Data Platform strategy in a dynamic, cloud...  ...warehouse to facilitate data ingestion into Braze CRM. Defines and...  ...Sciences & Analytics, Modern AI/ML Model Monetization, Cloud... 
    Platform
    Senior
    Contract work
    Remote work

    Delphi-US, LLC - Peacemakers in the Talent War

    New York, NY
    4 days ago
  • $225k - $285k

     ...is building the first AI-powered system of action...  ...top-performing growth engine by making go-to-market...  ...generation of go-to-market. As Senior Data Engineer for Enrichment...  ...the pipelines that ingest data from multiple...  ...Build the enrichment platform - Design and scale pipelines... 
    Platform
    Senior
    Remote work

    Unify

    New York, NY
    3 days ago
  • $120.8k - $217.4k

     ...Acuity Brands is seeking a Senior Data Engineer to join its Atrius Analytics...  ...scalable, intelligent data platform that ingests and normalizes...  ...data engineering pipelines for ingesting, transforming, and storing IoT...  ...to design, test, and deploy AI/ML solutions, collaborating... 
    Platform
    Senior

    6AM City

    New York, NY
    3 days ago
  •  ...Startup in the Marketing Technology AI space, with a platform that will transform how brands...  ...represented in AI. About the Role As a Senior Data Acquisition Engineer , you will own and evolve the...  ...production-grade scraping and ingestion infrastructure that enables multiple... 
    Platform
    Senior
    Work at office

    Rachel Paul Recruiting

    New York, NY
    3 days ago
  •  ...our business systems into a single, unified data platform to power analytics and AI initiatives. As a Senior Data Engineer , you will play a critical role in designing,...  ...architecture principles. Develop and manage data ingestion and ETL/ELT pipelines from core business... 
    Platform
    Senior
    Full time

    Level Access

    New York, NY
    1 day ago
  • $200k - $250k

     ...buried in spreadsheets, manual data pulls, and disconnected...  ...'re building the agentic AI platform designed exclusively for...  ...About the role We’re hiring a Senior Forward Deployed Engineer with a data engineering...  ...business logic. Optimize data ingestion and transformation... 
    Platform
    Senior
    Work at office

    Translucent AI

    New York, NY
    1 day ago
  •  ...financial technology platform covering scoring,...  ...a versatile AI Platform powering...  ...exciting environment. Data is at the core of...  ...plan, and the Data Engineering team is a significant...  .... We leverage and ingest data from multiple...  ...an experienced Senior Data Engineer – Team... 
    Platform
    Senior

    Optasia Group

    Brooklyn, NY
    3 days ago
  •  ...Senior Data Engineer Cobalt ID is building the business identity infrastructure...  ...synthetic ones. With AI accelerating fraud rings and...  ...possible. You'll build the ingestion pipelines, entity resolution...  ...powers leading social media platforms, search engines, and data fusion... 
    Platform
    Senior
    Full time

    Cobalt Identity Systems

    New York, NY
    3 days ago
  • $220k - $280k

     ...Formation Bio is a tech and AI driven pharma company...  ...., has built technology platforms, processes, and...  ...shape the future of our Data Platform. This role is ideal...  ...the intersection of data engineering and applied AI. You’ll lead efforts to ingest and transform unstructured... 
    Platform
    Senior
    Relocation

    Initial Therapeutics, Inc.

    New York, NY
    4 days ago
  • $120k - $127k

     ...services. The Opportunity As a Senior Data Engineer at Orijin, you will be a...  ...modernizing the company’s data platform. Your primary focus will be...  ...practices (event‑driven ingestion and streaming where appropriate...  ...machine learning or AI workflows (e.g., feature engineering... 
    Platform
    Senior
    Permanent employment
    Work experience placement
    Work at office
    Remote work

    Orijin, a public benefit corporation

    New York, NY
    3 days ago
  • $140k - $160k

     ...Job Description The Data Engineering team is seeking a Senior Data Engineer to help design,...  ...and scale the modern data platform that powers analytics, data...  ...batch and real‑time ingestion pipelines leveraging Databricks...  ...). Comfortable using AI‑assisted development tools... 
    Platform
    Senior
    Work at office
    3 days per week

    Jobr

    New York, NY
    2 days ago
  • $110k - $190k

     ...Overview Senior Data Management Professional - Data Engineering - Commodities Data. Location: New York...  ...proactive monitoring. Apply AI and machine learning...  ...detection to improve data ingestion and enrichment. Identify...  ...Engineering to align on platform evolution, scalability... 
    Platform
    Senior
    Work experience placement

    Bloomberg

    New York, NY
    3 days ago
  • $110k - $130k

     ...industry-leading proprietary data, technologies and...  ...areas. Curinos is hiring a Senior Data Engineer I to join our Data Platform team . This team owns the...  ...diverse team of engineers, AI and ML scientists, and product...  ...AI platforms, including ingestion, storage, validation,... 
    Platform
    Senior
    Part time
    Remote work
    Work from home
    Flexible hours

    Curinos

    New York, NY
    3 days ago
  •  ...highly skilled and experienced Senior Data Engineer to join our growing data...  ...optimize ETL/ELT processes to ingest, transform, and load data...  ...performance bottlenecks in the data platform. Drive continuous...  ...e.g., Git). Experience with AI tools (Claude, Vertex AI, Gemini... 
    Platform
    Senior
    Remote work

    Comfrt

    New York, NY
    2 days ago
  •  ...A leading cybersecurity firm is looking for a Senior Engineer II to join their GDI team. This role focuses on enabling petabyte-scale data ingestion and seamless onboarding experience using AI technologies. The ideal candidate will have over 10 years of experience in... 
    Senior

    CrowdStrike

    New York, NY
    3 days ago
  • $180k - $220k

     ...fully configurable and content-rich, AI-powered platform along with best-in-class expertise....  ...and drive down healthcare costs. As a Senior Data Engineer , you’ll be at the heart of...  ...lifecycle of core pipelines — from file ingestion to validated, queryable datasets — ensuring... 
    Platform
    Senior
    Flexible hours

    Machinify, Inc.

    New York, NY
    2 days ago
  • $176k - $207k

     ...Senior Data Engineer, Data Foundations & AI Platform United States (Remote) We Breathe Life Into Data At Komodo Health, our mission is to reduce the global...  ...You’ll own the end‑to‑end data processing lifecycle—ingestion, modeling, and serving—at scale, using cloud... 
    Platform
    Senior
    Local area
    Remote work
    Flexible hours

    Komodo Health

    New York, NY
    20 hours ago
  • $155k - $185k

     ...Overview We’re looking for a Senior Data Engineer who can take loosely defined...  ...Hands‑on experience ingesting data from diverse sources, including...  ...; experience on cloud platforms such as Snowflake, Redshift,...  ...understanding of how to leverage AI and automation in data engineering... 
    Platform
    Senior
    Work at office
    2 days per week
    3 days per week

    Gusto

    New York, NY
    3 days ago
  • $150k - $190k

     ...Architect and operate the platform – 24x7 reliability, IaC‑driven...  ...streaming pipelines that ingest billions of market‑data events each day Architect high...  ...consumed by quants and AI agents Optimize workflows to...  ...every data asset Mentor junior engineers and enforce best‑in‑class... 
    Platform
    Senior

    Schonfeld Strategic Advisors LLC Defunct

    New York, NY
    3 days ago
  •  ...team, we build multi‑agent AI systems that automate complex business workflows across platforms such as SAP, Salesforce, Workday...  ...Summary We are seeking a Senior Data Engineer to design and build scalable...  ...scalable ETL/ELT pipelines to ingest and transform data from... 
    Platform
    Senior
    Remote work

    Tessera Labs

    New York, NY
    3 days ago
  •  ...Anchorage Digital is seeking a mid-senior level engineer for their Asset Data Team to develop advanced data import systems for crypto asset management...  ..., contributing to a growing and dynamic digital asset platform. This fully remote position allows for collaboration across... 
    Platform
    Senior
    Remote work

    Anchorage Digital

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Data Engineer (AI Ingestion Platform). Be the first to apply!