Senior Data Engineer (AI Ingestion Platform)
SoftwareMind Americas
We are Software Mind, an awesome team of engineers who are ready to ramp up any top‑notch company’s projects! Our aim? To always be one step ahead. Become part of a multicultural company in constant growth with an excellent work environment certified by Great Place To Work. Job Description About the Project Software Mind is building a private, tenant‑isolated AI assistant for the real estate title and settlement industry. The platform is a retrieval‑first (RAG) system that ingests historical email, documents, and structured metadata into a per‑tenant vector index, and serves grounded, cited, expert‑weighted answers through a chat‑style Q&A interface with single sign‑on and full audit logging. The platform is AWS‑native with a Python/FastAPI backend, Vue.js frontend, OpenSearch/Pinecone vector store, and OpenAI/Anthropic/Bedrock as LLM provider. You will join a senior, cross‑functional LATAM‑based team where hands‑on AI delivery experience, not just familiarity, is the baseline expectation. You own the ingestion and processing backbone of the platform: the pipelines that transform raw email and document corpora into clean, PII‑minimised, chunked, and indexed data in the per‑tenant vector store. This is the foundational layer the AI extraction gateway depends on; quality here directly determines system accuracy. Your Responsibilities Build and own the historical email ingestion pipeline via Microsoft Graph API. Implement SharePoint / OneDrive document ingestion pipeline with scoped folder access. Design and implement the PII minimisation pre‑processing layer. Build the vector store indexing workflow (OpenSearch/Pinecone) with per‑tenant data isolation. Define and implement the data processing schema; produce and maintain schema documentation. Build the OCR routing orchestrator and integrate OCR service for scanned documents. Implement the raw text / content extraction layer for all supported document types. Define and prototype push vs. pull ingestion strategy, from one‑time PoC through to incremental nightly pipeline. Ensure data lineage and audit traceability are built into pipeline outputs from the outset. Tech Stack: Python, Microsoft Graph API, AWS (S3, DynamoDB, Lambda), OpenSearch, Pinecone, OCR Tooling, PII Libraries, NER Libraries, Docker, Jira, Confluence. Qualifications Must‑Have Skills & Experience 6+ years in data engineering; strong pipeline and ETL/ELT experience required. Proficiency in Python for data pipeline development. Experience with Microsoft Graph API or similar enterprise email/document APIs (M365, Exchange Online). AWS data services: S3, DynamoDB, Glue, and/or Lambda‑based event‑driven processing. Familiarity with PII detection and data minimisation techniques (regex‑based, NER‑based, or purpose‑built libraries). Experience with vector store indexing or semantic search pipeline construction. Nice‑to‑Have Prior experience building ingestion pipelines specifically for AI/ML, NLP, or LLM‑based platforms. OCR tooling experience: AWS Textract, Tesseract, or commercial OCR services. Understanding of per‑tenant data isolation patterns, tenant‑scoped encryption, and row‑level security. Familiarity with LangChain document loaders, embedding pipelines, or vector index management. We are accepting applications from LATAM countries. #J-18808-Ljbffr
- ...Software Mind Americas is seeking a Senior Data Engineer to enhance our data processing capabilities. You will design and build ingestion pipelines for AI projects, focusing on document and email data using Microsoft Graph API and AWS services. This role demands a strong...PlatformSenior
- ...New York, NY, is seeking a Senior Data Acquisition Engineer to lead the development of a... ...scalable web data acquisition platform. This hybrid role focuses on enhancing data ingestion processes and system... ...opportunity to contribute to AI-driven technologies and their...PlatformSenior
$160k - $180k
...company building critical data infrastructure in the... ...space. We are seeking a Lead/Senior Data Engineer to own the data platform from the ground up. You’... ...architecture on AWS, including ingestion, transformation, and... .... Active user of AI‑assisted coding tools (Claude...PlatformSenior- ...Snowflake is looking for a Principal Data Engineering Solutions Specialist to drive sales execution for ingestion workloads. Responsibilities... ...optimize usage of Snowflake's platform. Candidates should possess 10+... ...and making an impact in the AI-driven enterprise landscape....PlatformSenior
- ...years of experience in data engineering, backend engineering,... ...about ambiguous external platform behavior and turning... ...making We use AI-augmented development... ...the job involves As a Senior Data Engineer, you will... ...pipelines and services that ingest, transform, and reconcile...PlatformSenior
- ...bring deep expertise in Data Science, Machine Learning and AI. We are the trusted analytics... ...an experienced Data Engineer to join our data team. In... ...Support data preparation and ingestion for AI/ML and Generative... ...for data workflows and platform automation Collaborate with...PlatformSeniorLocal area
- ...Senior Data Engineer We're seeking a Senior Data Engineer to work across the full stack of Anaplan AI applications. You will build transformative AI capabilities... ...direction for how we ingest, transform, store, serve,... ...native ML infrastructure platforms. Knowledge of vector...PlatformSenior
- ...Quantiphi is an award-winning, AI-First digital engineering and consulting company... ...expertise, disciplined cloud and data engineering practices, and... ...to leading cloud and AI platforms such as NVIDIA, Google... ...based solutions at scale by ingesting data from sources like DB2....PlatformSeniorFull time
$140k - $180k
...PTO + Healthcare Insurance Are you a Senior Data Engineer with experience designing and building reliable data platforms, looking to join a high-growth AI start-up operating in the E-commerce... ...and warehouse systems that ingest, transform, and serve critical business...PlatformSeniorRemote work- ...| Contract We are seeking a Senior Data Engineer to join our team, focusing on... ...the overall Customer Data Platform strategy in a dynamic, cloud... ...warehouse to facilitate data ingestion into Braze CRM. Defines and... ...Sciences & Analytics, Modern AI/ML Model Monetization, Cloud...PlatformSeniorContract workRemote work
$225k - $285k
...is building the first AI-powered system of action... ...top-performing growth engine by making go-to-market... ...generation of go-to-market. As Senior Data Engineer for Enrichment... ...the pipelines that ingest data from multiple... ...Build the enrichment platform - Design and scale pipelines...PlatformSeniorRemote work$120.8k - $217.4k
...Acuity Brands is seeking a Senior Data Engineer to join its Atrius Analytics... ...scalable, intelligent data platform that ingests and normalizes... ...data engineering pipelines for ingesting, transforming, and storing IoT... ...to design, test, and deploy AI/ML solutions, collaborating...PlatformSenior- ...Startup in the Marketing Technology AI space, with a platform that will transform how brands... ...represented in AI. About the Role As a Senior Data Acquisition Engineer , you will own and evolve the... ...production-grade scraping and ingestion infrastructure that enables multiple...PlatformSeniorWork at office
- ...our business systems into a single, unified data platform to power analytics and AI initiatives. As a Senior Data Engineer , you will play a critical role in designing,... ...architecture principles. Develop and manage data ingestion and ETL/ELT pipelines from core business...PlatformSeniorFull time
$200k - $250k
...buried in spreadsheets, manual data pulls, and disconnected... ...'re building the agentic AI platform designed exclusively for... ...About the role We’re hiring a Senior Forward Deployed Engineer with a data engineering... ...business logic. Optimize data ingestion and transformation...PlatformSeniorWork at office- ...financial technology platform covering scoring,... ...a versatile AI Platform powering... ...exciting environment. Data is at the core of... ...plan, and the Data Engineering team is a significant... .... We leverage and ingest data from multiple... ...an experienced Senior Data Engineer – Team...PlatformSenior
- ...Senior Data Engineer Cobalt ID is building the business identity infrastructure... ...synthetic ones. With AI accelerating fraud rings and... ...possible. You'll build the ingestion pipelines, entity resolution... ...powers leading social media platforms, search engines, and data fusion...PlatformSeniorFull time
$220k - $280k
...Formation Bio is a tech and AI driven pharma company... ...., has built technology platforms, processes, and... ...shape the future of our Data Platform. This role is ideal... ...the intersection of data engineering and applied AI. You’ll lead efforts to ingest and transform unstructured...PlatformSeniorRelocation$120k - $127k
...services. The Opportunity As a Senior Data Engineer at Orijin, you will be a... ...modernizing the company’s data platform. Your primary focus will be... ...practices (event‑driven ingestion and streaming where appropriate... ...machine learning or AI workflows (e.g., feature engineering...PlatformSeniorPermanent employmentWork experience placementWork at officeRemote work$140k - $160k
...Job Description The Data Engineering team is seeking a Senior Data Engineer to help design,... ...and scale the modern data platform that powers analytics, data... ...batch and real‑time ingestion pipelines leveraging Databricks... ...). Comfortable using AI‑assisted development tools...PlatformSeniorWork at office3 days per week$110k - $190k
...Overview Senior Data Management Professional - Data Engineering - Commodities Data. Location: New York... ...proactive monitoring. Apply AI and machine learning... ...detection to improve data ingestion and enrichment. Identify... ...Engineering to align on platform evolution, scalability...PlatformSeniorWork experience placement$110k - $130k
...industry-leading proprietary data, technologies and... ...areas. Curinos is hiring a Senior Data Engineer I to join our Data Platform team . This team owns the... ...diverse team of engineers, AI and ML scientists, and product... ...AI platforms, including ingestion, storage, validation,...PlatformSeniorPart timeRemote workWork from homeFlexible hours- ...highly skilled and experienced Senior Data Engineer to join our growing data... ...optimize ETL/ELT processes to ingest, transform, and load data... ...performance bottlenecks in the data platform. Drive continuous... ...e.g., Git). Experience with AI tools (Claude, Vertex AI, Gemini...PlatformSeniorRemote work
- ...A leading cybersecurity firm is looking for a Senior Engineer II to join their GDI team. This role focuses on enabling petabyte-scale data ingestion and seamless onboarding experience using AI technologies. The ideal candidate will have over 10 years of experience in...Senior
$180k - $220k
...fully configurable and content-rich, AI-powered platform along with best-in-class expertise.... ...and drive down healthcare costs. As a Senior Data Engineer , you’ll be at the heart of... ...lifecycle of core pipelines — from file ingestion to validated, queryable datasets — ensuring...PlatformSeniorFlexible hours$176k - $207k
...Senior Data Engineer, Data Foundations & AI Platform United States (Remote) We Breathe Life Into Data At Komodo Health, our mission is to reduce the global... ...You’ll own the end‑to‑end data processing lifecycle—ingestion, modeling, and serving—at scale, using cloud...PlatformSeniorLocal areaRemote workFlexible hours$155k - $185k
...Overview We’re looking for a Senior Data Engineer who can take loosely defined... ...Hands‑on experience ingesting data from diverse sources, including... ...; experience on cloud platforms such as Snowflake, Redshift,... ...understanding of how to leverage AI and automation in data engineering...PlatformSeniorWork at office2 days per week3 days per week$150k - $190k
...Architect and operate the platform – 24x7 reliability, IaC‑driven... ...streaming pipelines that ingest billions of market‑data events each day Architect high... ...consumed by quants and AI agents Optimize workflows to... ...every data asset Mentor junior engineers and enforce best‑in‑class...PlatformSenior- ...team, we build multi‑agent AI systems that automate complex business workflows across platforms such as SAP, Salesforce, Workday... ...Summary We are seeking a Senior Data Engineer to design and build scalable... ...scalable ETL/ELT pipelines to ingest and transform data from...PlatformSeniorRemote work
- ...Anchorage Digital is seeking a mid-senior level engineer for their Asset Data Team to develop advanced data import systems for crypto asset management... ..., contributing to a growing and dynamic digital asset platform. This fully remote position allows for collaboration across...PlatformSeniorRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Data Engineer (AI Ingestion Platform). Be the first to apply!
- staff data engineer New York, NY
- data engineering intern summer New York, NY
- senior data integration developer New York, NY
- data engineer graduate New York, NY
- data engineer contract New York, NY
- data science developer New York, NY
- senior data center engineer New York, NY
- software data engineer New York, NY
- hadoop big data developer New York, NY
- data developer New York, NY

