AI Data Engineer
IntegriChain
IntegriChain is the data and application backbone for market access departments of Life Sciences manufacturers. We deliver the data, the applications, and the business process infrastructure for patient access and therapy commercialization. More than 250 manufacturers rely on our ICyte Platform to orchestrate their commercial and government payer contracting, patient services, and distribution channels. ICyte is the first and only platform that unites the financial, operational, and commercial data sets required to support therapy access in the era of specialty and precision medicine. With ICyte, Life Sciences innovators can digitalize their market access operations, freeing up resources to focus on more data-driven decision support. With ICyte, Life Sciences innovators are digitalizing labor-intensive processes – freeing up their best talent to identify and resolve coverage and availability hurdles and to manage pricing and forecasting complexity. We are headquartered in Philadelphia, PA (USA), with offices in: Ambler, PA (USA); Pune, India; and Medellín, Colombia. For more information, visit or follow us on Twitter @IntegriChain and LinkedIn. This role offers flexibility, but candidates must reside in Pennsylvania, New Jersey, or New York and be within a reasonable travel distance of our Philadelphia office, as regular in-person collaboration is required. Mission Join the Data Science team as an AI Data Engineer responsible for building the data foundations that make enterprise AI products accurate, explainable, and scalable. This role will design and implement Snowflake and dbt pipelines from raw source data to curated gold-layer datasets, create semantic models that LLM tools can use reliably, and partner with data science, product, and engineering teams to convert data dictionaries and business definitions into AI-ready data products. The ideal candidate is a strong data engineer with deep Snowflake/dbt experience and a practical understanding of how semantic layers, ER relationships, denormalized models, and metadata quality influence LLM and agent performance. Position Overview Snowflake and dbt engineering: Design, build, optimize, and operate Snowflake pipelines and dbt models across raw, curated, and gold-layer datasets. AI-ready semantic modeling: Create semantic models, relationships, metrics, dimensions, and curated views that allow LLM tools and agents to answer questions accurately. Data dictionary-driven delivery: Translate team-defined data dictionaries, business definitions, and source mappings into tested, governed, and reusable data products. Agent consumption focus: Design datasets for AI agents, natural-language analytics, Snowflake Cortex Analyst, and other LLM-powered tools. Enterprise data modeling: Balance normalized source models, ER relationships, dimensional models, denormalized consumption layers, and semantic-layer needs. Key Responsibilities Snowflake, dbt, and Data Pipeline Development Build reliable data pipelines from raw source data through curated silver layers and business-ready gold layers using Snowflake and dbt. Develop modular dbt models, tests, documentation, exposures, and lineage-friendly transformation patterns. Implement incremental processing, snapshots, audit columns, reconciliation, data quality checks, and restartable pipeline patterns. Optimize Snowflake SQL and dbt workloads for performance, scalability, cost, and maintainability. Work with orchestration and DevOps/SRE teams to support CI/CD, environment promotion, pipeline monitoring, and operational runbooks. Semantic Models and AI-Ready Data Products Create Snowflake semantic models and curated views that support accurate natural-language querying through Snowflake Cortex Analyst and related LLM tools. Translate approved data dictionaries into semantic model dimensions, facts, metrics, synonyms, descriptions, relationships, and business rules. Design ER relationships and join paths that are explicit, accurate, and easy for semantic-layer tools and AI agents to use. Create denormalized or consumption-optimized models where appropriate to reduce ambiguity and improve LLM answer quality. Partner with AI developers to understand tool schema needs, agent workflows, and how data model design affects LLM tool performance. Data Modeling, Integration, and Consolidation Design logical and physical models that support enterprise data consolidation, analytical reporting, AI workflows, and business operations. Work across source systems, files, APIs, cloud storage, operational systems, and analytical platforms to integrate data into Snowflake. Create reusable patterns for source-to-target mapping, schema evolution, master/reference data alignment, and data product publishing. Collaborate with business and technical stakeholders to validate data definitions, grain, relationships, hierarchies, and measures. Support data consolidation across Integrichain by rationalizing overlapping datasets and aligning enterprise definitions. Snowflake Cortex and AI Platform Enablement Understand Snowflake Cortex capabilities, including Cortex Analyst, Cortex Complete, semantic views/models, and metadata-driven AI workflows. Prepare data models and semantic layers for accurate LLM usage, including clear naming, descriptions, relationships, metrics, and governance metadata. Support AI Explorer and similar applications by ensuring curated datasets are reliable, performant, explainable, and governed. Partner with AI and application teams to troubleshoot semantic model issues, poor AI answers, ambiguous joins, missing metadata, or incorrect measures. Contribute to standards for AI-ready data design, semantic model review, data dictionary alignment, and LLM-friendly data modeling. Qualifications 6+ years of experience in data engineering, analytics engineering, database engineering, or data platform development in production environments. Strong hands-on experience with Snowflake, including SQL development, performance tuning, security-aware design, cost optimization, and large-volume processing. Strong hands-on experience with dbt or comparable ELT tooling, including models, tests, documentation, lineage, and environment promotion. Experience building raw-to-curated-to-gold data pipelines and business-ready datasets. Strong SQL and Snowflake development skills, including complex transformations, views, stored procedures/Snowflake Scripting, and query optimization. Experience creating semantic layers, semantic models, metrics, dimensions, relationships, and curated analytical views. Good understanding of ER modeling, dimensional modeling, denormalized consumption models, and data grain management. Experience translating data dictionaries and business definitions into physical models, dbt models, and semantic-layer definitions. Understanding of Snowflake Cortex capabilities such as Cortex Analyst, Cortex Complete, and semantic-model-driven natural-language querying. Ability to partner with data science, product, engineering, and business teams to deliver AI-ready data products. Preferred Experience Experience in life sciences, healthcare, pharma commercialization, MDM, patient data, channel data, or commercial data platforms. Experience with Snowflake semantic views, Cortex Analyst, Cortex Search, or other AI/LLM data platform capabilities. Experience with data quality frameworks, metadata management, data observability, and lineage tooling. Experience with orchestration tools such as dbt Cloud jobs, Airflow, Dagster, cloud-native schedulers, or similar platforms. Experience with Python for data automation, metadata processing, testing, or API integrations. Experience designing governed data products for BI, AI/ML, natural-language analytics, or agentic applications. Snowflake SnowPro, dbt certification, or equivalent data engineering credentials. What does IntegriChain have to offer? Mission driven: Work with the purpose of helping to improve patients' lives! Excellent and affordable medical benefits + non-medical perks including Student Loan Reimbursement, Flexible Paid Time Off and Paid Parental Leave 401(k) Plan with a Company Match to prepare for your future Robust Learning & Development opportunities including over 700+ development courses free to all employees IntegriChain is committed to equal treatment and opportunity in all aspects of recruitment, selection, and employment without regard to race, color, religion, national origin, ethnicity, age, sex, marital status, physical or mental disability, gender identity, sexual orientation, veteran or military status, or any other category protected under the law. IntegriChain is an equal opportunity employer; committed to creating a community of inclusion, and an environment free from discrimination, harassment, and retaliation. Our policy on visa sponsorship for US based positions: Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by IntegriChain. #J-18808-Ljbffr IntegriChain
- ...AriesView, a dynamic Real Estate AI startup in Boston, is looking for a Data & AI Software Engineer intern to support research into AI technologies. The intern will engage in implementing AI-driven workflows and contribute to the company's technology roadmap. The role...SuggestedInternshipRemote work
$160k - $180k
...The Data & AI Engineer sits within Carlyle’s Enterprise Technology & Data organization and supports firm‑wide data and AI initiatives spanning investment platforms, portfolio operations, investor relations, and corporate functions. The role operates within a federated...SuggestedWork at office$130k - $150k
...Senior AI Data Engineer (Agentic Systems) Location: USA / Europe / Israel - with a 5-hour overlap with EST hours Compensation: $130K - $150K We are hiring on behalf of our client who builds the technology that powers safer, more accessible financial markets. Our risk...Suggested- ...LATAM) Language Requirements: Advanced English (required), Intermediate Spanish (desired) About the Role We are looking for a AI Data Engineer to support several initiatives within our Data Management team , focused on Artificial Intelligence, data processing, and cloud...SuggestedRemote work
- A dynamic tech company in New York is seeking a Senior Engineer, Data & AI to develop and scale impactful features. The ideal candidate will have extensive experience in software engineering and AI, particularly in Python, SQL, and cloud infrastructure. Responsibilities...Suggested
- Uma plataforma de recrutamento está contratando para um papel de engenharia que envolve tarefas de avaliação em NLP/ML, incluindo rotulagem de dados e QA. Os candidatos devem ter experiência em funções técnicas com forte capacidade analítica e atenção aos detalhes. A vaga...Remote job
- A technology consulting firm in New York is seeking a skilled Data Engineer to support generative AI initiatives. You will design and maintain data infrastructure and pipelines for AI model deployment. The position requires a bachelor’s or master’s degree with 3+ years...
$175k - $225k
Cervin in New York is seeking a Senior Engineer, Data & AI to architect and implement AI-powered features for their platform. The ideal candidate will have over five years of relevant experience, strong skills in Python and SQL, and competency in building AI/ML-driven products...Remote jobFull time- Job Summary AriesView is an emerging Real Estate AI startup transforming how investors analyze and manage commercial real estate portfolios. They are seeking a Data & AI Software Engineer intern to support early‑stage research into AI technologies and contribute to the...Remote jobInternship
- A technology analytics firm based in New York is seeking an experienced data engineer to enhance data solutions. The ideal candidate will have at least 6 years in data engineering, proficient in SQL, and have experience in object-oriented programming languages like Python...
- We have partnered with a leading technology research organization to hire an AI Data Engineer. In this role, you will build scalable data pipelines, partner closely with Data Scientists and ML Engineers, and ensure the organization’s AI/ML models are fueled by high-quality...Contract workRemote work
- NucleusTeq in New York is looking for a Data Engineer + Gen AI for a 12+ month hybrid role. You will design and build scalable data platforms while leading a team of skilled engineers. Ideal candidates should have 6+ years in software development, expertise in Google Cloud...
- A leading financial services company is seeking a Data Engineer to build scalable data infrastructure and AI-enabled workflows. You will design, build, and optimize data pipelines while collaborating closely with teams to innovate solutions. Required skills include extensive...
$241k - $338k
Biohub is seeking a Data Engineer to design and implement data pipelines for genomic and imaging data at scale. This role requires strong software engineering skills and experience with distributed computing frameworks. The ideal candidate will have 8+ years of experience...- A CI está em busca de um Senior AI Data Engineer para transformar dados brutos em produtos confiáveis, utilizando inteligência artificial. O papel inclui desenvolver transformações com DBT e garantir a qualidade dos dados, colaborando com diversas equipes para criar solu...
$101.49k - $147k
Position Summary We have an exciting opportunity to join our team as a Sr. Engineer I, AI. NYU Langone Health seeks an experienced Data Science & AI Engineer to design, build, deploy, and govern enterprise‑grade AI and machine learning capabilities that support clinical...- Happy Scribe is seeking a Data Engineer in New York City, responsible for building and managing the data foundation of Glidepath. The ideal... ...engineering, focusing on creating scalable data pipelines and powering AI-driven insights. This position offers a unique chance to work...Local area
- Role: AI & Data Engineer Location: Remote US, light travel may be required Employment Type: Contract-to-Hire (6 Months) Top Skills: RAG, Snowflake, SQL/Oracle, JSON/APIs, Python/Java, AWS S3, CI/CD (Git/GitHub) Preferred Skills: Openpages, enterprise security and governance...Contract workRemote work
- ...seeks a Scientific Reasoning & Discovery Engineer to design high-quality datasets enhancing... ...creating tasks that require comprehensive data analysis and collaboration for ensuring scientific... ...to work fully remote on cutting-edge AI projects. #J-18808-Ljbffr Codefeast...Remote job
$179.4k - $224.25k
Scale AI, Inc. is looking for a Forward Deployed Engineer based in New York City to deliver critical data infrastructure for advanced AI models. This position involves working with leading AI companies and government agencies to solve complex data-related problems, and...- ProfitSolvis a SaaS business services provider for the legal and accounting industry. We are looking for an AI Data Engineer to join our growing team! We are seeking a seasoned AI Data Engineer to support the building of a centralized data platform on AWS to unify data...Remote jobWork from homeDay shift
- A leading technology research organization is seeking an AI Data Engineer to build scalable data pipelines and collaborate with data scientists on impactful AI initiatives. This is a fully remote role ideal for someone with at least 4 years of Data Engineering experience...Remote job
- Wipro is seeking AI/ML specialists to design and optimize models for data analysis and NLP applications. Ideal candidates should be IIT graduates with strong skills in Python, TensorFlow, and experience with NLP and generative AI models. Responsibilities include building...
- A tech solutions company is seeking an experienced AI and Data Engineer to design and deploy AI-driven solutions and enterprise data integrations. The role requires experience in machine learning, building data pipelines, and integrating systems while ensuring compliance...Remote jobContract work
- The Carlyle Group is seeking a Senior Data & AI Engineer to build and operate AI-ready data pipelines and core AI products. This role requires in-depth expertise in data engineering and applied AI engineering, collaborating with data science teams to deliver enterprise...
- US-based Software Development firm seeking a Data & AI Engineer to join our growing remote team! In this role, you will support our ongoing software projects and help plan and execute new projects. Summary RevStar is a dynamic team of engineers, UX designers, and product...Remote work
- A leading healthtech company in the United States is seeking a Senior Data and AI Engineer. This role involves building and optimizing data pipelines, along with BI deliverables and applied AI in healthcare contexts. The ideal candidate has 5+ years of data engineering...
- JOB DESCRIPTION: Required Qualifications 6+ years of hands-on experience in data engineering and analytics roles. Strong proficiency in SQL and experience with relational databases (MySQL, DB2, etc) Experience in Software development in Object oriented programming...Permanent employmentContract workLocal area
$120k - $180k
Mizuho Financial Group Inc. is looking for a Derivatives Analytics & AI Solutions Engineer in New York, NY. The role involves designing and delivering complex hedging and financing solutions utilizing AI-driven workflows. Ideal candidates will possess strong analytics and...- Cohere in New York is seeking a Software Engineer for Data Infrastructure. You will work on high-performance storage solutions for demanding AI workloads. Ideal candidates have 4+ years in data storage systems, strong Python skills, and Kubernetes experience. The role...Remote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Data Engineer. Be the first to apply!
- ai research engineer New York, NY
- ai developer New York, NY
- ai prompt engineer New York, NY
- ai engineer New York, NY
- senior ai engineer New York, NY
- ai ml engineer New York, NY
- ai engineer remote New York, NY
- machine learning ai engineer New York, NY
- remote data engineer New York, NY
- data engineer intern New York, NY

