Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Data Engineer

IntegriChain

IntegriChain is the data and application backbone for market access departments of Life Sciences manufacturers. We deliver the data, the applications, and the business process infrastructure for patient access and therapy commercialization. More than 250 manufacturers rely on our ICyte Platform to orchestrate their commercial and government payer contracting, patient services, and distribution channels. ICyte is the first and only platform that unites the financial, operational, and commercial data sets required to support therapy access in the era of specialty and precision medicine. With ICyte, Life Sciences innovators can digitalize their market access operations, freeing up resources to focus on more data-driven decision support. With ICyte, Life Sciences innovators are digitalizing labor-intensive processes – freeing up their best talent to identify and resolve coverage and availability hurdles and to manage pricing and forecasting complexity. We are headquartered in Philadelphia, PA (USA), with offices in: Ambler, PA (USA); Pune, India; and Medellín, Colombia. For more information, visit or follow us on Twitter @IntegriChain and LinkedIn. This role offers flexibility, but candidates must reside in Pennsylvania, New Jersey, or New York and be within a reasonable travel distance of our Philadelphia office, as regular in-person collaboration is required. Mission Join the Data Science team as an AI Data Engineer responsible for building the data foundations that make enterprise AI products accurate, explainable, and scalable. This role will design and implement Snowflake and dbt pipelines from raw source data to curated gold-layer datasets, create semantic models that LLM tools can use reliably, and partner with data science, product, and engineering teams to convert data dictionaries and business definitions into AI-ready data products. The ideal candidate is a strong data engineer with deep Snowflake/dbt experience and a practical understanding of how semantic layers, ER relationships, denormalized models, and metadata quality influence LLM and agent performance. Position Overview Snowflake and dbt engineering: Design, build, optimize, and operate Snowflake pipelines and dbt models across raw, curated, and gold-layer datasets. AI-ready semantic modeling: Create semantic models, relationships, metrics, dimensions, and curated views that allow LLM tools and agents to answer questions accurately. Data dictionary-driven delivery: Translate team-defined data dictionaries, business definitions, and source mappings into tested, governed, and reusable data products. Agent consumption focus: Design datasets for AI agents, natural-language analytics, Snowflake Cortex Analyst, and other LLM-powered tools. Enterprise data modeling: Balance normalized source models, ER relationships, dimensional models, denormalized consumption layers, and semantic-layer needs. Key Responsibilities Snowflake, dbt, and Data Pipeline Development Build reliable data pipelines from raw source data through curated silver layers and business-ready gold layers using Snowflake and dbt. Develop modular dbt models, tests, documentation, exposures, and lineage-friendly transformation patterns. Implement incremental processing, snapshots, audit columns, reconciliation, data quality checks, and restartable pipeline patterns. Optimize Snowflake SQL and dbt workloads for performance, scalability, cost, and maintainability. Work with orchestration and DevOps/SRE teams to support CI/CD, environment promotion, pipeline monitoring, and operational runbooks. Semantic Models and AI-Ready Data Products Create Snowflake semantic models and curated views that support accurate natural-language querying through Snowflake Cortex Analyst and related LLM tools. Translate approved data dictionaries into semantic model dimensions, facts, metrics, synonyms, descriptions, relationships, and business rules. Design ER relationships and join paths that are explicit, accurate, and easy for semantic-layer tools and AI agents to use. Create denormalized or consumption-optimized models where appropriate to reduce ambiguity and improve LLM answer quality. Partner with AI developers to understand tool schema needs, agent workflows, and how data model design affects LLM tool performance. Data Modeling, Integration, and Consolidation Design logical and physical models that support enterprise data consolidation, analytical reporting, AI workflows, and business operations. Work across source systems, files, APIs, cloud storage, operational systems, and analytical platforms to integrate data into Snowflake. Create reusable patterns for source-to-target mapping, schema evolution, master/reference data alignment, and data product publishing. Collaborate with business and technical stakeholders to validate data definitions, grain, relationships, hierarchies, and measures. Support data consolidation across Integrichain by rationalizing overlapping datasets and aligning enterprise definitions. Snowflake Cortex and AI Platform Enablement Understand Snowflake Cortex capabilities, including Cortex Analyst, Cortex Complete, semantic views/models, and metadata-driven AI workflows. Prepare data models and semantic layers for accurate LLM usage, including clear naming, descriptions, relationships, metrics, and governance metadata. Support AI Explorer and similar applications by ensuring curated datasets are reliable, performant, explainable, and governed. Partner with AI and application teams to troubleshoot semantic model issues, poor AI answers, ambiguous joins, missing metadata, or incorrect measures. Contribute to standards for AI-ready data design, semantic model review, data dictionary alignment, and LLM-friendly data modeling. Qualifications 6+ years of experience in data engineering, analytics engineering, database engineering, or data platform development in production environments. Strong hands-on experience with Snowflake, including SQL development, performance tuning, security-aware design, cost optimization, and large-volume processing. Strong hands-on experience with dbt or comparable ELT tooling, including models, tests, documentation, lineage, and environment promotion. Experience building raw-to-curated-to-gold data pipelines and business-ready datasets. Strong SQL and Snowflake development skills, including complex transformations, views, stored procedures/Snowflake Scripting, and query optimization. Experience creating semantic layers, semantic models, metrics, dimensions, relationships, and curated analytical views. Good understanding of ER modeling, dimensional modeling, denormalized consumption models, and data grain management. Experience translating data dictionaries and business definitions into physical models, dbt models, and semantic-layer definitions. Understanding of Snowflake Cortex capabilities such as Cortex Analyst, Cortex Complete, and semantic-model-driven natural-language querying. Ability to partner with data science, product, engineering, and business teams to deliver AI-ready data products. Preferred Experience Experience in life sciences, healthcare, pharma commercialization, MDM, patient data, channel data, or commercial data platforms. Experience with Snowflake semantic views, Cortex Analyst, Cortex Search, or other AI/LLM data platform capabilities. Experience with data quality frameworks, metadata management, data observability, and lineage tooling. Experience with orchestration tools such as dbt Cloud jobs, Airflow, Dagster, cloud-native schedulers, or similar platforms. Experience with Python for data automation, metadata processing, testing, or API integrations. Experience designing governed data products for BI, AI/ML, natural-language analytics, or agentic applications. Snowflake SnowPro, dbt certification, or equivalent data engineering credentials. What does IntegriChain have to offer? Mission driven: Work with the purpose of helping to improve patients' lives! Excellent and affordable medical benefits + non-medical perks including Student Loan Reimbursement, Flexible Paid Time Off and Paid Parental Leave 401(k) Plan with a Company Match to prepare for your future Robust Learning & Development opportunities including over 700+ development courses free to all employees IntegriChain is committed to equal treatment and opportunity in all aspects of recruitment, selection, and employment without regard to race, color, religion, national origin, ethnicity, age, sex, marital status, physical or mental disability, gender identity, sexual orientation, veteran or military status, or any other category protected under the law. IntegriChain is an equal opportunity employer; committed to creating a community of inclusion, and an environment free from discrimination, harassment, and retaliation. Our policy on visa sponsorship for US based positions: Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by IntegriChain. #J-18808-Ljbffr IntegriChain

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the AI Data Engineer in New York, NY vacancy
  •  ...AriesView, a dynamic Real Estate AI startup in Boston, is looking for a Data & AI Software Engineer intern to support research into AI technologies. The intern will engage in implementing AI-driven workflows and contribute to the company's technology roadmap. The role... 
    Suggested
    Internship
    Remote work

    Feedinkoo

    New York, NY
    2 days ago
  • $160k - $180k

     ...The Data & AI Engineer sits within Carlyle’s Enterprise Technology & Data organization and supports firm‑wide data and AI initiatives spanning investment platforms, portfolio operations, investor relations, and corporate functions. The role operates within a federated... 
    Suggested
    Work at office

    The Carlyle Group

    New York, NY
    2 days ago
  • $130k - $150k

     ...Senior AI Data Engineer (Agentic Systems) Location: USA / Europe / Israel - with a 5-hour overlap with EST hours Compensation: $130K - $150K We are hiring on behalf of our client who builds the technology that powers safer, more accessible financial markets. Our risk... 
    Suggested

    MLabs

    New York, NY
    3 days ago
  •  ...LATAM) Language Requirements: Advanced English (required), Intermediate Spanish (desired) About the Role We are looking for a AI Data Engineer to support several initiatives within our Data Management team , focused on Artificial Intelligence, data processing, and cloud... 
    Suggested
    Remote work

    Inclusion Cloud

    New York, NY
    4 days ago
  • A dynamic tech company in New York is seeking a Senior Engineer, Data & AI to develop and scale impactful features. The ideal candidate will have extensive experience in software engineering and AI, particularly in Python, SQL, and cloud infrastructure. Responsibilities... 
    Suggested

    Vendelux LLC.

    New York, NY
    5 days ago
  • Uma plataforma de recrutamento está contratando para um papel de engenharia que envolve tarefas de avaliação em NLP/ML, incluindo rotulagem de dados e QA. Os candidatos devem ter experiência em funções técnicas com forte capacidade analítica e atenção aos detalhes. A vaga...
    Remote job

    Rex.zone

    New York, NY
    3 days ago
  • A technology consulting firm in New York is seeking a skilled Data Engineer to support generative AI initiatives. You will design and maintain data infrastructure and pipelines for AI model deployment. The position requires a bachelor’s or master’s degree with 3+ years... 

    Inizio Partners Corp

    New York, NY
    3 days ago
  • $175k - $225k

    Cervin in New York is seeking a Senior Engineer, Data & AI to architect and implement AI-powered features for their platform. The ideal candidate will have over five years of relevant experience, strong skills in Python and SQL, and competency in building AI/ML-driven products... 
    Remote job
    Full time

    Cervin

    New York, NY
    3 days ago
  • Job Summary AriesView is an emerging Real Estate AI startup transforming how investors analyze and manage commercial real estate portfolios. They are seeking a Data & AI Software Engineer intern to support early‑stage research into AI technologies and contribute to the... 
    Remote job
    Internship

    Feedinkoo

    New York, NY
    2 days ago
  • A technology analytics firm based in New York is seeking an experienced data engineer to enhance data solutions. The ideal candidate will have at least 6 years in data engineering, proficient in SQL, and have experience in object-oriented programming languages like Python... 

    Cloud Analytics Technologies, LLC

    New York, NY
    1 day ago
  • We have partnered with a leading technology research organization to hire an AI Data Engineer. In this role, you will build scalable data pipelines, partner closely with Data Scientists and ML Engineers, and ensure the organization’s AI/ML models are fueled by high-quality... 
    Contract work
    Remote work

    ProSearch

    New York, NY
    3 days ago
  • NucleusTeq in New York is looking for a Data Engineer + Gen AI for a 12+ month hybrid role. You will design and build scalable data platforms while leading a team of skilled engineers. Ideal candidates should have 6+ years in software development, expertise in Google Cloud... 

    NucleusTeq

    New York, NY
    4 days ago
  • A leading financial services company is seeking a Data Engineer to build scalable data infrastructure and AI-enabled workflows. You will design, build, and optimize data pipelines while collaborating closely with teams to innovate solutions. Required skills include extensive... 

    Interactive Brokers Group, Inc.

    New York, NY
    4 days ago
  • $241k - $338k

    Biohub is seeking a Data Engineer to design and implement data pipelines for genomic and imaging data at scale. This role requires strong software engineering skills and experience with distributed computing frameworks. The ideal candidate will have 8+ years of experience... 

    Biohub

    New York, NY
    2 days ago
  • A CI está em busca de um Senior AI Data Engineer para transformar dados brutos em produtos confiáveis, utilizando inteligência artificial. O papel inclui desenvolver transformações com DBT e garantir a qualidade dos dados, colaborando com diversas equipes para criar solu... 

    CI

    New York, NY
    3 days ago
  • $101.49k - $147k

    Position Summary We have an exciting opportunity to join our team as a Sr. Engineer I, AI. NYU Langone Health seeks an experienced Data Science & AI Engineer to design, build, deploy, and govern enterprise‑grade AI and machine learning capabilities that support clinical... 

    NYU Langone Hospitals

    New York, NY
    3 days ago
  • Happy Scribe is seeking a Data Engineer in New York City, responsible for building and managing the data foundation of Glidepath. The ideal...  ...engineering, focusing on creating scalable data pipelines and powering AI-driven insights. This position offers a unique chance to work... 
    Local area

    Happy Scribe

    New York, NY
    5 days ago
  • Role: AI & Data Engineer Location: Remote US, light travel may be required Employment Type: Contract-to-Hire (6 Months) Top Skills: RAG, Snowflake, SQL/Oracle, JSON/APIs, Python/Java, AWS S3, CI/CD (Git/GitHub) Preferred Skills: Openpages, enterprise security and governance... 
    Contract work
    Remote work

    Snowrelic Inc

    New York, NY
    3 days ago
  •  ...seeks a Scientific Reasoning & Discovery Engineer to design high-quality datasets enhancing...  ...creating tasks that require comprehensive data analysis and collaboration for ensuring scientific...  ...to work fully remote on cutting-edge AI projects. #J-18808-Ljbffr Codefeast... 
    Remote job

    Codefeast Enterprises

    New York, NY
    2 days ago
  • $179.4k - $224.25k

    Scale AI, Inc. is looking for a Forward Deployed Engineer based in New York City to deliver critical data infrastructure for advanced AI models. This position involves working with leading AI companies and government agencies to solve complex data-related problems, and... 

    Scale AI, Inc.

    New York, NY
    3 days ago
  • ProfitSolvis a SaaS business services provider for the legal and accounting industry. We are looking for an AI Data Engineer to join our growing team! We are seeking a seasoned AI Data Engineer to support the building of a centralized data platform on AWS to unify data... 
    Remote job
    Work from home
    Day shift

    ProfitSolv

    New York, NY
    2 days ago
  • A leading technology research organization is seeking an AI Data Engineer to build scalable data pipelines and collaborate with data scientists on impactful AI initiatives. This is a fully remote role ideal for someone with at least 4 years of Data Engineering experience... 
    Remote job

    ProSearch

    New York, NY
    3 days ago
  • Wipro is seeking AI/ML specialists to design and optimize models for data analysis and NLP applications. Ideal candidates should be IIT graduates with strong skills in Python, TensorFlow, and experience with NLP and generative AI models. Responsibilities include building... 

    Wipro

    New York, NY
    3 days ago
  • A tech solutions company is seeking an experienced AI and Data Engineer to design and deploy AI-driven solutions and enterprise data integrations. The role requires experience in machine learning, building data pipelines, and integrating systems while ensuring compliance... 
    Remote job
    Contract work

    Snowrelic Inc

    New York, NY
    3 days ago
  • The Carlyle Group is seeking a Senior Data & AI Engineer to build and operate AI-ready data pipelines and core AI products. This role requires in-depth expertise in data engineering and applied AI engineering, collaborating with data science teams to deliver enterprise... 

    The Carlyle Group

    New York, NY
    2 days ago
  • US-based Software Development firm seeking a Data & AI Engineer to join our growing remote team! In this role, you will support our ongoing software projects and help plan and execute new projects. Summary RevStar is a dynamic team of engineers, UX designers, and product... 
    Remote work

    RevStar Consulting Inc

    New York, NY
    3 days ago
  • A leading healthtech company in the United States is seeking a Senior Data and AI Engineer. This role involves building and optimizing data pipelines, along with BI deliverables and applied AI in healthcare contexts. The ideal candidate has 5+ years of data engineering... 

    Plume

    New York, NY
    3 days ago
  • JOB DESCRIPTION: Required Qualifications 6+ years of hands-on experience in data engineering and analytics roles. Strong proficiency in SQL and experience with relational databases (MySQL, DB2, etc) Experience in Software development in Object oriented programming... 
    Permanent employment
    Contract work
    Local area

    Robotics Technologies LLC

    New York, NY
    3 days ago
  • $120k - $180k

    Mizuho Financial Group Inc. is looking for a Derivatives Analytics & AI Solutions Engineer in New York, NY. The role involves designing and delivering complex hedging and financing solutions utilizing AI-driven workflows. Ideal candidates will possess strong analytics and... 

    Mizuho Financial Group Inc.

    New York, NY
    3 days ago
  • Cohere in New York is seeking a Software Engineer for Data Infrastructure. You will work on high-performance storage solutions for demanding AI workloads. Ideal candidates have 4+ years in data storage systems, strong Python skills, and Kubernetes experience. The role... 
    Remote work

    Cohere

    New York, NY
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Data Engineer. Be the first to apply!