Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Data Engineer

C the Signs

Position Summary

The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data.

You will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high-quality, high-volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement

Key Responsibilities
  • Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine-tuning.
  • Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets.
  • Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets.
  • Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity.
  • Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models.
  • Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA).
  • Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability.
  • Document data engineering processes, data models, and data dictionaries.
  • Stay up-to-date with the latest advancements in data engineering, big data technologies, and machine learning.

Requirements

Required
  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • Proven experience as a Data Engineer, with a focus on big data technologies.
  • Strong proficiency in programming languages such as Python, Scala, or Java.
  • Extensive experience with data warehousing, ETL processes, and data modeling.
  • Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services.
  • Hands-on experience with big data frameworks like Apache Spark for distributed processing.
  • Excellent problem-solving skills and the ability to work independently and as part of a team.
  • Strong communication and interpersonal skills.
Preferred
  • Master's degree in a related field.
  • Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7).
  • Familiarity with machine learning concepts and LLM fine-tuning processes.
  • Experience with data orchestration tools (e.g., Apache Airflow).
Work Authorization:
  • Must be a US Citizen, Green Card holder, or currently in the US have valid H1B visa

Benefits

Why Join Us?

Joining  C the Signs  is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.

Benefits:

  • Competitive salary and benefits package.
  • Flexible working arrangements (remote or hybrid options available).
  • The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
  • Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
  • Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Vacancy posted 8 days ago
Similar jobs that could be interesting for youBased on the AI Data Engineer in United States vacancy
  • $167.2k - $209k

    A pioneering cloud service provider in Seattle seeks a Senior Engineer 2 for its AI Inference Data Plane team. This role requires designing and delivering high-scale, resilient data services. Responsibilities include technical leadership, system design, performance optimization... 
    Suggested
    Remote work

    DigitalOcean

    Seattle, WA
    3 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Suggested
    Remote work

    DigitalOcean

    San Francisco, CA
    3 days ago
  • $106k - $176k

     ...Job Family : Data Science Consulting Travel Required : Up to 25% Clearance Required : Active Secret Guidehouse is seeking an AI Engineer to support the design, development, validation, and deployment of production-grade AI-enabled applications... 
    Suggested
    Full time
    Temporary work
    Flexible hours

    Guidehouse

    Chicago, IL
    7 days ago
  • $125k - $150k

     ...processing physiological and environmental data to enable real-time insights and decision...  ...: ~ Bachelor’s Degree in relevant engineering or science discipline required...  ...~3+ years of experience in data science, AI/ML, or analytics ~ Experience with Python... 
    Suggested
    Contract work
    Temporary work

    KIHOMAC

    Aberdeen Proving Ground, MD
    6 days ago
  •  ...Preferred) or Remote (U.S./LATAM) Type: Contract-to-Hire Level: Mid-Senior About the Role We're hiring an AI & Data Platform Engineer to build AI-powered systems that automate RFP discovery, document analysis, and qualification workflows. You'll work... 
    Suggested
    Contract work
    Remote work

    G2i Inc.

    Huston, ID
    5 days ago
  • $123.4k - $176.3k

     ...outcomes. Specialty Networks' PPS Analytics platform analyzes data from electronic medical records (EMR), practice management,...  ...business hours (9a-5p EST). Job Summary The Senior Data/AI Engineer is a senior individual contributor on the Data & AI Engineering... 
    Temporary work
    Local area
    Immediate start
    Remote work
    Flexible hours

    Cardinal Health

    Lincoln, NE
    5 days ago
  •  ...Title: Data AI Engineer with Vector Databases Location: Plano, TX Job Description: Key Responsibilities: # Design and build ETL/ELT pipelines and data processing workflows # Develop batch and real-time data pipelines using modern frameworks... 

    Apex Informatics

    Plano, TX
    3 days ago
  •  ...Summary: We are seeking a highly skilled and motivated I Data Engineer with hands-on experience in Langchain , large data sets ,...  ...design, develop, and maintain scalable data pipelines, enable AI model integration, and manage large-scale datasets in cloud environments... 
    Hourly pay

    Macpower Digital Assets Edge

    Bettendorf, IA
    3 days ago
  • $149k - $223k

     ...Tampa, Florida, or at one of the Firm's offices in D.C, Dallas, or Atlanta. General Description: We are seeking an AI and Data Governance Engineer to join our team. The AI and Data Governance (AIDG) Engineer is responsible for implementing, configuring, and... 
    Temporary work
    Work at office

    Holland & Knight

    Washington DC
    3 days ago
  • Insight Global is seeking a Senior AI Engineer in Chicago, Illinois. This role involves working on Snowflake LLM observability and developing...  ..., proficiency in Python and SQL, and familiarity with cloud data engineering on AWS. We value diversity and encourage an inclusive... 

    Insight Global

    Chicago, IL
    3 days ago
  • $95k - $159k

     ...Job Description Summary The Senior Data Engineer designs and builds the AWS-native data foundation behind our enterprise AI applications - knowledge graphs, semantic layers, retrieval corpora, and the pipelines that keep them trustworthy. This role leads both the design... 
    Permanent employment
    Contract work
    Remote work
    Visa sponsorship
    Work visa
    Relocation package

    GE Aerospace

    San Ramon, CA
    6 days ago
  • $105k - $110k

     ...Sr. AI Data Engineer (Image Generation Data) iSoftStone, Inc. is seeking a Sr. AI Data Engineer (Image Generation Data) to join our team! This is a contract onsite opportunity in Menlo Park, CA. This is a one-year contract role, and candidates must have permanent authorization... 
    Permanent employment
    Contract work
    Temporary work
    For contractors
    Remote work

    iSoftStone

    Menlo Park, CA
    5 days ago
  • $108.8k - $191.82k

     ...Description: We are seeking a Senior AI Data Engineer to support mission-critical, classified programs. This role involves designing and maintaining secure, scalable data infrastructure that enables advanced AI/ML capabilities in highly controlled environments. The ideal... 
    Full time
    Temporary work
    Work experience placement
    Work at office
    Remote work
    Relocation
    Flexible hours
    Shift work
    3 days per week

    Lockheed Martin Corporation

    Fort Worth, TX
    4 days ago
  •  ...Data & AI Engineer We are looking for a candidate who has - Minimum 6 years of hands-on experience in data engineering roles, preferably in eCommerce, digital marketing, or consumer-facing applications - Strong knowledge in data modelling and propose best practices... 

    Intellisoft Technologies

    Bellevue, WA
    2 days ago
  •  ...Data & AI Senior Engineer Location: Washington, DC, Chicago, IL, Wilmington, DE, Philadelphia, PA Company: Amtrak Your success is a train ride away! As we move America's workforce toward the future, Amtrak connects businesses and communities across the country. We... 
    Hourly pay
    Temporary work
    Work experience placement
    Remote work
    Flexible hours

    Amtrak

    Philadelphia, PA
    2 days ago
  •  ...Role: AI & Data Engineer Location: Remote US, light travel may be required Employment Type: Contract-to-Hire (6 Months) Top Skills: RAG, Snowflake, SQL/Oracle, JSON/APIs, Python/Java, AWS S3, CI/CD (Git/GitHub) Preferred Skills: Openpages, enterprise security and governance... 
    Contract work
    Remote work

    Snowrelic Inc

    New York, NY
    1 day ago
  • $300k

     ...Data Engineer III Mindbank Consulting Group is seeking a Top Secret-cleared Data Engineer III to build and operationalize secure, mission...  ...initiatives. This role focuses on enabling real-time analytics and AI-driven decision-making in classified and DDIL environments.... 
    Work at office

    Navstar

    Monterey, CA
    4 days ago
  • $126.5k - $208.7k

     ...Imagine loving what you do and where you do it. Job Category Data Analytics, Data Science, Technology Compensation Overview...  ...3 What Is the Opportunity? Travelers is seeking a (Gen AI) Data Engineer II to join our dynamic team. In this role, you will be... 
    Work experience placement
    H1b
    Local area

    Travelers Insurance

    Atlanta, GA
    5 days ago
  •  ...We are seeking a highly skilled and experienced AI Big Data Engineer to design, develop, and optimize large-scale data processing systems. Position requires 3-days/week onsite in Rockville, MD or Tysons Corner, VA In this role, you will work closely with cross... 
    Work experience placement
    3 days per week

    Experis/Manpower Group

    Rockville, MD
    2 days ago
  • $95k - $154k

     ...roles employers repeatedly hire for: Java full stack, software programming, Python/Java development, DevOps, data analyst, data engineer, data scientist, and ML/AI engineer. In other words, the program builds candidates across Java/Full Stack/DevOps and Data Analytics/... 
    Full time
    H1b

    SynergisticIT

    Seattle, WA
    2 days ago
  • $125k - $175k

     ...Summary Join Expedient's AI CTRL product team as a Data Engineer, where you'll transform complex, unstructured datasets into clean, AI-ready data for enterprise clients. Working on-site in Cleveland, you'll play a critical role in ensuring client data drives accurate... 
    Full time
    Work at office

    Expedient

    Cleveland, OH
    5 days ago
  • $160k - $200k

     ...ID, NV, AZ, CO, KS, AR, LA, AL, GA, FL, SC, TN, VA, MD, NJ, DE, IL, WI, MI, OH, MA, PA, NH, CT Cortica is looking for a Senior AI Data Engineer to join its growing team! The Senior AI Data Engineer will serve as both architect and builder of our data ecosystem. Every... 
    Remote work

    Cortica

    San Diego, CA
    1 day ago
  •  ...Senior Data & AI Engineer We are seeking a highly experienced Senior Data & AI Engineer to design, build, and scale modern data platforms and AI-driven solutions. This role sits at the intersection of data engineering, machine learning, and generative AI, with a strong... 

    Think Consulting

    Richmond, VA
    1 day ago
  • $74.74 - $83.04 per hour

     ...communities. You will play a pivotal role in shaping the future of data-driven decision-making, directly impacting how our interactive...  ...adaptability to evolving needs. * Leverage modern data engineering practices and frameworks with an object-oriented approach to architect... 
    Hourly pay
    Temporary work
    Work experience placement
    Worldwide
    Flexible hours

    Aquent

    Redmond, WA
    1 day ago
  •  ...Job Details: Job Role: AI Data Engineer Location: New York, NY 10010 Duration: Full-Time job Must Have Technical/Functional Skills 10+ years of experience building large-scale distributed systems + strong experience with LLM systems, agentic workflows... 
    Full time

    The Judge Group, LLC

    New York, NY
    4 days ago
  • $140k - $180k

     ...Workers' Comp experience. Our culture is the engine that drives this mission forward—a culture...  ...speeds up return-to-work and generative AI for claims management, we're redefining...  ...teams to design, build, and maintain our data ecosystem including ETL pipelines, data lakes... 
    Full time
    Remote work

    KINETIC

    New York, NY
    3 days ago
  •  ...Job Posting Work alongside of our Data Scientist to deploy automated model pipelines for validation and deployment. Deliver automated...  ...production. Support the lifecycle management of deployed AI solutions Collaborate with stakeholders and communicate changes... 

    Omni Inclusive

    Tampa, FL
    3 days ago
  •  ...AI Engineer-Machine Learning We are seeking a highly skilled Agentic AI Data Engineer to design, build, and optimize intelligent, autonomous data systems that power next-generation AI applications. This role blends data engineering, machine learning infrastructure,... 

    EXL

    Jersey City, NJ
    2 days ago
  •  ...Description SAIC is seeking an AI Data Engineer to join our team at Fort Belvoir, Virginia. The AI Data Engineer will design, develop, and maintain data pipelines and architectures to support AI/ML workloads for the Army Intelligence & Security Enterprise (AISE... 

    SAIC

    Fort Belvoir, VA
    1 day ago
  •  ...Senior Data Engineer Travelers Data Engineering team constructs pipelines that contextualize and provide easy access to data by the entire...  ...designing and deploying production-grade, multi-agent AI applications using agent orchestration frameworks and runtimes... 
    Work experience placement
    Local area

    Travelers

    Hartford, CT
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Data Engineer. Be the first to apply!