AI Data Engineer
C the Signs
Position Summary The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data. You will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high-quality, high-volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement Key Responsibilities Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine-tuning. Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets. Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets. Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity. Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models. Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA). Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability. Document data engineering processes, data models, and data dictionaries. Stay up-to-date with the latest advancements in data engineering, big data technologies, and machine learning. Requirements Required Bachelor's degree in Computer Science, Engineering, or a related field. Proven experience as a Data Engineer, with a focus on big data technologies. Strong proficiency in programming languages such as Python, Scala, or Java. Extensive experience with data warehousing, ETL processes, and data modeling. Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services. Hands‑on experience with big data frameworks like Apache Spark for distributed processing. Excellent problem‑solving skills and the ability to work independently and as part of a team. Strong communication and interpersonal skills. Preferred Master's degree in a related field. Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7). Familiarity with machine learning concepts and LLM fine‑tuning processes. Experience with data orchestration tools (e.g., Apache Airflow). Work Authorization: Must be a US Citizen, Green Card holder, or currently in the US have valid H1B visa Why Join Us? Joining C the Signs is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact. Benefits Competitive salary and benefits package. Flexible working arrangements (remote or hybrid options available). The opportunity to work on life‑changing AI technology that directly impacts patient outcomes. Join a team that combines cutting‑edge innovation with a mission to save lives and improve health equity. Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare. #J-18808-Ljbffr
- ...Pfizer Belgium is seeking a Data and AI Engineer to develop and implement a modern data platform for biologics drug modalities. This hybrid role emphasizes collaboration with scientists and the development of AI solutions to enhance data analysis. The ideal candidate...Suggested
$146k - $241k
...Position Overview The Principal Data/AI Engineer helps drive the technical strategy and architecture of enterprise-scale data and AI platforms that power mission-critical data products, analytics, and AI-driven solutions. In this role, you will operate as a technical...SuggestedRemote workWork from home- Databricks Inc. is seeking a Forward Deployed Engineer (FDE) based in Boston, MA. In this hands-on role, you will work closely with customers to build and implement solutions that address their data and AI challenges using the Databricks platform. The ideal candidate will...Suggested
- Cacheflow is seeking a Sr. Forward Deployed Engineer in Boston, MA, to deliver production solutions and drive impactful customer... ...lead design decisions, implement end-to-end systems across data engineering and AI, ensuring all solutions align with customer needs. A focus...Suggested
$155k - $235k
...within the organization. About the Team The Data Platform team sits within Scribd’s... ...Catalog in Databricks, to the semantic and AI layers that sit on top. This is a high‑impact... ...deeply about making data work for everyone — engineers, analysts, and business users alike....SuggestedLocal areaHome officeFlexible hours- A leading grocery retailer is offering a paid Co-op program for students pursuing degrees in Computer Science, Data Analytics, or Engineering. This 6-month position provides exposure to innovative projects, mentorship for career development, and the opportunity to work...
- Cacheflow in Boston is seeking a Forward Deployed Engineer to work directly with customers and deliver impactful data solutions using the Databricks platform. The ideal candidate should have over 6 years of experience in data engineering and analytics, and be proficient...
- Geospatial AI Data Engineer Co-op Fall 2026 (2600017W) We are seeking a highly motivated Geospatial AI Engineering Intern with a passion for GIS and artificial intelligence, cloud computing, and spatial data-driven development. You will work alongside our MassDOT’s Planning...Hourly payFull timeInternshipWork at officeShift work
- A state government agency is seeking a Geospatial AI Data Engineer Co-op to work on GIS data and AI applications. This position offers a chance to develop skills in GIS and ML, assisting in data pipeline construction and architectural improvements. The ideal candidate...
- Motion Recruitment Partners LLC is seeking a full-time Senior Data Engineer (Applied AI/ML) based in Boston, MA. This role involves critical data engineering tasks combined with applied machine learning to tackle complex client challenges. The ideal candidate will lead...Full time
- A major pharmaceutical company in Cambridge is seeking a Software Engineer to develop AI-powered tools that enhance pharmaceutical research. This role involves building full stack applications, collaborating with scientists, and implementing cutting-edge AI technologies...
- jobr.pro is seeking a Forward Deployed Engineer to work with customers, addressing data and AI challenges using the Databricks platform. This hands-on role involves delivering production solutions and leading architectural decisions. Ideal candidates will have over 6 years...
- Snyk is seeking a Data Engineer to transform complex raw data into refined offerings that empower the company. You will collaborate with various stakeholders and leverage AI-assisted development tools. The ideal candidate has 3+ years in data engineering, expertise in SQL...Remote job
$152.3k - $209.45k
Menlo Ventures is seeking a Sr. Solutions Engineer based in Boston, MA, to lead technical strategies... ...with clients to drive the adoption of AI and ML solutions, establishing... ...Ideal candidates should have a background in data-driven transformations and programming alongside...$124k - $280k
...Specialty/Competency: Data, Analytics & AI Industry/Sector: Health Services Time Type: Full time Travel Requirements: Up to 80% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop...Full timeH1b$110.3k - $190.3k
Report: You will report to the Senior AI/ML Development Manager in the Architecture, Engineering, and Construction (AEC) Solutions Team. Location: We support hybrid... ...setting. Help develop scalable and resilient data ingestion and processing pipelines, monitoring and...$139.1k - $231.9k
Pfizer Oncology is building an AI-first R&D engine where artificial intelligence is a foundational capability shaping how medicines are discovered... ..., developed, and delivered to patients. We are seeking AI & Data Engineers to focus on generalist, use case driven AI...Permanent employmentWork experience placementH1bLocal areaVisa sponsorshipWork visaRelocation package- A leading open source software company is seeking a Python and Kubernetes Software Engineer focused on Data, AI/ML & Analytics. The role involves developing solutions for public cloud and private infrastructure in a remote setting. Ideal candidates have strong skills in...Remote work
- Tristar AI, based in Cambridge, MA, is seeking a talented individual for a hybrid role focusing on full stack responsibilities and customer analytics. You will contribute to a customer-facing dashboard and collaborate with clients to tailor solutions for their manufacturing...
- Scribd, Inc. is looking for a Senior AI Data Engineer in Boston to lead AI engineering within the Data Platform team. In this role, you will build data infrastructure that enables AI use cases, support platform stakeholders in creating data products, and integrate AI tooling...Flexible hours
- Khoury College is seeking a Snowflake Developer responsible for optimizing and developing the Snowflake data platform. This role requires expertise in data modeling, ELT pipelines, and integrating legacy data systems. The ideal candidate will have 4-6 years of experience...Work at office
- Pfizer Belgium in Cambridge is looking for an AI & Data Engineer to build AI-enabled solutions for Oncology R&D workflows. The role requires significant experience in data pipelines, analytics, and machine learning to improve operations and decision-making. The ideal candidate...
- Quantiphi, Inc. in Boston is seeking an experienced Architect Data Engineer to lead the architectural vision for a data layer tailored for Agentic AI. This role requires extensive expertise in database design, hybrid environments, and data governance. The right candidate...
$119k - $299.93k
...processes and related controls. Those in data, analytics and technology solutions at PwC... ...the design and deployment of enterprise AI/ML solutions, setting architecture standards... ...years of professional AI/ML development, engineering, or testing experience What Sets You...Full timeH1b- ...fastest and most powerful way for design professionals to search, sample, and specify materials. We're looking for a Senior Data & AI Engineer to lead the design, development, and operation of AI agents that power intelligent experiences across the Material Bank...Temporary workLocal areaRemote workFlexible hours
$165k - $216.56k
Snowflake, Inc in Boston is looking for a customer-facing Solution Engineer. In this role, you'll guide customers from raw data to real AI impact by architecting scalable solutions and delivering compelling demos. The ideal candidate has strong SQL and Python skills, familiar...- ...Job Description Job Description AI/ML Engineer - Computer Vision AI/ML Engineer - Computer Vision Remote in US Full-Time About the Opportunity We are a venture-backed technology company building AI-powered solutions that bring real-time intelligence to...Ongoing contractFull timeLocal areaRemote work
$60 per hour
...A remote AI development company is seeking proficient programmers to enhance cutting-edge AI systems. This role allows for flexible scheduling and remote work within the US, Canada, UK, and more. Responsibilities include designing coding solutions, writing high-quality...Remote workFlexible hours$86.49k - $122.16k
A vibrant educational institution seeks an AI Operations Specialist to manage AI systems and data pipelines. This role entails daily monitoring, operational support, and improvement of AI solutions. Candidates should possess MLOps experience and cloud platform familiarity...- Ll Oefentherapie in Boston, Massachusetts, is seeking highly skilled AI Engineers to design and build cloud-based data processing pipelines for large-scale healthcare data. You will work on solving complex problems including clinical decision support and revenue optimization...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Data Engineer. Be the first to apply!


