Senior Platform Data Engineer
Geisinger Health System
Senior Platform Data Engineer
The Senior Platform Data Engineer owns roadmap, priorities, platform standards, and architecture reviews; provides formal input on performance reviews. This position makes clinical data ready for AI at scale: owning the shared data products, retrieval infrastructure, and platform administration that the entire AI portfolio depends on. Owns Real-time data feeds. Reusable clinical data models and feature pipelines. RAG retrieval infrastructure (ingestion, chunking, embeddings, vector DB, retrieval pipelines). Databricks platform administration.
Job Duties
- Streams data from Epic SDE, ADT feeds, lab results, and other clinical sources into Databricks for downstream model consumption.
- Curates shared clinical feature tables (patient demographics, labs, vitals, diagnoses, utilization history, imaging metadata) in Databricks/Unity Catalog that multiple AI programs consume for model training, validation, and monitoring.
- Owns RAG Infrastructure, the shared retrieval-augmented generation platform that agentic and generative AI programs use to ground LLM outputs in organizational knowledge.
- Designs and operates document ingestion pipelines: normalizing clinical documents, policies, guidelines, and unstructured data sources into formats ready for embedding and retrieval.
- Implements and optimizes chunking strategies tailored to healthcare content (e.g., preserving clinical note structure, section-aware chunking for guidelines and protocols).
- Manages the embedding pipeline: selecting, tuning, and versioning embedding models (domain-specific clinical models where they outperform general-purpose).
- Administers the vector database: schema design, indexing, metadata management, access controls, and performance tuning.
- Builds and maintains retrieval pipelines: hybrid search (vector + keyword/BM25), reranking, and relevance filtering to maximize retrieval precision for downstream agents and LLM applications.
- Establishes data quality gates for RAG: automated profiling, completeness checks, and accuracy scoring before content enters the vector store.
- Monitors retrieval quality metrics (View email address on click.appcast.io, View email address on click.appcast.io, MRR) and continuously optimize retrieval performance.
- Databricks workspace configuration and Unity Catalog governance.
- Cluster policies, compute management, and cost monitoring.
- Manages user/group management and access control.
- Administrator for Feature Store.
Work is typically performed in an office environment. Accountable for satisfying all job specific obligations and complying with all organization policies and procedures. The specific statements in this profile are not intended to be all-inclusive. They represent typical elements considered necessary to successfully perform the job.
*Relevant experience may be a combination of related work experience and degree obtained (Master's Degree = 2 years).
Key Technologies
- Databricks (Delta Live Tables, Feature Store, PySpark, Unity Catalog)
- Epic SDE / epic-ws for real-time clinical data extraction
- Vector databases (Pinecone, Weaviate, Qdrant, or Databricks Vector Search)
- Embedding models and pipelines (clinical domain-specific and general-purpose)
- SQL, pandas
- Streaming and batch ingestion patterns
- CDIS Data Warehouse (source system for batch clinical data)
Required Skills & Qualifications
- 5+ years in data engineering, with strong experience building both batch and streaming data pipelines
- Expert-level Databricks skills: Delta Live Tables, PySpark, Unity Catalog, Feature Store
- Hands-on experience with real-time data ingestion (Kafka, Spark Structured Streaming, or comparable frameworks)
- Strong SQL and Python (pandas, PySpark) skills for data transformation and feature engineering
- Experience administering Databricks workspaces: cluster policies, compute management, access controls, cost monitoring
- Familiarity with clinical data models and healthcare data sources (EHR extracts, ADT feeds, lab results, claims data) strongly preferred
- Experience with Epic data extraction methods (SDE, FHIR, epic-ws) a significant plus
- Understanding of data governance principles: lineage, quality monitoring, access controls
Education
Bachelor's Degree-Related Field of Study (Required), Master's Degree-Related Field of Study (Preferred)
Experience
Minimum of 5 years-Relevant experience* (Required)
Certification(s) and License(s):
Skills:
OUR PURPOSE & VALUES: Everything we do is about caring for our patients, our members, our students, our Geisinger family and our communities.
- KINDNESS: We strive to treat everyone as we would hope to be treated ourselves.
- EXCELLENCE: We treasure colleagues who humbly strive for excellence.
- LEARNING: We share our knowledge with the best and brightest to better prepare the caregivers for tomorrow.
- INNOVATION : We constantly seek new and better ways to care for our patients, our members, our community, and the nation.
- SAFETY: We provide a safe environment for our patients and members and the Geisinger family.
We offer healthcare benefits for full time and part time positions from day one, including vision, dental and domestic partners. Perhaps just as important, we encourage an atmosphere of collaboration, cooperation and collegiality.
We know that a diverse workforce with unique experiences and backgrounds makes our team stronger. Our patients, members and community come from a wide variety of backgrounds, and it takes a diverse workforce to make better health easier for all. We are proud to be an affirmative action, equal opportunity employer and all qualified applicants will receive consideration for employment regardless to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or status as a protected veteran.
- ...Affirm is looking for an experienced Data Platform Architect to build and manage scalable analytics infrastructure. You'll work on architecting... ...performance. With over 10 years of experience in data engineering, you'll collaborate with product and engineering teams, mentoring...SeniorRemote workFlexible hours
- A leading technology services company is seeking a Senior Data Engineer to join their client’s Data Platform team. This fully remote role requires strong expertise in data engineering, with at least 7 years of experience in building production-grade data services. Candidates...SeniorRemote work
- ...Affirm in San Jose, CA is seeking a Senior Analytics Engineer to architect and evolve its lakehouse analytics platform. This role involves designing secure data access protocols, leading analytics engineering practices, and collaborating with cross-functional teams to...SeniorRemote work
- ...Portland, Oregon, is seeking an experienced Data Architect to lead the development of their lakehouse analytics platform. This role requires deep expertise in Snowflake... ...will have over 10 years of software or data engineering experience, focusing on creating scalable solutions...SeniorRemote work
- ...A leading real estate platform in the United States seeks a Senior Software Engineer, Big Data to shape their real-time data platform. The role involves designing scalable streaming infrastructure, collaborating across teams, and leading modernization initiatives. Candidates...SeniorRemote work
- ...Affirm is seeking a data engineer with 10+ years of experience to architect its lakehouse analytics platform. The ideal candidate will possess deep Snowflake expertise and a strong background in data governance and analytics engineering. This role involves driving technical...SeniorRemote workFlexible hours
$143k - $196.9k
...A cloud security company is seeking a Software Engineer to design and develop features for their data platform. Ideal candidates will have over 5 years of experience and strong leadership capabilities in large-scale data processing systems. The role includes mentoring...SeniorFull timeRemote work- ...A cryptocurrency data company is seeking a Data Engineer to design and maintain scalable data pipelines. This remote role involves ensuring data integrity and managing data infrastructure. The ideal candidate should have 5-8 years of experience in data engineering and...SeniorRemote work
- ...sustainable technology startup focused on tackling waste is looking for a remote software engineer. The role involves building and scaling a robust image capture and analysis platform. Candidates should have over 5 years of programming experience, proficiency in Python...SeniorRemote work
$125k - $200k
A leading data platform company is seeking an experienced Software Engineer (Data Platform) to build and scale data systems and APIs. This role entails developing and optimizing data pipelines with technologies such as Python, PySpark, and Databricks. Ideal candidates will...SeniorFull timeRemote work- ...ZEPZ TECHNOLOGY SERVICES LIMITED is seeking a Senior Data Engineer to lead the development of our data platform. This is a pivotal position focused on data governance and management for our cross-border remittances business. Key responsibilities include managing data modeling...SeniorRemote work
$260k - $310k
...Affirm in Los Angeles is looking for a Senior Data Engineer to architect and evolve their lakehouse analytics platform. Candidates should have over 10 years of experience in data engineering, particularly with Snowflake, Apache Iceberg, and Spark. The role includes guiding...SeniorRemote workFlexible hours- ...Affirm is seeking an experienced Data Engineer to lead the design and implementation of their lakehouse analytics platform. The role involves significant responsibilities in architecting Snowflake's analytical infrastructure, mentoring teams, and driving data governance...SeniorRemote work
- ...A healthcare technology company is seeking a Senior Data Infrastructure Engineer to design, build, and optimize data platforms within a cloud environment. Key responsibilities include architecting cloud-native infrastructure and ensuring high availability and reliability...Senior
$145k - $165k
...A leading financial services company is looking for a Staff Data Engineer to design and build robust data platforms supporting analytics and reporting. The ideal candidate will have over 10 years of experience in data engineering, strong SQL and programming skills, and...SeniorRemote work- 1Password is seeking a Senior Data Platform Engineer to help build a next-generation data platform. In this role, you will design, implement, and maintain a streaming-first data system, ensuring data availability for analytics and business use. The ideal candidate will...SeniorRemote work
- A healthcare technology company is seeking a Sr. Data Platform Engineer to drive cloud-native data solutions. This fully remote role involves designing data pipelines, leading migrations from SQL Server to PostgreSQL, and enhancing data quality frameworks. The ideal candidate...SeniorRemote work
- ...AI Chopping Block, Inc. is looking for a skilled data engineer in San Carlos, California, to design, build, and maintain large-scale data pipelines for training and evaluation of robotics foundation models. The ideal candidate will possess excellent software engineering...Senior
$232k - $282k
Affirm is seeking a seasoned data engineer to architect their lakehouse analytics platform in Las Vegas, NV. You will be responsible for Snowflake implementation, drive data governance, and ensure secure data access. With over 10 years experience required in data engineering...SeniorRemote work- ...Veeam is seeking a Senior Customer Success Engineer to enhance customer success across the Veeam Data Platform. In this role, you will configure accounts, identify migration opportunities, and conduct recovery simulations. With over 5 years in customer-facing engineering...Senior
- ...Sidecar Health is looking for a Software Engineer, Data to build large-scale data pipelines and optimize systems. This remote position, based in the United States, offers an opportunity to utilize your engineering skills to transform healthcare. The ideal candidate will...SeniorRemote work
- ...A healthcare technology company is seeking a Senior Software Engineer, Data Platform, to enhance and operate their cloud data platform. Responsibilities include building BigQuery data warehouses, enhancing real-time streaming capabilities, and collaborating with teams...SeniorRemote workFlexible hours
- ...Luxoft is seeking a talented Palantir Data Platform Engineer in the Philippines to design and maintain scalable data pipelines. The ideal candidate will possess over 5 years of Python programming experience and deep knowledge in data engineering practices, including CI...Senior
- ...Vistrada LLC is looking for a Senior Software Engineer/Consultant to join their team remotely. The successful candidate will collaborate with various teams to develop a secure data platform supporting an enterprise-scale answer engine product. Applicants should have a...SeniorRemote work
- ...PlayOn! Sports is seeking a Senior Software Engineer to design, build, and operate data services and APIs that power their brands. The role requires strong experience in Python and SQL, focusing on creating reliable data services with clear ownership. The ideal candidate...Senior
- ...CERTIFID is seeking a Data Engineer to design and maintain their data platform to prevent wire fraud in real estate transactions. You will build core data infrastructure, ensure the accuracy of business metrics, and implement automated systems. Ideal candidates will have...SeniorRemote workFlexible hours
- A leading digital marketing platform in Utah is seeking an experienced engineer to manage data for product and system development. This role involves collaboration across teams supporting a SaaS platform and requires strong T-SQL, SQL Server, and cloud knowledge. Enthusiastic...SeniorRemote work
$131.6k - $249.78k
...A leading data solutions firm is seeking a Remote Staff Data Engineer responsible for overseeing the Data Platform. The role calls for significant expertise in Databricks and AWS, along with strong leadership and data governance skills. With a focus on project management...SeniorRemote work- ...Senior Sap Hana Cloud Data Platform Engineer This role is responsible for the core enterprise data platform supporting financial, operational, and executive reporting across the organization. This is a full-time, on-site position based in Morgan Hill, CA. Anritsu Company...SeniorFull time
$190k - $220k
...A leading blockchain intelligence company is seeking a skilled data engineer to build reliable data services and develop complex data pipelines. The ideal candidate has over 5 years of experience with distributed system architecture and proficiency in Python and SQL. This...SeniorRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Platform Data Engineer. Be the first to apply!
- platform engineering manager United States
- platform engineer United States
- client platform engineer United States
- platform developer United States
- data platform engineer United States
- senior platform engineer United States
- bi data engineer United States
- staff data engineer United States
- data visualization developer United States
- data science developer United States

