Data Engineer
InterSources Inc
Data Engineer
Location: On-Site, Washington, DC
Interview Type: Phone Screening followed by In-Person
Duration: 12+ months
The Data Engineer designs, builds, and operates batch and streaming data pipelines and curated data products on the Enterprise Data Platform (EDP) using Databricks and Apache Spark. This role is hands-on in Python and R, enabling scalable engineering workflows while supporting analytics and research use cases. The engineer partners with product, architecture, governance, and mission teams to deliver secure, performant, observable pipelines and trusted datasets.
Requirements
The candidate shall possess the knowledge and skills set forth in the Technical Services BOA, Section 3.6.4.2 for labor category Information Data Engineer. The candidate shall also demonstrate the below knowledge and experience:
- Strong proficiency in Python and R for data engineering and analytical workflows.
- Hands-on experience with Databricks and Apache Spark, including Structured Streaming (watermarking, stateful processing concepts, checkpointing, exactly-once/at-least-once tradeoffs).
- Strong SQL skills for transformation and validation.
- Experience building production-grade pipelines: idempotency, incremental loads, backfills, schema evolution, and error handling.
- Experience implementing data quality checks and validation for both batch and eventstreams (late arrivals, deduplication, event-time vs processing-time).
- Observability skills: logging/metrics/alerting, troubleshooting, and performance tuning (partitions, joins/shuffles, caching, file sizing).
- Proficiency with Git and CI/CD concepts for data pipelines, Databricks asset bundling, Databricks application deployments, and proficiency using Databricks CLI.
- Experience with lakehouse table formats and patterns (e.g., Delta tables) including compaction/optimization and lifecycle management.
- Familiarity with orchestration patterns (Databricks Workflows/Jobs) and dependency management.
- Experience with governance controls (catalog permissions, secure data access patterns, metadata/lineage expectations).
- Knowledge of message/event platforms and streaming ingestion patterns (e.g., Kafka/Kinesis equivalents) and sink patterns for serving layers.
- Experience collaborating with research/analytics stakeholders and translating analytical needs into engineered data products.
- Strong problem-solving and debugging across ingestion → transformation → serving.
- Clear technical communication and documentation discipline.
- Ability to work across product/architecture/governance teams in a regulated environment.
- Deep Delta Lake expertise including time travel, Change Data Feed (CDF), MERGE operations, CLONE, table constraints, and optimization techniques; understanding of liquid clustering and table maintenance best practices.
- Experience with Lakeflow/Delta Live Tables (DLT) including expectations framework, materialized vs. streaming table patterns, and declarative pipeline design.
- Proficiency with testing frameworks (pytest, Great Expectations, deequ) and test-driven development practices for production data pipelines.
- Data modeling skills including dimensional modeling (star/snowflake schemas), medallion architecture implementation, and slowly changing dimension (SCD) pattern implementation.
- AWS data services experience including S3 optimization, IAM role configuration for data access, and CloudWatch integration; understanding of cost optimization patterns.
Education / Experience/Certifications/Accreditations:
- Bachelor's degree in a related field or equivalent experience.
- 10+ years of data engineering experience, including production Spark-based batch pipelines and streaming implementations.
- Desirable Certifications: Databricks Certified Apache Spark Developer Associate Databricks Certified Data Engineer Associate or Professional AWS Certified Developer Associate AWS Certified Data Engineer Associate AWS Certified Solution Architect Associate
The Contractor shall deliver, but not limited to, the following:
- Build and maintain end-to-end pipelines in Databricks using Spark (PySpark) for ingestion, transformation, and publication of curated datasets.
- Implement streaming / near-real-time patterns using Spark Structured Streaming (or equivalent), including state management, checkpointing, and recovery.
- Design incremental processing, partitioning strategies, and data layout/file sizing approaches to optimize performance and cost.
- Develop reusable pipeline components (common libraries, parameterized jobs, standardized patterns) to accelerate delivery across domains.
- Develop and operationalize workflows in Python and R for data preparation, analysis support, and research-ready extracts.
- Package code for repeatable execution (dependency management, environment reproducibility, job configuration).
- Implement data quality controls for batch and streaming (schema enforcement, completeness/validity checks, late/duplicate event handling, reconciliation).
- Build pipeline observability: logging, metrics, alerting, and dashboards; support oncall/incident response and root-cause analysis.
- Create runbooks and operational procedures for critical pipelines and streaming services.
- Ensure secure handling of sensitive data and apply least-privilege principles in pipeline design and execution.
- Contribute lineage notes, dataset definitions, and operational documentation to support reuse and auditability.
- Use version control and CI/CD practices for notebooks/code (code reviews, automated testing where feasible, deployment/promotion across environments).
- Collaborate with stakeholders to refine requirements, define SLAs, and deliver incrementally with measurable outcomes.
- Implement Lakeflow/Delta Live Tables (DLT) pipelines with data quality expectations, materialized views, and streaming tables; design pipeline DAGs and maintain declarative ETL workflows.
- Design and implement medallion architecture patterns (Bronze/Silver/Gold) with appropriate data quality gates, schema evolution strategies, and layer-specific optimization techniques (OPTIMIZE, VACUUM, Z-ordering/liquid clustering).
- Develop and maintain comprehensive testing strategies including unit tests for transformation logic, integration tests for end-to-end pipelines, and data quality validation using frameworks like Great Expectations or deequ.
- Perform data modeling and schema design for dimensional models, slowly changing dimensions (SCD), and analytical structures; collaborate on entity definitions and grain decisions.
- Contribute to Unity Catalog governance by registering datasets with metadata/descriptions/tags, implementing row/column-level security where required, and maintaining accurate lineage information.
- ...HiLabs is looking for highly motivated and technically strong Lead Data Engineers with deep expertise in Big Data platforms and a passion for building scalable, data-intensive systems. The ideal candidates will have strong hands-on experience in Spark, PySpark, distributed...SuggestedRelocationRelocation package
- ...BigBear.ai are seeking a highly skilled and motivated Data Engineer to join our Data Architecture & Engineering team. In this role, you will design and build scalable, secure, and efficient data pipelines that transform raw data into actionable insights. You’ll work across...Suggested
- ...customers nationwide. Our capabilities include IT Talent Solutions, Data Delivery & Analytics, Cyber Security, Cloud Migration,... ...Development, and Finance & Accounting. M9 Solutions is seeking a Data Engineer to support a government contract for a client in Bethesda, MD ....SuggestedFull timeContract workWork experience placementWeekend workAfternoon shift
- ...electricity marketplace to power an abundant electric future. As AI data centers drive a surge in electricity demand, millions of homes... .... The Opportunity We\'re looking for a Data / Analytics Engineer to own the data infrastructure that powers Arbor\'s intelligence...SuggestedFlexible hours
$77.5k - $176k
...technology like IoT, machine learning, and artificial intelligence means there’s more structured and unstructured data available today than ever before. As a data engineer, you know that organizing data can yield pivotal insights when it’s gathered from disparate sources. We...SuggestedFull timeContract workPart timeLocal area- ...Senior Data Engineer with 3-5 years of experience // Secret with SCI or TOP secret with SCI / Mandate The Opportunity: Ever-expanding technology like IoT, machine learning, and artificial intelligence means that there’s more structured and unstructured data available today...
- ...Job Title: Data Engineer Location(s): Arlington, VA & Washington DC (DUE TO CUSTOMER REQUIREMENTS YOU MUST BE LOCATED IN THE GREATER WASHINGTON DC AREA) Job Title: Data Engineer Location(s): Arlington, VA & Washington DC (DUE TO CUSTOMER REQUIREMENTS YOU MUST BE LOCATED...Full timeFor contractorsWork at officeImmediate startRemote workFlexible hours
- ...Benefits 401(k) matching Competitive salary Health insurance Paid time off About this Role Imagineeer is seeking a Data Engineer to support the design, development, and maintenance of secure, compliant data pipelines and enterprise data platforms within DoD and federal...Local areaWork from homeFlexible hours
- ...like IoT, machine learning, and artificial intelligence means that there’s more structured and unstructured data available today than ever before. As a data engineer, you know that organizing data can yield pivotal insights when it’s gathered from disparate sources. We...Local area
- ...Careers At Absolute Business Solutions Corp Current job opportunities are posted here as they become available. ABSC is seeking a Data Engineer with a TS/SCI clearance to support our DIA customer, Enterprise Wide OSINT Knowledge Support team. Our diverse team of...
- ...We’re hiring a hands‑on Data Engineer for a 12‑month initial B2B contract, based in Baltimore. Candidates must already be based in Baltimore (hybrid working). Interviews are taking place this week (2‑stage process). The Role Our client is a global AI and technology company...Contract workImmediate start
- ...Job Description Senior Data Engineer – Top Secret Clearance Required Will implement large-scale data ecosystems including data management, governance and the integration of structured and unstructured data to generate insights leveraging cloud-based platforms. Leverage...
$60 per hour
...Data Engineer Washington, DC Pay From: $60.00 per hour Qualifications 7+ years of related experience; advanced degree preferred. Advanced working knowledge of SQL and experience working with relational database platforms including PostgreSQL, Microsoft SQL Server, and...Hourly pay$160k - $200k
...Position Summary We are seeking a highly skilled Data Engineer to join our dynamic team. The ideal candidate will be responsible for creating robust data pipelines from various data vendors to gold tables, primarily for our Machine Learning (ML) team, utilizing Snowflake...Work at office$120k - $150k
...and supportive work environment, paired with a competitive salary and an industry-leading 401k contribution. We are looking for a Data Engineer to join our team in supporting the Program Assessment and Evaluation department. Your Day-to-day Work Will Include Providing...Bi-weekly payFull timeContract workFor contractorsRemote workFlexible hours- ...RiVidium Inc. is seeking a Data Engineer to support our planned MODES III team supporting Military Community and Family Policy (MC&FP). This role supports IT, Cybersecurity, and Data Operations - Core Operations and helps deliver mission-focused outcomes for service members...Contract work
- ...Humana Inc in Washington is seeking a Lead Data Engineer for the Risk Adjustment and Stars Analytics team. This role entails designing and optimizing data pipelines, ensuring effective collaboration across teams to deliver data solutions that meet business needs. The ideal...Remote work
$123k - $147k
...of public service and not for profit, we measure our success in the impact of our service. Position Summary ANSER is seeking a Data Engineer to design, build, and optimize enterprise-level data management systems to support Object-Based Intelligence (OBI) analytic processes...Permanent employmentFull timeTemporary workLocal areaFlexible hours- ...Job Family : Data Engineering & Architecture Consulting Travel Required : None Clearance Required : Active Secret What You Will Do : Focus on engineering, building, and maintaining data pipelines, applications, workflows, and governance capabilities inside the Palantir...Temporary workRemote workFlexible hours
- ...Employment Type: Full-Time, Mid-level Department: Business Intelligence CGS is seeking a passionate and driven Data Engineer to support a rapidly growing Data Analytics and Business Intelligence platform focused on providing solutions that empower our federal customers...Full timeFlexible hours
- ...What You Will Do The Data Engineer will join a dynamic team supporting a global implementation of IT systems at the Department of State. In this role, the candidate will review and support the client’s data policies, processes, and standards to identify strengths and areas...Temporary workRemote workFlexible hours
- ...a company that believes in being part of something bigger than themselves. Job Description Cortina Solutions, LLC is seeking a Data Engineer to support the development of analytical and automation solutions for Department of Justice clients. This high performing team works...Temporary workFor contractorsWork experience placementLocal areaFlexible hours
- ...POSITION TITLE: Data Engineer ROLE SUMMARY: Conducts data migration and data architecture for the BATS Modernization project. Responsibilities Design and build data pipelines using tools like SSIS. Manage ETL (Extract, Transform, Load) processes end-to-end. Ensure data...
- ...including majority remote with meetings in Tysons Corner and Crystal City. Employment Type: Full-Time We are seeking highly skilled Data Engineers (DA II / DA III) to support a critical federal data modernization initiative focused on enterprise financial management,...Full timeInterim roleRemote work
$124.09k - $149.5k
...Position Description The Data Engineer provides system and data engineering support to data analysis operations throughout the reporting and analysis lifecycle. The Engineer will build and maintain systems to extract, clean and transform data, using advanced automation...Immediate startWorldwide$95k - $120.65k
## Data EngineerApplylocations: US DC Remotetime type: Full timeposted on: Posted Todayjob requisition id: JR111095At Zelis, we Get Stuff Done. So, let’s get to it!**A Little About Us**Zelis is modernizing the healthcare financial experience across payers, providers, and...Full timeWork at officeLocal areaVisa sponsorshipFlexible hours$117.6k - $161.7k
...Become a part of our caring community This role leads the architecture, engineering, and operationalization of the centralized Finance reporting data platform supporting CenterWell Finance reporting, analytics, and month-end close processes. We are looking for deep...Bi-weekly payFull timeTemporary workApprenticeshipWork at officeRemote workWork from homeHome office- ...Overview BigBear.ai is seeking a Lead Data Engineer to support one of our customer. This position will be based out of Washington, D.C. and will offer remote flexibility. You will be joining our amazing BigBear.ai team to provide development for legacy...Temporary workRemote work
- ...Overview BigBear.ai is seeking a Lead Data Engineer to support one of our customers. This position will be based out of Washington, D.C. and will offer remote flexibility. You will be joining our amazing BigBear.ai team to provide development for legacy and existing systems...Temporary workRemote work
- ...Big Data Platform Software Engineer Lead, architect, design and develop secure, scalable, high-performance and reliable and cost-effective Big Data platform software and services in a Multi-Cloud environment. Provide technical leadership in Data Engineering practices...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!
- remote data engineer Washington DC
- data engineer intern Washington DC
- entry level big data engineer Washington DC
- big data devops engineer Washington DC
- entry level data engineer Washington DC
- data engineer Washington DC
- data engineer contract Washington DC
- software data engineer Washington DC
- big data cloud engineer Washington DC
- junior big data engineer Washington DC

