Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer

InterSources Inc

Data Engineer

Location: On-Site, Washington, DC

Interview Type: Phone Screening followed by In-Person

Duration: 12+ months

The Data Engineer designs, builds, and operates batch and streaming data pipelines and curated data products on the Enterprise Data Platform (EDP) using Databricks and Apache Spark. This role is hands-on in Python and R, enabling scalable engineering workflows while supporting analytics and research use cases. The engineer partners with product, architecture, governance, and mission teams to deliver secure, performant, observable pipelines and trusted datasets.

Requirements

The candidate shall possess the knowledge and skills set forth in the Technical Services BOA, Section 3.6.4.2 for labor category Information Data Engineer. The candidate shall also demonstrate the below knowledge and experience:

  • Strong proficiency in Python and R for data engineering and analytical workflows.
  • Hands-on experience with Databricks and Apache Spark, including Structured Streaming (watermarking, stateful processing concepts, checkpointing, exactly-once/at-least-once tradeoffs).
  • Strong SQL skills for transformation and validation.
  • Experience building production-grade pipelines: idempotency, incremental loads, backfills, schema evolution, and error handling.
  • Experience implementing data quality checks and validation for both batch and eventstreams (late arrivals, deduplication, event-time vs processing-time).
  • Observability skills: logging/metrics/alerting, troubleshooting, and performance tuning (partitions, joins/shuffles, caching, file sizing).
  • Proficiency with Git and CI/CD concepts for data pipelines, Databricks asset bundling, Databricks application deployments, and proficiency using Databricks CLI.
  • Experience with lakehouse table formats and patterns (e.g., Delta tables) including compaction/optimization and lifecycle management.
  • Familiarity with orchestration patterns (Databricks Workflows/Jobs) and dependency management.
  • Experience with governance controls (catalog permissions, secure data access patterns, metadata/lineage expectations).
  • Knowledge of message/event platforms and streaming ingestion patterns (e.g., Kafka/Kinesis equivalents) and sink patterns for serving layers.
  • Experience collaborating with research/analytics stakeholders and translating analytical needs into engineered data products.
  • Strong problem-solving and debugging across ingestion → transformation → serving.
  • Clear technical communication and documentation discipline.
  • Ability to work across product/architecture/governance teams in a regulated environment.
  • Deep Delta Lake expertise including time travel, Change Data Feed (CDF), MERGE operations, CLONE, table constraints, and optimization techniques; understanding of liquid clustering and table maintenance best practices.
  • Experience with Lakeflow/Delta Live Tables (DLT) including expectations framework, materialized vs. streaming table patterns, and declarative pipeline design.
  • Proficiency with testing frameworks (pytest, Great Expectations, deequ) and test-driven development practices for production data pipelines.
  • Data modeling skills including dimensional modeling (star/snowflake schemas), medallion architecture implementation, and slowly changing dimension (SCD) pattern implementation.
  • AWS data services experience including S3 optimization, IAM role configuration for data access, and CloudWatch integration; understanding of cost optimization patterns.

Education / Experience/Certifications/Accreditations:

  • Bachelor's degree in a related field or equivalent experience.
  • 10+ years of data engineering experience, including production Spark-based batch pipelines and streaming implementations.
  • Desirable Certifications: Databricks Certified Apache Spark Developer Associate Databricks Certified Data Engineer Associate or Professional AWS Certified Developer Associate AWS Certified Data Engineer Associate AWS Certified Solution Architect Associate

The Contractor shall deliver, but not limited to, the following:

  • Build and maintain end-to-end pipelines in Databricks using Spark (PySpark) for ingestion, transformation, and publication of curated datasets.
  • Implement streaming / near-real-time patterns using Spark Structured Streaming (or equivalent), including state management, checkpointing, and recovery.
  • Design incremental processing, partitioning strategies, and data layout/file sizing approaches to optimize performance and cost.
  • Develop reusable pipeline components (common libraries, parameterized jobs, standardized patterns) to accelerate delivery across domains.
  • Develop and operationalize workflows in Python and R for data preparation, analysis support, and research-ready extracts.
  • Package code for repeatable execution (dependency management, environment reproducibility, job configuration).
  • Implement data quality controls for batch and streaming (schema enforcement, completeness/validity checks, late/duplicate event handling, reconciliation).
  • Build pipeline observability: logging, metrics, alerting, and dashboards; support oncall/incident response and root-cause analysis.
  • Create runbooks and operational procedures for critical pipelines and streaming services.
  • Ensure secure handling of sensitive data and apply least-privilege principles in pipeline design and execution.
  • Contribute lineage notes, dataset definitions, and operational documentation to support reuse and auditability.
  • Use version control and CI/CD practices for notebooks/code (code reviews, automated testing where feasible, deployment/promotion across environments).
  • Collaborate with stakeholders to refine requirements, define SLAs, and deliver incrementally with measurable outcomes.
  • Implement Lakeflow/Delta Live Tables (DLT) pipelines with data quality expectations, materialized views, and streaming tables; design pipeline DAGs and maintain declarative ETL workflows.
  • Design and implement medallion architecture patterns (Bronze/Silver/Gold) with appropriate data quality gates, schema evolution strategies, and layer-specific optimization techniques (OPTIMIZE, VACUUM, Z-ordering/liquid clustering).
  • Develop and maintain comprehensive testing strategies including unit tests for transformation logic, integration tests for end-to-end pipelines, and data quality validation using frameworks like Great Expectations or deequ.
  • Perform data modeling and schema design for dimensional models, slowly changing dimensions (SCD), and analytical structures; collaborate on entity definitions and grain decisions.
  • Contribute to Unity Catalog governance by registering datasets with metadata/descriptions/tags, implementing row/column-level security where required, and maintaining accurate lineage information.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Data Engineer in Washington DC vacancy
  •  ...HiLabs is looking for highly motivated and technically strong Lead Data Engineers with deep expertise in Big Data platforms and a passion for building scalable, data-intensive systems. The ideal candidates will have strong hands-on experience in Spark, PySpark, distributed... 
    Suggested
    Relocation
    Relocation package

    HILABS Inc

    Bethesda, MD
    1 day ago
  •  ...BigBear.ai are seeking a highly skilled and motivated Data Engineer to join our Data Architecture & Engineering team. In this role, you will design and build scalable, secure, and efficient data pipelines that transform raw data into actionable insights. You’ll work across... 
    Suggested

    BigBear Inc

    Washington DC
    3 days ago
  •  ...customers nationwide. Our capabilities include IT Talent Solutions, Data Delivery & Analytics, Cyber Security, Cloud Migration,...  ...Development, and Finance & Accounting. M9 Solutions is seeking a Data Engineer to support a government contract for a client in Bethesda, MD .... 
    Suggested
    Full time
    Contract work
    Work experience placement
    Weekend work
    Afternoon shift

    M9 Solutions

    Bethesda, MD
    5 days ago
  •  ...electricity marketplace to power an abundant electric future. As AI data centers drive a surge in electricity demand, millions of homes...  .... The Opportunity We\'re looking for a Data / Analytics Engineer to own the data infrastructure that powers Arbor\'s intelligence... 
    Suggested
    Flexible hours

    Arbor LLC Defunct

    Washington DC
    1 day ago
  • $77.5k - $176k

     ...technology like IoT, machine learning, and artificial intelligence means there’s more structured and unstructured data available today than ever before. As a data engineer, you know that organizing data can yield pivotal insights when it’s gathered from disparate sources. We... 
    Suggested
    Full time
    Contract work
    Part time
    Local area

    Booz Allen Hamilton

    Washington DC
    3 days ago
  •  ...Senior Data Engineer with 3-5 years of experience // Secret with SCI or TOP secret with SCI / Mandate The Opportunity: Ever-expanding technology like IoT, machine learning, and artificial intelligence means that there’s more structured and unstructured data available today... 

    Alliance IT

    Washington DC
    5 days ago
  •  ...Job Title: Data Engineer Location(s): Arlington, VA & Washington DC (DUE TO CUSTOMER REQUIREMENTS YOU MUST BE LOCATED IN THE GREATER WASHINGTON DC AREA) Job Title: Data Engineer Location(s): Arlington, VA & Washington DC (DUE TO CUSTOMER REQUIREMENTS YOU MUST BE LOCATED... 
    Full time
    For contractors
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Elder Research

    Arlington, VA
    2 days ago
  •  ...Benefits 401(k) matching Competitive salary Health insurance Paid time off About this Role Imagineeer is seeking a Data Engineer to support the design, development, and maintenance of secure, compliant data pipelines and enterprise data platforms within DoD and federal... 
    Local area
    Work from home
    Flexible hours

    IMAGINEEER LLC

    Arlington, VA
    1 day ago
  •  ...like IoT, machine learning, and artificial intelligence means that there’s more structured and unstructured data available today than ever before. As a data engineer, you know that organizing data can yield pivotal insights when it’s gathered from disparate sources. We... 
    Local area

    Booz Allen Hamilton

    Alexandria, VA
    5 days ago
  •  ...Careers At Absolute Business Solutions Corp Current job opportunities are posted here as they become available. ABSC is seeking a Data Engineer with a TS/SCI clearance to support our DIA customer, Enterprise Wide OSINT Knowledge Support team. Our diverse team of... 

    Absolute Business Solutions Corp

    Alexandria, VA
    2 days ago
  •  ...We’re hiring a hands‑on Data Engineer for a 12‑month initial B2B contract, based in Baltimore. Candidates must already be based in Baltimore (hybrid working). Interviews are taking place this week (2‑stage process). The Role Our client is a global AI and technology company... 
    Contract work
    Immediate start

    Trust In SODA

    Washington DC
    1 day ago
  •  ...Job Description Senior Data Engineer – Top Secret Clearance Required Will implement large-scale data ecosystems including data management, governance and the integration of structured and unstructured data to generate insights leveraging cloud-based platforms. Leverage... 

    6AM City

    Washington DC
    1 day ago
  • $60 per hour

     ...Data Engineer Washington, DC Pay From: $60.00 per hour Qualifications 7+ years of related experience; advanced degree preferred. Advanced working knowledge of SQL and experience working with relational database platforms including PostgreSQL, Microsoft SQL Server, and... 
    Hourly pay

    Quadrant

    Washington DC
    1 day ago
  • $160k - $200k

     ...Position Summary We are seeking a highly skilled Data Engineer to join our dynamic team. The ideal candidate will be responsible for creating robust data pipelines from various data vendors to gold tables, primarily for our Machine Learning (ML) team, utilizing Snowflake... 
    Work at office

    The Carlyle Group

    Washington DC
    1 day ago
  • $120k - $150k

     ...and supportive work environment, paired with a competitive salary and an industry-leading 401k contribution. We are looking for a Data Engineer to join our team in supporting the Program Assessment and Evaluation department. Your Day-to-day Work Will Include Providing... 
    Bi-weekly pay
    Full time
    Contract work
    For contractors
    Remote work
    Flexible hours

    Intrepid USA

    Arlington, VA
    1 day ago
  •  ...RiVidium Inc. is seeking a Data Engineer to support our planned MODES III team supporting Military Community and Family Policy (MC&FP). This role supports IT, Cybersecurity, and Data Operations - Core Operations and helps deliver mission-focused outcomes for service members... 
    Contract work

    Rividium Inc

    Alexandria, VA
    4 days ago
  •  ...Humana Inc in Washington is seeking a Lead Data Engineer for the Risk Adjustment and Stars Analytics team. This role entails designing and optimizing data pipelines, ensuring effective collaboration across teams to deliver data solutions that meet business needs. The ideal... 
    Remote work

    Humana

    Washington DC
    14 hours ago
  • $123k - $147k

     ...of public service and not for profit, we measure our success in the impact of our service. Position Summary ANSER is seeking a Data Engineer to design, build, and optimize enterprise-level data management systems to support Object-Based Intelligence (OBI) analytic processes... 
    Permanent employment
    Full time
    Temporary work
    Local area
    Flexible hours

    ANSER

    Washington DC
    1 day ago
  •  ...Job Family : Data Engineering & Architecture Consulting Travel Required : None Clearance Required : Active Secret What You Will Do : Focus on engineering, building, and maintaining data pipelines, applications, workflows, and governance capabilities inside the Palantir... 
    Temporary work
    Remote work
    Flexible hours

    Guidehouse Careers

    Arlington, VA
    5 days ago
  •  ...Employment Type: Full-Time, Mid-level Department: Business Intelligence CGS is seeking a passionate and driven Data Engineer to support a rapidly growing Data Analytics and Business Intelligence platform focused on providing solutions that empower our federal customers... 
    Full time
    Flexible hours

    CGS Federal (Contact Government Services)

    Washington DC
    1 day ago
  •  ...What You Will Do The Data Engineer will join a dynamic team supporting a global implementation of IT systems at the Department of State. In this role, the candidate will review and support the client’s data policies, processes, and standards to identify strengths and areas... 
    Temporary work
    Remote work
    Flexible hours

    Guidehouse

    Arlington, VA
    1 day ago
  •  ...a company that believes in being part of something bigger than themselves. Job Description Cortina Solutions, LLC is seeking a Data Engineer to support the development of analytical and automation solutions for Department of Justice clients. This high performing team works... 
    Temporary work
    For contractors
    Work experience placement
    Local area
    Flexible hours

    Cortina Solutions

    Washington DC
    1 day ago
  •  ...POSITION TITLE: Data Engineer ROLE SUMMARY: Conducts data migration and data architecture for the BATS Modernization project. Responsibilities Design and build data pipelines using tools like SSIS. Manage ETL (Extract, Transform, Load) processes end-to-end. Ensure data... 

    Agelix Consulting

    Washington DC
    2 days ago
  •  ...including majority remote with meetings in Tysons Corner and Crystal City. Employment Type: Full-Time We are seeking highly skilled Data Engineers (DA II / DA III) to support a critical federal data modernization initiative focused on enterprise financial management,... 
    Full time
    Interim role
    Remote work

    Acquisition NexGen

    Washington DC
    5 days ago
  • $124.09k - $149.5k

     ...Position Description The Data Engineer provides system and data engineering support to data analysis operations throughout the reporting and analysis lifecycle. The Engineer will build and maintain systems to extract, clean and transform data, using advanced automation... 
    Immediate start
    Worldwide

    General Dynamics

    Washington DC
    2 days ago
  • $95k - $120.65k

    ## Data EngineerApplylocations: US DC Remotetime type: Full timeposted on: Posted Todayjob requisition id: JR111095At Zelis, we Get Stuff Done. So, let’s get to it!**A Little About Us**Zelis is modernizing the healthcare financial experience across payers, providers, and... 
    Full time
    Work at office
    Local area
    Visa sponsorship
    Flexible hours

    Zelis Healthcare

    Washington DC
    2 days ago
  • $117.6k - $161.7k

     ...Become a part of our caring community This role leads the architecture, engineering, and operationalization of the centralized Finance reporting data platform supporting CenterWell Finance reporting, analytics, and month-end close processes. We are looking for deep... 
    Bi-weekly pay
    Full time
    Temporary work
    Apprenticeship
    Work at office
    Remote work
    Work from home
    Home office

    Humana

    Washington DC
    17 days ago
  •  ...Overview BigBear.ai is seeking a Lead Data Engineer to support one of our customer. This position will be based out of Washington, D.C. and will offer remote flexibility. You will be joining our amazing BigBear.ai team to provide development for legacy... 
    Temporary work
    Remote work

    BigBear Inc

    Washington DC
    3 days ago
  •  ...Overview BigBear.ai is seeking a Lead Data Engineer to support one of our customers. This position will be based out of Washington, D.C. and will offer remote flexibility. You will be joining our amazing BigBear.ai team to provide development for legacy and existing systems... 
    Temporary work
    Remote work

    BigBear Inc

    Washington DC
    5 days ago
  •  ...Big Data Platform Software Engineer Lead, architect, design and develop secure, scalable, high-performance and reliable and cost-effective Big Data platform software and services in a Multi-Cloud environment. Provide technical leadership in Data Engineering practices... 

    Software Technology Inc

    Washington DC
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!