Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Cloud Data Engineer — Lakehouse & ETL Pipelines

Creative Information Technology India

If you are unable to complete this application due to a disability, contact this employer to ask for an accommodation or an alternative application process. Data Engineer – Baltimore City, MD Contract Falls Church, VA, US 16 days ago Requisition ID: 1802 Data Engineer – Baltimore City, MD About us Creative Information Technology Inc (CITI) is an esteemed IT enterprise renowned for its exceptional customer service and innovation. We serve both government and commercial sectors, offering a range of solutions such as Healthcare IT, Human Services, Identity Credentialing, Cloud Computing, and Big Data Analytics. With clients in the US and abroad, we hold key contract vehicles including GSA IT Schedule 70, NIH CIO-SP3, GSA Alliant, and DHS-Eagle II. Join us in driving growth and seizing new business opportunities. Background Client is seeking a hands-on Data Engineer to design, develop, and optimize large-scale data pipelines in support of our Enterprise Data Warehouse (EDW) and Data Lake solutions. This role requires deep technical expertise in coding, pipeline orchestration, and cloud-native data engineering on AWS. The Data Engineer will be directly responsible for implementing ingestion, transformation, and integration workflows — ensuring data is high-quality, compliant, and analytics-ready. This role may support other projects or teams within MDH as needed. Role and Responsibilities Responsible for designing, building, and maintaining data pipelines and infrastructure to support data-driven decisions and analytics. The individual is responsible for the following tasks: Design, develop and maintain data pipelines, and extract, transform, load (ETL) processes to collect, process and store structured and unstructured data Build data architecture and storage solutions, including data lakehouses, data lakes, data warehouse, and data marts to support analytics and reporting Develop data reliability, efficiency, and qualify checks and processes Monitor and optimize data architecture and data processing systems Collaboration with multiple teams to understand requirements and objectives Administer testing and troubleshooting related to performance, reliability, and scalability H. Create and update documentation Design, code, and deploy ETL/ELT pipelines across bronze, silver, and gold layers of the Data Lakehouse. Build ingestion pipelines for structured (SQL), semi-structured (JSON, XML), and unstructured data using PySpark/Python programming language using AWS Glue or EMR. Implement incremental loads, deduplication, error handling, and data validation. Actively troubleshoot, debug, and optimize pipelines for scalability and cost efficiency. EDW & Data Lake Implementation Develop dimensional data models (Star Schema, Snowflake Schema) for analytics and reporting. Build and maintain tables in Iceberg, Delta Lake, or equivalent OTF formats. Optimize partitioning, indexing, and metadata for fast query performance. Build ingestion and transformation pipelines for EDI X12 transactions (837, 835, 278, etc.). Implement mapping and transformation of EDI data with FHIR and HL7 frameworks. Work hands-on with AWS Health Lake (or equivalent) to store and query healthcare data. Data Quality, Security & Compliance Develop automated validation scripts to enforce data quality and integrity. Implement IAM roles, encryption, and auditing to meet HIPAA and CMS compliance standards. Maintain lineage and governance documentation for all pipelines. Work closely with the Lead Data Engineer, analysts, and data scientists to deliver pipelines that support enterprise-wide analytics. Actively contribute to CI/CD pipelines, Infrastructure-as-Code (IaC), and automation. Continuously improve pipelines and adopt new technologies where appropriate. Minimum Qualifications The candidate should have experience as data engineer or similar role with a strong understanding of data architecture and ETL processes. The candidate should be proficient in programming languages for data processing and knowledgeable of distributed computing and parallel processing. This position requires a bachelor’s or master’s degree from an accredited college or university with a major in computer science, statistics, mathematics, economics, or a related field. Three (3) years of equivalent experience in a related field may be substituted for the Bachelor’s degree. 3+ years hands-on experience in building, deploying, and maintaining data pipelines on AWS or equivalent cloud platforms. Strong coding skills in Python and SQL (Scala or Java a plus). Proven experience with Apache Spark (PySpark) for large-scale processing. Hands-on experience with AWS Glue, S3, Redshift, Athena, EMR, Lake Formation. Strong debugging and performance optimization skills in distributed systems. Hands-on experience with Iceberg, Delta Lake, or other OTF table formats. Experience with Airflow or other pipeline orchestration frameworks. Practical experience in CI/CD and Infrastructure-as-Code (Terraform, CloudFormation). Practical experience with EDI X12, HL7, or FHIR data formats. Strong understanding of Medallion Architecture for data lake houses. Hands-on experience building dimensional models and data warehouses. Working knowledge of HIPAA and CMS interoperability requirements. #J-18808-Ljbffr Creative Information Technology India

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Cloud Data Engineer — Lakehouse & ETL Pipelines in Falls Church, VA vacancy
  • CACI International Inc is seeking a Sr. Data Engineer in Arlington, Virginia. You will build and optimize data pipelines for ETL, integrate complex datasets, and develop scalable data warehousing solutions. The role requires a DoD Secret clearance, a BA/BS in Computer... 
    Pipeline
    Flexible hours

    CACI International Inc

    Arlington, VA
    21 hours ago
  • Cybermedia Technologies is seeking a Data Engineer in McLean, VA. This role involves designing ETL pipelines, optimizing data processing in Azure Databricks, and supporting...  ...and strong skills in SQL, Python, and cloud-based data platforms. The position offers benefits... 
    Pipeline

    Cybermedia Technologies

    Mc Lean, VA
    3 days ago
  •  ...landscape, an organization’s data has never been a more...  ...for seasoned Data Engineer to work with our team and...  ..., services, and pipelines. We are looking for a more...  ...Assess and understand the ETL jobs, workflows, BI tools...  ...warehouse solutions in cloud (Preferably AWS.... 
    Pipeline

    Steampunk, Inc.

    Mc Lean, VA
    3 days ago
  • A leading consulting firm in Arlington is seeking a Data Engineer to build, optimize, and maintain data pipelines and cloud solutions. You will work with cross-functional teams to deliver high-quality data solutions and ensure data integrity. Requirements include a Bachelor... 
    Pipeline

    Guidehouse

    Arlington, VA
    2 days ago
  •  ...IICS / AWS in McLean, VA. This role involves supporting IICS and the development of the AWS platform, including building pipelines and developing ETL solutions. The ideal candidate should have over 5 years of experience with Informatica PowerCenter & IICS, strong SQL... 
    Pipeline

    TEKsystems

    Mc Lean, VA
    3 days ago
  • $176.99k

    A global corporate data provider is seeking a Senior Data Engineer. You will build and maintain ETL pipelines, collaborate on requirements, and improve data processes. Requires a Bachelor’s degree and 3 years of experience in a similar role, with skills in Python, Scala... 
    Pipeline
    Remote job

    Sayari

    Washington DC
    1 day ago
  • A tech company in Washington, DC is seeking an Azure Data Engineer Developer with extensive experience in Microsoft Azure data stack,...  ...Factory and SQL Database. The role involves building complex ETL pipelines and providing expertise in data modeling and integration. Ideal... 
    Pipeline

    TechDigital Group

    Washington DC
    1 day ago
  • $62k - $141k

    Booz Allen Hamilton is seeking a Data Engineer in Washington, DC. The role involves building data pipelines, mentoring a team, and...  ...experience in database architecture, ETL workflows, and proficiency in...  ...with tools like Spark and cloud platforms such as AWS is required... 
    Pipeline

    Booz Allen Hamilton

    Washington DC
    4 days ago
  • TMCI is seeking a Junior Data Engineer in McLean, VA. This role involves designing, developing, and maintaining data pipelines and databases to enhance patient care in the healthcare industry...  ...with data modeling techniques and ETL tools is essential. Strong problem-solving... 
    Pipeline

    TMCI

    Mc Lean, VA
    21 hours ago
  •  ...ETL Data Engineer Tysons Corner, VA Type: Contract Category: Data Industry: Financial...  ...to build and maintain large-scale data pipelines and to design agentic AI systems that...  ...SQL engines such as Hive or Trino and cloud data platforms including AWS S3, EMR,... 
    Pipeline
    Hourly pay
    Contract work
    Work experience placement
    Local area
    Remote work

    Eliassen Group

    Vienna, VA
    2 days ago
  • $115k - $130k

    Pantheon Data is looking for a Data Engineer in Falls Church, Virginia, to support the Defense Health Agency. This role requires five years of experience, expertise in SQL, and the ability to design ETL pipelines. The ideal candidate will ensure data integration, work with... 
    Pipeline
    Remote job
    Flexible hours

    Pantheon Data

    Falls Church, VA
    4 days ago
  • ProSidian Consulting, LLC is seeking a Data Engineer in Alexandria, VA to support Workforce...  ...engineering experience, with a focus on building ETL pipelines and managing data architecture. Ideal...  ...should possess skills in ETL, SQL, and cloud platforms. The position supports a... 
    Pipeline

    ProSidian Consulting, LLC

    Alexandria, VA
    21 hours ago
  • Strategio Inc. is seeking a Senior Data Engineer in McLean, VA (Hybrid) to design and optimize data pipelines and support cloud data migration initiatives. The ideal candidate brings...  .... Responsibilities include building ETL pipelines and collaborating with cross-functional... 
    Pipeline

    Strategio Inc.

    Mc Lean, VA
    4 days ago
  •  .../JD Azure Databricks and Pyspark skills: Data ETL / Data Engineering Key technology: ADF, Databricks, dbt, Azure Cloud Services Supporting technology: MS SQL...  ...development experience in building complex data pipeline for lakehouse/data warehouses using Agile methodology.... 
    Pipeline
    Work experience placement

    Omni Inclusive

    Washington DC
    3 days ago
  • $135k - $165k

    Macalogic is seeking a Data Engineer to provide support to the Department of Commerce in Washington, DC. You will design, maintain, and optimize data pipelines, ETL workflows, and cloud data services to support AI initiatives. This hybrid role requires strong data engineering... 
    Pipeline

    Macalogic

    Washington DC
    2 days ago
  • Data Engineer/ETL — McLean, VA Overview Fuel Consulting is seeking a data engineer/ETL subject...  ...makes significant enhancements to existing pipelines. Resolves complex hardware/software...  ...Distributed Computing, Blade Centers, and cloud infrastructure Strong problem solving... 
    Pipeline

    Fuel Consulting, LLC

    Mc Lean, VA
    2 days ago
  • A leading consulting firm is seeking a Data Engineer at the Senior Analyst level in Arlington,...  ...involves building and maintaining data pipelines and collaborating with senior engineers....  ...skills in Python and SQL. Familiarity with cloud environments and data engineering... 
    Pipeline

    Accenture Federal Services

    Arlington, VA
    1 day ago
  • Infinitive seeks a highly skilled Data Engineer in McLean, Virginia. You will...  ...involves data architecture, ETL development, and integration...  ...proven experience in data pipeline management. Strong skills in...  ...essential. Familiarity with cloud platforms is preferred. #J-1... 
    Pipeline

    Infinitive

    Mc Lean, VA
    3 days ago
  •  ...Data Platform Engineer MPR Associates, Inc. (MPR), a thriving multi-discipline, specialty...  ...processing, working with modern cloud platforms and visualization tools...  ...data processing. Design Data Pipelines: Create and maintain robust ETL/ELT pipelines that process large... 
    Pipeline

    MPR Services

    Alexandria, VA
    2 days ago
  • A leading engineering firm is looking for a Data & Software Engineer to develop complex data flows for custom applications....  ...principles. Responsibilities include building data pipelines, using orchestration tools, and working with cloud environments like AWS. A Bachelor's degree... 
    Pipeline

    Vosper Thornycroft Group

    Mc Lean, VA
    21 hours ago
  • Redhorse Corporation is seeking a skilled Data Engineer based in Arlington, Virginia, to transform data utilization within the Department of Defense. This role will enable significant advancements in data-driven decision-making impacting national security initiatives. Candidates... 
    Pipeline

    Redhorse Corporation

    Arlington, VA
    3 days ago
  • $140k - $180k

    Steampunk, Inc. is seeking a seasoned Senior Data Engineer to create enterprise-grade data platforms and pipelines in Databricks. You will architect data migrations, assess ETL jobs, and work closely on data architecture in a collaborative team environment. The ideal candidate... 
    Pipeline

    Steampunk, Inc.

    Mc Lean, VA
    3 days ago
  •  ...Azure Cloud Data Engineer With Data Bricks Job Location: Vienna, VA Job Type: Contract Qualifications...  ..., Data Architecture, Apache Spark, ETL tools and techniques Proficient with...  ...with warehousing, data cleaning, data pipelines and other analytical techniques... 
    Pipeline
    Contract work

    InterSources

    Vienna, VA
    2 days ago
  • AEM Corporation is seeking a Data Engineer in Washington, DC to build trusted data products for Federal clients. This hands-on role requires designing scalable data pipelines using Azure technologies like Synapse and ADLS Gen2, collaborating within a Scrum team to meet... 
    Pipeline

    AEM Corporation

    Washington DC
    3 days ago
  • $110k - $130k

    Steampunk is seeking a skilled ETL Engineer located in McLean, Virginia, to develop enterprise-grade data pipelines. This role requires strong communication skills and a passion for data. You will assess ETL workflows, create reusable pipelines, and support analytics and... 
    Pipeline

    Steampunk

    Mc Lean, VA
    3 days ago
  • MPR Associates, Inc. is looking for a Data Platform Engineer in Alexandria, Virginia. This role involves building robust data processing...  ...engineering experience. Responsibilities include developing ETL/ELT pipelines and deploying data solutions on AWS. MPR offers a... 
    Pipeline

    MPR Associates, Inc.

    Alexandria, VA
    21 hours ago
  •  ...Azure Data Architect/Engineer The intend of this work is to guide the design...  .... ~ Minimum 8+ years of Cloud data architecture with at...  ...in developing complex ETL Data Pipelines using Databricks with PySpark...  ...& maintain Data Lake & Lakehouse (CDC & SCD). ~ Excellent... 
    Pipeline

    Mindlance

    Washington DC
    2 days ago
  • $180.37k - $212.2k

     ...team bridges the gap between data engineering, data science, and business...  ...internal audit - ensuring our pipelines, data models, and certified...  ...maintaining, and optimizing ETL/ELT pipelines, using modern...  ...synchronization patterns between lakehouse and warehouse environments.... 
    Pipeline
    Work at office
    Local area

    Coinbase

    Washington DC
    21 hours ago
  • Contract (6-9 months, potential conversion) Job Description The Data / Integration Engineer is responsible for building the data standardization layer, ETL pipelines, and ESI Learning Service that support the CLEARSITE™ agent decision-making loop. This role ensures schema... 
    Pipeline
    Contract work
    Work visa

    Digital Global Systems Inc

    Mc Lean, VA
    21 hours ago
  • Cydecor, Inc. in Arlington, Virginia seeks a Data Engineer to design and develop robust, scalable data pipelines using ETL/ELT methodologies. The ideal candidate should have...  ...proficiency in Python and SQL, alongside cloud environments like AWS or Azure. This role involves... 
    Pipeline

    Cydecor, Inc.

    Arlington, VA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Cloud Data Engineer — Lakehouse & ETL Pipelines. Be the first to apply!