Location: 100% Remote Years' Experience: 10+ years Education: Bachelor's in IT related field Work Authorization: Must show that applicant is legally permitted to work in the United States. Clearance: Applicants must be able to meet the requirements to obtain an Public Trust security clearance. NOTE: United States Citizenship is required to be eligible to obtain this security clearance. Key Skills:
- 10+ years of IT experience focusing on enterprise data architecture and management
- Experience with Databricks required
- 8+ years experience in Conceptual/Logical/Physical Data Modeling & expertise in Relational and Dimensional Data Modeling
- Experience with Great Expectations or other data quality validation frameworks
- Experience with ETL and ELT tools such as SSIS, Pentaho, and/or Data Migration Services
- Advanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design, Postgres performance optimization)
- Experience with AWS environment, CI/CD pipelines, and Python (Python 3)
- Plan, create, and maintain data architectures, ensuring alignment with business requirements
- Obtain data, formulate dataset processes, and store optimized data
- Identify problems and inefficiencies and apply solutions
- Determine tasks where manual participation can be eliminated with automation.
- Identify and optimize data bottlenecks, leveraging automation where possible
- Create and manage data lifecycle policies (retention, backups/restore, etc)
- In-depth knowledge for creating, maintaining, and managing ETL/ELT pipelines
- Create, maintain, and manage data transformations
- Maintain/update documentation
- Create, maintain, and manage data pipeline schedules
- Monitor data pipelines
- Create, maintain, and manage data quality gates (Great Expectations) to ensure high data quality
- Support AI/ML teams with optimizing feature engineering code
- Expertise in Spark/Python/Databricks, Data Lake and SQL
- Create, maintain, and manage Spark Structured Steaming jobs, including using the newer Delta Live Tables and/or DBT
- Research existing data in the data lake to determine best sources for data
- Create, manage, and maintain ksqlDB and Kafka Streams queries/code
- Data driven testing for data quality
- Maintain and update Python-based data processing scripts executed on AWS Lambdas
- Unit tests for all the Spark, Python data processing and Lambda codes
- Maintain PCIS Reporting Database data lake with optimizations and maintenance (performance tuning, etc)
- Streamlining data processing experience including formalizing concepts of how to handle lake data, defining windows, and how window definitions impact data freshness.
- 10+ years of IT experience focusing on enterprise data architecture and management
- Experience in Conceptual/Logical/Physical Data Modeling & expertise in Relational and Dimensional Data Modeling
- Experience with Databricks, Structured Streaming, Delta Lake concepts, and Delta Live Tables required
- Additional experience with Spark, Spark SQL, Spark DataFrames and DataSets, and PySpark
- Data Lake concepts such as time travel and schema evolution and optimization
- Structured Streaming and Delta Live Tables with Databricks a bonus
- Experience leading and architecting enterprise-wide initiatives specifically system integration, data migration, transformation, data warehouse build, data mart build, and data lakes implementation / support
- Advanced level understanding of streaming data pipelines and how they differ from batch systems
- Formalize concepts of how to handle late data, defining windows, and data freshness
- Advanced understanding of ETL and ELT and ETL/ELT tools such as SSIS, Pentaho, Data Migration Service etc
- Understanding of concepts and implementation strategies for different incremental data loads such as tumbling window, sliding window, high watermark, etc.
- Familiarity and/or expertise with Great Expectations or other data quality/data validation frameworks a bonus
- Understanding of streaming data pipelines and batch systems
- Familiarity with concepts such as late data, defining windows, and how window definitions impact data freshness
- Advanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design, Postgres performance optimization)
- Indexing and partitioning strategy experience
- Debug, troubleshoot, design and implement solutions to complex technical issues
- Experience with large-scale, high-performance enterprise big data application deployment and solution
- Understanding how to create DAGs to define workflows
- Familiarity with CI/CD pipelines, containerization, and pipeline orchestration tools such as Airflow, Prefect, etc a bonus but not required
- Architecture experience in AWS environment a bonus
- Familiarity working with Kinesis and/or Lambda specifically with how to push and pull data, how to use AWS tools to view data in Kinesis streams, and for processing massive data at scale a bonus
- Experience with Docker, Jenkins, and CloudWatch
- Ability to write and maintain Jenkinsfiles for supporting CI/CD pipelines
- Experience working with AWS Lambdas for configuration and optimization
- Experience working with DynamoDB to query and write data
- Experience with S3
- Knowledge of Python (Python 3 desired) for CI/CD pipelines a bonus
- Familiarity with Pytest and Unittest a bonus
- Experience working with JSON and defining JSON Schemas a bonus
- Experience setting up and management Confluent/Kafka topics and ensuring performance using Kafka a bonus
- Familiarity with Schema Registry, message formats such as Avro, ORC, etc.
- Understanding how to manage ksqlDB SQL files and migrations and Kafka Streams
- Ability to thrive in a team-based environment
- Experience briefing the benefits and constraints of technology solutions to technology partners, stakeholders, team members, and senior level of management
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Data Engineer in Washington DC vacancy
- ...clients across the Federal government, from senior level policy makers to program managers,... ...offering actionable insights by applying data-driven and analytics-based approaches in... ..., approaches and techniques. Our data engineers have the knowledge, skills, and initiative...Senior
- ...opportunities in an environment that embraces your unique skills and experience. Your Role and Responsibilities As an Senior Data Engineer at IBM you will harness the power of data to unveil captivating stories and intricate patterns. You'll contribute to data gathering...SeniorHoliday workFull timeTemporary workPart timeWork experience placementLocal areaRelocation
- Job Description Position Summary DPR is seeking an experienced Data Engineer to join our Data Engineering team. This role is part of the Data Engineering and AI team and will ensure DPR is moving towards data-driven decisions based on modern data engineering techniques...SeniorInternship
- Data Engineer - Senior This position requires an active TS/Sensitive Compartmental Information (SCI) clearance with the ability to obtain a Counterintelligence polygraph. Cherokee Analytics is seeking a Data Engineer to support Defense analytical requirements with enhanced...SeniorFull time
- ...seeking a Databricks-certified skilled consultant with experience delivering and leading strategic Databricks and data engineering client engagements. This Senior Manager will oversee the success of and play the Engagement Manager role on a portfolio of client engagements...SeniorLocal areaRemote job
- Senior Data Engineer Category: Software Development/ Engineering Main location: United States, Virginia, Arlington Position ID:J0924-0596 Employment Type: Full Time Position Description: Looking for a data passionate data engineer to join our team at CGI. Learn and...SeniorHoliday workFull timeContract workLocal area
- ...Inc is seeking qualified candidates to work on our efforts with a Prime for their end customer, a federal agency. Position : Senior Data Engineer - ( 50% REMOTE and 50% ONSITE) Location : Washington, DC or Crystal City, Arlington, VA Shift time: 8 am to 5 pm JOB...SeniorRemote jobHoliday workFull timeImmediate startShift work
- ...Senior Data Security Engineer Job Description Overview: CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate information, analytics, and online marketplaces. Included in the S&P 500 Index and the NASDAQ 100, CoStar Group...SeniorFull timeRemote job
- Job Description Insight Global is seeking a Secret cleared Data Engineer to support a federal client of ours remotely. The main task at hand... ...while minimizing downtime and disruptions - Work with senior developers to design new cloud-based data architectures and storage...SeniorRemote job
- Title: Senior Data Engineer - Pentaho Duration: 6 Months - Long Term Location: Washington, DC 20005 Hybrid Onsite: 2/3 days onsite per week from Day1. Candidate should be comfortable to pick official laptop from client location (Washington, DC 20005) after the onboarding...Senior2 days per week3 days per week
- Senior Data Engineer (Homes.com) Job Description Overview CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate information, analytics, and online marketplaces. Included in the S&P 500 Index and the NASDAQ 100, CoStar Group...SeniorFull timeRemote job3 days per week
- ...Senior Manager Data Engineering Locations: Washington - Seattle Campus Time Type: Full Time Posted On: Posted 3 Days Ago Job Requisition ID: R-90545 If you need assistance during the recruiting process due to a disability, please reach out to our Recruiting...SeniorFull time
- Overview As a Data Engineer you will help develop and deploy technical solutions to solve our customers’ hardest problems, using various platforms to integrate data, transform insights, and build first-class applications for operational decisions. You will leverage everything...SeniorWork experience placement
- ...Search Machine Learning (ML) team powers the ML aspects of search engines for Disney+ and Hulu platforms, in a highly collaborative... ...In this role, you will be partnering closely with ML engineers/data scientists to help automate and manage their data needs for regular...SeniorWorldwide
- still need people with heavy data, Adobe Analytics and Biopharmaceutical experience for these roles. Client is Client, project is for... ...client information online. Must be Client Level & Job Title : Senior Associate Data Analytics (depending on how this goes, team is open...SeniorContract workWork experience placementImmediate startRemote job3 days per week
- ...the world. We are igniting business growth by connecting people, data and applications – quickly, securely, and effortlessly. Together... ...Enrolled at a 4-year accredited college or university, rising senior level education status at the start of the internship Graduating...SeniorRemote jobHourly payTemporary workSummer workInternshipSummer internshipWork from home
$60k - $72k
...process and decision to hire new personnel if requested by SW Manager/Senior Manager. Works with the administrative support staff to... ...your information being transmitted by Jooble to the Employer, as data controller, through the Employer’s data processor SonicJobs....SeniorFull timeLocal areaImmediate startRelocation- Data Engineer, Data Services Company Overview Over the next ten years, there will be at least 4.6 million hospitalizations from the misuse... ...is transforming the pharmacy experience for medically complex seniors while also helping payers achieve their quality improvement...SeniorRemote job
- ...Senior Cache Developer with MUMPS Experience (Senior Healthcare Data Migration Engineer) Location : Remote US Citizenship/Green Card Holder Experience Level : 10+ Years Job Description : We are seeking a Senior Health Data Migration Engineer...SeniorRemote job
- ...reading! Leidos is looking to fill a Storage Engineer position to support the National Media... ...administering enterprise-level data storage systems (Dell Storage Center, Dell... ...performance metrics Experience coordinating with senior management and customers Candidate must...SeniorPermanent employment
- ...Company Description RightEye™ uses the power of science, data, and artificial intelligence to illuminate and understand... ...Description RightEye™ is seeking a skilled and experienced senior data science engineer familiar with cloud technologies to join our data science...SeniorFlexible hours
- Marathon TS Data Engineer – TS/SCI Bethesda, MD or Arlington, VA Marathon TS is seeking experienced Data Engineer Data Engineer supporting... ...minds in the GovCon industry. Responsibilities: As a senior member of the team, you bring deep expertise in data...SeniorPermanent employmentContract workFlexible hours
- ...Center 1 (19052), United States of America, McLean, VirginiaSenior Data Engineer (Remote-Eligible)Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery...SeniorRemote jobInternshipLocal area
- The Senior Data Center Engineer is a key player in server engineering, responsible for leading the planning of maintenance and the development of strategies across various platforms, with a primary focus on Linux systems. This role demands expertise in Red Hat Enterprise...SeniorWork experience placementRemote job
- Our Senior Java Data Engineer is a key member of the engineering staff working across the organization to provide a friction-less experience to our customers and maintain the highest standards of protection and availability. Our team thrives and succeeds in delivering...SeniorHourly payWork experience placementLocal areaShift work
- ...Senior Data Engineer Location: Washington: DC metro area Required: US Citizen We are seeking a Senior Data Engineer to join our team and support our client . Essential Duties and Responsibilities: Responsible for designing and building systems...SeniorHoliday workFull timeWork experience placement
- ...Define and implement the technology strategy across Analytics, Big Data, and Cloud platforms. Identify, research, evaluate current... ...monitoring, debugging, benchmarking and performance tuning of Database Engines. Experience with high-scale or distributed RDBMS a plus....SeniorLong distance
- A conservative political strategist company looking to fill their Senior Data Engineering position. They provide political strategies / advertising needs for their stakeholders who work on the local, state, and federal level, and specifically with conservative / right leaning...SeniorHoliday workLocal areaRemote jobH1b
- ...Job Opportunity: Our team is seeking a talented Top Secret cleared Data Engineer to work in our NW Washington DC location This role supports DHS's Office of Intelligence and Analysis (I&A). I&A is responsible for developing DHS-wide intelligence through managing the...SeniorLocal area
- ...you are doing well, Please find the job description given below and let me know your interest. Position: Senior Cloud Infrastructure Data Engineer Location: (100% Remote) Duration: 6+ months Job Description: MUST HAVES: #6+ yrs platform...SeniorRemote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Data Engineer. Be the first to apply!
Related searches
- senior data quality engineer Washington DC
- remote data architect Washington DC
- data pipeline engineer Washington DC
- hadoop big data architect Washington DC
- data systems engineer Washington DC
- data engineer intern Washington DC
- big data devops engineer Washington DC
- senior data center engineer Washington DC
- etl data engineer Washington DC
- big data cloud engineer Washington DC