Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer (Python/PySpark/Apache Spark)

Infinitive

About Infinitive

Infinitive is a data and AI consultancy that enables its clients to modernize, monetize and operationalize their data to create lasting and substantial value. We possess deep industry and technology expertise to drive and sustain adoption of new capabilities. We match our people and personalities to our clients' culture while bringing the right mix of talent and skills to enable high return on investment.

Infinitive has been named “Best Small Firms to Work For” by Consulting Magazine 8 times, most recently in 2025. Infinitive has also been named a Washington Post “Top Workplace”, Washington Business Journal “Best Places to Work”, and Virginia Business “Best Places to Work.”

Role Overview

We are seeking a highly skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will play a crucial role in designing, developing, and maintaining our clients' data infrastructure. Your expertise in Apache Spark, Python, PySpark, ETL processes, CI/CD (Jenkins or GitHub), and experience with both streaming and batch workflows will be essential in ensuring the efficient flow and processing of data to support our clients.

Responsibilities

  • Data Architecture and Design: Collaborate with cross-functional teams to understand data requirements and design robust data architecture solutions. Develop data models and schema designs to optimize data storage and retrieval.

  • ETL Development: Implement robust ETL processes to extract, transform, and load data from various sources. Ensure data quality, integrity, and consistency throughout the ETL pipeline.

  • Distributed Computing & Spark Development: Utilize your expertise in Apache Spark, Python, and PySpark to develop efficient, large-scale data processing and analysis scripts. Optimize code for performance, memory management, and scalability, keeping up-to-date with the latest industry best practices.

  • Data Integration: Integrate data from different systems and sources to provide a unified view for analytical purposes. Collaborate with data scientists and analysts to implement solutions that meet their data integration needs.

  • Streaming and Batch Workflows: Design and implement streaming workflows using PySpark Streaming or other relevant technologies. Develop batch processing workflows for large-scale data processing and analysis.

  • CI/CD Implementation: Implement and maintain continuous integration and continuous deployment (CI/CD) pipelines using Jenkins or GitHub Actions. Automate testing, code deployment, and monitoring processes to ensure the reliability of data pipelines.

Qualifications

  • Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.

  • Proven experience as a Data Engineer or similar role.

  • Strong programming skills in Python and deep expertise in Apache Spark and PySpark for both batch and streaming data processing.

  • Hands-on experience developing, tuning, and troubleshooting distributed data pipelines.

  • Solid understanding of ETL tools, data modeling, database design, and data warehousing concepts.

  • Familiarity with CI/CD tools such as Jenkins or GitHub Actions.

  • Excellent problem-solving, analytical, communication, and collaboration skills.

Preferred Skills

  • Experience with Ab Initio (e.g., GDE, Co-Operating System, EME) or a strong background in enterprise ETL modernization.

  • Knowledge of cloud platforms such as AWS, Azure, or Google Cloud.

  • Experience with version control systems (e.g., Git).

  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).

  • Understanding of data security and privacy best practices.

Infinitive is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic protected by applicable federal, state, or local law.

Powered by JazzHR

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Data Engineer (Python/PySpark/Apache Spark) in Ashburn, VA vacancy
  •  ...Description About Infinitive Infinitive is a data and AI consultancy that enables its...  ...a highly skilled and motivated Data Engineer to join our dynamic team. As a Data...  ...infrastructure. Your expertise in Apache Spark, Python, PySpark, ETL processes, CI/CD (Jenkins or... 
    Suggested
    Local area

    Infinitive Inc

    Ashburn, VA
    28 days ago
  •  ...community of innovators, engineers, analysts and business...  ..., Defense, AI/ML, and Data Science fields. As we...  .../ TS/SCI-cleared Apache Spark Developer to support...  ...Processing Apache Spark (PySpark, Scala) Delta Lake...  ...) Languages Python (PySpark) Scala (... 
    Suggested
    Flexible hours

    Absolute Business Solutions Corp

    Herndon, VA
    4 days ago
  •  ...Data Engineer Chantilly, Virginia, United States What Impact You'll Have GRVTY...  ...team is rather huge and includes Python (Pandas, numpy, scipy, scikit-learn,...  ...Object Detection, etc.), Linux, AWS/C2S, Apache NiFi, Spark, pySpark, Hadoop, Kafka, ElasticSearch, Solr,... 
    Suggested
    Local area
    Immediate start
    Remote work
    Flexible hours

    GRVTY

    Chantilly, Loudoun County, VA
    4 days ago
  • $3,000 per month

     ...Overview Acuity, Inc. seeks a  Data Engineer  to design, develop, and maintain scalable...  ...with strong emphasis on  Databricks, Apache Spark, and AWS .  The Data Engineer...  ...platform services ~ Experience with  Python, PySpark, SQL , or similar data engineering... 
    Suggested
    Work from home

    Acuity

    Reston, VA
    3 days ago
  •  ...The Data & Software Engineer works with a small team to build complex data flows...  ...will have advanced Python programming skills, familiarity...  ...configuring and updating Spark Jobs Containerizing and...  ...years' experience with:  Apache Spark & PySpark Advanced Python skills... 
    Suggested
    Temporary work

    Avalore, LLC

    Chantilly, Loudoun County, VA
    3 days ago
  • $99.2k - $154.3k

     ...Senior Python Developer with Spark Category: Software Development/ Engineering Main location: United States, Virginia, Reston...  ...Google/YouTube processing your data and using cookies - Learn more...  .... Deep expertise in Apache Spark, including: Performance... 
    Full time
    Local area

    CGI

    Reston, VA
    2 days ago
  •  ...Data Engineering Testing Specialist Key Responsibilities Define the end-to-end testing scope...  ...DMS (Data Migration Service) AWS Glue PySpark Deequ Event Bridge Data Lakes Python-based data pipelines Apache Airflow dbt (data build tool)... 

    Talent Software Services

    Reston, VA
    4 days ago
  •  ...Wolf constructs and deploys data management and analytics...  ...to boast a world-class engineering team that thrives on rolling...  ...building data products in Apache Avro and/or Parquet Python AWS Experience with...  ...Iceberg Presto/Trino/Spark Kubernetes Experience... 

    Dark Wolf Solutions

    Herndon, VA
    4 days ago
  •  ...description Are you a highly experienced Data Engineer passionate about transforming diverse...  ...using cutting-edge tools such as Spark, Apache Iceberg, Trino, OpenSearch, EMR cloud services...  ...storage and processing solutions. Python, SQL, Spark, and other data engineering... 

    SpatialGIS

    Chantilly, Loudoun County, VA
    4 days ago
  •  ...Altus Consulting is seeking a skilled Data Engineer to join our dynamic team. You will be responsible...  ...standard tools and technologies (e.g., Apache Spark, Kafka, Airflow). Extract, transform...  ...Skills: Strong programming skills (Python, Java, Scala). Experience with Big... 
    Contract work

    Altus Consulting Corp

    Herndon, VA
    4 days ago
  • $60 - $100 per hour

     ...JOB TITLE: Data Engineer LOCATION: Remote DURATION: 6+ month contract RATE RANGE: $60...  ...knowledge of big data technologies such as Hadoop, Spark, or Anacond Strong programming skills in Python, Java, Scala, and/or SQL Experience in client... 
    Hourly pay
    Full time
    Contract work
    Remote work

    Ursus Inc

    Ashburn, VA
    3 days ago
  •  ...Overview We are seeking a skilled Data Engineer with at least 5 years of...  ...processing frameworks (e.g., Spark, Hadoop) ~ Hands-on...  ...pipelines using tools such as Apache Airflow, Informatica, or similar...  ...programming languages such as Python, Scala, or Java ~ Experience... 

    VTG Defense

    Herndon, VA
    3 days ago
  •  ...Senior Data Engineer - Cybersecurity Your work days are brighter here...  ...science workloads (e.g., Ray, Spark/EMR GPU instances)....  ...Libraries: Strong experience using Apache Spark / AWS EMR, Ray, or Dask...  .../CD: Advanced proficiency in Python (including ML ecosystems like... 

    Workday

    Reston, VA
    2 days ago
  •  ...: Contract Job #3714 Title: Data Engineer Location: Chantilly, VA...  ...optimize Data Pipelines using tools such as Spark, Apache Iceberg, Trino, OpenSearch, EMR cloud services...  ...data storage and processing solutions Python, SQL, Spark and other data engineering... 
    Contract work
    Local area

    Cornerstone Defense

    Chantilly, Loudoun County, VA
    1 day ago
  • $120k - $150k

     ...community of innovators, engineers, analysts and business...  ..., Defense, AI/ML, and Data Science fields. As we continue...  ...pipelines, optimize Spark workloads, and deliver...  ..., reusable code in Python and Spark to support distributed...  ...Desired Skills Apache Spark Background... 
    Remote work

    Absolute Business Solutions Corp

    Herndon, VA
    4 days ago
  •  ...advanced full-spectrum cyber, data operations, systems...  ...markets. Job Title: Data Engineer Location: Sterling, VA...  ...Programming Languages: Proficiency in Python or Java for data manipulation...  ...with technologies like Apache Hadoop, Spark, or Kafka. Data Visualization... 
    Contract work

    Nightwing

    Hamilton, VA
    25 days ago
  •  ...Founded by ex-Googlers with engineers from Google, Amazon,...  ...particularly in AI, data engineering, blockchain...  ...programming skills in Python, Rust, or Java ~ Experience...  ...., Kafka, Flink, Beam, Spark) and orchestration...  ...pipelines using Dataflow (Apache Beam), Pub/Sub,... 
    Work at office

    SZNS Solutions LLC

    Reston, VA
    4 days ago
  • $90.7k - $141.78k

     ...Responsibilities Noblis is seeking Data Engineers at multiple levels to join...  ...have a deep background in Python-based data workflows and...  ...Python), and ETL tools (e.g., Apache NiFi, Pentaho, Kafka)....  ...Experience with Hadoop and Spark Experience with Elasticsearch... 
    Full time
    Contract work
    Part time
    Local area
    Remote work

    Noblis

    Chantilly, Loudoun County, VA
    11 hours ago
  • $3,000 per month

     ...Acuity Inc. is seeking a highly skilled Data Engineer to join our Engineering Team, helping drive...  ...knowledge and/or experience with Spark, Delta Lake, and distributed data pipelines...  ...Responsibilities Build and maintain scalable PySpark-based data pipelines in Databricks... 
    Remote work
    Work from home

    Acuity

    Reston, VA
    11 hours ago
  •  ...Description SOSi is seeking a highly skilled Senior Data Engineer to support a US government customer in Chantilly, VA....  ...mathematics, etc.). ~ Experience with services including Apache Kafka, Apache Spark, and Prefect ~ Experience containerizing applications using... 
    Work at office
    Worldwide

    SOSi

    Chantilly, Loudoun County, VA
    2 days ago
  •  ...About Infinitive: Infinitive is a data and AI consultancy that enables its clients...  ...technical guidance and mentorship to data engineering teams.  Stay updated with the latest industry...  ...the ability to build and optimize Spark applications.  Extensive experience with... 
    Local area

    Infinitive

    Ashburn, VA
    1 day ago
  •  ...tuning Solr/Elastic for performance and query optimization 2. Minimum of 3 years of experience in a big data environment using tools such as Hadoop, Pyspark, Spark, and Hbase (prefer at least two of the list) 3. Minimum of 5 years of hands-on, back-end Java... 

    Leading Path Consulting

    Chantilly, Loudoun County, VA
    3 days ago
  •  ...passionate about harnessing data to solve some of the nation’s...  ...success. We're seeking a Data Engineer with a rare mix of curiosity...  ...data pipelines using Apache Spark , Apache Hudi , AWS EMR...  ...resilient full-stack solutions with Python , Java , or Scala Required... 

    Equilibrium Technologies LLC

    Chantilly, Loudoun County, VA
    1 day ago
  •  ...Job #3772 Position Chief Data Engineer Work Location McLean, VA...  ...processing platforms such as Hadoop, Spark, and cloud services (AWS, Azure, GCP...  ...Proficiency in programming languages such as Python, SQL, and tools like Apache NiFi, Talend, or similar ETL tools.... 
    Contract work

    Cornerstone Defense

    Chantilly, Loudoun County, VA
    2 days ago
  •  ...motivated, career and customer-oriented Data Engineer to join our team in Chantilly, VA....  ...programming language experience such as Python or Java. Significant experience with...  ...RESTful APIs, data pipelining systems like Apache Airflow, and performing ETL tasks in a Linux... 
    Full time
    Work at office

    MANTECH

    Chantilly, Loudoun County, VA
    11 hours ago
  • $150k - $265k

     ...diverse perspectives to every project. We are seeking engineers who wish to grow their careers and want to become part...  ...Description We are seeking a Palantir Data Engineer who leverages advanced Python skills within a small team to develop and implement data... 
    Hourly pay
    Extra income
    Temporary work
    Local area
    Immediate start
    Flexible hours

    Erias Ventures

    Chantilly, Loudoun County, VA
    4 days ago
  • $3,000 per month

     ...Acuity is looking for a Data Scientist to help shape next...  ...Collaborate with data engineers, analysts, and business leaders...  ...more programming languages (Python, R, PySpark, or SQL) and demonstrated experience...  ...such as Databricks and Apache Spark. ~ Prior experience... 
    Work from home

    Acuity

    Reston, VA
    2 days ago
  •  ...Our client is looking for Data Engineer/Platform Engineer who is based out of Reston...  ...Skilled in programming languages like Python, Unix Shell scripting Experience...  ..., and Lambda Experience with Spark and Amazon EMR/Hadoop, PySpark and AWS Glue is a plus Experience... 
    Contract work
    Work at office

    Hallmark Global Solutions Ltd

    Reston, VA
    2 days ago
  •  ...organization in Reston, VA is seeking a new Data Engineer/Platform Engineer to design, deploy,...  ...using Terraform, GitLab, Ansible, and Python Build, manage, and optimize AWS...  ...Desired Skills: Experience with Spark, PySpark, AWS Glue, or EMR/Hadoop Experience... 
    Monday to Friday
    Shift work
    Day shift
    3 days per week

    Tandym Group

    Reston, VA
    2 days ago
  •  ...FSP required at time of application We’re looking for a Data Engineer who is passionate about building modern, scalable solutions for...  ...experience. REQUIRED KNOWLEDGE/SKILLS  Knowledge of NodeJS or Python  Strong understanding of APIs, microservices, and... 

    Leading Path Consulting LLC

    Chantilly, Loudoun County, VA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer (Python/PySpark/Apache Spark). Be the first to apply!