Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer (Python/PySpark/Apache Spark)

Infinitive

About Infinitive Infinitive is a data and AI consultancy that enables its clients to modernize, monetize, and operationalize their data to create lasting and substantial value. We possess deep industry and technology expertise to drive and sustain adoption of new capabilities. We match our people and personalities to our clients' culture while bringing the right mix of talent and skills to enable high return on investment. Role Overview We are seeking a highly skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will play a crucial role in designing, developing, and maintaining our clients' data infrastructure. Your expertise in Apache Spark, Python, PySpark, ETL processes, CI/CD (Jenkins or GitHub), and experience with both streaming and batch workflows will be essential in ensuring the efficient flow and processing of data to support our clients. Responsibilities Data Architecture and Design: Collaborate with cross-functional teams to understand data requirements and design robust data architecture solutions. Develop data models and schema designs to optimize data storage and retrieval. ETL Development: Implement robust ETL processes to extract, transform, and load data from various sources. Ensure data quality, integrity, and consistency throughout the ETL pipeline. Distributed Computing & Spark Development: Utilize your expertise in Apache Spark, Python, and PySpark to develop efficient, large-scale data processing and analysis scripts. Optimize code for performance, memory management, and scalability, keeping up-to-date with the latest industry best practices. Data Integration: Integrate data from different systems and sources to provide a unified view for analytical purposes. Collaborate with data scientists and analysts to implement solutions that meet their data integration needs. Streaming and Batch Workflows: Design and implement streaming workflows using PySpark Streaming or other relevant technologies. Develop batch processing workflows for large-scale data processing and analysis. CI/CD Implementation: Implement and maintain continuous integration and continuous deployment (CI/CD) pipelines using Jenkins or GitHub Actions. Automate testing, code deployment, and monitoring processes to ensure the reliability of data pipelines. Qualifications Bachelor's or Master's degree in Computer Science, Information Technology, or a related field. Proven experience as a Data Engineer or similar role. Strong programming skills in Python and deep expertise in Apache Spark and PySpark for both batch and streaming data processing. Hands-on experience developing, tuning, and troubleshooting distributed data pipelines. Solid understanding of ETL tools, data modeling, database design, and data warehousing concepts. Familiarity with CI/CD tools such as Jenkins or GitHub Actions. Excellent problem-solving, analytical, communication, and collaboration skills. Preferred Skills Experience with Ab Initio (e.g., GDE, Co-Operating System, EME) or a strong background in enterprise ETL modernization. Knowledge of cloud platforms such as AWS, Azure, or Google Cloud. Experience with version control systems (e.g., Git). Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes). Understanding of data security and privacy best practices. Infinitive is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic protected by applicable federal, state, or local law. #J-18808-Ljbffr

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the Data Engineer (Python/PySpark/Apache Spark) in Ashburn, VA vacancy
  •  ...About Infinitive Infinitive is a data and AI consultancy that enables its clients...  ...a highly skilled and motivated Data Engineer to join our dynamic team. As a Data...  ...infrastructure. Your expertise in Apache Spark, Python, PySpark, ETL processes, CI/CD (Jenkins or GitHub... 
    Suggested
    Full time
    Local area

    Infinitive Inc

    Ashburn, VA
    a month ago
  •  ...Infinitive Inc, located in Virginia, is looking for a skilled Data Engineer to join their team. The ideal candidate will be...  ...maintaining data infrastructure, utilizing technologies like Apache Spark and Python. The role requires collaboration with cross-functional teams... 
    Suggested

    Infinitive

    Ashburn, VA
    11 hours ago
  •  ...Infinitive Inc in Ashburn is seeking a skilled Data Engineer to design and maintain data infrastructure for clients. You will work with Python and PySpark on ETL processes and ensure data quality. This role demands expertise in scripting and data integration. The ideal... 
    Suggested

    Infinitive

    Ashburn, VA
    11 hours ago
  •  ...A technology firm located in Reston, Virginia, is seeking an experienced data engineer to build and maintain ETL pipelines using Python and PySpark on AWS Glue. The ideal candidate has strong expertise in AWS services, including Lambda, SNS, and SQS, along with solid... 
    Suggested

    Quantum Technologies USA

    Reston, VA
    11 hours ago
  •  ...A data analytics company seeks an experienced developer to build and maintain ETL pipelines using Python and PySpark on AWS. The role involves orchestrating workflows with AWS Step Functions, implementing event-driven patterns, and ensuring data quality. The ideal candidate... 
    Suggested

    Cloud Analytics Technologies, LLC

    Reston, VA
    4 days ago
  •  ...A leading automation solutions provider in Virginia is seeking a skilled data engineer to build and maintain ETL pipelines using Python and PySpark on AWS Glue. You will work extensively with various AWS services to orchestrate workflows and ensure data quality. The ideal... 

    Robotics Prcocess Automation, LLC

    Reston, VA
    4 days ago
  • $77.6k - $176k

     ...Job Number: R0233320 Spark Data Engineer, Senior The Opportunity Ever-expanding technology like IoT, machine learning, and artificial intelligence...  ...Have 5+ years of experience in Spark development, including PySpark or Java Spark 5+ years of experience designing, developing,... 
    Full time
    Contract work
    Part time
    Local area
    Remote work

    Phase2 Technology

    Chantilly, Loudoun County, VA
    11 hours ago
  •  ...A leading data solutions firm is seeking a Data Engineer based in Reston, Virginia. The successful candidate will design, develop, and maintain scalable...  ...clients. The role emphasizes the use of Databricks, Apache Spark, and AWS to deliver reliable and high-performing... 

    Acuity

    Reston, VA
    4 days ago
  •  ...CGI Njoyn in Reston, Virginia, seeks a Senior Python Developer with Spark expertise to design and optimize large-scale data systems. Responsibilities include developing data pipelines, writing advanced SQL, and working with relational datasets in a hybrid model. Candidates... 

    CGI Njoyn

    Reston, VA
    11 hours ago
  •  ...community of innovators, engineers, analysts and business...  ..., Defense, AI/ML, and Data Science fields. As we...  .../ TS/SCI-cleared Apache Spark Developer to support...  ...Processing Apache Spark (PySpark, Scala) Delta Lake...  ...) Languages Python (PySpark) Scala (... 
    Flexible hours

    Absolute Business Solutions Corp

    Herndon, VA
    1 day ago
  •  ...A technology solutions provider in the United States is seeking an experienced Data Engineer to build and maintain ETL pipelines using Python and Pyspark on AWS Glue. Key responsibilities include orchestrating workflows, implementing messaging patterns, and ensuring data... 

    Robotics Technologies LLC

    Reston, VA
    11 hours ago
  •  ...Overview Acuity, Inc. seeks a Data Engineer to design, develop, and maintain scalable data...  ...with strong emphasis on Databricks, Apache Spark, and AWS . The Data Engineer delivers...  ...platform services Experience with Python, PySpark, SQL, or similar data engineering languages... 

    Acuity

    Reston, VA
    11 hours ago
  •  ...GRVTY's team provides tactical data engineering solutions. We embed skilled Data Engineers...  ...team is rather huge and includes Python (Pandas, numpy, scipy, scikit-learn,...  ...Object Detection, etc.), Linux, AWS/C2S, Apache NiFi, Spark, pySpark, Hadoop, Kafka, ElasticSearch, Solr,... 
    Local area
    Immediate start
    Remote work
    Flexible hours

    GRVTY

    Chantilly, Loudoun County, VA
    11 hours ago
  •  ...Virginia, United States Position Title: Data Engineer What Impact Youll Have GRVTYs team...  ...this team is rather huge and includes Python (Pandas, numpy, scipy,scikit-learn,...  ...Object Detection, etc.), Linux, AWS/C2S, Apache NiFi, Spark,pySpark, Hadoop, Kafka, ElasticSearch, Solr,... 
    Local area
    Immediate start
    Remote work
    Flexible hours

    GRVTY

    Chantilly, Loudoun County, VA
    4 days ago
  •  ...Acuity, Inc. is looking for a skilled Data Engineer to join our Engineering Team in Reston, Virginia. This role focuses on designing and...  ...federal clients, requiring expertise in technologies such as Spark and Delta Lake. The ideal candidate will address key tasks such... 

    Acuity

    Reston, VA
    2 days ago
  •  ...The Data & Software Engineer works with a small team to build complex...  ...will have advanced Python programming skills, familiarity...  ...and updating Spark Jobs...  ...experience with ApacheSpark & PySpark AdvancedPython skills...  ...DynamoDB UnityCatalog OSS, Apache Polaris ApacheSuperset... 
    Temporary work

    Avalore, LLC

    Chantilly, Loudoun County, VA
    1 day ago
  •  ...A technology and engineering firm in Chantilly, Virginia, is seeking a skilled Software Engineer specializing in data engineering. The role is responsible for designing and optimizing...  ...using modern technologies such as Spark, Elasticsearch, and Java Spring Boot. Candidates... 

    Vosper Thornycroft Group

    Chantilly, Loudoun County, VA
    4 days ago
  •  ...Permanent Full Time Title Senior Python Developer with Spark Category Software Development / Engineering Location Reston, Virginia,...  ...build, and optimize large-scale data processing systems in a cloud-native...  ...pipelines. Deep expertise in Apache Spark, including performance... 
    Permanent employment
    Full time
    Local area

    CGI Njoyn

    Reston, VA
    11 hours ago
  • $98.61k - $167.64k

    ICF is seeking a skilled Data Engineer to join our Corporate IT organization...  ...production-quality SQL, Python, and Spark code for data...  ...Querying SQL (advanced) Python PySpark Bash/Shell Scala Cloud & Platforms...  ...Orchestration & Pipelines Apache Airflow dbt Databricks... 
    Full time
    Contract work
    Work experience placement
    Work at office
    Remote work

    ICF

    Reston, VA
    1 day ago
  • $99.2k - $154.3k

     ...Senior Python Developer with Spark Category: Software Development/ Engineering Main location: United States, Virginia, Reston...  ...Google/YouTube processing your data and using cookies -Learn more...  ...pipelines . Deep expertise in Apache Spark, including: Performance... 
    Full time
    Local area

    CGI Technologies and Solutions, Inc.

    Reston, VA
    1 day ago
  •  ...Informatica's SAAS platform Intelligent Data Management on Cloud) and using Self...  ...in programming languages like Python, Unix Shell scripting Experience automating...  ..., and Lambda Experience with Spark and Amazon EMR/Hadoop, PySpark and AWS Glue is a plus Experience with... 

    Unisys

    Reston, VA
    11 hours ago
  •  ...their team in Reston, Virginia. The role involves developing and maintaining large scale Big Data systems and applications, utilizing languages and technologies such as SQL, Python, and AWS. With requirements including a Bachelor's degree in Computer Science and two years... 
    Remote work

    Blueface

    Reston, VA
    11 hours ago
  •  ...Senior Data Engineer Dark Wolf constructs and deploys data management and analytics...  ...building data products in Apache Avro and/or Parquet Python AWS Experience with REST APIs...  ...skills: Iceberg Presto/Trino/Spark Kubernetes Experience in AirFlow... 

    Dark Wolf Solutions

    Chantilly, Loudoun County, VA
    2 days ago
  •  ...tuning Solr/Elastic for performance and query optimization 2. Minimum of 3 years of experience in a big data environment using tools such as Hadoop, Pyspark, Spark, and Hbase (prefer at least two of the list) 3. Minimum of 5 years of hands-on, back-end Java... 

    Leading Path Consulting

    Chantilly, Loudoun County, VA
    10 days ago
  • $60 - $100 per hour

     ...JOB TITLE: Data Engineer LOCATION: Remote DURATION: 6+ month contract RATE RANGE: $60...  ...knowledge of big data technologies such as Hadoop, Spark, or Anacond Strong programming skills in Python, Java, Scala, and/or SQL Experience in client... 
    Hourly pay
    Full time
    Contract work
    Remote work

    Ursus Inc

    Ashburn, VA
    11 hours ago
  •  ...Overview We are seeking a skilled Data Engineer with at least 5 years of...  ...processing frameworks (e.g., Spark, Hadoop). Hands‑on experience...  ...pipelines using tools such as Apache Airflow, Informatica, or...  ...programming languages such as Python, Scala, or Java. Experience with... 

    VTG Defense

    Herndon, VA
    4 days ago
  •  ...Altus Consulting is seeking a skilled Data Engineer to join our dynamic team. You will be responsible...  ...standard tools and technologies (e.g., Apache Spark, Kafka, Airflow). Extract, transform...  ...Skills: Strong programming skills (Python, Java, Scala). Experience with Big... 
    Contract work

    Altus Consulting Corp

    Herndon, VA
    1 day ago
  •  ...Founded by ex-Googlers with engineers from Google, Amazon,...  ...particularly in AI, data engineering, blockchain...  ...programming skills in Python, Rust, or Java Experience...  ...., Kafka, Flink, Beam, Spark) and orchestration tools...  ...using Dataflow (Apache Beam), Pub/Sub, BigQuery... 
    Work at office

    SZNS Solutions

    Reston, VA
    4 days ago
  • $120k - $150k

     ...community of innovators, engineers, analysts and business...  ..., Defense, AI/ML, and Data Science fields. As we continue...  ...pipelines, optimize Spark workloads, and deliver...  ..., reusable code in Python and Spark to support distributed...  ...Desired Skills Apache Spark Background... 
    Remote work

    Absolute Business Solutions Corp

    Herndon, VA
    1 day ago
  •  ...Job Title: Data Engineer Location: Sterling, VA Clearance: TS/SCI Poly This position is CONTINGENT...  ...Programming Languages: Proficiency in Python or Java for data manipulation and...  ...: Experience with technologies like Apache Hadoop, Spark, or Kafka. Data Visualization: Ability... 
    Contract work

    Limelight Health

    Sterling, VA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer (Python/PySpark/Apache Spark). Be the first to apply!