Data Engineer (Python/PySpark/Apache Spark)
Infinitive
About Infinitive Infinitive is a data and AI consultancy that enables its clients to modernize, monetize, and operationalize their data to create lasting and substantial value. We possess deep industry and technology expertise to drive and sustain adoption of new capabilities. We match our people and personalities to our clients' culture while bringing the right mix of talent and skills to enable high return on investment. Role Overview We are seeking a highly skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will play a crucial role in designing, developing, and maintaining our clients' data infrastructure. Your expertise in Apache Spark, Python, PySpark, ETL processes, CI/CD (Jenkins or GitHub), and experience with both streaming and batch workflows will be essential in ensuring the efficient flow and processing of data to support our clients. Responsibilities Data Architecture and Design: Collaborate with cross-functional teams to understand data requirements and design robust data architecture solutions. Develop data models and schema designs to optimize data storage and retrieval. ETL Development: Implement robust ETL processes to extract, transform, and load data from various sources. Ensure data quality, integrity, and consistency throughout the ETL pipeline. Distributed Computing & Spark Development: Utilize your expertise in Apache Spark, Python, and PySpark to develop efficient, large-scale data processing and analysis scripts. Optimize code for performance, memory management, and scalability, keeping up-to-date with the latest industry best practices. Data Integration: Integrate data from different systems and sources to provide a unified view for analytical purposes. Collaborate with data scientists and analysts to implement solutions that meet their data integration needs. Streaming and Batch Workflows: Design and implement streaming workflows using PySpark Streaming or other relevant technologies. Develop batch processing workflows for large-scale data processing and analysis. CI/CD Implementation: Implement and maintain continuous integration and continuous deployment (CI/CD) pipelines using Jenkins or GitHub Actions. Automate testing, code deployment, and monitoring processes to ensure the reliability of data pipelines. Qualifications Bachelor's or Master's degree in Computer Science, Information Technology, or a related field. Proven experience as a Data Engineer or similar role. Strong programming skills in Python and deep expertise in Apache Spark and PySpark for both batch and streaming data processing. Hands-on experience developing, tuning, and troubleshooting distributed data pipelines. Solid understanding of ETL tools, data modeling, database design, and data warehousing concepts. Familiarity with CI/CD tools such as Jenkins or GitHub Actions. Excellent problem-solving, analytical, communication, and collaboration skills. Preferred Skills Experience with Ab Initio (e.g., GDE, Co-Operating System, EME) or a strong background in enterprise ETL modernization. Knowledge of cloud platforms such as AWS, Azure, or Google Cloud. Experience with version control systems (e.g., Git). Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes). Understanding of data security and privacy best practices. Infinitive is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic protected by applicable federal, state, or local law. #J-18808-Ljbffr
- ...About Infinitive Infinitive is a data and AI consultancy that enables its clients... ...a highly skilled and motivated Data Engineer to join our dynamic team. As a Data... ...infrastructure. Your expertise in Apache Spark, Python, PySpark, ETL processes, CI/CD (Jenkins or GitHub...SuggestedFull timeLocal area
- ...Infinitive Inc, located in Virginia, is looking for a skilled Data Engineer to join their team. The ideal candidate will be... ...maintaining data infrastructure, utilizing technologies like Apache Spark and Python. The role requires collaboration with cross-functional teams...Suggested
- ...Infinitive Inc in Ashburn is seeking a skilled Data Engineer to design and maintain data infrastructure for clients. You will work with Python and PySpark on ETL processes and ensure data quality. This role demands expertise in scripting and data integration. The ideal...Suggested
- ...A technology firm located in Reston, Virginia, is seeking an experienced data engineer to build and maintain ETL pipelines using Python and PySpark on AWS Glue. The ideal candidate has strong expertise in AWS services, including Lambda, SNS, and SQS, along with solid...Suggested
- ...A data analytics company seeks an experienced developer to build and maintain ETL pipelines using Python and PySpark on AWS. The role involves orchestrating workflows with AWS Step Functions, implementing event-driven patterns, and ensuring data quality. The ideal candidate...Suggested
- ...A leading automation solutions provider in Virginia is seeking a skilled data engineer to build and maintain ETL pipelines using Python and PySpark on AWS Glue. You will work extensively with various AWS services to orchestrate workflows and ensure data quality. The ideal...
$77.6k - $176k
...Job Number: R0233320 Spark Data Engineer, Senior The Opportunity Ever-expanding technology like IoT, machine learning, and artificial intelligence... ...Have 5+ years of experience in Spark development, including PySpark or Java Spark 5+ years of experience designing, developing,...Full timeContract workPart timeLocal areaRemote work- ...A leading data solutions firm is seeking a Data Engineer based in Reston, Virginia. The successful candidate will design, develop, and maintain scalable... ...clients. The role emphasizes the use of Databricks, Apache Spark, and AWS to deliver reliable and high-performing...
- ...CGI Njoyn in Reston, Virginia, seeks a Senior Python Developer with Spark expertise to design and optimize large-scale data systems. Responsibilities include developing data pipelines, writing advanced SQL, and working with relational datasets in a hybrid model. Candidates...
- ...community of innovators, engineers, analysts and business... ..., Defense, AI/ML, and Data Science fields. As we... .../ TS/SCI-cleared Apache Spark Developer to support... ...Processing Apache Spark (PySpark, Scala) Delta Lake... ...) Languages Python (PySpark) Scala (...Flexible hours
- ...A technology solutions provider in the United States is seeking an experienced Data Engineer to build and maintain ETL pipelines using Python and Pyspark on AWS Glue. Key responsibilities include orchestrating workflows, implementing messaging patterns, and ensuring data...
- ...Overview Acuity, Inc. seeks a Data Engineer to design, develop, and maintain scalable data... ...with strong emphasis on Databricks, Apache Spark, and AWS . The Data Engineer delivers... ...platform services Experience with Python, PySpark, SQL, or similar data engineering languages...
- ...GRVTY's team provides tactical data engineering solutions. We embed skilled Data Engineers... ...team is rather huge and includes Python (Pandas, numpy, scipy, scikit-learn,... ...Object Detection, etc.), Linux, AWS/C2S, Apache NiFi, Spark, pySpark, Hadoop, Kafka, ElasticSearch, Solr,...Local areaImmediate startRemote workFlexible hours
- ...Virginia, United States Position Title: Data Engineer What Impact Youll Have GRVTYs team... ...this team is rather huge and includes Python (Pandas, numpy, scipy,scikit-learn,... ...Object Detection, etc.), Linux, AWS/C2S, Apache NiFi, Spark,pySpark, Hadoop, Kafka, ElasticSearch, Solr,...Local areaImmediate startRemote workFlexible hours
- ...Acuity, Inc. is looking for a skilled Data Engineer to join our Engineering Team in Reston, Virginia. This role focuses on designing and... ...federal clients, requiring expertise in technologies such as Spark and Delta Lake. The ideal candidate will address key tasks such...
- ...The Data & Software Engineer works with a small team to build complex... ...will have advanced Python programming skills, familiarity... ...and updating Spark Jobs... ...experience with ApacheSpark & PySpark AdvancedPython skills... ...DynamoDB UnityCatalog OSS, Apache Polaris ApacheSuperset...Temporary work
- ...A technology and engineering firm in Chantilly, Virginia, is seeking a skilled Software Engineer specializing in data engineering. The role is responsible for designing and optimizing... ...using modern technologies such as Spark, Elasticsearch, and Java Spring Boot. Candidates...
- ...Permanent Full Time Title Senior Python Developer with Spark Category Software Development / Engineering Location Reston, Virginia,... ...build, and optimize large-scale data processing systems in a cloud-native... ...pipelines. Deep expertise in Apache Spark, including performance...Permanent employmentFull timeLocal area
$98.61k - $167.64k
ICF is seeking a skilled Data Engineer to join our Corporate IT organization... ...production-quality SQL, Python, and Spark code for data... ...Querying SQL (advanced) Python PySpark Bash/Shell Scala Cloud & Platforms... ...Orchestration & Pipelines Apache Airflow dbt Databricks...Full timeContract workWork experience placementWork at officeRemote work$99.2k - $154.3k
...Senior Python Developer with Spark Category: Software Development/ Engineering Main location: United States, Virginia, Reston... ...Google/YouTube processing your data and using cookies -Learn more... ...pipelines . Deep expertise in Apache Spark, including: Performance...Full timeLocal area- ...Informatica's SAAS platform Intelligent Data Management on Cloud) and using Self... ...in programming languages like Python, Unix Shell scripting Experience automating... ..., and Lambda Experience with Spark and Amazon EMR/Hadoop, PySpark and AWS Glue is a plus Experience with...
- ...their team in Reston, Virginia. The role involves developing and maintaining large scale Big Data systems and applications, utilizing languages and technologies such as SQL, Python, and AWS. With requirements including a Bachelor's degree in Computer Science and two years...Remote work
- ...Senior Data Engineer Dark Wolf constructs and deploys data management and analytics... ...building data products in Apache Avro and/or Parquet Python AWS Experience with REST APIs... ...skills: Iceberg Presto/Trino/Spark Kubernetes Experience in AirFlow...
- ...tuning Solr/Elastic for performance and query optimization 2. Minimum of 3 years of experience in a big data environment using tools such as Hadoop, Pyspark, Spark, and Hbase (prefer at least two of the list) 3. Minimum of 5 years of hands-on, back-end Java...
$60 - $100 per hour
...JOB TITLE: Data Engineer LOCATION: Remote DURATION: 6+ month contract RATE RANGE: $60... ...knowledge of big data technologies such as Hadoop, Spark, or Anacond Strong programming skills in Python, Java, Scala, and/or SQL Experience in client...Hourly payFull timeContract workRemote work- ...Overview We are seeking a skilled Data Engineer with at least 5 years of... ...processing frameworks (e.g., Spark, Hadoop). Hands‑on experience... ...pipelines using tools such as Apache Airflow, Informatica, or... ...programming languages such as Python, Scala, or Java. Experience with...
- ...Altus Consulting is seeking a skilled Data Engineer to join our dynamic team. You will be responsible... ...standard tools and technologies (e.g., Apache Spark, Kafka, Airflow). Extract, transform... ...Skills: Strong programming skills (Python, Java, Scala). Experience with Big...Contract work
- ...Founded by ex-Googlers with engineers from Google, Amazon,... ...particularly in AI, data engineering, blockchain... ...programming skills in Python, Rust, or Java Experience... ...., Kafka, Flink, Beam, Spark) and orchestration tools... ...using Dataflow (Apache Beam), Pub/Sub, BigQuery...Work at office
$120k - $150k
...community of innovators, engineers, analysts and business... ..., Defense, AI/ML, and Data Science fields. As we continue... ...pipelines, optimize Spark workloads, and deliver... ..., reusable code in Python and Spark to support distributed... ...Desired Skills Apache Spark Background...Remote work- ...Job Title: Data Engineer Location: Sterling, VA Clearance: TS/SCI Poly This position is CONTINGENT... ...Programming Languages: Proficiency in Python or Java for data manipulation and... ...: Experience with technologies like Apache Hadoop, Spark, or Kafka. Data Visualization: Ability...Contract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer (Python/PySpark/Apache Spark). Be the first to apply!
- remote data engineer Ashburn, VA
- data engineer Ashburn, VA
- data engineer contract Ashburn, VA
- sr information security engineer Ashburn, VA
- senior data quality engineer Ashburn, VA
- finance data engineer Ashburn, VA
- data developer Ashburn, VA
- junior data engineer remote Ashburn, VA
- senior cloud data engineer Ashburn, VA
- data engineer machine learning Ashburn, VA

