Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

SPARK Data Onboarding Engineer- NJ

Photon

PySpark Data Engineer

We are seeking a skilled PySpark Data Engineer to join our team and drive the development of robust data processing and transformation solutions within our data platform. You will be responsible for designing, implementing, and maintaining PySpark-based applications to handle complex data processing tasks, ensure data quality, and integrate with diverse data sources. The ideal candidate possesses strong PySpark development skills, experience with big data technologies, and the ability to work in a fast-paced, data-driven environment.

Key Responsibilities:
  • Design, develop, and test PySpark-based applications to process, transform, and analyze large-scale datasets from various sources, including relational databases, NoSQL databases, batch files, and real-time data streams.
  • Implement efficient data transformation and aggregation using PySpark and relevant big data frameworks.
  • Develop robust error handling and exception management mechanisms to ensure data integrity and system resilience within Spark jobs.
  • Optimize PySpark jobs for performance, including partitioning, caching, and tuning of Spark configurations.
Data Analysis and Transformation:
  • Collaborate with data analysts, data scientists, and data architects to understand data processing requirements and deliver high-quality data solutions.
  • Analyze and interpret data structures, formats, and relationships to implement effective data transformations using PySpark.
  • Work with distributed datasets in Spark, ensuring optimal performance for large-scale data processing and analytics.
Data Integration and ETL:
  • Design and implement ETL (Extract, Transform, Load) processes to ingest and integrate data from various sources, ensuring consistency, accuracy, and performance.
  • Integrate PySpark applications with data sources such as SQL databases, NoSQL databases, data lakes, and streaming platforms.
Qualifications and Skills:
  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 5+ years of hands-on experience in big data development, preferably with exposure to data-intensive applications.
  • Strong understanding of data processing principles , techniques, and best practices in a big data environment.
  • Proficiency in PySpark, Apache Spark, and related big data technologies for data processing, analysis, and integration.
  • Experience with ETL development and data pipeline orchestration tools (e.g., Apache Airflow, Luigi).
  • Strong analytical and problem-solving skills, with the ability to translate business requirements into technical solutions.
  • Excellent communication and collaboration skills to work effectively with data analysts, data architects, and other team members.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the SPARK Data Onboarding Engineer- NJ in Dallas, TX vacancy
  • A leading data solutions company is seeking a Remote Lead Data Engineer with over 7 years of experience in consumer finance or equivalent fields. This role involves...  ...data using advanced technologies such as Spark and Kafka. The ideal candidate will have a strong... 
    Suggested
    Remote job

    Global Channel Management, Inc

    Dallas, TX
    1 day ago
  •  ...robotics located in Dallas, Texas is seeking a qualified candidate for data analysis and management. The role involves providing and...  ...new data sources, and requires strong skills in ETL and SQL, with Spark knowledge as a bonus. This position promotes inclusivity and... 
    Suggested

    Robotics Technologies LLC

    Dallas, TX
    2 days ago
  •  ...organization in the technology and data solutions industry, is seeking a Sr Data Ops Engineer to join their team. As a Sr Data...  ...Ops Engineer Location: Iselin, NJ / Irving, TX / Columbus, OH / Charlotte...  ...data flows using Kafka, Flink, Spark streaming, and other modern data... 
    Suggested
    Flexible hours

    ManpowerGroup Global, Inc.

    Irving, TX
    4 days ago
  •  ...A leading technology services company is looking for a Developer / Engineer in Irving, Texas. This role requires at least 7 years of experience in building data pipelines using Python, including experience with MLOps and the Hadoop ecosystem. Ideal candidates will have... 
    Suggested
    Flexible hours

    NTT DATA

    Irving, TX
    4 days ago
  • $102.4k - $179k

     ...and enhance a modern, AI-ready data enablement platform that...  ...data products, and reusable engineering patterns across the enterprise...  ...configuration, and data product onboarding templates. Design, develop...  ...workflows, Delta Live Tables, Spark, and Starburst/Trino.... 
    Suggested

    Vizient

    Irving, TX
    1 day ago
  •  ...Beware of scams. S3 never asks for money during its onboarding process. Job Title: Lead Cloud Data Platform Engineer (AI/ Data Engineering) Contract Length: 12+ month...  ...with hands-on cloud data solutions, including Spark-based ingestion and processing ~3+ years of experience... 
    Contract work
    Work at office
    Remote work
    Visa sponsorship

    Leading Utilities Organization

    Irving, TX
    1 day ago
  • $55k - $138k

    PNC Financial Services Group, Inc. is seeking a Data Engineer to develop and maintain our Treasury Management payment network, leveraging tools like Hadoop and Neo4j. The candidate must possess strong communication skills and leadership abilities, with a background in data... 

    PNC Financial Services Group, Inc.

    Dallas, TX
    2 days ago
  •  ...Select Minds LLC is seeking a Data Quality Engineer to ensure the reliability and correctness of data pipelines. This hands-on role focuses on validating data across platforms like Databricks and AWS. Candidates must have 7+ years of experience, with specific expertise... 

    SelectMinds

    Dallas, TX
    4 days ago
  •  ...Microsoft Fabric as the enterprise data and analytics foundation that...  ...a hands-on Fabric Data Engineer to own the data layer of that...  ...security. ~ Build and maintain Spark notebooks (PySpark), Data...  ..., CDAO, and partner teams onboarding workloads to Fabric. ~ Build... 

    Vanguard Group, Inc.

    Dallas, TX
    1 day ago
  •  ...General Purpose This role oversees and manages the work of Data Engineer who design and develop ETL pipelines and processes in our Cloud...  ...with big data technologies and platforms, such as Hadoop, Spark, Apache Kafka, and distributed data processing frameworks. ~... 
    Local area

    Coca-Cola Southwest Beverages

    Dallas, TX
    1 day ago
  •  ...project is a high‑impact business process engine designed to optimize customer pharmacy procurement...  ...proficiency in Python and PySpark for big data processing Strong experience with Azure...  ...of distributed computing concepts and Spark optimization technique. Hands-on... 
    Full time
    Immediate start
    Remote work

    Productiv Team

    Dallas, TX
    4 days ago
  •  ...that shape the industries of tomorrow. Its engineers build critical infrastructure to...  ...The Position We are seeking a Lead Data Engineer to play a critical role in advancing...  ...Python, Scala, or Java ~ Experience with Spark, Flink, or Beam ~ Experience with Airflow... 
    Temporary work
    Flexible hours

    NorthMark Strategies

    Dallas, TX
    5 days ago
  • $55k - $138k

     ...developing, maintaining, and expanding a complex, data‑centric Treasury Management payment network currently...  ...skills and lead technical initiatives as a Data Engineer. Utilize orchestration and scheduling tools, Spark/PySpark, and Linux operating systems. Architect and... 
    Temporary work

    PNC Financial Services Group, Inc.

    Dallas, TX
    2 days ago
  •  ...Understanding containerization and performance tuning in Data Pipelines 3+ years experience in development using Python, Spark Python, PySpark, Snowflake & Databricks...  ...and adapt quickly, mentoring and leading other engineers from a technical capacity Solution Architecting... 

    TechDigital Group

    Dallas, TX
    4 days ago
  • $130k - $155k

    Health--e-Commerce in Dallas is seeking an experienced Senior Data Engineer to join their IT team. This role involves designing and maintaining data pipelines and infrastructure to support the organization's data needs. The ideal candidate will have at least 5 years of... 
    Flexible hours

    Health--e-Commerce

    Dallas, TX
    5 days ago
  •  ...strategic leadership for the County’s data engineering function, overseeing the design, development...  ...pipeline delivery. Ensures efficient onboarding of new data sources, systems, and...  ...with big data frameworks such as Hadoop, Spark, Databricks, or Flink. Knowledge of Azure... 

    Dallas County

    Dallas, TX
    2 days ago
  •  ...Data Engineer With MemSQL Long Term Contract Irving, TX Job Summary: As a Data Engineer specializing in MemSQL, you will design...  ...and design. Knowledge of big data technologies (Hadoop, Spark, etc.) is a plus. Strong communication and teamwork skills.... 
    Long term contract

    Abode Techzone LLC

    Irving, TX
    3 days ago
  •  ...Data Engineer II Dallas, TX - Hybrid (3x in office/week) About Lantern Lantern is the specialty care platform connecting people...  ...Azure (preferred), AWS, or GCP. ~ Hands-on experience with Spark, Python, and SQL to build, test, and maintain data pipelines (... 
    Temporary work
    Work at office

    EmployerDirect Healthcare

    Dallas, TX
    7 hours ago
  • $48k

     ...Data Engineer (II) For the past 20 years, we have powered many Digital Experiences for the Fortune 500. Since 1999, we have grown from...  ...experience in Hadoop/Apache technologies (Pig, Hive, HBase, Storm, Spark, Kafka, Oozie). 2+ years experience in Google Cloud Platform... 
    Full time
    For contractors

    Photon

    Dallas, TX
    3 days ago
  •  ...current security controls for users and process authorization Onboard Pricing 20 to the enhanced environments Required: ~...  ...platform development and support ~3 years of AIML platform engineering experience 3 years of experience developing platform orchestration... 
    Remote work
    Monday to Friday

    Kaav Inc.

    Dallas, TX
    9 days ago
  • $93.7k - $177.68k

     ....Our company purpose is to empower easy, data-driven decision-making on important healthcare...  ...Position Summary: MedInsight's engineering team is building the next generation of healthcare...  ...data platforms, including Databricks, Spark, and cloud technologies, who is eager to... 
    Full time
    Temporary work
    Work experience placement
    Remote work
    Flexible hours

    Milliman

    Dallas, TX
    4 days ago
  •  ...Data Engineer APN Consulting, Inc. is a progressive IT staffing and services company offering innovative business solutions to improve...  ...workflow management Proficiency in real-time data processing (Spark Streaming, Flink, Kafka Streams). Experience with cloud data warehouses... 
    Work at office
    Remote work
    1 day per week

    APN Consulting

    Dallas, TX
    1 day ago
  •  ...Job Title: Data Engineer – MEM SQL Location: New Jersey / Irving, TX / Tampa, FL Job Description: We are looking for an experienced...  ...platforms (AWS / Azure / GCP) Knowledge of Big Data technologies (Spark, Kafka, etc.) Telecom domain experience (preferred, not... 

    Qode

    Irving, TX
    2 days ago
  • $73.01k - $170.64k

     ...Databricks Lead Data Engineer As a Databricks Lead Data Engineer, you are expected to lead the development team and have strong development...  ...-driven frameworks, data quality checks, and optimization of spark job performance. Develop Databricks platform components... 
    Work at office
    Local area
    Flexible hours

    Perficient

    Dallas, TX
    1 day ago
  • $25k

     ...Reprise Financial is seeking a passionate and highly motivated Lead Data Engineer Microsoft Azure and Fabric platform experience to join our...  ...Data Factory, and Fabric Data Factory Proficient with SQL, Spark SQL, PySpark, and Python working with relational databases, as... 
    Work at office
    Remote work
    Work from home

    Reprise Financial

    Dallas, TX
    1 day ago
  •  ...that shape the industries of tomorrow. Its engineers build critical infrastructure to...  ...innovation. The Position We are seeking a Lead Data Engineer to play a critical role in advancing...  ...Python, Scala, or Java Experience with Spark, Flink, or Beam Experience with Airflow... 

    NMC2

    Dallas, TX
    4 days ago
  • We’re looking for a Lead Data Engineer to help design, build, and scale a modern enterprise data platform on AWS. This is a hands‑on role...  ...data pipelines and platforms on AWS (S3, Glue, Redshift, EMR/Spark, Athena, Lambda). Lead technical design and delivery of major... 

    Harnham

    Dallas, TX
    2 days ago
  •  ...Overview: Job Title : Big Data Engineer Location : Irving TX (hybrid role) (Only local candidates) Duration : 12 Months Roles...  ...experience. ~4+ years of Working experience on tools like Spark, HBase, Hive, Sqoop, Impala, Kafka, Flume, Oozie, MapReduce,... 
    Work experience placement
    Local area

    Guru Schools

    Irving, TX
    3 days ago
  •  ...Description Job Description We’re looking for an experienced Data Engineer to help design, build, and optimize modern data pipelines that...  ...experience in data engineering with strong Python, SQL, and Spark skills ~ Hands-on experience building and automating ETL/ELT... 
    Remote work
    Flexible hours

    NEP Group Inc.

    Dallas, TX
    26 days ago
  • Database Engineer- Qlik Replicate/ETL (Dallas, TX; Morris County, NJ; ...) Job Title: Database Engineer - Qlik Location: Raleigh, NC; Morristown, NJ; Phoenix, AZ...  ...) with deep experience in managing and maintaining data replication platforms, particularly Qlik Replicate... 
    Work experience placement
    Remote work
    1 day per week

    Cedent

    Dallas, TX
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to SPARK Data Onboarding Engineer- NJ. Be the first to apply!