Senior PySpark Data Engineer
$125k - $140kTata Consultancy Services
Roles & Responsibilities
Job Title: Data Engineer
• Data Pipeline Development & Maintenance: Design, build, and maintain highly scalable and efficient ETL/ELT data pipelines utilizing PySpark and Spark SQL for complex data transformations.
• Cloud Data Infrastructure Management: Deploy, manage, and scale critical data infrastructure components on leading cloud platforms such as Amazon Web Services (AWS) (e.g., EMR, Glue), Microsoft Azure (e.g., Databricks, Synapse), or Google Cloud Platform (GCP).
• Data Warehousing & Storage Optimization: Strategically manage data layout, partitioning, and indexing within Apache Hive and various cloud data lake solutions to optimize performance and accessibility.
• Performance Tuning & Optimization: Proactively identify and resolve performance bottlenecks in Spark jobs, leveraging Spark UI for in-depth analysis, effectively managing data skewness, and optimizing memory utilization.
• Diverse Data Integration: Develop robust solutions for ingesting high-volume and diverse datasets from both structured relational databases and unstructured flat files into our data ecosystem.
• Automated Workflow Orchestration: Implement and manage automated data workflows using industry-standard scheduling tools like Apache Airflow or platform-native schedulers, ensuring timely and reliable data delivery.
• Strategic Collaboration: Partner closely with data scientists, business analysts, and cross-functional enterprise teams to translate complex business requirements into technically sound and efficient data solutions. Qualifications:
• Big Data Frameworks Expertise: Demonstrated high proficiency in Apache Spark architecture, including a deep understanding of drivers, executors, and Directed Acyclic Graphs (DAGs).
• Advanced Programming: Exceptional coding skills in Python and extensive experience with the PySpark API for developing intricate data transformations and processing logic.
• Querying & Schema Management: Strong command of HiveQL and ANSI SQL, coupled with expertise in data partitioning techniques and effective schema definition.
• Optimized Storage Formats: In-depth understanding and practical experience with optimized big data storage file formats such as Parquet, ORC, and Avro.
• Cloud Ecosystem Development: Hands-on development experience utilizing cloud-native big data utilities (e.g., AWS EMR, Azure Databricks) with in major cloud platforms.
• Data Warehousing Fundamentals: Solid foundation in Dimensional Data Modeling, including Star and Snowflake schemas, and practical experience with Data Lakes concepts and implementation.
Preferred Qualifications
• CI/CD & DevOps Automation: Experience with Continuous Integration/Continuous Deployment (CI/CD) practices and automation tools like Git, Jenkins, or Ansible.
• NoSQL Database Integration: Exposure to and experience with NoSQL databases such as HBase, Cassandra, or MongoDB.
• Professional Cloud Certifications: Relevant professional cloud certifications (e.g., AWS Certified Data Engineer, Microsoft Certified: Azure Data Engineer Associate) are highly valued Salary Range: $125,000 to $140,000 per year
Vacancy posted 5 hours ago
Similar jobs that could be interesting for youBased on the Senior PySpark Data Engineer in Irving, TX vacancy
- ...Covetus, located in Irving, Texas, is seeking a Data Engineer who will be responsible for designing, developing, and maintaining data solutions in a Big Data environment using predominantly PySpark/Python. The role involves creating data pipelines, ensuring data quality...Senior
- ...Tata Consultancy Services is seeking a Senior PySpark Data Engineer in Irving, TX. In this role, you will design and optimize data pipelines, manage cloud infrastructure, and collaborate with teams to deliver robust data solutions. The ideal candidate will have 8+ years...Senior
- ...must hold AWS Certifications. Responsibilities include designing and deploying scalable data solutions on AWS, as well as building and maintaining ETL/ELT data pipelines with PySpark. The role requires expertise in data warehousing solutions, focusing on performance and...SeniorPermanent employmentLocal area
- ...As a Data Engineer, you will be responsible for designing, developing, and maintaining data solutions for data generation, collection, and processing in Big Data environment using predominantly PySpark/Python. Your typical day will involve creating data pipelines, ensuring...SeniorWork experience placement
- ...Role: Sr. PySpark Data Engineer - Fulltime Location: Irving, TX Job Description: We are seeking a skilled PySpark Data Engineer to join our team. The ideal candidate will have expertise in big data processing, ETL pipeline development, and...SeniorFull time
- ...is seeking an experienced developer with over 12 years in the field. The ideal candidate must possess strong skills in Python and PySpark, with experience in GCP-GCS and Big Query. Excellent communication abilities are required to connect with both internal teams and external...Senior
- ...Techaxis, Inc is looking for a highly skilled AWS Certified Engineer to design and optimize data solutions in the AWS ecosystem. You will build and... ...robust data pipelines, focusing on big data processing using PySpark and modern data warehousing concepts. The ideal...
$125k - $140k
...Tata Consultancy Services is searching for an AWS Certified Data Engineer in Irving, Texas. The ideal candidate will design and optimize data solutions in the AWS ecosystem, primarily using PySpark for big data processing. This role involves creating and managing efficient...$157k - $184.4k
...Positions Available COMPANY: McKesson Corporation POSITION: Senior Data Engineer LOCATION: 6555 State Highway 161, Irving, TX 75039 JOB... ...data workflows and integrations across systems. PySpark: Distributed Data Processing framework for transforming and...SeniorRemote work- ...NTT DATA North America is seeking a Data Engineer to join their team in Irving, Texas. The role involves designing and building robust data pipelines and... ...of experience and proficiency in technologies such as PySpark, SQL, and familiarity with cloud environments. The position...SeniorHourly pay
- ...Citibank (Switzerland) AG is looking for a Data Engineer in Irving, Texas. The role involves developing and maintaining scalable ETL/ELT pipelines using technologies such as PySpark, Spark SQL, and Databricks. Candidates should have 4–7 years of experience in data engineering...Senior
- ...We are seeking a Senior Data Engineer to join a multidisciplinary team focused on building scalable data platforms and solutions that enhance... ...years in data modeling. Strong proficiency in SQL, Python, and PySpark. Experience with cloud platforms such as AWS or Azure. Deep...Senior
- ...Data Engineer We are seeking a skilled and detail-oriented Data Engineer to join our growing data team. The ideal candidate will be responsible... .... Strong programming skills in languages such as Python, PySpark, SQL etc. Experience in Build and optimize ETL workflows...SeniorContract work
$65 - $75 per hour
...A client of Innova Solutions is looking for a Sr Data Engineer. Title: Senior Data Engineer Position type: Fulltime Contract Duration:... ...Apache airflow Experience in Cloudera Expertise in PySpark Experience in Google Cloud Platform Experience with...SeniorHourly payFull timeContract workTemporary workWork experience placementImmediate startWorldwideFlexible hours- ...tomorrow\'s health today, we want to hear from you. Job title: Senior/Lead Data Engineer Current Need: The Senior / Lead Data Engineer will be part... ...experience with Snowflake, Databricks, Azure Data Factory, PySpark, Analytical SQL, Splunk, Alation, R, Python; SAP and IBM...SeniorWork at office
$102.4k - $179k
...and enhance a modern, AI-ready data enablement platform that... ...data products, and reusable engineering patterns across the enterprise... ...platform use cases. As a hands‑on senior individual contributor, you... ...solutions using Azure Databricks, PySpark, SQL, Delta Lake, Azure Data...Senior$100k - $125k
...development of Quantexa-based solutions. Key responsibilities include hands-on development in Scala and PySpark, configuring Quantexa Compound Keys, and designing data pipelines. A Bachelor's degree in Computer Science is required, along with strong expertise in data...- ...LTM is seeking an Azure Data Engineer for a full-time role located in Texas. The ideal candidate will have 6 to 10 years of experience and will be responsible for designing and maintaining scalable data solutions using Python and Azure Data Factory. Responsibilities include...SeniorFull timeRelocation
- ...REALIGN LLC is seeking an experienced PySpark Developer in Irving, TX. The ideal candidate will have 5-10 years in data engineering, with strong skills in the Apache Spark framework and PySpark. Responsibilities include designing and developing scalable data pipelines,...Full time
- ...Infosys is looking for a candidate to join their Data and Analytics (DNA) unit, transforming data into actionable insights. You will contribute to software solutions, facilitate discussions, and enhance applications by integrating new features. The ideal applicant will...
- ...Healthcare IT Leaders is seeking a Data Center Engineer in Irving, TX for a 100% on-site position. This role involves leading complex infrastructure initiatives and providing technical oversight within the Data Center Services organization. The ideal candidate will have...Senior
- ...GM Financial is looking for a Data Engineer to handle cloud-based data engineering tasks focusing on large-scale data set processing. Your role will include coding, testing, and deploying automation, while collaborating with various teams to enhance data solutions. We...Senior
- ...Role description Job Title: Big DATA Developer with Python Work Location : Tampa,FL Job Description: Seeking a Senior Python Data Engineer with 3 to 5 years of experience skilled in Apache Spark and Python to design and implement scalable data processing...Senior
- ...VeeRteq Solutions Inc. is seeking a Senior Data Engineer in Irving, TX, for a hybrid role requiring local candidates. The ideal candidate will have over 10 years of experience, with strong expertise in Databricks, Python, and SQL. Responsibilities include building and...SeniorLocal area
- ...Peyton Resource Group in Irving, Texas is seeking a Senior Data Engineer to design and maintain scalable data infrastructure that supports analytics and decision-making. The ideal candidate should possess a Bachelor’s or Master’s degree in Computer Science and 5+ years...Senior
- ...email the resume to ****@*****.*** Role: Senior Data Engineer Location: Dallas, TX or Mountain View, CA Duration:... ...tools (Airflow, Composer, Oozie) with deep expertise in SQL, PySpark, or scala. ~5+ years leading data engineering...SeniorHourly payLong term contract
- ...We are seeking a Lead Snowflake Engineer to design, build, and operate robust data pipeline infrastructure that ingests from diverse source systems, normalizes data into consumption-ready models in Snowflake, and delivers clean, reliable datasets for Power BI reporting...Senior
- ...Senior Data Ops Engineer Location: Charlotte, NC; Irving, TX; Chandler, AZ; Columbus, OH; Des Moines, IA; or Minneapolis, MN. Rate: DOE Term: 12+ Months This role is an opportunity to be part of a high-performing team passionate about data, focusing on building...Senior
- ...Overview We are seeking a highly skilled Data Engineer to design, build, and optimize... ...efficiency. Required Qualifications Experienced Senior Data Engineer with over 3 years of Databricks... ...with Databricks, including: Spark (PySpark or Spark SQL) Delta Lake Data Lakehouse...SeniorFull timeTemporary workShift workDay shift
- ...Role: Senior Data Engineer Remote Work: INDIA Location: Hyderabad / Noida, INDIA *Only Consultants local to INDIA are eligible. *No visa Sponsorship... ...develop, and maintain scalable data pipelines using Python, PySpark, and other modern programming languages to support both...SeniorLocal areaRemote workVisa sponsorship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior PySpark Data Engineer. Be the first to apply!
Related searches
- remote data engineer Irving, TX
- entry level big data engineer Irving, TX
- big data devops engineer Irving, TX
- data engineer Irving, TX
- data engineer contract Irving, TX
- software data engineer Irving, TX
- big data cloud engineer Irving, TX
- junior big data engineer Irving, TX
- sr information security engineer Irving, TX
- hadoop big data developer Irving, TX


