Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer

PGH Career Connector

Description: We are seeking a Data Engineer to support our client with data ingestion, data deduplication and data tagging for migration of a large-scale data environment into Databricks. The ideal candidate will also bring hands‑on expertise in end‑to‑end data pipeline management, including data ingestion from diverse sources, de‑duplication of large‑scale datasets, and data tagging to support downstream analytics, governance, and machine learning workflows. Roles and Responsibilities (including but not limited to) Design, develop, and maintain scalable data ingestion pipelines to onboard structured, semi-structured, and unstructured data from batch and streaming sources (e.g., APIs, databases, flat files, message queues) into the Azure/Databricks environment. Implement de‑duplication strategies across large‑scale datasets using deterministic and probabilistic matching techniques to ensure data integrity and reduce redundancy within the Data Lake. Develop and enforce data tagging frameworks to classify, label, and annotate datasets with appropriate metadata (e.g., sensitivity, source, domain, lineage) to support data governance, discoverability, and compliance requirements. Assist with operationalizing deployments and support of cloud services for ETL operations. This will include standardizing and automating processes and workflows, creating documentation/knowledge articles, and overall assisting Operations staff who have limited experience in Cloud. Written and oral presentations to high‑level CIO management on status of current efforts. Possesses skills and experience related to business management, systems engineering, operations research, and management engineering. Typically has specialization in a particular technology or business application. Keeps abreast of technological developments and industry trends. Assist with deployment, configuration, and management of Azure Cloud environment. Assist with migration efforts of existing ETL jobs into Azure/Databricks cloud environment. Ability to share optimization and efficiencies with the larger team and management. Ability to automate solutions to repetitive problems/tasks. Basic Qualifications Bachelor’s degree and 13 years of experience. A degree from an accredited College/University in the applicable field of services is preferred. Four additional years of relevant experience in lieu of a college degree is required. If degree is not in the applicable field, then four additional years of related experience is required. 3+ years demonstrated experience designing and implementing data ingestion pipelines using tools such as Azure Data Factory, Apache Kafka, Apache NiFi, Spark Structured Streaming, or equivalent technologies. 3+ years of experience applying de‑duplication techniques at scale, including record linkage, fuzzy matching, and entity resolution across structured and unstructured datasets. 3+ hands‑on experience with data tagging and metadata management, including the use of tagging schemas, data catalogs (e.g., Azure Purview, Apache Atlas), and automated classification tools to support data governance and lineage tracking. 3+ Demonstrated experience working with unstructured data. 2+ years of experience in using Databricks or other Spark‑based platforms. Fluency in at least one scripting language (Python, Perl, Ruby, or equivalent). Desired Skills Integration of Git in continuous deployment and experience with DevOps monitoring tools. Experience with one or more of the following products and technologies: SAS, Python, C++, Hadoop, SQL Database/Coding, Teradata, Oracle, Amazon S3, Apache Spark, Machine Learning, Natural Language Processing, and visualization tools such as Tableau, Qlik. Strong skills and experience in Cloud Operations support in Azure. Additional Provisions Pass a client‑mandated clearance process to include drug screening, criminal history check and credit check. All candidates must be a US Citizen or permanent status Green Card holder. Cannot have more than 6 months travel outside the United States within the last five years. Military Service excluded. (Exception does not include military family members.) #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Data Engineer in Pittsburgh, PA vacancy
  •  ...the IT Transformation team, you will help build and scale the data foundation that powers our logistics operations and AI initiatives...  ...initiatives What We're Looking For: • 5+ years of data engineering experience • Strong SQL and Python skills • Experience... 
    Suggested

    Bridgeway

    Coraopolis, PA
    3 days ago
  •  ...RxARE future in healthcare! Location: Pittsburgh, PA (Hybrid) Classification: Exempt Status: Full-Time Reports to: Data Engineering Manager Purpose The Data Engineer creates, operates, and extends data pipelines and/or orchestration solutions built in... 
    Suggested
    Full time
    Temporary work
    Part time
    Work at office
    Local area
    Remote work
    Flexible hours

    PANTHERx Rare Pharmacy

    Pittsburgh, PA
    1 day ago
  • $75k - $85k

     ...Data Engineer Department: Data Employment Type: Full Time Location: Pittsburgh, PA Compensation: $75,000 - $85,000 / year Description About Wolfe Recognized among Pittsburgh's 2024 Top Workplaces and Fastest-Growing Companies, Wolfe has been a leader... 
    Suggested
    Full time
    Temporary work

    Wolfe

    Pittsburgh, PA
    4 days ago
  •  ...Technology Solutions. We have an opportunity for Performance Engineer for one of my clients. Here I am sharing the details below....  ...performance logs using ELK/Splunk • Strong analyzing/data mining skills using SQL. • Experience on Kafka/messaging performance... 
    Suggested
    Contract work
    Remote work

    Texas State Library and Archives Commision

    Millvale, PA
    4 days ago
  •  ...Data Engineer Location || Onsite - Warrendale, PA / Pittsburgh, PA (U.S.) This role requires core experience and expertise on - Databricks (advanced, hands-on), Python, ETL/ELT pipeline development, Spark (SQL/PySpark). Job purpose ~ The... 
    Suggested
    Temporary work

    SysMind Tech

    Pittsburgh, PA
    3 days ago
  •  ...Data Engineer Founded in 2019 and headquartered in Pittsburgh, PA, Free Market Health supports forward-thinking payers and specialty pharmacies of all sizes who need to operate in a complex and opaque market. We empower all stakeholders to optimize resources and maximize... 
    Temporary work

    Free Market Health

    Pittsburgh, PA
    3 days ago
  •  ...We're looking for an experienced Data Engineer to help design, build, and optimize modern data pipelines that power analytics, BI, and machine learning across NEP Group. In this role, you'll develop scalable ETL/ELT workflows feeding Snowflake and Databricks, collaborate... 
    Remote work
    Flexible hours

    NEP Group

    Pittsburgh, PA
    2 days ago
  •  ...Data Engineer/ETL Location: Pittsburgh, PA Contract for 12+ months Description: Data Engineer - ETL with Python, Hadoop & Snowflake Responsibilities • Organize business needs into ETL/ELT logical models and ensure data structures are designed... 
    Contract work

    Apex Informatics

    Pittsburgh, PA
    11 hours ago
  • $88k - $166.3k

     ...Location: Monroeville, Pennsylvania Job Title: Experienced Data Engineer Status: Full-time Professional Annual Salary Range: $88,000 - $166,300 *Salary commensurate with education and experience. Job Summary As a Data Engineer at BPMI, you will work... 
    Full time
    For contractors
    Work at office

    Bechtel Plant Machinery

    Monroeville, PA
    2 days ago
  •  ...Senior Data Engineer Location - Pittsburgh or Boston Why Confluence? At Confluence, we've always been driven by a commitment to innovation, precision, and partnership in the investment data space. Our global footprint now spans multiple countries, giving our employees... 
    Flexible hours

    Confluence Technologies

    Pittsburgh, PA
    3 days ago
  • $140k - $160k

     ...in the efforts to design, develop, and maintain databases and data integration (ETL) systems to support business applications and...  ...Proficiency in programming languages is commonly used in data engineering, such as Python or Java Our Company: Carrington Mortgage... 
    Work experience placement
    Remote work
    Work from home

    Carrington

    Coraopolis, PA
    3 days ago
  •  ...Job Description: Senior Data Engineer (Full-Time) Location: Pittsburgh, PA (Hybrid - 3 days onsite per week) Prequel Solutions is seeking a Senior Data Engineer for a full-time, salaried opportunity with a leading financial services organization in the Pittsburgh... 
    Full time
    Contract work
    3 days per week

    Prequel Solutions

    Pittsburgh, PA
    11 hours ago
  • $55 - $60 per hour

    Genesis10 is currently seeking a Senior Data / Feature Engineer - Onsite position with a Major Financial Institution located in Pittsburgh, PA. This is a contract to hire opportunity. W2 rate: $70-75/hour We are hiring a Senior Data / Feature Engineer to build and... 
    Hourly pay
    Permanent employment
    Contract work

    Genesis10

    Pittsburgh, PA
    3 days ago
  •  ...Role: Data Engineer Location: Warrendale, PA / Pittsburgh, PA (Onsite) Job Type: Contract Role Overview We are seeking an experienced Data Engineer with strong expertise in Databricks, Python, and Spark to design and build scalable data pipelines... 
    Contract work

    SysMind Tech

    Pittsburgh, PA
    11 hours ago
  •  ...Data Engineer Contractor 5 Days Onsite – Pittsburgh, PA / Farmers Branch, TX / Miamisburg, OH / Houston, TX Key Responsibilities Gather business and technical requirements from multiple lines of business and source systems Design, develop, and support... 
    For contractors

    System One Holdings, LLC

    Pittsburgh, PA
    2 days ago
  •  ...Machine Translation Data Engineer Onsite 3 days per week in any of these locations: Seattle, NYC, Raleigh NC, Pittsburgh PA, Boston MA We are looking for a data engineer passionate about designing and developing solutions to create scalable and high-quality data... 
    3 days per week

    Infotree Global Solutions

    Pittsburgh, PA
    1 day ago
  •  ...Data Engineer - Research and Development The Pittsburgh Pirates are a storied franchise in Major League Baseball who are reinventing themselves on every level. Boldly and relentlessly pursuing excellence by: purposefully developing a player and people-centered... 

    MLB - Pittsburgh Pirates

    Pittsburgh, PA
    1 day ago
  • $106.9k - $176.5k

     ...wherever you want it to go.  Join EY and help to build a better working world. We are seeking a highly skilled Senior Consultant Data Engineer with expertise in cloud data engineering, specifically Databricks. The ideal candidate will have strong client management and... 
    Summer holiday
    Flexible hours

    EY

    Pittsburgh, PA
    11 hours ago
  •  ...Job Title: Data Engineer (Must Be US Citizen Or Green Card Holder...no OPT) Location: Pittsburgh, PA (onsite) Employment Type: Full-time, Direct Hire Pay: Commensurate with experience About the role (Must Be US Citizen Or Green Card Holder...no OPT)... 
    Full time

    Enkompas

    Pittsburgh, PA
    2 days ago
  •  ...If you are an Engineer who is passionate about renewable energy, sustainability, and being an instrumental part of the transition to clean...  ..., Emerson has an exciting opportunity for you! As a Proposals Data Support Engineer , you will be responsible for gathering, recording... 
    Temporary work
    Work at office
    Remote work
    Flexible hours

    Emerson

    Pittsburgh, PA
    5 days ago
  • $102.5k - $187.9k

     ...analytics solutions that drive customer insights and enhance marketing strategies. The ideal candidate will have a strong background in data analytics, experience with Adobe CJA, and a passion for delivering actionable insights to clients. Working in diverse,... 
    Summer holiday
    Flexible hours

    EY

    Pittsburgh, PA
    4 days ago
  • Bridgeway, located in Coraopolis, is expanding its data engineering team. As part of the IT Transformation team, you will design and build data pipelines critical for operations and AI initiatives. The ideal candidate will have over 5 years of data engineering experience... 

    Bridgeway

    Coraopolis, PA
    3 days ago
  • $77k - $202k

     ...Specialty/Competency: Data, Analytics & AI Industry/Sector: Not Applicable Time Type: Full time Travel Requirements: Up to 60% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop... 
    Full time
    H1b

    PwC

    Pittsburgh, PA
    1 day ago
  •  ...Data Communications Engineer Date: Jun 19, 2026 Location: Pittsburgh, PA, US Company: RailWorks Benefits Offering RailWorks is committed to helping our employees live better lives. We offer comprehensive benefits packages to eligible employees, including competitive... 
    Work at office

    RailWorks

    Pittsburgh, PA
    11 hours ago
  • $99k - $232k

     ...Specialty/Competency: Data, Analytics & AI Industry/Sector: Not Applicable Time Type: Full time Travel Requirements: Up to 60% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop... 
    Full time
    H1b

    PwC

    Pittsburgh, PA
    20 days ago
  • $124k - $280k

     ...Specialty/Competency: Data, Analytics & AI Industry/Sector: Not Applicable Time Type: Full time Travel Requirements: Up to 60% At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop... 
    Full time
    H1b

    PwC

    Pittsburgh, PA
    11 hours ago
  • $125.5k - $230.2k

     ...want it to go.  Join EY and help to build a better working world. We are looking for a dynamic and experienced Manager of Data Engineering to lead our team in designing and implementing complex cloud analytics solutions with a strong focus on Databricks. The ideal... 
    Summer holiday
    Flexible hours

    EY

    Pittsburgh, PA
    4 days ago
  •  ...Senior Data Platform Engineer Pittsburgh, Pennsylvania, United States Company Description Govini transforms Defense Acquisition from an outdated manual process to a software-driven strategic advantage for the United States. Our flagship product, Ark, supports... 
    Full time
    Work at office

    Govini

    Pittsburgh, PA
    4 days ago
  •  ...improvements. This position manages the installation and maintenance of mechanical systems at Data Centers and operations of specialized cooling systems. Acts as an Engineering resource for the complete H5 portfolio of mission critical facilities. Plans and monitors... 
    Work at office
    Immediate start

    H5 Data Centers

    Pittsburgh, PA
    5 days ago
  •  ...DICK'S Sporting Goods is seeking a Lead Analytics Engineer in Coraopolis, PA. In this role, you will act as an SME in Analytics Engineering...  ...needs, and mentoring team members. You will drive the design of data models and BI applications, participate in Agile team activities... 

    DICK'S Sporting Goods

    Coraopolis, PA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!