Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer - Databricks SME.

PLANIT Group

Senior Data Engineer

We are seeking a Senior Data Engineer to support our client with data ingestion, data deduplication and data tagging for migration of a large-scale data environment into Databricks. The ideal candidate will also bring hands-on expertise in end-to-end data pipeline management, including data ingestion from diverse sources, de-duplication of large-scale datasets, and data tagging to support downstream analytics, governance, and machine learning workflows.

Roles and Responsibilities (including but not limited to):

  • Design, develop, and maintain scalable data ingestion pipelines to onboard structured, semi-structured, and unstructured data from batch and streaming sources (e.g., APIs, databases, flat files, message queues) into the Azure/Databricks environment.
  • Implement de-duplication strategies across large-scale datasets using deterministic and probabilistic matching techniques to ensure data integrity and reduce redundancy within the Data Lake.
  • Develop and enforce data tagging frameworks to classify, label, and annotate datasets with appropriate metadata (e.g., sensitivity, source, domain, lineage) to support data governance, discoverability, and compliance requirements.
  • Assist with Operationalizing deployments and support of Cloud services for ETL Operations. This will include standardizing and automating processes and workflows, creating documentation/knowledge articles, and overall assisting Operations staff who have limited experience in Cloud.
  • Written and oral presentations to high-level CIO management on status of current efforts.
  • Possesses skills and experience related to business management, systems engineering, operations research, and management engineering. Typically has specialization in a particular technology or business application. Keeps abreast of technological developments and industry trends.
  • Assist with deployment, configuration, and management of Azure Cloud environment.
  • Assist with migration efforts of existing ETL jobs into Azure/Databricks cloud environment.
  • Ability to share optimization and efficiencies with the larger team and management.
  • Ability to automate solutions to repetitive problems/tasks.

Basic Qualifications

  • Must be eligible for a Position of Public Trust, including U.S. citizenship or permanent residency, five years of U.S. residency, and no more than six months of international travel in the past five years (excluding travel for U.S.-based work).
  • Bachelor's degree and 13 years of experience. A degree from an accredited College/University in the applicable field of services is preferred. Four additional years of relevant experience in lieu of a college degree is required. If Degree is not in the applicable field, then four additional years of related experience is required.
  • 5+ years demonstrated experience designing and implementing data ingestion pipelines using tools such as Azure Data Factory, Apache Kafka, Apache NiFi, Spark Structured Streaming, or equivalent technologies.
  • 5+ years of experience applying de-duplication techniques at scale, including record linkage, fuzzy matching, and entity resolution across structured and unstructured datasets.
  • 5+ Hands-on experience with data tagging and metadata management, including the use of tagging schemas, data catalogs (e.g., Azure Purview, Apache Atlas), and automated classification tools to support data governance and lineage tracking.
  • 5 + Demonstrated experience working with unstructured data.
  • 2+ years of experience in using Databricks or other Spark-based platforms.
  • Fluency in at least one scripting language (Python, Perl, Ruby, or equivalent).

Desired Skills:

  • Integration of Git in continuous deployment and experience with DevOps monitoring tools.
  • Experience with one or more of the following products and technologies: SAS, Python, C++, Hadoop, SQL Database/Coding, Teradata, Oracle, Amazon S3, Apache Spark, Machine Learning, Natural Language Processing, and visualization tools such as Tableau, Strategy and QLIK.
  • Strong skills and experience in Cloud Operations support in Azure.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Data Engineer - Databricks SME. in Raleigh, NC vacancy
  •  ...Designs, develops, and optimizes data integration processes, ETL...  ...rotation and providing SME assistance as needed. Leads the...  ...Factory, Fabric, Snowflake, or Databricks) to lead and optimize data integration...  ...commonly used in data engineering, such as Python or Java. What... 
    Databricks
    Work experience placement

    Carrington

    Raleigh, NC
    1 day ago
  •  ...in Raleigh, NC is seeking an experienced Cloud Architect (Data Lake/Data Bricks) SME. The role, hybrid with 4 days onsite and 1 day remote, requires...  ...IT architecture, specifically in Azure environments and Databricks. Responsibilities include overseeing implementations,... 
    Databricks
    Remote work

    Robotics Prcocess Automation, LLC

    Raleigh, NC
    1 day ago
  •  ...Job Title: Sr. Data Engineer Skills: Azure Data Factory, Databricks, Snowflake, Azure Devops Experience: 10+ years Location: Raleigh, NC (Hybrid) Duration: Fulltime We at Coforge are hiring a Sr. Data Engineer with following skillset : Solid 10... 
    Databricks
    Full time

    Coforge

    Raleigh, NC
    4 days ago
  • Job Title Cloud Architect (Data Lake/Data Bricks) SME Location Raleigh, NC / Hybrid (4 Days Onsite...  ...related to business management, systems engineering, operations research, and management...  ...of existing ETL jobs into Azure/Databricks cloud environment Assist ETL staff... 
    Databricks
    Hourly pay
    Permanent employment
    Contract work
    Local area
    Remote work

    Robotics Prcocess Automation, LLC

    Raleigh, NC
    1 day ago
  • A leading technology company is seeking a Senior Data Engineer to join their remote team. This role involves designing and maintaining scalable ETL/ELT pipelines with Databricks on AWS, requiring 7+ years of Data Engineering experience and proficiency in Python and SQL... 
    Databricks
    Full time
    Remote work
    Flexible hours

    Lumenalta

    Raleigh, NC
    10 days ago
  • $100k - $140k

    Tata Consultancy Services is hiring a hands-on Databricks Data Engineer in Raleigh, North Carolina. The role involves designing, developing, and optimizing scalable ETL/ELT data pipelines using Databricks, Apache Spark, and SQL. Candidates should have 10+ years of data... 
    Databricks

    Tata Consultancy Services

    Raleigh, NC
    4 days ago
  • $60k - $110k

     ...Join to apply for the Data Engineer - Databricks (Remote) role at LumenaltaBase pay range$60,000 - $110,000 per yearAbout the RoleWe work with global enterprises to design and build data platforms that power digital products used by millions of users. Our projects involve... 
    Databricks
    Full time
    Remote work
    Monday to Friday

    Lumenalta

    Raleigh, NC
    8 hours ago
  • A regional banking institution is looking for a Data Engineer to design and maintain ETL pipelines on a Databricks platform. The ideal candidate should have a Bachelor’s degree in Computer Science and over 5 years of experience in data engineering. Key responsibilities... 
    Databricks

    TowneBank

    Raleigh, NC
    4 days ago
  • 慨正橡扯 in Raleigh, NC is seeking a Data Engineer III to design and optimize data platforms, focusing on scalable solutions. This role involves managing Databricks, Spark, and Redshift while ensuring data quality and efficiency. Ideal candidates will have strong experience... 
    Databricks
    Flexible hours

    慨正橡扯

    Raleigh, NC
    3 days ago
  •  ...Senior Data Engineer Fully Remote-United States Job Type Full-time Description Overview Tanaq Technical Services (TTS...  ...Redshift, Data Factory ETL/ELT frameworks: Airflow, Spark, Databricks Data modeling, schema design, and optimization for AI workloads... 
    Databricks
    Full time
    Contract work
    Local area
    Remote work

    St. George Tanaq Corporation

    Raleigh, NC
    1 day ago
  •  ...global enterprises to design and scale data platforms that power products used by millions...  ...Role as a Tech LeadAs a Tech Lead, Data Engineering, you will own the technical direction of...  ..., production-grade data pipelines using Databricks, PySpark, AWS, and SQLLead by example... 
    Databricks
    Full time
    Remote work
    Flexible hours

    Lumenalta

    Raleigh, NC
    8 hours ago
  • $142.3k - $195.7k

    Humana Inc in Raleigh, North Carolina is seeking a Lead Data Engineer to drive analytics and improve data accessibility. The position requires expertise in Databricks, SQL, and building scalable data architectures. The successful candidate will enhance operational efficiency... 
    Databricks
    Remote job

    Humana Inc

    Raleigh, NC
    3 days ago
  • $150k - $190k

     ...for our internal users, customers and partners. The Senior Data Engineer will be responsible for the analysis, design, and development...  ...or equivalent experience Hands-on experience with Databricks Hands-on experience with PowerBI report development Hands... 
    Databricks
    Work experience placement
    Worldwide
    Flexible hours
    Shift work

    SHI GmbH

    Raleigh, NC
    3 days ago
  • $125k - $155k

     ...TEKsystems c/o Allegis Group is seeking a Data Platform Engineer to support and evolve our modern data environment as we transition into Microsoft...  ...in data engineering. Candidates with experience in Databricks, Snowflake, or similar platforms are encouraged to apply. Responsibilities... 
    Databricks
    Remote work

    TEKsystems c/o Allegis Group

    Raleigh, NC
    3 days ago
  • $175.5k - $180k

    Tech & AI Senior Data Engineer I - QuantumBlack, AI by McKinsey Job ID: 109245 Atlanta Boston Chicago New Jersey New...  ...workflows using modern frameworks (Spark, LangChain, Databricks, Dask, Airflow, Dagster, Kedro, etc.) ~ Ability to lead the... 
    Databricks
    Apprenticeship
    Work at office
    Local area
    Easy work

    McKinsey & Company

    Raleigh, NC
    1 day ago
  •  ...Overview We are seeking a Senior Analytics Engineer to join a growing Analytics Delivery...  ...technical, business-facing role centered on data modeling, transformation, and business...  ...Nice-to-Have Experience Snowflake Databricks or Spark Looker or LookML Semantic layer... 
    Databricks

    Harnham

    Raleigh, NC
    4 days ago
  • $77k - $202k

     ...Specialty/Competency: Data, Analytics & AI Industry/Sector: Not Applicable Time...  ...PwC, our people in data and analytics engineering focus on leveraging advanced technologies...  ...Associate, Snowflake Core, Snowflake Databricks Data Engineer Associate] is a plus - Designing... 
    Databricks
    Full time
    H1b

    PwC

    Raleigh, NC
    1 day ago
  • hackajob, partnering with LexisNexis, is seeking a Data Engineer III in Raleigh, NC. This role involves designing and maintaining large-...  ...systems. The ideal candidate will have extensive experience with Databricks and AWS services, and will play a critical role in shaping... 
    Databricks

    hackajob

    Raleigh, NC
    4 days ago
  • $185k

    The Role I'm Scott Roberts, Senior Manager, Engineering at Teamworks. I lead the Data Platform team, and we're building the foundation that brings together...  ..., or Hudi) and modern processing engines (Spark, Databricks, Trino, or Snowflake) Deep AWS experience (S3, IAM, Glue... 
    Databricks
    Worldwide

    Teamworks

    Raleigh, NC
    4 days ago
  •  ...Azure Data Engineer Location: Raleigh, NC (5 Days Work from office) Duration: 12+ months Rate: DOE Expert level skills writing and...  ...secure data processing pipelines using Azure Data Factory, Azure Databricks, and other Azure services Managing and optimizing data... 
    Databricks
    Work at office

    Georgia IT Inc

    Raleigh, NC
    2 days ago
  • $65k

    Position Overview We are seeking a motivated Data Engineer to join our Data Engineering team. The ideal candidate will have exposure in...  ...data lake and data warehouse architectures (e.g., Snowflake, Databricks, Big query etc.) Knowledge of containerization and CI/CD pipelines... 
    Databricks
    Hourly pay
    Temporary work
    Work at office
    Relocation

    Cognizant

    Raleigh, NC
    4 days ago
  • VA1‑3 Commercial Pl #1500‑Norf, 3 Commercial Pl, 15th Floor, Norfolk, VA 23510, USA The Data Engineer will design, build, and maintain batch ETL pipelines on a modern Databricks Lakehouse platform, delivering high-quality data solutions that support critical banking functions... 
    Databricks
    Work experience placement

    TowneBank

    Raleigh, NC
    5 days ago
  • $91.7k - $163.7k

     ...people with the care, pharmacy benefits, data and resources they need to feel their...  ...consumers. Optum Insight Technology and Engineering is a critical function in Optum Insight...  ...Engineer to build data products on Azure and Databricks. We are looking for someone who can take... 
    Databricks
    Minimum wage
    Full time
    Work experience placement
    Work at office
    Local area
    Remote work

    Optum

    Raleigh, NC
    2 days ago
  • $117.6k - $161.7k

     ...community This role leads the architecture, engineering, and operationalization of the centralized Finance reporting data platform supporting CenterWell Finance...  ...processes. We are looking for deep expertise in Databricks lakehouse engineering, medallion architecture... 
    Databricks
    Bi-weekly pay
    Full time
    Temporary work
    Apprenticeship
    Work at office
    Remote work
    Work from home
    Home office

    Humana

    Raleigh, NC
    2 days ago
  • $71.6k - $119.4k

     ...prosperity in society. About the Role We are looking for a Data Engineer III to join our Data Engineering team at LexisNexis. This role...  ...In this role, you will provide technical leadership across Databricks development, Redshift administration, engineering analytics,... 
    Databricks
    Temporary work
    Local area
    Immediate start
    Flexible hours

    慨正橡扯

    Raleigh, NC
    3 days ago
  • Idexcel is looking for a Senior Data Engineer in Raleigh, North Carolina, to support data ingestion, de-duplication, and tagging for a large-scale data migration into Databricks. The ideal candidate should have 5+ years of relevant experience in data pipeline management... 
    Databricks

    Idexcel

    Raleigh, NC
    4 days ago
  • $124k - $280k

     ...Specialty/Competency: Data, Analytics & AI Industry/Sector: Not Applicable Time...  ...PwC, our people in data and analytics engineering focus on leveraging advanced technologies...  ..., Snowflake Core, Snowflake Architect, Databricks Data Engineer Associate] is a plus - Designing... 
    Databricks
    Full time
    H1b

    PwC

    Raleigh, NC
    4 days ago
  • $125.5k - $230.2k

     ...help to build a better working world. Technology – Data and Decision Science – Data Engineering – Manager We are looking for a dynamic and...  ...complex cloud analytics solutions with a strong focus on Databricks. The ideal candidate will possess deep technical expertise... 
    Databricks
    Summer holiday
    Flexible hours

    EY

    Raleigh, NC
    3 days ago
  •  ...technology solutions provider is seeking a Principal Software Engineer - Data Platform Engineering in Raleigh, North Carolina. This role...  ...and developing micro-services using Python, Apache Spark, and Databricks. The ideal candidate will have at least 10 years of software... 
    Databricks

    Compunnel, Inc.

    Raleigh, NC
    2 days ago
  • $125.5k - $230.2k

    A global consulting firm is seeking a Manager of Data Engineering to lead the design and implementation of cloud analytics solutions. The ideal candidate will have experience in Databricks, strong data architecture skills, and a proven ability to engage clients effectively... 
    Databricks

    Ernst & Young Oman

    Raleigh, NC
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer - Databricks SME.. Be the first to apply!