Data Engineer - Databricks SME.
PLANIT Group
Senior Data Engineer
We are seeking a Senior Data Engineer to support our client with data ingestion, data deduplication and data tagging for migration of a large-scale data environment into Databricks. The ideal candidate will also bring hands-on expertise in end-to-end data pipeline management, including data ingestion from diverse sources, de-duplication of large-scale datasets, and data tagging to support downstream analytics, governance, and machine learning workflows.
Roles and Responsibilities (including but not limited to):
- Design, develop, and maintain scalable data ingestion pipelines to onboard structured, semi-structured, and unstructured data from batch and streaming sources (e.g., APIs, databases, flat files, message queues) into the Azure/Databricks environment.
- Implement de-duplication strategies across large-scale datasets using deterministic and probabilistic matching techniques to ensure data integrity and reduce redundancy within the Data Lake.
- Develop and enforce data tagging frameworks to classify, label, and annotate datasets with appropriate metadata (e.g., sensitivity, source, domain, lineage) to support data governance, discoverability, and compliance requirements.
- Assist with Operationalizing deployments and support of Cloud services for ETL Operations. This will include standardizing and automating processes and workflows, creating documentation/knowledge articles, and overall assisting Operations staff who have limited experience in Cloud.
- Written and oral presentations to high-level CIO management on status of current efforts.
- Possesses skills and experience related to business management, systems engineering, operations research, and management engineering. Typically has specialization in a particular technology or business application. Keeps abreast of technological developments and industry trends.
- Assist with deployment, configuration, and management of Azure Cloud environment.
- Assist with migration efforts of existing ETL jobs into Azure/Databricks cloud environment.
- Ability to share optimization and efficiencies with the larger team and management.
- Ability to automate solutions to repetitive problems/tasks.
Basic Qualifications
- Must be eligible for a Position of Public Trust, including U.S. citizenship or permanent residency, five years of U.S. residency, and no more than six months of international travel in the past five years (excluding travel for U.S.-based work).
- Bachelor's degree and 13 years of experience. A degree from an accredited College/University in the applicable field of services is preferred. Four additional years of relevant experience in lieu of a college degree is required. If Degree is not in the applicable field, then four additional years of related experience is required.
- 5+ years demonstrated experience designing and implementing data ingestion pipelines using tools such as Azure Data Factory, Apache Kafka, Apache NiFi, Spark Structured Streaming, or equivalent technologies.
- 5+ years of experience applying de-duplication techniques at scale, including record linkage, fuzzy matching, and entity resolution across structured and unstructured datasets.
- 5+ Hands-on experience with data tagging and metadata management, including the use of tagging schemas, data catalogs (e.g., Azure Purview, Apache Atlas), and automated classification tools to support data governance and lineage tracking.
- 5 + Demonstrated experience working with unstructured data.
- 2+ years of experience in using Databricks or other Spark-based platforms.
- Fluency in at least one scripting language (Python, Perl, Ruby, or equivalent).
Desired Skills:
- Integration of Git in continuous deployment and experience with DevOps monitoring tools.
- Experience with one or more of the following products and technologies: SAS, Python, C++, Hadoop, SQL Database/Coding, Teradata, Oracle, Amazon S3, Apache Spark, Machine Learning, Natural Language Processing, and visualization tools such as Tableau, Strategy and QLIK.
- Strong skills and experience in Cloud Operations support in Azure.
- ...Designs, develops, and optimizes data integration processes, ETL... ...rotation and providing SME assistance as needed. Leads the... ...Factory, Fabric, Snowflake, or Databricks) to lead and optimize data integration... ...commonly used in data engineering, such as Python or Java. What...DatabricksWork experience placement
- ...in Raleigh, NC is seeking an experienced Cloud Architect (Data Lake/Data Bricks) SME. The role, hybrid with 4 days onsite and 1 day remote, requires... ...IT architecture, specifically in Azure environments and Databricks. Responsibilities include overseeing implementations,...DatabricksRemote work
- ...Job Title: Sr. Data Engineer Skills: Azure Data Factory, Databricks, Snowflake, Azure Devops Experience: 10+ years Location: Raleigh, NC (Hybrid) Duration: Fulltime We at Coforge are hiring a Sr. Data Engineer with following skillset : Solid 10...DatabricksFull time
- Job Title Cloud Architect (Data Lake/Data Bricks) SME Location Raleigh, NC / Hybrid (4 Days Onsite... ...related to business management, systems engineering, operations research, and management... ...of existing ETL jobs into Azure/Databricks cloud environment Assist ETL staff...DatabricksHourly payPermanent employmentContract workLocal areaRemote work
- A leading technology company is seeking a Senior Data Engineer to join their remote team. This role involves designing and maintaining scalable ETL/ELT pipelines with Databricks on AWS, requiring 7+ years of Data Engineering experience and proficiency in Python and SQL...DatabricksFull timeRemote workFlexible hours
$100k - $140k
Tata Consultancy Services is hiring a hands-on Databricks Data Engineer in Raleigh, North Carolina. The role involves designing, developing, and optimizing scalable ETL/ELT data pipelines using Databricks, Apache Spark, and SQL. Candidates should have 10+ years of data...Databricks$60k - $110k
...Join to apply for the Data Engineer - Databricks (Remote) role at LumenaltaBase pay range$60,000 - $110,000 per yearAbout the RoleWe work with global enterprises to design and build data platforms that power digital products used by millions of users. Our projects involve...DatabricksFull timeRemote workMonday to Friday- A regional banking institution is looking for a Data Engineer to design and maintain ETL pipelines on a Databricks platform. The ideal candidate should have a Bachelor’s degree in Computer Science and over 5 years of experience in data engineering. Key responsibilities...Databricks
- 慨正橡扯 in Raleigh, NC is seeking a Data Engineer III to design and optimize data platforms, focusing on scalable solutions. This role involves managing Databricks, Spark, and Redshift while ensuring data quality and efficiency. Ideal candidates will have strong experience...DatabricksFlexible hours
- ...Senior Data Engineer Fully Remote-United States Job Type Full-time Description Overview Tanaq Technical Services (TTS... ...Redshift, Data Factory ETL/ELT frameworks: Airflow, Spark, Databricks Data modeling, schema design, and optimization for AI workloads...DatabricksFull timeContract workLocal areaRemote work
- ...global enterprises to design and scale data platforms that power products used by millions... ...Role as a Tech LeadAs a Tech Lead, Data Engineering, you will own the technical direction of... ..., production-grade data pipelines using Databricks, PySpark, AWS, and SQLLead by example...DatabricksFull timeRemote workFlexible hours
$142.3k - $195.7k
Humana Inc in Raleigh, North Carolina is seeking a Lead Data Engineer to drive analytics and improve data accessibility. The position requires expertise in Databricks, SQL, and building scalable data architectures. The successful candidate will enhance operational efficiency...DatabricksRemote job$150k - $190k
...for our internal users, customers and partners. The Senior Data Engineer will be responsible for the analysis, design, and development... ...or equivalent experience Hands-on experience with Databricks Hands-on experience with PowerBI report development Hands...DatabricksWork experience placementWorldwideFlexible hoursShift work$125k - $155k
...TEKsystems c/o Allegis Group is seeking a Data Platform Engineer to support and evolve our modern data environment as we transition into Microsoft... ...in data engineering. Candidates with experience in Databricks, Snowflake, or similar platforms are encouraged to apply. Responsibilities...DatabricksRemote work$175.5k - $180k
Tech & AI Senior Data Engineer I - QuantumBlack, AI by McKinsey Job ID: 109245 Atlanta Boston Chicago New Jersey New... ...workflows using modern frameworks (Spark, LangChain, Databricks, Dask, Airflow, Dagster, Kedro, etc.) ~ Ability to lead the...DatabricksApprenticeshipWork at officeLocal areaEasy work- ...Overview We are seeking a Senior Analytics Engineer to join a growing Analytics Delivery... ...technical, business-facing role centered on data modeling, transformation, and business... ...Nice-to-Have Experience Snowflake Databricks or Spark Looker or LookML Semantic layer...Databricks
$77k - $202k
...Specialty/Competency: Data, Analytics & AI Industry/Sector: Not Applicable Time... ...PwC, our people in data and analytics engineering focus on leveraging advanced technologies... ...Associate, Snowflake Core, Snowflake Databricks Data Engineer Associate] is a plus - Designing...DatabricksFull timeH1b- hackajob, partnering with LexisNexis, is seeking a Data Engineer III in Raleigh, NC. This role involves designing and maintaining large-... ...systems. The ideal candidate will have extensive experience with Databricks and AWS services, and will play a critical role in shaping...Databricks
$185k
The Role I'm Scott Roberts, Senior Manager, Engineering at Teamworks. I lead the Data Platform team, and we're building the foundation that brings together... ..., or Hudi) and modern processing engines (Spark, Databricks, Trino, or Snowflake) Deep AWS experience (S3, IAM, Glue...DatabricksWorldwide- ...Azure Data Engineer Location: Raleigh, NC (5 Days Work from office) Duration: 12+ months Rate: DOE Expert level skills writing and... ...secure data processing pipelines using Azure Data Factory, Azure Databricks, and other Azure services Managing and optimizing data...DatabricksWork at office
$65k
Position Overview We are seeking a motivated Data Engineer to join our Data Engineering team. The ideal candidate will have exposure in... ...data lake and data warehouse architectures (e.g., Snowflake, Databricks, Big query etc.) Knowledge of containerization and CI/CD pipelines...DatabricksHourly payTemporary workWork at officeRelocation- VA1‑3 Commercial Pl #1500‑Norf, 3 Commercial Pl, 15th Floor, Norfolk, VA 23510, USA The Data Engineer will design, build, and maintain batch ETL pipelines on a modern Databricks Lakehouse platform, delivering high-quality data solutions that support critical banking functions...DatabricksWork experience placement
$91.7k - $163.7k
...people with the care, pharmacy benefits, data and resources they need to feel their... ...consumers. Optum Insight Technology and Engineering is a critical function in Optum Insight... ...Engineer to build data products on Azure and Databricks. We are looking for someone who can take...DatabricksMinimum wageFull timeWork experience placementWork at officeLocal areaRemote work$117.6k - $161.7k
...community This role leads the architecture, engineering, and operationalization of the centralized Finance reporting data platform supporting CenterWell Finance... ...processes. We are looking for deep expertise in Databricks lakehouse engineering, medallion architecture...DatabricksBi-weekly payFull timeTemporary workApprenticeshipWork at officeRemote workWork from homeHome office$71.6k - $119.4k
...prosperity in society. About the Role We are looking for a Data Engineer III to join our Data Engineering team at LexisNexis. This role... ...In this role, you will provide technical leadership across Databricks development, Redshift administration, engineering analytics,...DatabricksTemporary workLocal areaImmediate startFlexible hours- Idexcel is looking for a Senior Data Engineer in Raleigh, North Carolina, to support data ingestion, de-duplication, and tagging for a large-scale data migration into Databricks. The ideal candidate should have 5+ years of relevant experience in data pipeline management...Databricks
$124k - $280k
...Specialty/Competency: Data, Analytics & AI Industry/Sector: Not Applicable Time... ...PwC, our people in data and analytics engineering focus on leveraging advanced technologies... ..., Snowflake Core, Snowflake Architect, Databricks Data Engineer Associate] is a plus - Designing...DatabricksFull timeH1b$125.5k - $230.2k
...help to build a better working world. Technology – Data and Decision Science – Data Engineering – Manager We are looking for a dynamic and... ...complex cloud analytics solutions with a strong focus on Databricks. The ideal candidate will possess deep technical expertise...DatabricksSummer holidayFlexible hours- ...technology solutions provider is seeking a Principal Software Engineer - Data Platform Engineering in Raleigh, North Carolina. This role... ...and developing micro-services using Python, Apache Spark, and Databricks. The ideal candidate will have at least 10 years of software...Databricks
$125.5k - $230.2k
A global consulting firm is seeking a Manager of Data Engineering to lead the design and implementation of cloud analytics solutions. The ideal candidate will have experience in Databricks, strong data architecture skills, and a proven ability to engage clients effectively...Databricks
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer - Databricks SME.. Be the first to apply!
- data visualization developer Raleigh, NC
- data science developer Raleigh, NC
- senior data center engineer Raleigh, NC
- sr information security engineer Raleigh, NC
- junior big data engineer Raleigh, NC
- entry level big data engineer Raleigh, NC
- data engineer contract Raleigh, NC
- aws data engineer Raleigh, NC
- data engineer manager Raleigh, NC
- senior data engineer Raleigh, NC


