Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer

Sparibis

Location: 100% Remote

Years’ Experience: 5+ years Professional Experience

Education: Bachelor’s Degree in IT related field

Clearance: Applicants must be able to obtain and maintain a secret security clearance. United States Citizenship is required as part of the eligibility criteria to be able to obtain this type of security clearance.

Required Certifications:

  • CompTIA Security +

Key Skills:

  • 5+ years of IT experience focusing on enterprise data architecture and management to include data flow charts, diagrams, and other technical documentation.
  • Experience with Databricks, Structured Streaming, Delta Lake concepts, and Delta Live Tables required.
  • Python development experience required.
  • Experience with ETL and ELT tools such as SSIS, Pentaho, and/or Data Migration Services, and the ability to incorporate Python as required.
  • Advanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design, Postgres performance optimization).
  • Proficiency using Git for version control, including repository management, branching, merging, and pull requests.
  • Active CompTIA Security+ certification preferred. If selected, must be able to obtain a CompTIA Security+ certification prior to beginning supporting the program.

Responsibilities

  • Plan, create, and maintain data architectures, ensuring alignment with business requirements.
  • Obtain data, formulate dataset processes, and store optimized data.
  • Identify problems and inefficiencies and apply solutions.
  • Determine tasks where manual participation can be eliminated with automation.
  • Identify and optimize data bottlenecks, leveraging automation where possible.
  • Create and manage data lifecycle policies (retention, backups/restore, etc).
  • In-depth knowledge for creating, maintaining, and managing ETL/ELT pipelines.
  • Create, maintain, and manage data transformations.
  • Maintain/update documentation.
  • Create, maintain, and manage data pipeline schedules.
  • Monitor data pipelines.
  • Create, maintain, and manage data quality gates (Great Expectations) to ensure high data quality.
  • Support AI/ML teams with optimizing feature engineering code.
  • Expertise in Spark/Python/Databricks, Data Lake and SQL.
  • Create, maintain, and manage Spark Structured Steaming jobs, including using the newer Delta Live Tables and/or DBT.
  • Research existing data in the data lake to determine best sources for data.
  • Create, manage, and maintain ksqlDB and Kafka Streams queries/code
  • Data driven testing for data quality.
  • Maintain and update Python-based data processing scripts executed on AWS Lambdas.
  • Unit tests for all the Spark, Python data processing and Lambda codes.
  • Maintain PCIS Reporting Database data lake with optimizations and maintenance (performance tuning, etc).
  • Streamlining data processing experience including formalizing concepts of how to handle lake data, defining windows, and how window definitions impact data freshness.

Qualifications

  • 5+ years of IT experience focusing on enterprise data architecture and management.
  • Must have an active Secret security clearance.
  • Bachelor degree required.
  • CompTIA Security+ certification preferred. If selected, must be able to obtain a CompTIA Security+ certification prior to begin supporting the program.
  • Experience in Conceptual/Logical/Physical Data Modeling & expertise in Relational and Dimensional Data Modeling.
  • Experience with Databricks and Python Development, Structured Streaming, Delta Lake concepts, and Delta Live Tables required.
    • Additional experience with Spark, Spark SQL, Spark DataFrames and DataSets, and PySpark.
    • Data Lake concepts such as time travel and schema evolution and optimization.
    • Structured Streaming and Delta Live Tables with Databricks a bonus.
  • Knowledge of Python (Python 3.X) for CI/CD pipelines required.
    • Familiarity with Pytest and Unittest a bonus.
  • Experience leading and architecting enterprise-wide initiatives specifically system integration, data migration, transformation, data warehouse build, data mart build, and data lakes implementation / support.
    • Advanced level understanding of streaming data pipelines and how they differ from batch systems.
    • Formalize concepts of how to handle late data, defining windows, and data freshness.
    • Advanced understanding of ETL and ELT and ETL/ELT tools such as SSIS, Pentaho, Data Migration Service etc.
    • Understanding of concepts and implementation strategies for different incremental data loads such as tumbling window, sliding window, high watermark, etc.
    • Familiarity and/or expertise with Great Expectations or other data quality/data validation frameworks a bonus.
    • Understanding of streaming data pipelines and batch systems.
    • Familiarity with concepts such as late data, defining windows, and how window definitions impact data freshness.
  • Advanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design, Postgres performance optimization).
  • Indexing and partitioning strategy experience.
  • Debug, troubleshoot, design and implement solutions to complex technical issues.
  • Experience with large-scale, high-performance enterprise big data application deployment and solution.
  • Understanding how to create DAGs to define workflows.
  • Familiarity with CI/CD pipelines, containerization, and pipeline orchestration tools such as Airflow, Prefect, etc a bonus but not required.
  • Architecture experience in AWS environment a bonus.
    • Familiarity working with Kinesis and/or Lambda specifically with how to push and pull data, how to use AWS tools to view data in Kinesis streams, and for processing massive data at scale a bonus.
    • Experience with Docker, Jenkins, and CloudWatch.
    • Ability to write and maintain Jenkinsfiles for supporting CI/CD pipelines.
    • Experience working with AWS Lambdas for configuration and optimization.
    • Experience working with DynamoDB to query and write data.
    • Experience with S3.
  • Experience working with JSON and defining JSON Schemas a bonus.
  • Experience setting up and management Confluent/Kafka topics and ensuring performance using Kafka a bonus.
    • Familiarity with Schema Registry, message formats such as Avro, ORC, etc.
    • Understanding how to manage ksqlDB SQL files and migrations and Kafka Streams.
  • Ability to thrive in a team-based environment.
  • Experience briefing the benefits and constraints of technology solutions to technology partners, stakeholders, team members, and senior level of management.
  • Proficiency using Git for version control, including repository management, branching, merging, and pull requests.
    • Repository setup and management.
    • Branching strategies (feature, develop, main).
    • Merging and resolving conflicts.
    • Creating and reviewing pull requests.
    • Commit best practices (clear messages, atomic commits).
    • Tagging and release management.

About Sparibis

Sparibis LLC is a professional solution firm that Clients rely on to access the best talent to drive their business success.

Sparibis is an equal opportunity employer that values diversity at all levels. All individuals, regardless of personal characteristics, are encouraged to apply.

Vacancy posted 21 days ago
Similar jobs that could be interesting for youBased on the Data Engineer in United States vacancy
  •  ...MANTECH seeks a motivated, career and customer-oriented Data Engineer to join our team in Herndon VA. The Data Engineer will leverage their expertise with Python to support the customer’s data pipelines and related applications, from collection to ingestion, and ensure... 
    Suggested

    MANTECH

    Herndon, VA
    13 hours ago
  •  ...MANTECH seeks a motivated, career and customer-oriented Data Engineer to join our team in Chantilly, VA.   The Data Engineer will leverage their development skills and experience to support the successful designing, ingesting, cleansing, transformation, loading... 
    Suggested
    Full time
    Work at office

    MANTECH

    Chantilly, Loudoun County, VA
    13 hours ago
  •  ...MANTECH seeks a motivated, career and customer-oriented Senior Data Engineer to join our team in Chantilly, VA.   The Senior Data Engineer will leverage their strong technical background and knowledge to support the Sponsor’s data initiatives, to include designing... 
    Suggested
    Full time
    Work at office

    MANTECH

    Chantilly, Loudoun County, VA
    13 hours ago
  • $130k - $150k

     ...Job Description Job Description Apply now: Lead Data Engineer / Delivery Lead, location is in LA. The start date is ASAP for this contract position. Job Title: Lead Data Engineer / Delivery Lead Location-Type: LA Start Date Is: ASAP Duration: Contract... 
    Suggested
    Contract work
    Immediate start

    Mondo

    Los Angeles, CA
    16 days ago
  •  ...Job Description Job Description Lead Data Engineer (Los Angeles, CA) Experience: 9–12 years Strong data engineering background Databricks is a must, along with experience in data modeling, building pipelines, orchestration, and supporting reporting teams... 
    Suggested

    Inizio Partners Corp

    Los Angeles, CA
    12 days ago
  • $130k - $150k

     ...position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior Data Engineer based in the United States. This role plays a key part in shaping and scaling modern enterprise data platforms that power... 
    Remote job
    Full time
    Flexible hours

    jobgether

    United States
    3 days ago
  • $131k - $220k

     ...collects, stores, processes and builds applications within our big data platform. Responsible for integrating these applications with...  ...a valid H1 visa. Primary Accountabilities *Leads an engineering team to meet project deadlines and priorities. *Supervises assigned... 
    Full time
    Local area
    Relocation package

    American Family Insurance

    Madison, WI
    12 days ago
  •  ...Req ID: 247432        General Purpose This role oversees and manages the work of Data Engineer who design and develop ETL pipelines and processes in our Cloud Infrastructure. They are responsible for the overall data integrity and quality, pipeline performance,... 
    Local area

    Coca-Cola Southwest Beverages

    Dallas, TX
    4 days ago
  • $173.1k - $276.8k

     ...do work that matters - to you, to your community, and to the world. Progress starts with you. Job Description The Lead Data Engineer is a senior technical leader responsible for guiding the design, development, and optimization of Visa's large-scale data platforms... 
    Work at office
    Local area

    Visa

    Bellevue, WA
    1 day ago
  • $127.31k - $167.1k

     ...both companies to advance the promise of an immunotherapy in the treatment of multiple myeloma. Legend Biotech is seeking a Lead Data Engineer as part of the Information Technology team based in Bridgewater, NJ. Role Overview We are seeking a Lead Data Engineer to... 
    Permanent employment
    Full time
    Temporary work
    For contractors
    Local area
    Worldwide
    Flexible hours

    Legend Biotech US

    Bridgewater, NJ
    1 day ago
  • $125k - $143.71k

     ...Job Posting Title: Lead Data Engineer ---- Hiring Department: Enterprise Technology - Data to Insights (D2I) ---- Position Open To: All Applicants ---- Weekly Scheduled Hours: 40 ---- FLSA Status: Exempt ---- Earliest Start Date... 
    Full time
    For contractors
    Fixed term contract
    Work at office
    Immediate start
    Remote work
    Monday to Friday
    Flexible hours
    Shift work
    Night shift

    The University of Texas at Austin

    Austin, TX
    13 hours ago
  •  ...For a deeper look at how Envestnet is shaping the future of financial advice, visit     The Team You’ll Join The Lead Data Engineer will play a critical implementation role on the Data Engineering and Data Services team and be responsible for data pipeline solutions... 
    Work experience placement
    Remote work

    Envestnet

    Raleigh, NC
    13 hours ago
  • $86.62k - $101.9k

     ...Job Title Lead Data Engineer Job Description Summary Job Description The Lead Data Engineer helps architect and lead the development of enterprise-scale data platforms and advanced analytics solutions across multiple business units and subject areas. In addition... 
    Minimum wage
    Flexible hours

    Cushman & Wakefield

    United States
    22 days ago
  • $130k - $176k

     ...assume the sponsorship of an employment visa at this time". Selective Insurance is seeking an energetic and collaborative Data Engineer Team lead to work on data and analytics projects supporting the Claims team within the Information Management group. This group... 
    Work experience placement

    Selective Insurance Company of America

    Charlotte, NC
    2 days ago
  • TBD Gen is proud to be an equal-opportunity employer, committed to diversity and inclusivity. We base employment decisions on merit, experience, and business needs, without considering race, color, national origin, age, religion, sex, pregnancy, genetic information,...

    Gen Digital Inc

    New York, NY
    2 days ago
  •  ...Role- Lead Data Enginee r Location - Irvine, CA ( need local or open to relocate candidates , as this is 3-4 days' work from office ). * Engineering Lead Job Description: Engineering leader, with strong hands-on Python/ PySpark/ Platform background... 
    Work at office
    Local area
    Relocation

    Concord IT Systems

    Irvine, CA
    2 days ago
  • $132k - $160k

     ...Data Engineering Lead Creative Artists Agency (CAA) is the leading entertainment and sports agency, with global expertise in filmed and live entertainment, digital media, publishing, sponsorship sales and endorsements, media finance, consumer investing, fashion, trademark... 
    Work at office

    Creative Artists Agency

    Nashville, TN
    3 days ago
  •  ...end solution delivery. • Lead and mentor a team of full-stack developers, ensuring high-quality delivery and adherence to best engineering practices. • Architect, design, and build scalable backend services using Java, microservices, and REST APIs. • Develop... 

    Diamondpick

    Laurel, MD
    7 days ago
  • $121.7k - $162.2k

     ...Resorts, we seek individuals like YOU to create unique and show-stopping experiences for our guests. THE JOB: The Senior Data Engineer provides technical leadership in the architecture, development, and operational management of enterprise data platforms and... 

    MGM Resorts International

    Las Vegas, NV
    2 days ago
  •  ...Role: Lead Data Engineer (Priority 1) Location: San Diego( Onsite only ) Job Type Contrct/Full Time Lead Data Engineer JD: Role Summary We are seeking a highly skilled Lead Data Engineer with deep, hands-on experience in building... 
    Full time

    VBeyond

    Nacogdoches, TX
    4 days ago
  •  ...About the job Lead Data Engineer Job Title: Lead Data Engineer No of Positions:2 Location: Jersey City/ Bedford, NJ (Hybrid) (3 days from office per week) Experience: 10+ years specifically Key Skills: Snowflake, SQL, Python, Spark, AWS-... 
    Work experience placement
    Work at office
    3 days per week

    Inizio Partners

    Jersey City, NJ
    1 day ago
  • $122k - $207k

     ...Lead Data Engineer Position Title: Lead Data Engineer Location: Austin, TX Mastercard is a global technology company in the payments industry. Our mission is to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions... 

    Dynamic Yield

    Austin, TX
    4 days ago
  •  ...Lead Data Engineer Date: May 30, 2026 Location: Columbus, OH, US, 43219 Company: FlightSafety International About FlightSafety International FlightSafety International is the world's premier professional aviation training company and supplier of flight simulators... 
    Permanent employment
    Work at office

    Frasca International

    Columbus, OH
    4 days ago
  • $215.2k - $245.6k

     ...Lead Data Engineer Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers... 
    Full time
    Part time
    Internship
    H1b
    Local area

    Capital One Financial Corp

    San Francisco, CA
    4 days ago
  •  ...Lead Snowflake Data Engineer Our client is seeking a Lead Snowflake Data Engineer to design, own, and deliver end-to-end data engineering solutions in modern cloud environments. This role requires full lifecycle ownership across Snowflake pipelines, data modeling, and... 

    TheStaffed

    New York, NY
    4 days ago
  • $140k - $180k

     ...Lead Data Engineer Join our team to help create and develop the future of live entertainment and sports in Orange County! Mission: To enrich the lives in our community through shared experiences, welcoming spaces, and responsible actions. Vision: We will be the... 
    Local area

    OCVIBE

    Anaheim, CA
    17 days ago
  • $160k - $220k

     ...Lead Data Engineer Deliberate AI | Hybrid (NYC or Boston) | Full-Time About Deliberate AI: We're a venture-backed company at the frontier of precision mental health. In partnerships with some of the world's top ranked medical schools and psychiatric hospitals, we... 
    Full time
    Worldwide
    Relocation
    Flexible hours
    Shift work
    Night shift
    Day shift

    Deliberate AI

    Boston, MA
    4 days ago
  • $160k - $220k

     ...Lead Data Engineer Location: Santa Clara, CA, United States Location Type: On-site Salary Range: 160000 - 220000 USD Annually We are seeking a Lead Data Engineer to architect, build, and lead the development of scalable, cloud-based data platforms that support enterprise... 

    Q-Cells

    Santa Clara, CA
    13 hours ago
  • $110k - $204.2k

     ...Lead Data Engineer We are seeking a highly skilled and strategic Lead Data Engineer to join our team. In this advanced individual contributor role, you will spearhead the development of sophisticated, insightful, and visually compelling reporting solutions using tools... 
    Work at office
    Local area
    Flexible hours
    2 days per week
    3 days per week

    Thomson Reuters

    Eagan, MN
    13 hours ago
  •  ...Lead Data Engineer Are you ready to accelerate your potential and make a real difference within life sciences, diagnostics, and biotechnology? At SCIEX, one of Danaher's 15+ operating companies, our work saves lives—and we're all united by a shared commitment to... 
    Remote work

    Danaher Corporation

    United States
    13 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!