Senior Data Engineer Job Description

Senior Data Engineer Job Description Template

Our company is looking for a Senior Data Engineer to join our team.

Responsibilities:

  • Focus on enabling developers and analysts through self-service and automated tooling, rather than manual requests and acting as a gatekeeper;
  • Display sense of ownership over assigned work, requiring minimal direction and driving to completion in a sometimes fuzzy and uncharted environment;
  • Designing and building new cloud-based data pipelines, optimized for processing terabytes of spatiotemporal data;
  • Lead a team to develop Cloud enabled Data and Analytics solutions;
  • Work with data engineering related groups in the support of deployment of processing jobs;
  • Design, build and support data processing pipelines and APIs;
  • Build ETLs/ELTs to take data from various operational systems and craft a unified dimensional or star schema data model for analytics and reporting;
  • Continually improve ongoing reporting and analysis processes, automating or simplifying self-service support for business constituents;
  • Architect and build data pipelines to optimize for performance, data quality, scalability, ease of future development, and cost;
  • Work with data science and feature teams to develop and maintain ETL pipelines;
  • Work closely with the Data Architect and Data Scientists to develop plans to enhance the Data Science practice;
  • Build tools and datamarts to enable analytics at a deeper level;
  • Gather requirements, assess gaps and build roadmaps and architectures to help the analytics driven organization achieve its goals;
  • Develop data-related Proof of Concepts in order to demonstrate feasibility and value to SambaSafety’s customers;
  • Architect, design, develop and engineering end-to-end data pipelines across multiple data sources and systems of record.

Requirements:

  • Experience in mentoring other team members in development best practice, and methodologies;
  • You are passionate about data quality control and know how and where to anticipate potential errors;
  • Ability to work in a team environment that promotes collaboration;
  • 3+ years of Software Engineering experience (preferably with Java);
  • Curiosity, open-mindedness, drive, and a proven ability to work independently or as part of a team;
  • AWS cloud services: EC2, EMR, RDS, Redshift, Athena, Lambda;
  • Subject to Public Trust Suitability;
  • 2+ years’ with any scripting language (Python, etc.);
  • Knowledge of the software development lifecycle, agile methodologies, and structured software development methodologies;
  • Having planned and/or participated in terabyte-scale data migrations;
  • Deep experience designing and implementing highly scalable, distributed application systems;
  • Experience performing analysis with large datasets in a cloud-based environment, preferably with an understanding of Google’s Cloud Platform;
  • 2+ years of experience with Scala, Golang, Python, Java, Ruby or a similar programming language;
  • 1-3 years of experience working with GIS datasets;
  • You have solid experience in building highly scalable, distributed services, capable of handling large amounts of data.