Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Pipeline Engineer - School of Computer Science - MLD

Carnegie-Mellon University

Carnegie Mellon University is a private, global research university that stands among the world's most renowned education institutions. With ground-breaking brain science, path-breaking performances, creative start-ups, big data, big ambitions, hands-on learning, and a whole lot of robots, CMU doesn't imagine the future, we invent it. If you're passionate about joining a community that challenges the curious to deliver work that matters, your journey starts here!

The Machine Learning Department (MLD) at Carnegie Mellon University is a leading hub for research and education in artificial intelligence and machine learning. It focuses on developing innovative algorithms and models to address complex problems in diverse fields such as robotics, healthcare, and finance. The department offers a range of undergraduate and graduate programs, fostering a collaborative environment that bridges theoretical research and practical applications. Faculty and students frequently collaborate with industry and other academic disciplines to push the boundaries of what is possible with machine learning.

We are seeking a Data Pipeline Engineer to join the team! As a Data Pipeline Engineer, your role is vital in ensuring the integrity and reliability of our data pipelines. This position is responsible for monitoring, troubleshooting, and conducting root cause analysis of data quality issues within our pipelines, but as a part time team member, you will consult and assist rather than lead in these areas. Your contributions are crucial to maintaining the high standard of our epidemiological tracking and forecasting tools. This role will report directly to the Delphi Engineering Manager.

Core Responsibilities

  • Monitor and maintain the health and efficiency of data pipelines.

  • Troubleshoot and perform root cause analysis for data discrepancies and pipeline issues.

  • Communicate with data providers to understand data discrepancies and manage changes in data delivery.

  • Implement fixes and enhancements to improve data quality and pipeline performance.

  • Collaborate with data scientists and analysts to understand data needs and implement effective data solutions.

  • Develop strategies for data validation and quality assurance.

  • Optimize data flow and collection to improve system efficiency.

  • Document and manage data pipeline architectures, including maintenance and update protocols.

  • Use tools such as SQL, version control and CI/CD, containerization, task schedulers, python frameworks, and cloud services for data pipeline management.

  • Ensure compliance with data governance and security standards.

Adaptability, excellence, and passion are vital qualities within Carnegie Mellon University. We are in search of a team member who can effectively interact with a varied population of internal and external partners at a high level of integrity. We are looking for someone who shares our values and who will support the mission of the university through their work.

Qualifications:

  • Bachelor's Degree required.

  • Minimum one year of research computing experience required.

  • Basic Linux use and administration: system layout, file permissions, shell, utilities (syslog, cron), diagnostic tools (ps, htop, grep, lsof)

  • Experience in Apache Airflow, preferably version 3.0

  • Basic database use, especially in Postgres

  • Rough script programming (Python, bash)

  • Team software development (git/GitHub, Jira, code reviews, agile methodologies)

  • Data analysis: diagnosing and fixing runtime errors and logic bugs; performing basic growth projections to predict future problems; communicating results

  • Required technologies: Python, MySQL/Postgres, Linux, git & GitHub, Apache Airflow

  • A combination of education and proven experience from which comparable knowledge is demonstrated may be considered.

Preferred Technologies and Languages:

  • Linux, Ubuntu, Bash, Make

  • Apache Airflow

  • Python, pandas, Flask, PyPI publishing

  • SQL, Postgres

  • git, GitHub, GitHub Actions, GitHub Issues

  • Docker, Docker Compose

  • Elastic, Kibana, FileBeat

  • G Suite (Calendar, Mail, Docs, Sheets, Slides, Forms, AppsScript, Groups)

  • Jira Software

Requirements :

  • Successful completion of a pre-employment background check

Joining the CMU team opens the door to an array of exceptional benefits.

Benefits eligible ( employees enjoy a wide array of benefits including comprehensive medical, prescription, dental, and vision insurance ( as well as a generous retirement savings program ( with employer contributions. Unlock your potential with tuition benefits ( , take well-deserved breaks with ample paid time off ( and observed holidays ( , and rest easy with life and accidental death and disability insurance.

Additional perks include a free Pittsburgh Regional Transit bus pass, access to our Family Concierge Team ( to help navigate childcare needs, fitness center access ( , and much more!

For a comprehensive overview of the benefits available, explore our Benefits page ( .

At Carnegie Mellon, we value the whole package when extending offers of employment. Beyond credentials, we evaluate the role and responsibilities, your valuable work experience, and the knowledge gained through education and training. We appreciate your unique skills and the perspective you bring. Your journey with us is about more than just a job; it's about finding the perfect fit for your professional growth and personal aspirations.

Are you interested in an exciting opportunity with an exceptional organization?! Apply today!

Location

Remote

Job Function

Software/Applications Development/Engineering

Position Type

Staff - Fixed Term (Fixed Term)

Full Time/Part time

Part time

Pay Basis

Hourly

More Information:

  • Please visit "Why Carnegie Mellon ( " to learn more about becoming part of an institution inspiring innovations that change the world.

  • Click here ( to view a listing of employee benefits

  • Carnegie Mellon University is an Equal Opportunity Employer/Disability/Veteran .

  • Statement of Assurance (

Interested in a career with Carnegie Mellon University but not finding anything that currently aligns with your interests, background, or experience? Learn how to sign up for Job Alerts ( through your candidate profile.

If your heart is in your work, come work with us. Carnegie Mellon University isn't just one of the world's most renowned educational institutions - it's also a hotspot for some of the most talented doers, dreamers, and difference-makers on the planet. When you join our staff, you'll become an important part of our mission to create a healthier, safer, and more just life for all. No matter what your role or location, you'll connect and collaborate with dedicated, passionate colleagues - and you'll have the satisfaction of delivering work that truly matters.

We cultivate a vibrant, welcoming environment where everyone is valued and encouraged to contribute and achieve. In addition to competitive benefits and a robust support network, you'll have access to many tools and resources to sharpen your abilities and professional skills, as well as opportunities to engage and share perspectives with a dynamic and inspiring community of uniquely talented staff, faculty, students, and alumni.

The future is awaiting your expertise and intellect. Come join the architects of what's next. Apply now.

Learn more about Student Employment ( .

Please see Faculty Careers. (

For technical assistance, email HR Services (View email address on click.appcast.io) or call View phone number on click.appcast.io.

If you are an individual with a disability and you require assistance with the job application process, please email Equal Opportunity Services (View email address on click.appcast.io) or call View phone number on click.appcast.io.

Prospective Employee Disclosures (

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Data Pipeline Engineer - School of Computer Science - MLD in Pittsburgh, PA vacancy
  •  ...Job Title: Data Engineer (Must Be US Citizen Or Green Card Holder...no OPT) Location: Pittsburgh, PA (onsite) Employment Type:...  ...for helping to implement, grow, and optimize our data and data pipeline architecture. As part of a growing team in a face-paced start... 
    Suggested
    Full time

    Enkompas

    Pittsburgh, PA
    4 days ago
  •  ...steps. Pitt Digital is seeking a Senior Data Engineer to support the Center for Excellence in...  ...design and implement cloud-based data pipelines that connect enrollment CRM systems,...  ...Generally sedentary desk work with computer-keyboard video phone other than traveling... 
    Suggested
    Full time
    Remote work
    Relocation
    Visa sponsorship
    Free visa
    Afternoon shift

    VetJobs

    Pittsburgh, PA
    4 days ago
  •  ...contribute to the company's success. As a Data Engineer - ETL| Informatica | SQL PL/SQL | RDBMS...  ...Writing and optimizing code for data pipelines to ensure systems are highly efficient,...  ..., Data Analytics, Data Mining, Data Science, Informatics, Machine Learning (ML), PL... 
    Suggested
    Full time
    Temporary work
    Part time
    Work experience placement
    Work at office

    PNC Financial Services Group

    Pittsburgh, PA
    3 days ago
  • $88k - $166.3k

     ...Job Title: Experienced Data Engineer Status: Full-time Professional...  ...the adoption of data science and analytics into standard...  ...designing and maintaining data pipelines, lakes, warehouses, and reports...  ...Requirements: Bachelor's degree in Computer Science, Data Science,... 
    Suggested
    Full time
    For contractors
    Work at office

    Bechtel Plant Machinery

    Monroeville, PA
    4 days ago
  •  ...Status: Full-Time Reports to: Data Engineering Manager Purpose The Data...  ...creates, operates, and extends data pipelines and/or orchestration solutions built...  ...Qualifications Bachelor's degree in computer science, Mathematics, Statistics, or a... 
    Suggested
    Full time
    Temporary work
    Part time
    Work at office
    Local area
    Remote work
    Flexible hours

    Pantherx Specialty Llc

    Pittsburgh, PA
    19 hours ago
  •  ...JD: Role name: Databricks Senior Data Engineer Work site: Pittsburgh, PA (Onsite) Contract Job Description: 10+years of expereince as Datawarehouse Data Engineer. • Data Architecture Design: Develop/Modify the architecture for the EDW, including data models... 
    Contract work

    eTeam

    Pittsburgh, PA
    19 hours ago
  • $106.9k - $176.5k

     ...better working world. Technology – Data and Decision Science – Data Engineering – Senior We are seeking a...  ...objectives. Lead end-to-end data pipeline development, including data...  ...technologies in data engineering and cloud computing. This role offers the... 
    Summer holiday
    Flexible hours

    EY

    Pittsburgh, PA
    1 day ago
  •  ...Machine Translation Data Engineer Onsite 3 days per week in any of these locations: Seattle...  ...data extraction, annotation, auditing pipeline to improve MT data annotation...  ...data processing pipeline in distributed computing environments Document development for... 
    3 days per week

    Infotree Global Solutions

    Pittsburgh, PA
    3 days ago
  •  ...Kforce has a client seeking a Data Engineer on a contract basis in Pittsburgh, PA.Summary:The Data Engineer will be tasked with translating...  ...architecture, and applications. Bachelor's degree in Computer Science, Information Systems, or related field., or equivalent work... 
    Hourly pay
    Contract work
    Work experience placement

    Kforce

    Pittsburgh, PA
    4 days ago
  •  ...Data Engineer Contractor 5 Days Onsite – Pittsburgh, PA / Farmers Branch, TX / Miamisburg, OH / Houston, TX Key Responsibilities...  ...-paced environment Education ~ Bachelor’s degree in Computer Science, Information Systems, or related field preferred #M1... 
    For contractors

    System One Holdings, LLC

    Pittsburgh, PA
    4 days ago
  • $49.5 - $62.5 per hour

     ...Overview: We are seeking an experienced Data Engineer to join our AI & Digital Team. This...  ...will design, build, and optimize data pipelines and infrastructure, enabling advanced...  ...Requirements Bachelor’s degree in computer science or related fields with 3-5 years’ experience... 
    Hourly pay
    Contract work

    US Tech Solutions

    Pittsburgh, PA
    5 days ago
  •  ...Job Description • Design and maintain data pipelines using Fabric Data Factory, Dataflow Gen2, and Notebooks (Python, Spark, SQL) •...  ...with CI/CD pipeline experience • Bachelor's in CS, IS, Data Engineering ○ Or 10 equivalent yrs experience • PowerBI - dashboarding... 

    Insight Global

    Pittsburgh, PA
    3 days ago
  • $65k - $171.93k

     ...have an opportunity to contribute to the company's success. As a Data Engineer Senior within PNC's Data Product Organization, you will be...  ...Thinking, Competitive Advantages, Data Analytics, Data Mining, Data Science, Machine Learning (ML) Competencies Application Delivery... 
    Full time
    Temporary work
    Part time
    Work experience placement
    Work at office

    PNC

    Pittsburgh, PA
    5 days ago
  •  ...Technology Solutions. We have an opportunity for Performance Engineer for one of my clients. Here I am sharing the details below....  ...performance logs using ELK/Splunk • Strong analyzing/data mining skills using SQL. • Experience on Kafka/messaging performance... 
    Contract work
    Remote work

    Texas State Library and Archives Commision

    Millvale, PA
    1 day ago
  •  ...Data Engineer Location || Onsite - Warrendale, PA / Pittsburgh, PA (U.S.) This role requires core experience and expertise on - Databricks (advanced, hands-on), Python, ETL/ELT pipeline development, Spark (SQL/PySpark). Job purpose ~ The... 
    Temporary work

    SysMind Tech

    Pittsburgh, PA
    19 hours ago
  • $70.8k - $156.7k

     ...Data Engineer - GenAI Category: Software Development/ Engineering...  ...prompt engineering, and cloud computing within the Azure ecosystem...  ...Develop robust and scalable pipelines for data preprocessing, model...  ...Bachelor's degree in computer science, computer engineering, or... 
    Full time
    Work experience placement
    Local area

    CGI Technologies and Solutions, Inc.

    Pittsburgh, PA
    4 days ago
  •  ...We're looking for an experienced Data Engineer to help design, build, and optimize modern data pipelines that power analytics, BI, and machine learning across NEP Group. In this role, you'll develop scalable ETL/ELT workflows feeding Snowflake and Databricks, collaborate... 
    Remote work
    Flexible hours

    NEP Group

    Pittsburgh, PA
    4 days ago
  •  ...team, you will help build and scale the data foundation that powers our logistics operations...  ...responsible for designing reliable data pipelines, integrating data from multiple...  ...'re Looking For: • 5+ years of data engineering experience • Strong SQL and Python skills... 

    Bridgeway

    Coraopolis, PA
    19 hours ago
  •  ...Job Description: Senior Data Engineer (Full-Time) Location: Pittsburgh, PA (Hybrid - 3 days onsite per week) Prequel Solutions...  ...environment. This role focuses on designing high-performance data pipelines, improving data quality, and supporting enterprise analytics... 
    Full time
    Contract work
    3 days per week

    Prequel Solutions

    Pittsburgh, PA
    2 days ago
  •  ...Day-to-Day Responsibilities / Project Details: Support Data Engineering initiatives across two Data Engineering Teams: Prospect...  ...Hands-on data engineering experience (batch and real-time pipelines, data warehousing, data modeling) # Airflow experience with... 
    Contract work

    Saxon Global

    Pittsburgh, PA
    19 hours ago
  •  ...Sr Data Engineer Location: Pittsburgh PA (Onsite) Duration: 12 Months DESCRIPTION Design, construct, and maintain scalable...  ...to ensure continuous service delivery. Create dataprocessing pipelines utilizing Databricks Notebooks, Spark SQL, Python and other... 
    Flexible hours

    SolGenie

    Pittsburgh, PA
    19 hours ago
  • $55 - $60 per hour

     ...Senior Data / Feature Engineer Genesis10 is currently seeking a Senior Data / Feature Engineer - Onsite position with a Major Financial Institution...  ...to build and scale production-grade machine learning pipelines and feature engineering systems. This is strictly an... 
    Hourly pay
    Contract work

    Genesis10

    Pittsburgh, PA
    3 hours ago
  • $140k - $160k

     ...and maintain databases and data integration (ETL) systems to...  ...effectively across teams. High school diploma or equivalent work...  ..., bachelor's degree in computer sciences preferred Four (4) or more...  ...languages is commonly used in data engineering, such as Python or Java... 
    Work experience placement
    Remote work
    Work from home

    Carrington

    Coraopolis, PA
    3 days ago
  •  ...Data Engineer(DE) Location: NY, Lake Mary, Pittsburgh (Hybrid) Duration: Fulltime Job Description:...  ...interview Primary skills: Python, Spark, Distributed computing Secondary Skills: Machine Learning, Data Science, Cloud computing Experience (min): 7 years+... 
    Full time

    Zortech Solutions

    Pittsburgh, PA
    3 days ago
  • $42k - $172.25k

     ...an opportunity to contribute to the company’s success. As a Data Engineer within PNC's Lending Tech organization, you will be based...  ...supports volume growth sustainably. Education BS in Computer Science, Engineering, or equivalent practical experience. PNC is... 
    Full time
    Temporary work
    Part time
    Work experience placement
    Work at office

    PNC

    Pittsburgh, PA
    1 day ago
  •  ...Data Engineer - Research and Development The Pittsburgh Pirates are a storied franchise...  ...# Build and maintain scalable data pipelines to support baseball operations. # Collect...  ...States. # Bachelor's degree in Computer Science, Data Engineering, or a related field... 

    MLB - Pittsburgh Pirates

    Pittsburgh, PA
    3 days ago
  •  ...As a Data Engineer, you will work independently or as part of a team to deliver cloud-based technology solutions across products, projects...  ...Databricks, ADLS, SQL DB, etc.). • Bachelor's degree in Computer Science, Computer Engineering, or a STEM discipline (Science,... 

    Insight Global

    Pittsburgh, PA
    4 days ago
  •  ...Role: Data Engineer Location: Warrendale, PA / Pittsburgh, PA (Onsite) Job Type: Contract Role Overview We are...  ...Databricks, Python, and Spark to design and build scalable data pipelines. The ideal candidate will have hands-on experience with ETL/... 
    Contract work

    SysMind Tech

    Pittsburgh, PA
    2 days ago
  •  ...Ark, supports Supply Chain, Science and Technology, Production, Sustainment...  ...and best-in-class data to more rapidly imagine, develop...  ...and experienced data engineer who shares our passion and obsession...  ...required Bachelor's degree in Computer Science, Mathematics or a... 
    Full time
    Work at office

    Govini

    Pittsburgh, PA
    1 day ago
  • $600 per week

     ...Dev10 is your opportunity to upskill and launch a career in Data Engineering. Dev10 provides a pathway for motivated learners...  ...clearly listed on your resume: A recent STEM degree (e.g., Computer Science, Information Technology, Engineering, or a related... 
    Hourly pay
    Work experience placement
    Immediate start
    Relocation
    Visa sponsorship
    Relocation package

    Genesis10

    Pittsburgh, PA
    19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Pipeline Engineer - School of Computer Science - MLD. Be the first to apply!