Data Pipeline Engineer - School of Computer Science - MLD
Carnegie-Mellon University
Carnegie Mellon University is a private, global research university that stands among the world's most renowned education institutions. With ground-breaking brain science, path-breaking performances, creative start-ups, big data, big ambitions, hands-on learning, and a whole lot of robots, CMU doesn't imagine the future, we invent it. If you're passionate about joining a community that challenges the curious to deliver work that matters, your journey starts here!
The Machine Learning Department (MLD) at Carnegie Mellon University is a leading hub for research and education in artificial intelligence and machine learning. It focuses on developing innovative algorithms and models to address complex problems in diverse fields such as robotics, healthcare, and finance. The department offers a range of undergraduate and graduate programs, fostering a collaborative environment that bridges theoretical research and practical applications. Faculty and students frequently collaborate with industry and other academic disciplines to push the boundaries of what is possible with machine learning.
We are seeking a Data Pipeline Engineer to join the team! As a Data Pipeline Engineer, your role is vital in ensuring the integrity and reliability of our data pipelines. This position is responsible for monitoring, troubleshooting, and conducting root cause analysis of data quality issues within our pipelines, but as a part time team member, you will consult and assist rather than lead in these areas. Your contributions are crucial to maintaining the high standard of our epidemiological tracking and forecasting tools. This role will report directly to the Delphi Engineering Manager.
Core Responsibilities
Monitor and maintain the health and efficiency of data pipelines.
Troubleshoot and perform root cause analysis for data discrepancies and pipeline issues.
Communicate with data providers to understand data discrepancies and manage changes in data delivery.
Implement fixes and enhancements to improve data quality and pipeline performance.
Collaborate with data scientists and analysts to understand data needs and implement effective data solutions.
Develop strategies for data validation and quality assurance.
Optimize data flow and collection to improve system efficiency.
Document and manage data pipeline architectures, including maintenance and update protocols.
Use tools such as SQL, version control and CI/CD, containerization, task schedulers, python frameworks, and cloud services for data pipeline management.
Ensure compliance with data governance and security standards.
Adaptability, excellence, and passion are vital qualities within Carnegie Mellon University. We are in search of a team member who can effectively interact with a varied population of internal and external partners at a high level of integrity. We are looking for someone who shares our values and who will support the mission of the university through their work.
Qualifications:
Bachelor's Degree required.
Minimum one year of research computing experience required.
Basic Linux use and administration: system layout, file permissions, shell, utilities (syslog, cron), diagnostic tools (ps, htop, grep, lsof)
Experience in Apache Airflow, preferably version 3.0
Basic database use, especially in Postgres
Rough script programming (Python, bash)
Team software development (git/GitHub, Jira, code reviews, agile methodologies)
Data analysis: diagnosing and fixing runtime errors and logic bugs; performing basic growth projections to predict future problems; communicating results
Required technologies: Python, MySQL/Postgres, Linux, git & GitHub, Apache Airflow
A combination of education and proven experience from which comparable knowledge is demonstrated may be considered.
Preferred Technologies and Languages:
Linux, Ubuntu, Bash, Make
Apache Airflow
Python, pandas, Flask, PyPI publishing
SQL, Postgres
git, GitHub, GitHub Actions, GitHub Issues
Docker, Docker Compose
Elastic, Kibana, FileBeat
G Suite (Calendar, Mail, Docs, Sheets, Slides, Forms, AppsScript, Groups)
Jira Software
Requirements :
- Successful completion of a pre-employment background check
Joining the CMU team opens the door to an array of exceptional benefits.
Benefits eligible ( employees enjoy a wide array of benefits including comprehensive medical, prescription, dental, and vision insurance ( as well as a generous retirement savings program ( with employer contributions. Unlock your potential with tuition benefits ( , take well-deserved breaks with ample paid time off ( and observed holidays ( , and rest easy with life and accidental death and disability insurance.
Additional perks include a free Pittsburgh Regional Transit bus pass, access to our Family Concierge Team ( to help navigate childcare needs, fitness center access ( , and much more!
For a comprehensive overview of the benefits available, explore our Benefits page ( .
At Carnegie Mellon, we value the whole package when extending offers of employment. Beyond credentials, we evaluate the role and responsibilities, your valuable work experience, and the knowledge gained through education and training. We appreciate your unique skills and the perspective you bring. Your journey with us is about more than just a job; it's about finding the perfect fit for your professional growth and personal aspirations.
Are you interested in an exciting opportunity with an exceptional organization?! Apply today!
Location
Remote
Job Function
Software/Applications Development/Engineering
Position Type
Staff - Fixed Term (Fixed Term)
Full Time/Part time
Part time
Pay Basis
Hourly
More Information:
Please visit "Why Carnegie Mellon ( " to learn more about becoming part of an institution inspiring innovations that change the world.
Click here ( to view a listing of employee benefits
Carnegie Mellon University is an Equal Opportunity Employer/Disability/Veteran .
Statement of Assurance (
Interested in a career with Carnegie Mellon University but not finding anything that currently aligns with your interests, background, or experience? Learn how to sign up for Job Alerts ( through your candidate profile.
If your heart is in your work, come work with us. Carnegie Mellon University isn't just one of the world's most renowned educational institutions - it's also a hotspot for some of the most talented doers, dreamers, and difference-makers on the planet. When you join our staff, you'll become an important part of our mission to create a healthier, safer, and more just life for all. No matter what your role or location, you'll connect and collaborate with dedicated, passionate colleagues - and you'll have the satisfaction of delivering work that truly matters.
We cultivate a vibrant, welcoming environment where everyone is valued and encouraged to contribute and achieve. In addition to competitive benefits and a robust support network, you'll have access to many tools and resources to sharpen your abilities and professional skills, as well as opportunities to engage and share perspectives with a dynamic and inspiring community of uniquely talented staff, faculty, students, and alumni.
The future is awaiting your expertise and intellect. Come join the architects of what's next. Apply now.
Learn more about Student Employment ( .
Please see Faculty Careers. (
For technical assistance, email HR Services (View email address on click.appcast.io) or call View phone number on click.appcast.io.
If you are an individual with a disability and you require assistance with the job application process, please email Equal Opportunity Services (View email address on click.appcast.io) or call View phone number on click.appcast.io.
Prospective Employee Disclosures (
- ...Job Title: Data Engineer (Must Be US Citizen Or Green Card Holder...no OPT) Location: Pittsburgh, PA (onsite) Employment Type:... ...for helping to implement, grow, and optimize our data and data pipeline architecture. As part of a growing team in a face-paced start...SuggestedFull time
- ...steps. Pitt Digital is seeking a Senior Data Engineer to support the Center for Excellence in... ...design and implement cloud-based data pipelines that connect enrollment CRM systems,... ...Generally sedentary desk work with computer-keyboard video phone other than traveling...SuggestedFull timeRemote workRelocationVisa sponsorshipFree visaAfternoon shift
- ...contribute to the company's success. As a Data Engineer - ETL| Informatica | SQL PL/SQL | RDBMS... ...Writing and optimizing code for data pipelines to ensure systems are highly efficient,... ..., Data Analytics, Data Mining, Data Science, Informatics, Machine Learning (ML), PL...SuggestedFull timeTemporary workPart timeWork experience placementWork at office
$88k - $166.3k
...Job Title: Experienced Data Engineer Status: Full-time Professional... ...the adoption of data science and analytics into standard... ...designing and maintaining data pipelines, lakes, warehouses, and reports... ...Requirements: Bachelor's degree in Computer Science, Data Science,...SuggestedFull timeFor contractorsWork at office- ...Status: Full-Time Reports to: Data Engineering Manager Purpose The Data... ...creates, operates, and extends data pipelines and/or orchestration solutions built... ...Qualifications Bachelor's degree in computer science, Mathematics, Statistics, or a...SuggestedFull timeTemporary workPart timeWork at officeLocal areaRemote workFlexible hours
- ...JD: Role name: Databricks Senior Data Engineer Work site: Pittsburgh, PA (Onsite) Contract Job Description: 10+years of expereince as Datawarehouse Data Engineer. • Data Architecture Design: Develop/Modify the architecture for the EDW, including data models...Contract work
$106.9k - $176.5k
...better working world. Technology – Data and Decision Science – Data Engineering – Senior We are seeking a... ...objectives. Lead end-to-end data pipeline development, including data... ...technologies in data engineering and cloud computing. This role offers the...Summer holidayFlexible hours- ...Machine Translation Data Engineer Onsite 3 days per week in any of these locations: Seattle... ...data extraction, annotation, auditing pipeline to improve MT data annotation... ...data processing pipeline in distributed computing environments Document development for...3 days per week
- ...Kforce has a client seeking a Data Engineer on a contract basis in Pittsburgh, PA.Summary:The Data Engineer will be tasked with translating... ...architecture, and applications. Bachelor's degree in Computer Science, Information Systems, or related field., or equivalent work...Hourly payContract workWork experience placement
- ...Data Engineer Contractor 5 Days Onsite – Pittsburgh, PA / Farmers Branch, TX / Miamisburg, OH / Houston, TX Key Responsibilities... ...-paced environment Education ~ Bachelor’s degree in Computer Science, Information Systems, or related field preferred #M1...For contractors
$49.5 - $62.5 per hour
...Overview: We are seeking an experienced Data Engineer to join our AI & Digital Team. This... ...will design, build, and optimize data pipelines and infrastructure, enabling advanced... ...Requirements Bachelor’s degree in computer science or related fields with 3-5 years’ experience...Hourly payContract work- ...Job Description • Design and maintain data pipelines using Fabric Data Factory, Dataflow Gen2, and Notebooks (Python, Spark, SQL) •... ...with CI/CD pipeline experience • Bachelor's in CS, IS, Data Engineering ○ Or 10 equivalent yrs experience • PowerBI - dashboarding...
$65k - $171.93k
...have an opportunity to contribute to the company's success. As a Data Engineer Senior within PNC's Data Product Organization, you will be... ...Thinking, Competitive Advantages, Data Analytics, Data Mining, Data Science, Machine Learning (ML) Competencies Application Delivery...Full timeTemporary workPart timeWork experience placementWork at office- ...Technology Solutions. We have an opportunity for Performance Engineer for one of my clients. Here I am sharing the details below.... ...performance logs using ELK/Splunk • Strong analyzing/data mining skills using SQL. • Experience on Kafka/messaging performance...Contract workRemote work
- ...Data Engineer Location || Onsite - Warrendale, PA / Pittsburgh, PA (U.S.) This role requires core experience and expertise on - Databricks (advanced, hands-on), Python, ETL/ELT pipeline development, Spark (SQL/PySpark). Job purpose ~ The...Temporary work
$70.8k - $156.7k
...Data Engineer - GenAI Category: Software Development/ Engineering... ...prompt engineering, and cloud computing within the Azure ecosystem... ...Develop robust and scalable pipelines for data preprocessing, model... ...Bachelor's degree in computer science, computer engineering, or...Full timeWork experience placementLocal area- ...We're looking for an experienced Data Engineer to help design, build, and optimize modern data pipelines that power analytics, BI, and machine learning across NEP Group. In this role, you'll develop scalable ETL/ELT workflows feeding Snowflake and Databricks, collaborate...Remote workFlexible hours
- ...team, you will help build and scale the data foundation that powers our logistics operations... ...responsible for designing reliable data pipelines, integrating data from multiple... ...'re Looking For: • 5+ years of data engineering experience • Strong SQL and Python skills...
- ...Job Description: Senior Data Engineer (Full-Time) Location: Pittsburgh, PA (Hybrid - 3 days onsite per week) Prequel Solutions... ...environment. This role focuses on designing high-performance data pipelines, improving data quality, and supporting enterprise analytics...Full timeContract work3 days per week
- ...Day-to-Day Responsibilities / Project Details: Support Data Engineering initiatives across two Data Engineering Teams: Prospect... ...Hands-on data engineering experience (batch and real-time pipelines, data warehousing, data modeling) # Airflow experience with...Contract work
- ...Sr Data Engineer Location: Pittsburgh PA (Onsite) Duration: 12 Months DESCRIPTION Design, construct, and maintain scalable... ...to ensure continuous service delivery. Create dataprocessing pipelines utilizing Databricks Notebooks, Spark SQL, Python and other...Flexible hours
$55 - $60 per hour
...Senior Data / Feature Engineer Genesis10 is currently seeking a Senior Data / Feature Engineer - Onsite position with a Major Financial Institution... ...to build and scale production-grade machine learning pipelines and feature engineering systems. This is strictly an...Hourly payContract work$140k - $160k
...and maintain databases and data integration (ETL) systems to... ...effectively across teams. High school diploma or equivalent work... ..., bachelor's degree in computer sciences preferred Four (4) or more... ...languages is commonly used in data engineering, such as Python or Java...Work experience placementRemote workWork from home- ...Data Engineer(DE) Location: NY, Lake Mary, Pittsburgh (Hybrid) Duration: Fulltime Job Description:... ...interview Primary skills: Python, Spark, Distributed computing Secondary Skills: Machine Learning, Data Science, Cloud computing Experience (min): 7 years+...Full time
$42k - $172.25k
...an opportunity to contribute to the company’s success. As a Data Engineer within PNC's Lending Tech organization, you will be based... ...supports volume growth sustainably. Education BS in Computer Science, Engineering, or equivalent practical experience. PNC is...Full timeTemporary workPart timeWork experience placementWork at office- ...Data Engineer - Research and Development The Pittsburgh Pirates are a storied franchise... ...# Build and maintain scalable data pipelines to support baseball operations. # Collect... ...States. # Bachelor's degree in Computer Science, Data Engineering, or a related field...
- ...As a Data Engineer, you will work independently or as part of a team to deliver cloud-based technology solutions across products, projects... ...Databricks, ADLS, SQL DB, etc.). • Bachelor's degree in Computer Science, Computer Engineering, or a STEM discipline (Science,...
- ...Role: Data Engineer Location: Warrendale, PA / Pittsburgh, PA (Onsite) Job Type: Contract Role Overview We are... ...Databricks, Python, and Spark to design and build scalable data pipelines. The ideal candidate will have hands-on experience with ETL/...Contract work
- ...Ark, supports Supply Chain, Science and Technology, Production, Sustainment... ...and best-in-class data to more rapidly imagine, develop... ...and experienced data engineer who shares our passion and obsession... ...required Bachelor's degree in Computer Science, Mathematics or a...Full timeWork at office
$600 per week
...Dev10 is your opportunity to upskill and launch a career in Data Engineering. Dev10 provides a pathway for motivated learners... ...clearly listed on your resume: A recent STEM degree (e.g., Computer Science, Information Technology, Engineering, or a related...Hourly payWork experience placementImmediate startRelocationVisa sponsorshipRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Pipeline Engineer - School of Computer Science - MLD. Be the first to apply!
- junior big data engineer Pittsburgh, PA
- senior data engineer Pittsburgh, PA
- sr information security engineer Pittsburgh, PA
- senior data integration developer Pittsburgh, PA
- data developer Pittsburgh, PA
- data engineer Pittsburgh, PA
- entry level big data engineer Pittsburgh, PA
- data engineer analytics Pittsburgh, PA
- big data engineer Pittsburgh, PA
- junior data engineer remote Pittsburgh, PA

