Data Engineer - Data Architecture for Data Science & Machine Learning
$109.3k - $219.6kPenn State University
Senior Data Engineer
We are seeking a Senior Data Engineer with deep expertise in database design, optimization, and data access strategies to support our growing data science and machine learning initiatives within the Visualization and Decision Support Division of Penn State ARL. In this role, you will architect and optimize data systems that empower our data scientists to efficiently research, train, and deploy both traditional and ML-based algorithms and applications.
The ideal candidate is a PostgreSQL expert who also brings hands-on experience with other modern data storage technologies—such as NoSQL, graph, and time-series databases—and can guide the organization in choosing the right tools and structures for each data use case.
Located in either State College, PA or Reston, VA
ARL is an authorized DoD SkillBridge partner and welcomes all transitioning military members to apply.
You will:
- Design and maintain scalable, high-performance database solutions to support data science workflows and ML experimentation
- Partner with data scientists to understand data access patterns and develop storage strategies that accelerate analysis and model training
- Serve as the internal subject matter expert on PostgreSQL—including schema design, indexing, partitioning, and query optimization
- Evaluate and integrate alternative database technologies (e.g., MongoDB, Neo4j, Redis, Cassandra) where they provide clear advantages
- Lead efforts to optimize data pipelines for both structured and unstructured data used in algorithm development
- Ensure data integrity, security, and governance across storage systems
- Implement monitoring, automation, and performance-tuning tools for all database environments
- Advise on data lifecycle management—balancing accessibility for R&D with efficiency and compliance requirements
Required skills/experience includes:
- 5+ years of experience in data engineering, database architecture, or related technical roles
- Expert-level proficiency in PostgreSQL (query tuning, schema design, indexing, partitioning, replication)
- Strong understanding of data modeling, normalization vs. denormalization tradeoffs, and query optimization
- Experience with non-relational databases (e.g., MongoDB, Cassandra, Neo4j, Redis, or DynamoDB)
- Familiarity with machine learning workflows and how data is consumed for training, evaluation, and deployment
- Experience with cloud database services (AWS RDS/Aurora, GCP Cloud SQL, Azure Database)
- Proficiency in SQL and one or more scripting languages (Python preferred)
- Excellent communication and collaboration skills—comfortable working closely with data scientists, ML engineers, and software developers
Preferred skills/experience includes:
- Experience architecting hybrid data ecosystems spanning relational, NoSQL, and analytical databases.
- Knowledge of data lake, warehouse, and feature store architectures (e.g., Snowflake, Redshift, BigQuery, Feast).
- Familiarity with ETL/ELT frameworks and data orchestration tools (e.g., Airflow, dbt).
- Bachelor's or Master's degree in Computer Science, Data Engineering, or a related field.
Work location can be State College, PA or Reston, VA
MINIMUM EDUCATION, WORK EXPERIENCE & REQUIRED CERTIFICATIONS
If filled as Research and Development Engineer - Principal Professional, this position requires:
Bachelor's Degree - Engineering or Science
19+ years of relevant experience
Required Certifications: None
If filled as Research and Development Engineer - Advanced Professional, this position requires:
Bachelor's Degree - Engineering or Science
5+ years of relevant experience
Required Certifications: None
If filled as Research and Development Engineer - Senior Professional, this position requires:
Bachelor's Degree - Engineering or Science
14+ years of relevant experience
Required Certifications: None
ARL's purpose is to research and develop innovative solutions to challenging scientific, engineering, and technology problems in support of the Navy, the Department of Defense (DoD), and the Intel Community (IC).
FOR FURTHER INFORMATION on ARL, visit our web site at
BACKGROUND CHECKS/CLEARANCES
Employment with the University will require successful completion of background check(s) in accordance with University policies. All positions at ARL require candidates to possess the ability to obtain a government security clearance; you will be notified during the interview process if this position is subject to a government background investigation. You must be a U.S. citizen to apply. Employment with the ARL will require successful completion of a pre-employment drug screen.
SALARY & BENEFITS
The salary range for this position, including all possible grades, is $109,300.00 - $219,600.00.**THE PROPOSED SALARY RANGE MAY BE IMPACTED BY GEOGRAPHIC DIFFERENTIAL**
Salary Structure - Information on Penn State's salary structure
Penn State provides a competitive benefits package for full-time employees designed to support both personal and professional well-being. In addition to comprehensive medical, dental, and vision coverage, employees enjoy robust retirement plans and substantial paid time off which includes holidays, vacation and sick time. One of the standout benefits is the generous 75% tuition discount, available to employees as well as eligible spouses and children. For more detailed information, please visit our Benefits Page.
$95k - $154k
...Why do Tech Companies not Hire recent Computer Science Graduates | SynergisticIT Technical Skills or Experience... ...We Focus on Java /Full stack/Devops and Data Science /Data Engineers/Data analysts/BI Analysts/ Machine learning/AI candidates Ideal Candidates: Recent...SuggestedFull timeH1b$76.7k - $129.5k
...and forward-thinking Data Scientist to advance... ...that supports teaching, learning, and research through... ...artificial intelligence, machine learning, and digital... ..., and the College of Engineering.CBDR advances what it... ...agendas that apply data science and digital humanities...SuggestedFull timePart timeFor contractorsWork experience placementRemote workFlexible hours$76.7k - $129.5k
...experienced, and highly motivated Data Scientists to assist in... ...and deliver algorithmic, machine learning, and artificial... ...a Bachelor's degree in an Engineering or Science discipline. Additional experience... ...algorithms on GPU hardware architectures, specifically NVIDIA based...SuggestedFor contractorsWork at office- ...Overview: PA based -Supply Chain and distribution company Remote Visa- any 10 months duration Need a strong Sr Lead Data Engineer/ Architect for clients Azure Digital Data platform Core skills - Azure Digital Data platform Supply Chain and...SuggestedRemote work
$140k - $200k
...reading is never a barrier to learning. Over 50 million people... ...include frontend and backend engineers, AI research scientists, and... ...We're looking to hire for our Data side of our AI team at Speechify... ...~ BS/MS/PhD in Computer Science or a related field. ~5+ years...SuggestedFull timeWork at officeShift work- ...Blue Mountain Quality Resources, Inc. is seeking a Full Stack Software Engineer to develop features for their leading asset management software. The candidate will work with React on the front end and .NET MVC on the back end, aiming to enhance the product with AI technologies...Remote work
- Blue Mountain Quality Resources, Inc. is seeking a Quality Engineer to enhance product releases using generative AI. You'll collaborate with the quality engineering team, focusing on automated testing and maintaining testing infrastructure. With at least 2 years of experience...Remote work
- ...help organizations harness data to make better decisions. As... ...regulations. Partner with engineering and IT to embed governance directly... ...workflows APIs and system architectures. Evaluate AI Tools... ...Bachelors degree in Computer Science Engineering Information Systems...Full timeTemporary workImmediate startRemote workVisa sponsorshipFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer - Data Architecture for Data Science & Machine Learning. Be the first to apply!
- data technician State College, PA
- data cabling State College, PA
- data internship State College, PA
- clinical data State College, PA
- data intern State College, PA
- data recovery State College, PA
- data collection researcher State College, PA
- clinical data coordinator remote State College, PA
- data network cabling State College, PA
- provider data management State College, PA


