Data Engineer
Robots and Pencils
Robots & Pencils is an applied AI engineering firm building the next frontier of business architecture. We design and ship AI co-workers that integrate into enterprise operations and deliver measurable results for our clients. We're all in on AWS, combining deep UX capability with senior engineering talent to get AI into production fast and keep it there.
We've earned the trust of leaders across Consumer Products and Retail, Education, Energy, Financial Services, Healthcare, and Manufacturing and more, and earned a reputation as the nimble alternative to traditional global systems integrators. Founded in 2009, with delivery centers in Canada, the United States, Eastern Europe, and Latin America, we are smaller, faster, and more senior by design. Our teams average 15+ years of experience. We move fast, sweat the details, and build things that actually ship.
Position Overview
In this role, you will design scalable data lakes, warehouses, and pipelines, define governance and quality standards, and drive data platform modernization across real, in-flight work where performance, reliability, and security are critical. You'll mentor more junior engineers, partner with leadership on data strategy, and bring an AI-forward mindset.
Why This Role Matters At Robots & Pencils, we design AI systems for a human world. Our name says it all. Robots and pencils means engineering paired with creativity, because every agent we ship has to work for real people in real workflows. That balance is baked into how we operate.
Every role here contributes directly to that mission. Here, you shape how AI systems integrate into enterprise operations, how teams move at real velocity, and how products create measurable impact for clients and the people they serve. We ship production-ready AI in 30 to 45 days. That pace demands people who take ownership, lead with craft, and care deeply about what they put their name on.
What You'll Do
Craft & Delivery
- Define data architecture and platform strategy, leading design across pipelines, warehouses, and data lakes
- Build and optimize scalable data pipelines supporting batch and real-time processing
- Define and enforce data governance, quality standards, and compliance frameworks across the platform
- Build monitoring, logging, and alerting for data pipelines and services, and contribute to CI/CD workflows for data deployment and automation
- Drive data platform modernization, optimizing for performance, cost, and scalability
- Bring an AI-forward mindset to your daily work, using tools like Claude, Cursor, and other modern AI assistants to ship higher-quality work at pace
- Design and implement data contracts and event flows in collaboration with backend, platform, and engineering teams
- Lead the design and implementation of data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows
- Integrate data services with APIs, middleware, and third-party systems to support end-to-end data consumption
- Partner with leadership on data strategy, translating technical depth into decisions others can act on
- Collaborate closely with engineering, analytics, AI, and product teams to align data platforms with broader goals
- Advocate for data quality, governance, and platform best practices across teams and engagements
- Establish data engineering standards that lift the quality and consistency of work across the team
- Mentor junior and mid-level engineers, helping them grow their craft, confidence, and impact
- Make high-stakes architectural decisions with clear ownership and consideration of long-term tradeoffs
- 7+ years of professional data engineering experience, with experience leading complex data platform initiatives
- Strong system architecture background with expertise in distributed data systems
- Expert proficiency in Python, Scala, and SQL
- Deep expertise with cloud-native data platforms and enterprise data warehousing
- Strong expertise in data pipeline orchestration and processing
- Strong experience with streaming platforms and real-time data processing (e.g., Kafka, Kinesis, Pub/Sub)
- Strong data modeling expertise and experience with data transformation
- Strong experience with data quality, governance, and compliance frameworks
- Strong experience with container orchestration and CI/CD for data systems
- Strong experience building data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows
- Demonstrated leadership and technical mentoring experience across a team or organization
- Strong stakeholder communication skills, with the ability to translate technical depth across audiences
- Demonstrable, day-to-day usage and expert knowledge of AI-forward coding tools such as Claude and Cursor
- Excellent problem-solving skills and the ability to navigate highly ambiguous technical and business challenges with sound judgment
- Experience with data mesh or data fabric concepts, lakehouse architectures, or governance framework implementation is a plus
- Experience with handling and modeling data in the healthcare industry is a plus
- AWS certifications, like Certified Data Engineer - Associate, strongly preferred
- A doer. You see something broken and fix it. You'd rather move on clarity than wait for certainty.
- A fast learner who knows you don't know everything. The AI landscape changes weekly. You're senior enough to know better and curious enough to keep learning anyway.
- Direct in a way that makes the work better. You give honest feedback. You'd rather have the hard conversation than blow smoke.
- Obsessed with craft. You know genius is in the details. You ship exceptional, not perfect, and you don't put your name on work you wouldn't stand behind.
- Built for ownership. You honor commitments, admit mistakes fast, and back your teammates when a decision costs something. No handoffs, no finger-pointing.
- All in. You treat clients' businesses like your own. You take the work seriously without taking yourself seriously.
- Resourceful when the budget, timeline, or team is tight. Constraints don't slow you down. They sharpen you.
- Glad to be in the room with people who care as much as you do. Our teams average fifteen-plus years of experience. We hire people who push each other to do better work.
- ...Project Overview This project is a high‑impact business process engine designed to optimize customer pharmacy procurement. The system analyzes... ...role Expert-level proficiency in Python and PySpark for big data processing Strong experience with Azure Databricks and Azure...SuggestedFull timeImmediate startRemote work
- ...Lead Data Engineer Ciklum is looking for a Lead Data Engineer to join our team full-time in Ukraine. We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With...SuggestedFull timeWork at officeRemote workShift work
$173.1k - $276.8k
...do work that matters - to you, to your community, and to the world. Progress starts with you. Job Description The Lead Data Engineer is a senior technical leader responsible for guiding the design, development, and optimization of Visa's largescale data platforms...SuggestedWork at officeLocal area- ...our mission is to monetize audiences , across every device. Our data-driven tech delivers 30% higher revenues for our clients on... ...products, and culture on our website Your Job As our Lead Data Engineer, your mission is to architect, scale, and maintain the backbone...SuggestedRemote workFlexible hours
$140k - $170k
...Lead Data Engineer Fully Remote • Windsor Mill, MD 21224 Overview Salary Range $140,000.00 - $170,000.00 Salary Position Type Full Time Education Level Not Specified Description About Us: At RELI Group, our work is grounded in purpose. We partner with government...SuggestedFull timeLive inRemote work- ...Lead, Data Engineer L3Harris Enterprise Data and AI team is seeking a Data Engineer with experience in managing enterprise-level data life cycle processes. This role includes overseeing data ETL/ELT pipelines, ensuring adherence to data standards, maintaining data frameworks...Remote work
- ...divh2Lead Data Engineer-DE/h2pLocation: Scottsdale AZ (day 1 onsite)/ppDuration: Fulltime/ppJob Description:/ppMust have skill set: Java, Scala, Python, Spark, S3, Glue, Redshift/pulliYou have 6-8 years of relevant software development experience./liliYou have hands-on...Full time
$138.8k - $231.3k
...Lead Data Engineer (Operations / KTLO) We are seeking a technically strong and operationally focused Lead Data Engineer to manage the day-to-day operations of PSaS Data & Analytics platforms. This role focuses on Keep the Lights On (KTLO) activities, ensuring reliable...Remote work- ...Lead Data Engineer - Master Data Management (MDM) Location: Remote, Canada Job Type: Contract Mandatory Skills - MDM, Databricks, Kafka, Snowflakes, GraphQL, SQL, Python, Microservices Job Description: We are seeking highly skilled Azure Data Engineer with...Contract workRemote work
- ...Sr. Data Engineer: Remote To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodation may be made...Remote work
$106.61k - $284.28k
...Lead Data Privacy Engineer We're building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold...Hourly payFull timeTemporary workWork experience placementLocal areaRemote work- ...Overview The Infosys Data and Analytics (DNA) unit is at the forefront of transforming data into actionable insights, driving business... ..., mentor other team members in their efforts to build data engineering skillsets Assist team management in defining projects, including...Temporary workRelocation
- ...Owning the reliability and operational excellence of the data platform, the full-time Lead Data Engineer will build, maintain, and operate data pipelines using Snowflake, Airflow, AWS, Python, and SQL, while also leading technical initiatives and optimizing costs in a...Full timeRemote work
- ...Position: Lead Data Engineer with MarTech Location: SFO, CA (Hybrid 2 days a week) Key Responsibilities Lead end to end MarTech engineering initiatives across orchestration, data processing, and activation pipelines. Architect scalable, event...Remote work2 days per week
$117.6k - $161.7k
...Become a part of our caring community This role leads the architecture, engineering, and operationalization of the centralized Finance reporting data platform supporting CenterWell Finance reporting, analytics, and month-end close processes. We are looking for deep...Bi-weekly payFull timeTemporary workApprenticeshipWork at officeRemote workWork from homeHome office- ...Genesis10 is seeking a Lead Data Engineer. This is a hybrid 3-month contract-to-hire position with a client located in Columbus, OH. This role pays $60.00-$68.00 per hour, W2, based on skill and experience level. Job Description: The Data Engineer Lead plays a...Hourly payPermanent employmentContract workRemote work
$130k - $176k
...assume the sponsorship of an employment visa at this time". Selective Insurance is seeking an energetic and collaborative Data Engineer Team lead to work on data and analytics projects supporting the Claims team within the Information Management group. This group...Work experience placement$86.62k - $101.9k
...Job Title Lead Data Engineer Job Description Summary Job Description The Lead Data Engineer helps architect and lead the development of enterprise-scale data platforms and advanced analytics solutions across multiple business units and subject areas. In addition...Minimum wageFlexible hours- ...Lead Data Engineer Location: Remote Duration: 6 Months Contract to Hire (Quarterly Onsite for a few days in Dallas TX) BigR.io is a technology consulting firm empowering data to drive innovation and advanced analytics. We specialize in cutting-edge Big Data, Machine...Contract workRemote work
- ...Lead Data/Software Engineer (Real Time Streaming) In this position the candidate will have an opportunity to drive technical standards, platform design, and technical ownership related to a modernization and build out of a foundational data platform effort. The candidate...Remote work
$120k - $140k
...Lead Data Engineer Reports To: Principal Data Engineer Location: Chicago, IL Environment: Remote Status: Exempt, Salaried Recognized by Gartner in their Modern 4PL Market Guide, Redwood Logistics is at the forefront of industry innovation. Our cutting-edge...Full timeTemporary workRemote workMonday to FridayFlexible hours- Okay with relocation. 5 Openings total. USC/GC/H4 Only! Client : Fidelity location : Jersey City, NJ (Hybrid) Duration : 12 Month+ Pay : $68/Hr W2 Need LinkedIn Must Have Skills: Skills wise we need a Senior Level Oracle Database Developer...Remote workRelocation
- ...Lead Data Engineer We are Lennar Lennar is one of the nations leading homebuilders, dedicated to making an impact and creating an extraordinary experience for their Homeowners, Communities, and Associates by building quality homes and providing exceptional customer service...Live inLocal area
- ...JOB SUMMARY Responsible for technical design, writing, support and operations of efficient, scalable, and extensible data integrations and ingestions. Implement effective solutions from the ground up and investigate anomalies in a mature running data platform and environment...Weekend work
- ...Snowflake Senior Data Engineer Accomplished Tech Visionary: Embark on an exciting journey into the realm of software development with 3Pillar! We extend an invitation for you to join our team and gear up for a thrilling adventure. At 3Pillar, our focus is on leveraging...Work at officeRemote workFlexible hours
- ...Lead Data Engineer, Technology | Data & Analytics We're looking for a Lead Data Engineer to own the design and evolution of our modern data platform at Catalyst Brands, powering data-driven decisions across our portfolio of retail and consumer brands. You'll lead complex...Remote work
$150k - $160k
...Company Description BLEND360 is an acclaimed, forward-thinking Data, Digital Marketing, & AI Solutions Company, dedicated to fueling... ...excellence. Job Description We are seeking a Lead Data Engineer to support a large-scale healthcare data platform initiative...Remote work- ...Lead Data Engineer We are Lennar Lennar is one of the nation's leading homebuilders, dedicated to making an impact and creating an extraordinary experience for their Homeowners, Communities, and Associates by building quality homes and providing exceptional customer...Live inLocal area
- ...Lead Data Engineer Pearl works with the top 1% of candidates from around the world and connects them with the best startups in the US and EU. Our clients have raised over $5B in aggregate and are backed by companies like OpenAI, a16z, and Founders Fund. They're looking...Temporary workRemote work
- ...To support the advancement of data solutions in a hybrid work environment, the full-time Lead Data Developer will design and implement... ...Bachelor's degree in Computer Science, Information Systems, Engineering, or related field (or equivalent experience) 8+ years of experience...Full timeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!
- bi data engineer United States
- staff data engineer United States
- data visualization developer United States
- data science developer United States
- senior data center engineer United States
- sr information security engineer United States
- IT data engineer United States
- junior big data engineer United States
- entry level big data engineer United States
- data engineer contract United States

