Data Engineer
Apify
Data Engineer
Apify is the largest marketplace of tools for AI. 30,000+ Actors helping people and agents get real-time web data, track competitors, generate leads, or integrate their apps. Actors are built by a global creator community that now earns more than $1M every month.
Join us to help people put the web to work. Apify can find missing children, protect consumers from fake discounts across the EU, and feed data to AI chatbots.
We're looking for a Data Engineer to own the integration layer between Snowflake and the operational tools that run Apify's go-to-market and product motion: HubSpot, Intercom, Mixpanel, and Segment. You'll make sure the right data lands in the right system at the right time, with the right shape, so Sales, Marketing, Customer Success, and Product teams can act on it.
You'll be the 9th member of the data team - joining a mix of analytical engineers, analysts, and data scientists - at the moment Segment is being rolled out as Apify's CDP. That's yours to land end-to-end.
What You'll Be Working On
Own the integration domain end to end - all pipelines, transformations, and Snowflake models that connect HubSpot, Intercom, Mixpanel, and Segment to the rest of the platform, in both directions.
Design event tracking and the CDP layer with the RevOps team as Segment becomes the source of truth for behavioral data flowing into product, marketing, and CRM systems.
Build reliable, observable pipelines in Keboola and dbt - with clear data contracts, schema tests, freshness guarantees, and alerting.
Model integration data in Snowflake so HubSpot, Intercom, Mixpanel, and Segment data lands in well-defined tables that downstream consumers can trust, with documentation that analysts and scientists can actually use.
Power lifecycle automations - PQA scores back into HubSpot, behavioral campaigns in Intercom and customer.io, product usage signals - by shipping the data they depend on.
Diagnose and resolve pipeline incidents independently - trace lineage across multiple components, find root causes, fix, and write the runbook so it doesn't bite the next person.
Tech Stack
Snowflake - data warehouse
Keboola - extractors, writers, and orchestration
dbt - transformations on Snowflake (orchestrated by Keboola; this is where we're actively migrating existing transformation logic)
Tableau and Redash - BI
n8n - workflow automation
Segment - CDP, currently being rolled out end-to-end
Who We're Looking For
3+ years of data engineering experience, with meaningful time spent on integrations between a cloud warehouse and operational SaaS tools (HubSpot, Salesforce, Intercom, Zendesk, Mixpanel, Amplitude, Segment, RudderStack, or similar).
Fluent in SQL (window functions, CTEs, complex multi-source joins, query optimization) and comfortable in Python for the parts a no-code tool can't handle.
Production experience with Snowflake (or BigQuery, Databricks, Redshift), and an understanding of the cost, performance, and access-control tradeoffs of a usage-based warehouse.
Experience building end-to-end pipelines combining an orchestration or ELT platform (Keboola, Fivetran, Airflow, Dagster, Prefect, Matillion) with a transformation framework like dbt.
Hands-on experience with a CDP (Segment, RudderStack, mParticle) - tracking plans, schemas, identity resolution, downstream consumers - not just installing the snippet.
You think in data contracts - schema stability, freshness SLAs, documented field definitions - and treat the boundary between your domain and downstream consumers as a first-class interface.
Comfortable with reverse ETL (Census, Keboola, or hand-rolled), and you understand what it means to write back to a CRM that humans are also editing.
Pragmatic about tooling - happy to use n8n for the right job, and equally happy to write proper code when that's the right call.
Able to explain why a dashboard moved and what it means to non-technical stakeholders in Sales, Marketing, and Customer Success, in English, both in writing and in person.
Nice To Have
Experience with usage-based billing or product-led growth data models.
Exposure to LLM-assisted workflows in the data stack.
Prior experience at a SaaS company between 50 and 500 people.
By The End Of The First Month, We Expect You To
Know the data team, the RevOps and Growth stakeholders who depend on the integration layer, and the workflows that flow through HubSpot, Intercom, Mixpanel, and Segment.
Work through the existing Keboola components and dbt models to understand what's in place, what's fragile, and where the silent failures live.
Trace a typical record from each source system through to the Snowflake tables analysts use.
By The End Of The First 3 Months, We Expect You To
Have a complete map of the integration domain - what flows where, what's owned by whom, where the silent failures are - and a documented six-month plan for the work ahead.
Have at least one end-to-end improvement shipped with monitoring in place.
Be the go-to person on the data team for HubSpot, Intercom, Mixpanel, and Segment data questions.
By The End Of The First 6 Months, We Expect You To
Have Segment operating as the durable CDP for Apify, with a published tracking plan and reliable event flows into Snowflake and downstream tools.
Have core tables from HubSpot, Intercom, Mixpanel, and Segment with documented data contracts - schema, freshness SLA, ownership - and tests and alerting in place.
Have driven measurable improvements in data freshness, pipeline reliability, and incident response time, tracked publicly, and shipped at least one cross-team initiative where the data integration unlocked a business outcome (conversion lift, churn reduction, ops automation).
Why Should You Work At Apify?
Space, support, and autonomy for personal growth, with a direct impact on our success
Full-time position in Prague (Lucerna Palace)
Flexible working hours (perfect for both night owls and early birds)
Nobody counts holidays as long as the work gets done
Unlimited Claude for every Apifier. We don't count tokens. Just use them well
Stock options and profit sharing
Free Multisport card
We welcome pets, kids, and bikes in the office
Epic team buildings and offsites with biking, canoeing, and other adventures
Solid education and training budget, conference tickets, internal "Eat & Learn" sessions, and the possibility to work across teams
Generous hardware budget
Free lunches every day when working from the office
Unlimited supply of coffee and snacks
Free entry to the
- ...Cost Methods, Tools & Data Solutions (CMTD)is seeking a Data Engineer , preferably with a finance or cost engineering background and hands-on exposure to data engineering to drive delivery of modern analytics, automation and AI-enabled solutions. This role will...SuggestedImmediate start
$30 - $35 per hour
...Job Description Insight Global is seeking a 1 Lead Technical Data Analyst sitting remotely in LATAM to join a large financial... ...with Investment Analytics team on the business side and the other engineering teams, and ensures that solutions align with business strategy,...SuggestedRemote work- ...Lead Data Engineer We are Lennar Lennar is one of the nation's leading homebuilders, dedicated to making an impact and creating an extraordinary experience for their Homeowners, Communities, and Associates by building quality homes and providing exceptional customer...SuggestedLive inLocal area
- ...Lead Data Engineer Pearl works with the top 1% of candidates from around the world and connects them with the best startups in the US and EU. Our clients have raised over $5B in aggregate and are backed by companies like OpenAI, a16z, and Founders Fund. They're looking...SuggestedTemporary workRemote work
$120k - $140k
...Lead Data Engineer Reports To: Principal Data Engineer Location: Chicago, IL Environment: Remote Status: Exempt, Salaried Recognized by Gartner in their Modern 4PL Market Guide, Redwood Logistics is at the forefront of industry innovation. Our cutting-edge...SuggestedFull timeTemporary workRemote workMonday to FridayFlexible hours$150k - $160k
...Company Description BLEND360 is an acclaimed, forward-thinking Data, Digital Marketing, & AI Solutions Company, dedicated to fueling... ...excellence. Job Description We are seeking a Lead Data Engineer to support a large-scale healthcare data platform initiative...Remote work- ...Lead Data Engineer As a Lead Data Engineer you concept, design, implement and support data pipelines and databases within our Azure/Databricks Data & Analytics platform. You will always find the balance between individual requirements and stable solutions. You are open...Job sharingWork at officeRemote workFlexible hours
- ...To support the advancement of data solutions in a hybrid work environment, the full-time Lead Data Developer will design and implement... ...Bachelor's degree in Computer Science, Information Systems, Engineering, or related field (or equivalent experience) 8+ years of experience...Full timeRemote work
- ...divh2Lead Data Engineer-DE/h2pLocation: Scottsdale AZ (day 1 onsite)/ppDuration: Fulltime/ppJob Description:/ppMust have skill set: Java, Scala, Python, Spark, S3, Glue, Redshift/pulliYou have 6-8 years of relevant software development experience./liliYou have hands-on...Full time
- ...Lead Data Engineer Ciklum is looking for a Lead Data Engineer to join our team full-time in Ukraine. We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With...Full timeWork at officeRemote workShift work
$173.1k - $276.8k
...do work that matters - to you, to your community, and to the world. Progress starts with you. Job Description The Lead Data Engineer is a senior technical leader responsible for guiding the design, development, and optimization of Visa's largescale data platforms...Work at officeLocal area- ...our mission is to monetize audiences , across every device. Our data-driven tech delivers 30% higher revenues for our clients on... ...products, and culture on our website Your Job As our Lead Data Engineer, your mission is to architect, scale, and maintain the backbone...Remote workFlexible hours
$140k - $170k
...Lead Data Engineer Fully Remote • Windsor Mill, MD 21224 Overview Salary Range $140,000.00 - $170,000.00 Salary Position Type Full Time Education Level Not Specified Description About Us: At RELI Group, our work is grounded in purpose. We partner with government...Full timeLive inRemote work- ...Project Overview This project is a high‑impact business process engine designed to optimize customer pharmacy procurement. The system analyzes... ...role Expert-level proficiency in Python and PySpark for big data processing Strong experience with Azure Databricks and Azure...Full timeImmediate startRemote work
- ...Owning the reliability and operational excellence of the data platform, the full-time Lead Data Engineer will build, maintain, and operate data pipelines using Snowflake, Airflow, AWS, Python, and SQL, while also leading technical initiatives and optimizing costs in a...Full timeRemote work
- ...Position: Lead Data Engineer with MarTech Location: SFO, CA (Hybrid 2 days a week) Key Responsibilities Lead end to end MarTech engineering initiatives across orchestration, data processing, and activation pipelines. Architect scalable, event...Remote work2 days per week
$106.61k - $284.28k
...Lead Data Privacy Engineer We're building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold...Hourly payFull timeTemporary workWork experience placementLocal areaRemote work- ...Sr. Data Engineer: Remote To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodation may be made...Remote work
- ...Lead Data Engineer - Master Data Management (MDM) Location: Remote, Canada Job Type: Contract Mandatory Skills - MDM, Databricks, Kafka, Snowflakes, GraphQL, SQL, Python, Microservices Job Description: We are seeking highly skilled Azure Data Engineer with...Contract workRemote work
- ...Lead, Data Engineer L3Harris Enterprise Data and AI team is seeking a Data Engineer with experience in managing enterprise-level data life cycle processes. This role includes overseeing data ETL/ELT pipelines, ensuring adherence to data standards, maintaining data frameworks...Remote work
- ...Lead Snowflake Data Engineer Location: Rosemont, IL (Initial Remote) Mandatory skills: AWS, Snowflake, Informatica, Python • Experience working on Snowflake. • Experience working on Informatica Cloud (IICS). • Experience working on AWS data environment – S3...Remote work
- ...Lead Data Engineer Location: Remote (EST) Contract: 6+ months ongoing (possible contract to hire/must be eligible for conversion) Summary: As a Lead Data Engineer, you will play a critical role in designing, developing, and maintaining our data infrastructure....Ongoing contractContract workRemote work
- ...KORE1, a nationwide provider of staffing and recruiting solutions, has an immediate opening for a Data Engineer.(remote) About us: We are building the financial operating system for healthcare provider organizations. We are a lean, exceptional team growing 3x year...For contractorsLocal areaImmediate startRemote work
- ...JD Job Summary: Role Overview The Senior Data Engineer will be responsible for designing, developing, and maintaining scalable data pipelines and solutions on Google Cloud Platform (GCP). The role focuses on data ingestion, transformation, orchestration, and...
- ...Lead Data Engineer Location: Remote (may have to relocate in the future to Dallas/Southlake, TX or Austin, TX) Work Arrangement: Remote (with potential future relocation to TX) Interview Mode: Video Position Overview We are seeking a Lead Data Engineer to join...Remote workRelocation
- Okay with relocation. 5 Openings total. USC/GC/H4 Only! Client : Fidelity location : Jersey City, NJ (Hybrid) Duration : 12 Month+ Pay : $68/Hr W2 Need LinkedIn Must Have Skills: Skills wise we need a Senior Level Oracle Database Developer...Remote workRelocation
- ...Lead Data Engineer, Technology | Data & Analytics We're looking for a Lead Data Engineer to own the design and evolution of our modern data platform at Catalyst Brands, powering data-driven decisions across our portfolio of retail and consumer brands. You'll lead complex...Remote work
$142.3k - $195.7k
...(RASA) delivers integrated solutions that leverage high-quality data, data-driven insights, and technology to create differentiated member... ...Programs organization within Risk Adjustment. The Lead Data Engineer handles work assignments involving complex issues where the...Bi-weekly payWeekly payFull timeTemporary workApprenticeshipWork at officeRemote workWork from homeHome office- ...Snowflake Senior Data Engineer Accomplished Tech Visionary: Embark on an exciting journey into the realm of software development with 3Pillar! We extend an invitation for you to join our team and gear up for a thrilling adventure. At 3Pillar, our focus is on leveraging...Work at officeRemote workFlexible hours
$107.5k - $204.5k
...the strength of more than 100 years of experience and renowned engineering expertise to meet the needs of today's mission and stay ahead of... ...who have experience designing wireless communication and data links. This position focuses on designing and testing wireless...Temporary workWork experience placementInterim roleWork at officeRemote workRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!
- bi data engineer United States
- staff data engineer United States
- data visualization developer United States
- data science developer United States
- senior data center engineer United States
- sr information security engineer United States
- IT data engineer United States
- junior big data engineer United States
- entry level big data engineer United States
- data engineer contract United States


