Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer

Apify

Data Engineer

Apify is the largest marketplace of tools for AI. 30,000+ Actors helping people and agents get real-time web data, track competitors, generate leads, or integrate their apps. Actors are built by a global creator community that now earns more than $1M every month.

Join us to help people put the web to work. Apify can find missing children, protect consumers from fake discounts across the EU, and feed data to AI chatbots.

We're looking for a Data Engineer to own the integration layer between Snowflake and the operational tools that run Apify's go-to-market and product motion: HubSpot, Intercom, Mixpanel, and Segment. You'll make sure the right data lands in the right system at the right time, with the right shape, so Sales, Marketing, Customer Success, and Product teams can act on it.

You'll be the 9th member of the data team - joining a mix of analytical engineers, analysts, and data scientists - at the moment Segment is being rolled out as Apify's CDP. That's yours to land end-to-end.

What You'll Be Working On
  • Own the integration domain end to end - all pipelines, transformations, and Snowflake models that connect HubSpot, Intercom, Mixpanel, and Segment to the rest of the platform, in both directions.

  • Design event tracking and the CDP layer with the RevOps team as Segment becomes the source of truth for behavioral data flowing into product, marketing, and CRM systems.

  • Build reliable, observable pipelines in Keboola and dbt - with clear data contracts, schema tests, freshness guarantees, and alerting.

  • Model integration data in Snowflake so HubSpot, Intercom, Mixpanel, and Segment data lands in well-defined tables that downstream consumers can trust, with documentation that analysts and scientists can actually use.

  • Power lifecycle automations - PQA scores back into HubSpot, behavioral campaigns in Intercom and customer.io, product usage signals - by shipping the data they depend on.

  • Diagnose and resolve pipeline incidents independently - trace lineage across multiple components, find root causes, fix, and write the runbook so it doesn't bite the next person.

Tech Stack
  • Snowflake - data warehouse

  • Keboola - extractors, writers, and orchestration

  • dbt - transformations on Snowflake (orchestrated by Keboola; this is where we're actively migrating existing transformation logic)

  • Tableau and Redash - BI

  • n8n - workflow automation

  • Segment - CDP, currently being rolled out end-to-end

Who We're Looking For
  • 3+ years of data engineering experience, with meaningful time spent on integrations between a cloud warehouse and operational SaaS tools (HubSpot, Salesforce, Intercom, Zendesk, Mixpanel, Amplitude, Segment, RudderStack, or similar).

  • Fluent in SQL (window functions, CTEs, complex multi-source joins, query optimization) and comfortable in Python for the parts a no-code tool can't handle.

  • Production experience with Snowflake (or BigQuery, Databricks, Redshift), and an understanding of the cost, performance, and access-control tradeoffs of a usage-based warehouse.

  • Experience building end-to-end pipelines combining an orchestration or ELT platform (Keboola, Fivetran, Airflow, Dagster, Prefect, Matillion) with a transformation framework like dbt.

  • Hands-on experience with a CDP (Segment, RudderStack, mParticle) - tracking plans, schemas, identity resolution, downstream consumers - not just installing the snippet.

  • You think in data contracts - schema stability, freshness SLAs, documented field definitions - and treat the boundary between your domain and downstream consumers as a first-class interface.

  • Comfortable with reverse ETL (Census, Keboola, or hand-rolled), and you understand what it means to write back to a CRM that humans are also editing.

  • Pragmatic about tooling - happy to use n8n for the right job, and equally happy to write proper code when that's the right call.

  • Able to explain why a dashboard moved and what it means to non-technical stakeholders in Sales, Marketing, and Customer Success, in English, both in writing and in person.

Nice To Have
  • Experience with usage-based billing or product-led growth data models.

  • Exposure to LLM-assisted workflows in the data stack.

  • Prior experience at a SaaS company between 50 and 500 people.

By The End Of The First Month, We Expect You To
  • Know the data team, the RevOps and Growth stakeholders who depend on the integration layer, and the workflows that flow through HubSpot, Intercom, Mixpanel, and Segment.

  • Work through the existing Keboola components and dbt models to understand what's in place, what's fragile, and where the silent failures live.

  • Trace a typical record from each source system through to the Snowflake tables analysts use.

By The End Of The First 3 Months, We Expect You To
  • Have a complete map of the integration domain - what flows where, what's owned by whom, where the silent failures are - and a documented six-month plan for the work ahead.

  • Have at least one end-to-end improvement shipped with monitoring in place.

  • Be the go-to person on the data team for HubSpot, Intercom, Mixpanel, and Segment data questions.

By The End Of The First 6 Months, We Expect You To
  • Have Segment operating as the durable CDP for Apify, with a published tracking plan and reliable event flows into Snowflake and downstream tools.

  • Have core tables from HubSpot, Intercom, Mixpanel, and Segment with documented data contracts - schema, freshness SLA, ownership - and tests and alerting in place.

  • Have driven measurable improvements in data freshness, pipeline reliability, and incident response time, tracked publicly, and shipped at least one cross-team initiative where the data integration unlocked a business outcome (conversion lift, churn reduction, ops automation).

Why Should You Work At Apify?
  • Space, support, and autonomy for personal growth, with a direct impact on our success

  • Full-time position in Prague (Lucerna Palace)

  • Flexible working hours (perfect for both night owls and early birds)

  • Nobody counts holidays as long as the work gets done

  • Unlimited Claude for every Apifier. We don't count tokens. Just use them well

  • Stock options and profit sharing

  • Free Multisport card

  • We welcome pets, kids, and bikes in the office

  • Epic team buildings and offsites with biking, canoeing, and other adventures

  • Solid education and training budget, conference tickets, internal "Eat & Learn" sessions, and the possibility to work across teams

  • Generous hardware budget

  • Free lunches every day when working from the office

  • Unlimited supply of coffee and snacks

  • Free entry to the

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Data Engineer in United States vacancy
  •  ...Cost Methods, Tools & Data Solutions (CMTD)is  seeking a Data Engineer , preferably with a finance or cost engineering background and hands-on exposure to data engineering to drive delivery of modern analytics, automation and AI-enabled solutions. This role will... 
    Suggested
    Immediate start

    Stellantis

    Auburn Hills, MI
    8 hours ago
  • $30 - $35 per hour

     ...Job Description Insight Global is seeking a 1 Lead Technical Data Analyst sitting remotely in LATAM to join a large financial...  ...with Investment Analytics team on the business side and the other engineering teams, and ensures that solutions align with business strategy,... 
    Suggested
    Remote work

    Insight Global

    Boston, MA
    4 days ago
  •  ...Lead Data Engineer We are Lennar Lennar is one of the nation's leading homebuilders, dedicated to making an impact and creating an extraordinary experience for their Homeowners, Communities, and Associates by building quality homes and providing exceptional customer... 
    Suggested
    Live in
    Local area

    Lennar

    Irving, TX
    6 days ago
  •  ...Lead Data Engineer Pearl works with the top 1% of candidates from around the world and connects them with the best startups in the US and EU. Our clients have raised over $5B in aggregate and are backed by companies like OpenAI, a16z, and Founders Fund. They're looking... 
    Suggested
    Temporary work
    Remote work

    Pearl

    United States
    1 day ago
  • $120k - $140k

     ...Lead Data Engineer Reports To: Principal Data Engineer Location: Chicago, IL Environment: Remote Status: Exempt, Salaried Recognized by Gartner in their Modern 4PL Market Guide, Redwood Logistics is at the forefront of industry innovation. Our cutting-edge... 
    Suggested
    Full time
    Temporary work
    Remote work
    Monday to Friday
    Flexible hours

    Redwood Logistics

    United States
    1 day ago
  • $150k - $160k

     ...Company Description BLEND360 is an acclaimed, forward-thinking Data, Digital Marketing, & AI Solutions Company, dedicated to fueling...  ...excellence. Job Description We are seeking a Lead Data Engineer to support a large-scale healthcare data platform initiative... 
    Remote work

    Blend360

    United States
    4 days ago
  •  ...Lead Data Engineer As a Lead Data Engineer you concept, design, implement and support data pipelines and databases within our Azure/Databricks Data & Analytics platform. You will always find the balance between individual requirements and stable solutions. You are open... 
    Job sharing
    Work at office
    Remote work
    Flexible hours

    Beiersdorf Shared Services Gmbh (bss)

    United States
    15 hours ago
  •  ...To support the advancement of data solutions in a hybrid work environment, the full-time Lead Data Developer will design and implement...  ...Bachelor's degree in Computer Science, Information Systems, Engineering, or related field (or equivalent experience) 8+ years of experience... 
    Full time
    Remote work

    Virtual Vocations Inc

    United States
    1 day ago
  •  ...divh2Lead Data Engineer-DE/h2pLocation: Scottsdale AZ (day 1 onsite)/ppDuration: Fulltime/ppJob Description:/ppMust have skill set: Java, Scala, Python, Spark, S3, Glue, Redshift/pulliYou have 6-8 years of relevant software development experience./liliYou have hands-on... 
    Full time

    Zortech Solutions

    Scottsdale, AZ
    8 hours ago
  •  ...Lead Data Engineer Ciklum is looking for a Lead Data Engineer to join our team full-time in Ukraine. We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With... 
    Full time
    Work at office
    Remote work
    Shift work

    Ciklum

    United States
    1 day ago
  • $173.1k - $276.8k

     ...do work that matters - to you, to your community, and to the world. Progress starts with you. Job Description The Lead Data Engineer is a senior technical leader responsible for guiding the design, development, and optimization of Visa's largescale data platforms... 
    Work at office
    Local area

    Visa

    Bellevue, WA
    1 day ago
  •  ...our mission is to monetize audiences , across every device. Our data-driven tech delivers 30% higher revenues for our clients on...  ...products, and culture on our website Your Job As our Lead Data Engineer, your mission is to architect, scale, and maintain the backbone... 
    Remote work
    Flexible hours

    Sparteo

    United States
    1 day ago
  • $140k - $170k

     ...Lead Data Engineer Fully Remote • Windsor Mill, MD 21224 Overview Salary Range $140,000.00 - $170,000.00 Salary Position Type Full Time Education Level Not Specified Description About Us: At RELI Group, our work is grounded in purpose. We partner with government... 
    Full time
    Live in
    Remote work

    RELI Group, Inc.

    United States
    1 day ago
  •  ...Project Overview This project is a high‑impact business process engine designed to optimize customer pharmacy procurement. The system analyzes...  ...role Expert-level proficiency in Python and PySpark for big data processing Strong experience with Azure Databricks and Azure... 
    Full time
    Immediate start
    Remote work

    Productiv Team

    Dallas, TX
    8 hours ago
  •  ...Owning the reliability and operational excellence of the data platform, the full-time Lead Data Engineer will build, maintain, and operate data pipelines using Snowflake, Airflow, AWS, Python, and SQL, while also leading technical initiatives and optimizing costs in a... 
    Full time
    Remote work

    Virtual Vocations Inc

    United States
    1 day ago
  •  ...Position: Lead Data Engineer with MarTech Location: SFO, CA (Hybrid 2 days a week) Key Responsibilities Lead end to end MarTech engineering initiatives across orchestration, data processing, and activation pipelines. Architect scalable, event... 
    Remote work
    2 days per week

    Georgia IT Inc

    United States
    1 day ago
  • $106.61k - $284.28k

     ...Lead Data Privacy Engineer We're building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold... 
    Hourly pay
    Full time
    Temporary work
    Work experience placement
    Local area
    Remote work

    Oak St. Health

    United States
    4 days ago
  •  ...Sr. Data Engineer: Remote To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodation may be made... 
    Remote work

    Diverse Lynx

    United States
    4 days ago
  •  ...Lead Data Engineer - Master Data Management (MDM) Location: Remote, Canada Job Type: Contract Mandatory Skills - MDM, Databricks, Kafka, Snowflakes, GraphQL, SQL, Python, Microservices Job Description: We are seeking highly skilled Azure Data Engineer with... 
    Contract work
    Remote work

    Epsilon Solutions Ltd

    United States
    1 day ago
  •  ...Lead, Data Engineer L3Harris Enterprise Data and AI team is seeking a Data Engineer with experience in managing enterprise-level data life cycle processes. This role includes overseeing data ETL/ELT pipelines, ensuring adherence to data standards, maintaining data frameworks... 
    Remote work

    Navstar

    United States
    2 days ago
  •  ...Lead Snowflake Data Engineer Location: Rosemont, IL (Initial Remote) Mandatory skills: AWS, Snowflake, Informatica, Python • Experience working on Snowflake. • Experience working on Informatica Cloud (IICS). • Experience working on AWS data environment – S3... 
    Remote work

    Zortech Solutions

    United States
    1 day ago
  •  ...Lead Data Engineer Location: Remote (EST) Contract: 6+ months ongoing (possible contract to hire/must be eligible for conversion) Summary: As a Lead Data Engineer, you will play a critical role in designing, developing, and maintaining our data infrastructure.... 
    Ongoing contract
    Contract work
    Remote work

    RIT Solutions

    United States
    1 day ago
  •  ...KORE1, a nationwide provider of staffing and recruiting solutions, has an immediate opening for a Data Engineer.(remote) About us: We are building the financial operating system for healthcare provider organizations. We are a lean, exceptional team growing 3x year... 
    For contractors
    Local area
    Immediate start
    Remote work

    KORE1 Technologies

    United States
    4 days ago
  •  ...JD Job Summary: Role Overview The Senior Data Engineer will be responsible for designing, developing, and maintaining scalable data pipelines and solutions on Google Cloud Platform (GCP). The role focuses on data ingestion, transformation, orchestration, and... 

    E-Solutions

    United States
    4 days ago
  •  ...Lead Data Engineer Location: Remote (may have to relocate in the future to Dallas/Southlake, TX or Austin, TX) Work Arrangement: Remote (with potential future relocation to TX) Interview Mode: Video Position Overview We are seeking a Lead Data Engineer to join... 
    Remote work
    Relocation

    Anveta

    United States
    4 days ago
  • Okay with relocation. 5 Openings total. USC/GC/H4 Only! Client : Fidelity location : Jersey City, NJ (Hybrid) Duration : 12 Month+ Pay : $68/Hr W2 Need LinkedIn Must Have Skills: Skills wise we need a Senior Level Oracle Database Developer...
    Remote work
    Relocation

    Saxon Global

    United States
    3 days ago
  •  ...Lead Data Engineer, Technology | Data & Analytics We're looking for a Lead Data Engineer to own the design and evolution of our modern data platform at Catalyst Brands, powering data-driven decisions across our portfolio of retail and consumer brands. You'll lead complex... 
    Remote work

    RIT Solutions

    United States
    1 day ago
  • $142.3k - $195.7k

     ...(RASA) delivers integrated solutions that leverage high-quality data, data-driven insights, and technology to create differentiated member...  ...Programs organization within Risk Adjustment. The Lead Data Engineer handles work assignments involving complex issues where the... 
    Bi-weekly pay
    Weekly pay
    Full time
    Temporary work
    Apprenticeship
    Work at office
    Remote work
    Work from home
    Home office

    Humana

    United States
    3 days ago
  •  ...Snowflake Senior Data Engineer Accomplished Tech Visionary: Embark on an exciting journey into the realm of software development with 3Pillar! We extend an invitation for you to join our team and gear up for a thrilling adventure. At 3Pillar, our focus is on leveraging... 
    Work at office
    Remote work
    Flexible hours

    3Pillar Global

    United States
    1 day ago
  • $107.5k - $204.5k

     ...the strength of more than 100 years of experience and renowned engineering expertise to meet the needs of today's mission and stay ahead of...  ...who have experience designing wireless communication and data links. This position focuses on designing and testing wireless... 
    Temporary work
    Work experience placement
    Interim role
    Work at office
    Remote work
    Relocation package
    Flexible hours

    Raytheon

    Tucson, AZ
    1 hour ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!