Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer

$180k - $280k

Sunsets HQ Corp.

About Sunset

Sunset is building the data layer for real-world AI training. We work with frontier labs to turn messy, multi-modal enterprise data into the highest-quality training data on the market - sourced from the hundreds of venture-backed startups we've helped wind down.

We're a fast-growing team based in-person in Dumbo, Brooklyn. Backed by Floodgate, Afore Capital, Hustle Fund, and incredible entrepreneurs.


The Role

As a Data Engineer at Sunset, you'll own the pipeline that turns raw, chaotic enterprise data into the highest-quality training data on the market. One of our core technical challenges is entity resolution and de-identification across different sources and modalities. An even deeper challenge is understanding the node structures and linkages well enough to effectively reconstruct the business world this data comes from. All of this happens on sensitive data, which means security and privacy aren't a separate workstream but are built into every pipeline, system, and decision we make.

What You'll Work On

You'll own problems end-to-end. Some examples of what you might tackle in your first 90 days:
  • Designing the de-identification layer that replaces PII with stable pseudonyms while preserving every relationship across every source
  • Building coreference resolution across Slack threads, email chains, and Linear comments so that "me," "him," and first-name mentions all resolve to the right canonical entity
  • Hardening how we ingest, store, and process sensitive client data - from encryption and access controls to audit trails and isolation boundaries
  • Extending our entity resolution pipeline to handle new modalities - think audio, video, design files, or embedded references inside documents
You Might Be a Fit If
  • You are a product minded engineer and have shipped data pipelines at scale
  • You have strong Python and are comfortable across NER, record linkage, and coreference
  • You take security and privacy seriously and have built systems where getting it wrong wasn't an option
  • You want to own a hard, ambiguous problem end-to-end rather than wait for a PRD
  • AI is deeply integrated into your workflow and life
This Role Might Not Be a Fit If
  • You want to work remote or hybrid - we're in-person 5 days/week in Dumbo
  • You want to do novel ML research - this role is applied, not research
  • You prefer long planning cycles or narrow ownership
Our Stack

Python, Postgres, Redis, AWS. We pick tools based on the problem, not the other way around.

Compensation & Benefits
  • $180K-$280K base + meaningful equity
  • 100% covered medical, dental, and vision
  • Unlimited PTO
  • $500 in-office setup allowance
How We Hire
  1. Intro Chat (20 min) - mutual fit and interests
  2. Technical Session (1hr) - collaborative problem-solving
  3. Onsite (2-3 hrs) - product deep dive, system design, meet the team
  4. Quick references → Offer
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Data Engineer in New York, NY vacancy
  • $110k - $175.6k

     ...sits within the newly consolidated Employer & Broker pillar of Data and Analytics (D&A) organization supporting MetLife's U.S. Business...  ...Write validation scripts to validate the data loaded by data engineering team. Work with IT, business, and architects to develop and... 
    Suggested
    Temporary work
    Work experience placement
    Work at office
    Local area
    3 days per week

    MetLife

    New York, NY
    4 days ago
  •  ...Hi, Position: Lead Data Engineer Experience Required: 12 years+ Location: Jersey City, NJ | Onsite Employment Type: Full-Time NOTE - Must have taken care of team size of Minimum of 10 or more people and sole contributer required... 
    Suggested
    Full time

    Centraprise

    Jersey City, NJ
    2 days ago
  •  ...the vision and achieving the goals of our three core lines of business: Indexing, Digital Distribution, and Data & Analytics. Made up of developers, data engineers, designers, and project managers, the platform team is the engine that drives forward the technical... 
    Suggested

    TMX Group

    New York, NY
    26 days ago
  •  ...Lead Data Engineer Locations: Richmond - VA / McLean - VA / Plano - TX / Chicago - IL / NYC - NY / Wilmington - DE (Preferred Location; Hybrid Role; Needs to work 3 days from Office in a week) Job Type: Long Term Contract Responsibilities: Lead the design... 
    Suggested
    Long term contract
    Work at office

    InterSources

    New York, NY
    12 days ago
  •  ...Video Visa : USC, GC, GC EAD, H4, L2 This is onsite from day-1 Description : Client needs a forward-thinking Lead Data Engineer willing to work on-site) in New York NY. This opportunity is in the Asset Management/ Financial Services / Investment Management... 
    Suggested

    ShiftCode Analytics

    New York, NY
    1 day ago
  •  ...Lead Snowflake Data Engineer Our client is seeking a Lead Snowflake Data Engineer to design, own, and deliver end-to-end data engineering solutions in modern cloud environments. This role requires full lifecycle ownership across Snowflake pipelines, data modeling, and... 

    TheStaffed

    New York, NY
    2 days ago
  • $175k - $255k

     ...deserve, we'd love to meet you. About the Role The Revenue Engine team at Charlie Health services all parts of our business...  ...externally sourced datasets, and building user-facing products. As a data engineer on the Revenue Engine team, you will be responsible for... 
    Full time
    Work at office
    Local area

    Charlie Health Outreach

    New York, NY
    2 days ago
  • $112k - $135k

     ...Lead Data Engineer Publicis Media is a social marketing and dynamic content center of excellence, specializing in connecting brands to consumers in real-time, social environments. Responsibilities Design, build, and maintain scalable ETL and ELT pipelines based... 
    Temporary work
    Freelance
    Flexible hours

    Prodigious Worldwide

    New York, NY
    2 days ago
  •  ...Lead Data Engineer This role is for a Senior Data Engineer with at least 5+ years of experience in Data Engineering roles at the senior level who can provide support to our existing engineers. Help build a connected data system (Data Fabric) and use hands-... 

    3B Staffing LLC

    New York, NY
    2 days ago
  •  ...role is part of a multi year enterprise initiative to modernize data platforms by migrating from legacy and on prem environments to...  ...Databricks on strategic cloud platforms , enabling standardized data engineering, analytics, centralized reporting, reconciliation utilities,... 

    Yochana

    Jersey City, NJ
    3 days ago
  • Job Title Okay with relocation. 5 Openings total. USC/GC/H4 Only! Client: Fidelity location: Jersey City, NJ (Hybrid) Duration: 12 Month+ Pay: $68/Hr W2 Need LinkedIn Must Have Skills: Skills wise we need a Senior Level Oracle Database Developer, that is ...
    Relocation

    Saxon Global

    Jersey City, NJ
    2 days ago
  • $100k - $120k

     ...Lead Data Engineer This role is to be based near our offices in New York City or West Los Angeles. We drive intelligent growth for ambitious businesses and leading brands. Customer understanding is more potent and drives greater value when insights go to work... 
    Worldwide

    Material Service Corporation

    New York, NY
    15 days ago
  •  ...collaborate directly with the client's team, make business trips.... Digis islooking for anexperienced, proactive, and self-driven Lead Data Engineer tojoin our fully remote team. About the Project You’ll bepartofalarge-scale platform focused on performance management and... 
    Remote work

    DIGIS Corporation

    New York, NY
    3 days ago
  • TBD Gen is proud to be an equal-opportunity employer, committed to diversity and inclusivity. We base employment decisions on merit, experience, and business needs, without considering race, color, national origin, age, religion, sex, pregnancy, genetic information,...

    Gen Digital Inc

    New York, NY
    23 hours ago
  •  ...About the job Lead Data Engineer Job Title: Lead Data Engineer No of Positions:2 Location: Jersey City/ Bedford, NJ (Hybrid) (3 days from office per week) Experience: 10+ years specifically Key Skills: Snowflake, SQL, Python, Spark, AWS-... 
    Work experience placement
    Work at office
    3 days per week

    Inizio Partners

    Jersey City, NJ
    4 days ago
  • Job Description: Must Have : DBT Core , Snowflake and Python, AWS Airflow , Container services(Docker) , Terraform Good to Have : BI tooling experience(Power BI or Tableau), understanding of Agile processes( Kanban or Jira) . We need strong person as this...

    ECHO IT SOLUTIONS INC .

    New York, NY
    12 days ago
  •  ...Lead Data Engineer – Big Data & Cloud (AWS)(Python) Location: New York, NY (Onsite) Contract: 6 Months Visa Status: ONLY GC & USC Strong Python and Big Data expertise JOB DESCRIPTION: We are seeking a Senior Data Engineer / Technology Lead with 10–15... 
    Contract work

    Argyle Infotech

    New York, NY
    2 days ago
  • $174k - $230k

     ...The Job We are looking for our first in-house Data Engineer to own and evolve our core data infrastructure. This is an early and high-impact role. As the data engineering function grows under Engineering, you’ll have a real voice in shaping how it’s built - the processes... 
    For contractors
    Remote work

    Atticus Inc

    New York, NY
    3 days ago
  • $220k

     ...Specialist Recruiter | Databricks Data Engineer Recruitment | Connecting Top Talent with Leading US Opportunities Job Title: Lead Data Engineer Location: Remote Employment Type: Full-time Compensation: Up to $220k + benefits About the Role We’re hiring experienced Lead... 
    Full time
    Remote work
    Flexible hours

    KDR Talent Solutions USA

    New York, NY
    3 days ago
  •  ...Lead Data Engineer - Palantir & PySpark (Lead Data engineer role where experience should be on both hands on and leading the team.) Candidate preference : Should be in the United States and from East coast. Experience : 8-15 Years Location... 
    Remote work

    VBeyond

    Jersey City, NJ
    4 days ago
  •  ...Role: Lead Data Engineer Location: New York Mode of Work: Onsite 60/hour. Responsibilities - Lead the design and implementation of a robust, scalable, and reusable data ingestion framework using Microsoft Fabric - Building a... 

    Maintec Technologies

    New York, NY
    1 day ago
  • $160k - $220k

     ...Lead Data Engineer Deliberate AI | Hybrid (NYC or Boston) | Full-Time About Deliberate AI: We're a venture-backed company at the frontier of precision mental health. In partnerships with some of the world's top ranked medical schools and psychiatric hospitals, we... 
    Full time
    Worldwide
    Relocation
    Flexible hours
    Shift work
    Night shift
    Day shift

    Deliberate AI

    New York, NY
    2 days ago
  •  ...Proposition: The position sits within the newly consolidated Data and Analytics (D&A) organization supporting the U.S. Business...  ...data science, from data infrastructure, data governance, data engineering, data modeling, data analysis to business intelligence, data science... 

    MetLife

    New York, NY
    1 day ago
  • $100k - $200k

    Who We Are: Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress...  ...Dream Teams. Who You Are: You're a senior, hands-on data engineer with 8+ years of experience designing, building, and operating production... 
    Local area
    Flexible hours

    Galaxy USA

    New York, NY
    23 hours ago
  • $120k - $158.4k

     ...Role Value Proposition: The MetLife Corporate Functions Data Office is part of the Data and Analytics Organization (D&A) within...  ...head into our New Frontier strategy! The Lead Big Data Engineer and Data Architect plays a critical role in big data development... 
    Temporary work
    Work at office
    Local area
    3 days per week

    MetLife

    New York, NY
    2 days ago
  • $100k

     ...are always looking for ways to improve our games and how we operate them. We’re growing our LiveOps team and are looking for a Data Engineer/Analyst who can help us scale our LiveOps systems, reporting, and player insights to the next level. About the Role You’ll work... 
    Full time
    Remote work
    Worldwide

    GrabJobs

    New York, NY
    2 days ago
  •  ...Job Description The Lead Data Analyst plays a critical role in designing, implementing, and optimizing advanced data analytics solutions...  .... Proficiency in PySpark, Python, SQL, and Scala for data engineering, automation, and analytics. Expertise in data modeling, data mapping... 

    Remote Jobs

    New York, NY
    3 days ago
  • $100k - $125k

    Blackstone is the world's largest alternative asset manager. We seek to create positive economic impact and long-term value for our investors, the companies we invest in, and the communities in which we work. We do this by using extraordinary people and flexible capital...
    Local area
    Flexible hours

    Blackstone Restaurant

    New York, NY
    4 days ago
  •  ...As a Lead Data Engineer at Rearc, you'll play a pivotal role in establishing and maintaining technical excellence within our data engineering team. Your deep expertise in data architecture, ETL processes, and data modeling will be instrumental in optimizing data workflows... 
    Flexible hours

    Rearc

    New York, NY
    3 days ago
  •  ...Title: Lead Snowflake Data Engineer Location: NYC | Fully Onsite | Prioritizing local candidates Contract with potential for extension We are seeking a Lead Snowflake Data Engineer to design, own, and deliver end-to-end data engineering solutions in modern... 
    Contract work
    Local area

    Suncap Technology

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!