Data Engineer
$180k - $280kSunsets HQ Corp.
About Sunset Sunset is building the data layer for real-world AI training. We work with frontier labs to turn messy, multi-modal enterprise data into the highest-quality training data on the market - sourced from the hundreds of venture-backed startups we've helped wind down. We're a fast-growing team based in-person in Dumbo, Brooklyn. Backed by Floodgate, Afore Capital, Hustle Fund, and incredible entrepreneurs.
The Role As a Data Engineer at Sunset, you'll own the pipeline that turns raw, chaotic enterprise data into the highest-quality training data on the market. One of our core technical challenges is entity resolution and de-identification across different sources and modalities. An even deeper challenge is understanding the node structures and linkages well enough to effectively reconstruct the business world this data comes from. All of this happens on sensitive data, which means security and privacy aren't a separate workstream but are built into every pipeline, system, and decision we make. What You'll Work On You'll own problems end-to-end. Some examples of what you might tackle in your first 90 days:
The Role As a Data Engineer at Sunset, you'll own the pipeline that turns raw, chaotic enterprise data into the highest-quality training data on the market. One of our core technical challenges is entity resolution and de-identification across different sources and modalities. An even deeper challenge is understanding the node structures and linkages well enough to effectively reconstruct the business world this data comes from. All of this happens on sensitive data, which means security and privacy aren't a separate workstream but are built into every pipeline, system, and decision we make. What You'll Work On You'll own problems end-to-end. Some examples of what you might tackle in your first 90 days:
- Designing the de-identification layer that replaces PII with stable pseudonyms while preserving every relationship across every source
- Building coreference resolution across Slack threads, email chains, and Linear comments so that "me," "him," and first-name mentions all resolve to the right canonical entity
- Hardening how we ingest, store, and process sensitive client data - from encryption and access controls to audit trails and isolation boundaries
- Extending our entity resolution pipeline to handle new modalities - think audio, video, design files, or embedded references inside documents
- You are a product minded engineer and have shipped data pipelines at scale
- You have strong Python and are comfortable across NER, record linkage, and coreference
- You take security and privacy seriously and have built systems where getting it wrong wasn't an option
- You want to own a hard, ambiguous problem end-to-end rather than wait for a PRD
- AI is deeply integrated into your workflow and life
- You want to work remote or hybrid - we're in-person 5 days/week in Dumbo
- You want to do novel ML research - this role is applied, not research
- You prefer long planning cycles or narrow ownership
- $180K-$280K base + meaningful equity
- 100% covered medical, dental, and vision
- Unlimited PTO
- $500 in-office setup allowance
- Intro Chat (20 min) - mutual fit and interests
- Technical Session (1hr) - collaborative problem-solving
- Onsite (2-3 hrs) - product deep dive, system design, meet the team
- Quick references → Offer
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Data Engineer in New York, NY vacancy
$110k - $175.6k
...sits within the newly consolidated Employer & Broker pillar of Data and Analytics (D&A) organization supporting MetLife's U.S. Business... ...Write validation scripts to validate the data loaded by data engineering team. Work with IT, business, and architects to develop and...SuggestedTemporary workWork experience placementWork at officeLocal area3 days per week- ...Hi, Position: Lead Data Engineer Experience Required: 12 years+ Location: Jersey City, NJ | Onsite Employment Type: Full-Time NOTE - Must have taken care of team size of Minimum of 10 or more people and sole contributer required...SuggestedFull time
- ...the vision and achieving the goals of our three core lines of business: Indexing, Digital Distribution, and Data & Analytics. Made up of developers, data engineers, designers, and project managers, the platform team is the engine that drives forward the technical...Suggested
- ...Lead Data Engineer Locations: Richmond - VA / McLean - VA / Plano - TX / Chicago - IL / NYC - NY / Wilmington - DE (Preferred Location; Hybrid Role; Needs to work 3 days from Office in a week) Job Type: Long Term Contract Responsibilities: Lead the design...SuggestedLong term contractWork at office
- ...Video Visa : USC, GC, GC EAD, H4, L2 This is onsite from day-1 Description : Client needs a forward-thinking Lead Data Engineer willing to work on-site) in New York NY. This opportunity is in the Asset Management/ Financial Services / Investment Management...Suggested
- ...Lead Snowflake Data Engineer Our client is seeking a Lead Snowflake Data Engineer to design, own, and deliver end-to-end data engineering solutions in modern cloud environments. This role requires full lifecycle ownership across Snowflake pipelines, data modeling, and...
$175k - $255k
...deserve, we'd love to meet you. About the Role The Revenue Engine team at Charlie Health services all parts of our business... ...externally sourced datasets, and building user-facing products. As a data engineer on the Revenue Engine team, you will be responsible for...Full timeWork at officeLocal area$112k - $135k
...Lead Data Engineer Publicis Media is a social marketing and dynamic content center of excellence, specializing in connecting brands to consumers in real-time, social environments. Responsibilities Design, build, and maintain scalable ETL and ELT pipelines based...Temporary workFreelanceFlexible hours- ...Lead Data Engineer This role is for a Senior Data Engineer with at least 5+ years of experience in Data Engineering roles at the senior level who can provide support to our existing engineers. Help build a connected data system (Data Fabric) and use hands-...
- ...role is part of a multi year enterprise initiative to modernize data platforms by migrating from legacy and on prem environments to... ...Databricks on strategic cloud platforms , enabling standardized data engineering, analytics, centralized reporting, reconciliation utilities,...
- Job Title Okay with relocation. 5 Openings total. USC/GC/H4 Only! Client: Fidelity location: Jersey City, NJ (Hybrid) Duration: 12 Month+ Pay: $68/Hr W2 Need LinkedIn Must Have Skills: Skills wise we need a Senior Level Oracle Database Developer, that is ...Relocation
$100k - $120k
...Lead Data Engineer This role is to be based near our offices in New York City or West Los Angeles. We drive intelligent growth for ambitious businesses and leading brands. Customer understanding is more potent and drives greater value when insights go to work...Worldwide- ...collaborate directly with the client's team, make business trips.... Digis islooking for anexperienced, proactive, and self-driven Lead Data Engineer tojoin our fully remote team. About the Project You’ll bepartofalarge-scale platform focused on performance management and...Remote work
- TBD Gen is proud to be an equal-opportunity employer, committed to diversity and inclusivity. We base employment decisions on merit, experience, and business needs, without considering race, color, national origin, age, religion, sex, pregnancy, genetic information,...
- ...About the job Lead Data Engineer Job Title: Lead Data Engineer No of Positions:2 Location: Jersey City/ Bedford, NJ (Hybrid) (3 days from office per week) Experience: 10+ years specifically Key Skills: Snowflake, SQL, Python, Spark, AWS-...Work experience placementWork at office3 days per week
- Job Description: Must Have : DBT Core , Snowflake and Python, AWS Airflow , Container services(Docker) , Terraform Good to Have : BI tooling experience(Power BI or Tableau), understanding of Agile processes( Kanban or Jira) . We need strong person as this...
- ...Lead Data Engineer – Big Data & Cloud (AWS)(Python) Location: New York, NY (Onsite) Contract: 6 Months Visa Status: ONLY GC & USC Strong Python and Big Data expertise JOB DESCRIPTION: We are seeking a Senior Data Engineer / Technology Lead with 10–15...Contract work
$174k - $230k
...The Job We are looking for our first in-house Data Engineer to own and evolve our core data infrastructure. This is an early and high-impact role. As the data engineering function grows under Engineering, you’ll have a real voice in shaping how it’s built - the processes...For contractorsRemote work$220k
...Specialist Recruiter | Databricks Data Engineer Recruitment | Connecting Top Talent with Leading US Opportunities Job Title: Lead Data Engineer Location: Remote Employment Type: Full-time Compensation: Up to $220k + benefits About the Role We’re hiring experienced Lead...Full timeRemote workFlexible hours- ...Lead Data Engineer - Palantir & PySpark (Lead Data engineer role where experience should be on both hands on and leading the team.) Candidate preference : Should be in the United States and from East coast. Experience : 8-15 Years Location...Remote work
- ...Role: Lead Data Engineer Location: New York Mode of Work: Onsite 60/hour. Responsibilities - Lead the design and implementation of a robust, scalable, and reusable data ingestion framework using Microsoft Fabric - Building a...
$160k - $220k
...Lead Data Engineer Deliberate AI | Hybrid (NYC or Boston) | Full-Time About Deliberate AI: We're a venture-backed company at the frontier of precision mental health. In partnerships with some of the world's top ranked medical schools and psychiatric hospitals, we...Full timeWorldwideRelocationFlexible hoursShift workNight shiftDay shift- ...Proposition: The position sits within the newly consolidated Data and Analytics (D&A) organization supporting the U.S. Business... ...data science, from data infrastructure, data governance, data engineering, data modeling, data analysis to business intelligence, data science...
$100k - $200k
Who We Are: Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress... ...Dream Teams. Who You Are: You're a senior, hands-on data engineer with 8+ years of experience designing, building, and operating production...Local areaFlexible hours$120k - $158.4k
...Role Value Proposition: The MetLife Corporate Functions Data Office is part of the Data and Analytics Organization (D&A) within... ...head into our New Frontier strategy! The Lead Big Data Engineer and Data Architect plays a critical role in big data development...Temporary workWork at officeLocal area3 days per week$100k
...are always looking for ways to improve our games and how we operate them. We’re growing our LiveOps team and are looking for a Data Engineer/Analyst who can help us scale our LiveOps systems, reporting, and player insights to the next level. About the Role You’ll work...Full timeRemote workWorldwide- ...Job Description The Lead Data Analyst plays a critical role in designing, implementing, and optimizing advanced data analytics solutions... .... Proficiency in PySpark, Python, SQL, and Scala for data engineering, automation, and analytics. Expertise in data modeling, data mapping...
$100k - $125k
Blackstone is the world's largest alternative asset manager. We seek to create positive economic impact and long-term value for our investors, the companies we invest in, and the communities in which we work. We do this by using extraordinary people and flexible capital...Local areaFlexible hours- ...As a Lead Data Engineer at Rearc, you'll play a pivotal role in establishing and maintaining technical excellence within our data engineering team. Your deep expertise in data architecture, ETL processes, and data modeling will be instrumental in optimizing data workflows...Flexible hours
- ...Title: Lead Snowflake Data Engineer Location: NYC | Fully Onsite | Prioritizing local candidates Contract with potential for extension We are seeking a Lead Snowflake Data Engineer to design, own, and deliver end-to-end data engineering solutions in modern...Contract workLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!
Related searches
- junior data developer New York, NY
- director data engineering New York, NY
- junior big data engineer New York, NY
- data engineer graduate New York, NY
- senior data engineer New York, NY
- data platform engineer New York, NY
- sr information security engineer New York, NY
- senior data integration developer New York, NY
- data developer New York, NY
- data engineer New York, NY

