Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer

$185k - $225k

You.com

At You.com, we are building the AI Search Infrastructure that powers modern AI systems. Our goal is to create the trusted knowledge layer that agents, applications, and enterprises rely on to retrieve real‑time, accurate, and citation‑backed information. Our platform combines proprietary vertical indexes with LLM‑optimized retrieval systems to power AI agents, applications, and enterprise workflows. We are solving hard problems across search, large language models, and large‑scale infrastructure to make AI systems more reliable, transparent, and useful. Our team includes engineers, researchers, product builders, and operators who care about solving meaningful problems and delivering real‑world impact. Whether you are improving core infrastructure, shaping product experiences, or helping bring new AI capabilities to market, your work will help define how modern AI finds and uses knowledge. About the Role We are looking for a hands‑on Data Engineer to help build and scale our modern data platform. In this role, you will work closely with engineering, product, and analytics teams to develop reliable, high‑performance data pipelines and systems. You’ll contribute to both batch and real‑time data processing using technologies like Databricks, AWS, and Kafka, while helping ensure data quality, accessibility, and usability across the organization. You’ll play a key role in enabling data activation, ensuring that high‑quality data flows not only into the warehouse but also outward to business tools like Salesforce, HubSpot, and Braze. Additionally, you will help power next‑generation AI‑driven applications , including agent‑based systems and retrieval‑augmented generation (RAG), by building robust data foundations and pipelines. This is a great opportunity for someone who enjoys solving data challenges end‑to‑end from ingestion to insights. Responsibilities Build and maintain scalable data pipelines (batch and streaming) using tools like Databricks, Spark, Kafka, and AWS services Design, develop, and optimize ETL/ELT workflows using DBT, PySpark, SQL and tools like Fivetran Partner closely with marketing and growth teams to enable data use cases such as segmentation, campaign targeting, and lifecycle analytics Develop and maintain reverse ETL pipelines to sync data from the warehouse to tools like Salesforce, HubSpot, Braze and other downstream systems Create and manage curated datasets to support analytics, reporting, and go‑to‑market initiatives Build and maintain dashboards and reporting layers to support marketing and business performance tracking Support AI/ML and agent‑based applications by preparing and serving high‑quality datasets for RAG pipelines and MCP (Model Context Protocol) integrations Monitor pipeline performance, troubleshoot issues, and ensure high data reliability and quality Implement data quality checks, validations, and alerting mechanisms across both ingestion and activation layers Collaborate with cross‑functional teams to define data contracts and ensure consistency across systems Qualifications 6+ years of experience in data engineering or a related field Strong hands‑on experience with Databricks, AWS (S3, Glue, Athena, EMR, etc.) and Kafka Proficiency in Python (PySpark) and SQL for large‑scale data processing Experience building and maintaining ETL/ELT pipelines (DBT/Airflow or similar experience preferred) Experience with data ingestion tools such as Fivetran (or similar) Familiarity with reverse ETL / data activation workflows and syncing data to tools like Salesforce, HubSpot, Braze Exposure to or experience with AI/ML data pipelines , including RAG architectures, vector databases, or embeddings workflows Familiarity with agent‑based systems, MCP integrations, or LLM‑powered applications is a strong plus Experience working with marketing, Product or growth teams on data use cases (segmentation, attribution, campaign analytics, etc.) Understanding of data modeling and working with large‑scale datasets (batch and streaming) Experience creating dashboards and supporting reporting workflows (BI tools) for both internal and external audiences Strong problem‑solving skills and ability to debug production data issues Strong communication skills and ability to work collaboratively across teams Our salary bands are structured based on a combination of geographic tiers and internal leveling. Compensation is determined by multiple factors assessed during the interview process, with the final offer reflecting these considerations. Salary Band

$185,000 - $225,000 USD

Company Perks:
  • Hubs in San Francisco and New York City offering regular in‑person gatherings and co‑working sessions
  • Flexible PTO with U.S. holidays observed and a week shutdown in December to rest and recharge*
  • A competitive health insurance plan covers 100% of the policyholder and 75% for dependents*
  • 12 weeks of paid parental leave in the U.S.*
  • 401(k) program, 3% match – vested immediately!*
  • $500 work‑from‑home stipend to be used up to a year of your start date*
  • $600 technology stipend to support a portion of our hybrid/remote team's cell phone and internet expenses*
  • $1,200 per year Health & Wellness Allowance to support your personal goals*
The chance to collaborate with a team at the forefront of AI research *Certain perks and benefits are limited to full‑time employees only You.com participates in E‑Verify. We will provide the Social Security Administration (SSA) and, if necessary, the Department of Homeland Security (DHS) with information from each new employee’s Form I‑9 to confirm work authorization. (English/Spanish: E‑Verify Participation /Right to Work). We are also an inclusive, equitable, and accessible workplace. Please let us know if you require accommodation for any portion of the recruitment and hiring process. Beware of recruiting scams: You.com will only contact you through official @You.com email addresses and will never ask for payment or sensitive personal information during the hiring process. U.S. Standard Demographic Questions We invite applicants to share their demographic background. If you choose to complete this survey, your responses may be used to identify areas of improvement in our hiring process. How would you describe your gender identity? Select... How would you describe your racial/ethnic background? Select... How would you describe your sexual orientation? Select... Do you identify as transgender? Select... Do you have a disability or chronic condition that substantially limits one of your major life activities? Select... Are you a veteran or active member of the United States Armed Forces? Select... Voluntary Self‑Identification For government reporting purposes, we ask candidates to respond to the below self‑identification survey. Completion is entirely voluntary. Your answer will be recorded and maintained in a confidential file. As set forth in You.com’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law. If you believe you belong to any of the protected veterans categories, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information to measure the effectiveness of our outreach. Voluntary Self‑Identification of Disability Form CC-305 Page 1 of 1 OMB Control Number 1250‑0005 Expires 04/30/2026 Why are you being asked to complete this form? We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years. Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs website at How do you know if you have a disability? A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to: Alcohol or other substance use disorder (not currently using drugs illegally) Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS Blind or low vision Cancer (past or present) Cardiovascular or heart disease Celiac disease Cerebral palsy Deaf or serious difficulty hearing Diabetes Disfigurement (e.g., burns, wounds, accidents, congenital disorders) Epilepsy or other seizure disorder Gastrointestinal disorders (e.g., Crohn’s Disease, irritable bowel syndrome) Intellectual or developmental disability Mental health conditions (e.g., depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD) Missing limbs or partially missing limbs Mobility impairment (e.g., use of a wheelchair, scooter, walker, leg brace(s) and/or other supports) Nervous system condition (e.g., migraine headaches, Parkinson’s disease, multiple sclerosis) Neurodivergence (e.g., ADHD, autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities) Partial or complete paralysis (any cause) Pulmonary or respiratory conditions (e.g., tuberculosis, asthma, emphysema) Short stature (dwarfism) Traumatic brain injury Disability Status Select... PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete. #J-18808-Ljbffr You.com

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Data Engineer in San Francisco, CA vacancy
  • $215.2k - $245.6k

     ...Lead Data Engineer Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers... 
    Suggested
    Full time
    Part time
    Internship
    H1b
    Local area

    Capital One Financial Corp

    San Francisco, CA
    1 day ago
  •  ...Job title: Lead Data Engineer Work Location: San Francisco, CA. Type: Contract Tech Stack & Skills She's Looking For: Core (Must-Have): Backend / Data Engineering (Primary focus) End-to-end data pipeline experience Strong SQL... 
    Suggested
    Contract work

    VBeyond

    San Francisco, CA
    3 days ago
  •  ...Lead Data Engineer The Office of Information Technology (IT) is responsible for enabling State Bar's internal and external stakeholders by the management, implementation, and maintenance of an organization's technology to support of State Bar's mission and goals. The... 
    Suggested
    Work at office

    State Bar CA

    San Francisco, CA
    1 day ago
  •  ...Lead Data Engineer RADIUMONE IS A GLOBAL PROGRAMMATIC AD BUYING PLATFORM RadiumOne is the 6th largest web property in the U.S. according to comScore We build intelligent software that automates media buying, making big data actionable for marketers and connects... 
    Suggested

    Stepping Up Solutions

    San Francisco, CA
    1 day ago
  • $180k - $225k

     ...is committed to simple principles: a rigorous understanding of data, modern technology, and most importantly, compassion and care for...  ...learning, data analytics, actuarial science, and research. The Data Engineer team is a core part of the broader Data organization, which is... 
    Suggested
    Shift work

    Nuna Inc

    San Francisco, CA
    4 days ago
  •  ...Job Title Mandatory Skills: (Oracle or PostgreSQL) and ETL Pipelines and Big Data and AWS Responsibilities · Uses structured tools for analysis and presentation of concepts and models to enhance the BRD · Develop, maintain and deliver training materials to the... 
    Work experience placement

    Omega Solutions Inc

    San Francisco, CA
    4 days ago
  •  ...Key Responsibilities Lead end-to-end MarTech engineering initiatives across orchestration, data processing, and activation pipelines. Architect scalable, event-driven systems that power real-time marketing experiences and automated customer journeys... 

    ALIS Software LLC

    San Francisco, CA
    4 days ago
  •  ...Description POSITON DESCRIPTION We are seeking a  Lead Data Engineer to architect, build, and lead the development of scalable, cloud-based data platforms that support enterprise analytics, operational reporting, and advanced data use cases. This role provides... 

    Q-Cells

    San Francisco, CA
    2 days ago
  •  ...Thoughtspot and other BI tools • Write SQL for processing raw data, kafka ingestions, adf pipelines, data validation and QA •...  ...other Big Data related technologies Work with product and engineering team to understand requirements, evaluate new features and architecture... 

    BayOne Solutions

    San Francisco, CA
    1 day ago
  •  ...technology firm in San Francisco is seeking a Forward Deployed Engineer to connect customer systems with their grocery platform. This role...  ...5+ years in solutions engineering and a strong background in data handling. Join us to tackle challenges in a $1T industry with a... 

    Vori, Inc

    San Francisco, CA
    3 days ago
  • $166.5k - $266.2k

     ...development. AI4D's mission is connecting scientists to petabyte-scale data through natural language interfaces, automated analysis...  ...practices that scale across therapeutic areas. As a Scientific Data Engineer, you will close that gap. You will build the semantic layer,... 
    Full time
    Flexible hours

    Eli Lilly

    San Francisco, CA
    4 days ago
  • $160k - $175k

     ...company is backed by climate-tech and Silicon Valley investors. For more information, please visit Role Overview As a Data Engineer at Gridware, you'll help build and maintain the pipelines and data systems powering our Active Grid Response platform. You'll work... 

    Gridware

    San Francisco, CA
    5 days ago
  •  ...Higgsfield Analytics Engineer Higgsfield AI is the leading video AI company redefining synthetic media on socials. The company is entering...  ...at Higgsfield is an A-player. You are: A strong SQL and data modeling expert who cares deeply about metric integrity.... 
    Local area
    2 days per week

    Higgsfield AI

    San Francisco, CA
    5 days ago
  • $148k - $185k

     ...Data Engineer III Los Angeles, California, United States; San Francisco, CA, United States About Crunchyroll Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We super-serve over 100 million anime and manga fans across... 
    Flexible hours

    Crunchyroll

    San Francisco, CA
    4 days ago
  •  ...Analog Mixed-Signal (AMS) Cad Engineer Location: San Francisco, CA The Integrated Circuit (IC) CAD Engineer - Analog Mixed-Signal Flow Automation role involves supporting IC design teams in various capacities, including PDK administration, layout design support,... 
    Remote work

    Redolent

    San Francisco, CA
    3 days ago
  • $150k - $210k

     ...the lives of patients living with severe, complex diseases. Our data platform is used by drug developers and patient advocacy groups...  ...equity. About the role We're looking for our Founding Data Engineer who's excited to help shape the future of how we use data to... 
    Work experience placement
    Work at office
    Relocation
    Flexible hours
    3 days per week

    Probably Genetic

    San Francisco, CA
    2 days ago
  •  ...The Data Engineer will be responsible for collecting, parsing, managing, analyzing, and visualizing large datasets to transform information into actionable insights. The role involves building scalable, repeatable, and secure data pipelines across various platforms. The... 

    Compunnel

    San Francisco, CA
    3 days ago
  •  ...Job Title: Data Engineer Location: San Francisco, CA Required Clearance: Secret Salary: Competitive Key Responsibilities Design, build, and maintain scalable data pipelines and ETL processes to support AI and machine learning workflows. Collaborate with data scientists... 

    Fullscope

    San Francisco, CA
    3 days ago
  •  ...performance by transforming every night of sleep into a personalized, data-driven recovery experience. We are trusted by high performers,...  ...through cutting-edge technology, and we need a world-class Data Engineer to power our next phase of hypergrowth. You'll architect and... 
    Full time
    Work at office
    Immediate start
    Worldwide
    Sleeping nights
    Flexible hours
    Night shift

    Eight Sleep

    San Francisco, CA
    5 days ago
  • $120.8k - $151k

     ...We make this a reality by empowering you with the tools, resources, and support you need to grow your career. Data at Brex Our Scientists and Engineers work together to make data — and insights derived from data — a core asset across Brex. But it's more than just... 
    Work at office
    Remote work
    Work from home
    3 days per week

    Brex

    San Francisco, CA
    5 days ago
  •  ...Data Engineer Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies that have historically relied on manual processes that are labor intensive and costly. Our platform is gaining traction with finance teams across... 
    Full time
    Flexible hours

    Stuut

    San Francisco, CA
    1 day ago
  •  ...Forward Deployed Data Engineer Electricity grids are undergoing the most significant transformation in a century. The shift to renewables, the proliferation of rooftop solar, batteries, and EVs, and the increasing complexity of distribution networks are forcing utilities... 
    Home office
    Flexible hours
    Shift work

    GRIDSIGHT

    San Francisco, CA
    5 days ago
  • $120k - $160k

     ...Data Engineer Los Angeles; New York; Remote; San Francisco EDO is the TV outcomes company. Our leading measurement platform connects convergent TV airings to the ad-driven consumer behaviors most predictive of future sales. EDO empowers the advertising industry to... 
    Full time
    Work experience placement
    Work at office
    Immediate start
    Remote work
    Flexible hours

    EDO

    San Francisco, CA
    8 hours ago
  • £75k - £95k per year

     ...Join to apply for the Data Engineer role at Surecall Tech Join to apply for the Data Engineer role at Surecall Tech Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Surecall Tech Entrepreneur & Recruitment... 
    Full time
    Freelance
    Remote work
    Flexible hours

    Surecall Tech

    San Francisco, CA
    11 days ago
  • $150k - $177k

     ...agents working together) think and execute. About The Role As Notion continues to grow rapidly, we're seeking talented data engineers to join our team and help us build the foundational datasets and pipelines to support our go-to-market strategies. You'll be at... 
    Local area

    Notion Labs, Inc

    San Francisco, CA
    2 days ago
  •  ...across the globe. Join us on this journey to redefine resource management-and change lives along the way. The Role As a Data Engineer at Air Apps, you will be responsible for designing, building, and optimizing data pipelines, data warehouses, and data lakes... 
    Temporary work
    Worldwide

    Air Apps

    San Francisco, CA
    3 days ago
  •  ...Growth Data Engineer OpenArt is an AI Storytelling and Visual Creation Platform used by millions worldwide. We're building the next generation of creative tools powered by cutting-edge AI, enabling anyone to create videos, visuals, characters, and stories with unprecedented... 
    Worldwide
    Visa sponsorship

    Openart Ai

    San Francisco, CA
    2 days ago
  •  ...Gtm Minded Data Engineer We are looking to bring on a talented GTM minded Data Engineer to join the team. The ideal candidate wants to work in an agile environment close to the business and is not tied down to one specific product, but rather is a catalyst of innovation... 

    Rippling

    San Francisco, CA
    7 hours ago
  •  ...content to any extent *TRAVEL - Potentially a few times a year REQUIRED: - Bachelor's required - 8+ years of experience in Data Engineering (will accept 6+ DE YoE if their resume is otherwise outstanding and supplemented with 2+ non-DE YoE, and they can show... 
    Internship
    3 days per week

    Rose International

    San Francisco, CA
    4 days ago
  •  ...A leading sleep technology company is looking for a Data Engineer to drive the construction of data infrastructure that supports millions of users. This role requires 6+ years of experience building data platforms and mastery of tools like SQL and Python. You will work... 

    Eight Sleep

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!