Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Distributed Data Systems (US)

$215k - $250k

Onehouse

Job Description

Job Description

About Onehouse

Onehouse is a mission-driven company dedicated to freeing data from data platform lock-in. We deliver the industry’s most interoperable data lakehouse through a cloud-native managed service built on Apache Hudi. Onehouse enables organizations to ingest data at scale with minute-level freshness, centrally store it, and make available to any downstream query engine and use case (from traditional analytics to real-time AI / ML).

We are a team of self-driven, inspired, and seasoned builders that have created large-scale data systems and globally distributed platforms that sit at the heart of some of the largest enterprises out there including Uber, Snowflake, AWS, Linkedin, Confluent and many more. Riding off a fresh $35M Series B backed by Craft, Greylock and Addition Ventures, we're now at $68M total funding and looking for rising talent to grow with us and become future leaders of the team. Come help us build the world's best fully managed and self-optimizing data lake platform!

*If not local to Bay Area, you must be willing to relocate within 45 days and onboard in person for one week. Relocation package provided.

The Community You Will Join

When you join Onehouse, you're joining a team of passionate professionals tackling the deeply technical challenges of building a 2-sided engineering product. Our engineering team serves as the bridge between the worlds of open source and enterprise: contributing directly to and growing Apache Hudi (already used at scale by global enterprises like Uber, Amazon, ByteDance etc) and concurrently defining a new industry category - the transactional data lake. The Data Infrastructure team is the grounding heartbeat to all of this. We live and breathe databases, building cornerstone infrastructure by working under Hudi's hood to solving incredibly complex optimization and systems problems.

The Impact You Will Drive:
  • As a foundational member of the Data Infrastructure team, you will productionize the next generation of our data tech stack by building the software and data features that actually process all of the data we ingest.
  • Accelerate our open source <> enterprise flywheel by working on the guts of Apache Hudi's transactional engine and optimizing it for diverse Onehouse customer workloads.
  • Act as a SME to deepen our teams' expertise on database internals, query engines, storage and/or stream processing.
A Typical Day:
  • Design new concurrency control and transactional capabilities that maximize throughput for competing writers.
  • Design and implement new indexing schemes, specifically optimized for incremental data processing and analytical query performance.
  • Design systems that help scale and streamline metadata and data access from different query/compute engines.
  • Solve hard optimization problems to improve the efficiency (increase performance and lower cost) of distributed data processing algorithms over a Kubernetes cluster.
  • Leverage data from existing systems to find inefficiencies, and quickly build and validate prototypes.
  • Collaborate with other engineers to implement and deploy, safely rollout the optimized solutions in production.
What You Bring to the Table:
  • Strong, object-oriented design and coding skills (Java and/or C/C++ preferably on a UNIX or Linux platform).
  • Experience with inner workings of distributed (multi-tiered) systems, algorithms, and relational databases.
  • You embrace ambiguous/undefined problems with an ability to think abstractly and articulate technical challenges and solutions.
  • An ability to prioritize across feature development and tech debt with urgency and speed.
  • An ability to solve complex programming/optimization problems.
  • An ability to quickly prototype optimization solutions and analyze large/complex data.
  • Robust and clear communication skills.
  • Nice to haves (but not required):
  • Experience working with database systems, Query Engines or Spark codebases.
  • Experience in optimization mathematics (linear programming, nonlinear optimization).
  • Existing publications of optimizing large-scale data systems in top-tier distributed system conferences.
  • PhD degree with 2+ years industry experience in solving and delivering high-impact optimization projects.

How We'll Take Care of You

- Competitive Compensation; the estimated base salary range for this role is $215,000 - $250,000

-Equity Compensation; our success is your success with eligible participation in our company equity plan

- Health & Well-being; we'll invest in your physical and mental well-being with up to 90% health coverage (50% for spouses/dependents) including comprehensive medical, dental & vision benefits

- Financial Future; we'll invest in your financial well-being by making this role eligible to contribute to our company 401(k) or Roth 401(k) retirement plan

- Location; we are a remote-friendly company (internationally distributed across N. America + India), though some roles will be subject to in-person requirements in alignment with the needs of the business

- Generous Time Off; unlimited PTO (mandatory 1 week/year minimum), uncapped sick days and 11 paid company holidays

- Company Camaraderie; Annual company offsites and Quarterly team onsites @Sunnyvale HQ

- Food & Meal Allowance; weekly lunch stipend, in-office snacks/drinks

- Equipment; we'll provide you with the equipment you need to be successful and a one-time $500 stipend for your initial desk setup

- Child Bonding!; 8 weeks off for parents (birthing, non-birthing, adoptive, foster, child placement, new guardianship) - fully paid so you can focus your energy on your newest addition

House Values

One Team

Optimize for the company, your team, self - in that order. We may fight long and hard in the trenches, take care of your co-workers with empathy. We give more than we take to build the one house, that everyone dreams of being part of.

Tough & Persevering

We are building our company in a very large, fast-growing but highly competitive space. Life will get tough sometimes. We take hardships in the stride, be positive, focus all energy on the path forward and develop a champion's mindset to overcome odds. Always day one!

Keep Making It Better Always

Rome was not built in a day; If we can get 1% better each day for one year, we'll end up thirty-seven times better. This means being organized, communicating promptly, taking even small tasks seriously, tracking all small ideas, and paying it forward.

Think Big, Act Fast

We have tremendous scope for innovation, but we will still be judged by impact over time. Big, bold ideas still need to be strategized against priorities, broken down, set in rapid motion, measure, refine, repeat. Great execution is what separates promising companies from proven unicorns.

Be Customer Obsessed

Everyone has the responsibility to drive towards the best experience for the customer, be an OSS user or a paid customer. If something is broken, own it, say something, do something; never ignore. Be the change that you want to see in the company.

Pay Range Transparency

Onehouse is committed to fair and equitable compensation practices. Our job titles may span more than one career level. The pay range(s) for this role is listed above and represents the base salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are dependent upon several factors that are unique to each candidate, including but not limited to: job-related skills, depth of transferable experience, relevant certifications and training, business needs, market demands and specific work location. Based on the factors above, Onehouse utilizes the full width of the range; the base pay range is subject to change and may be modified in the future. The total compensation package for this position will also include eligibility for equity options and the benefits listed above.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 8 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, Distributed Data Systems (US) in Sunnyvale, CA vacancy
  • $215k - $250k

     ...Onehouse Data Infrastructure Engineer Onehouse is a mission-driven company...  ...created large-scale data systems and globally distributed platforms that sit at...  ...rising talent to grow with us and become future...  ...tech stack by building the software and data features that actually... 
    Suggested
    Odd job
    Work at office
    Local area
    Remote work
    Relocation
    Relocation package

    OneHouse LLC

    Sunnyvale, CA
    5 days ago
  • $192k - $260k

     ...obsessed with enabling data teams to solve the...  ...workloads, making us one of the fastest...  ...the world. Our engineering teams build highly...  ...the largest scale software platforms. The...  ...network, and operating system faults, and our...  ...PhD in databases, distributed systems.... 
    Suggested
    Work at office
    Local area

    Menlo Ventures

    Mountain View, CA
    3 days ago
  • $192k - $260k

    Databricks is looking for a seasoned engineer with over 8 years of experience in Java,...  ...candidate will contribute to our innovative data and AI infrastructure platform,...  ...a strong foundation in algorithms and distributed systems. We offer a generous salary range of $1... 
    Suggested

    Menlo Ventures

    Mountain View, CA
    4 days ago
  • $140k - $240k

     ...Cerebras Systems builds the world's largest...  ...security-first based engineering. Cerebras cluster...  ...management software stack - all the way...  ...management role in distributed systems security....  ...making skills with data and trade-off analysis...  ...of our team tell us there are five main... 
    Suggested

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  • Senior Software Engineer - Distributed Data Systems
    Suggested

    Databricks

    Mountain View, CA
    3 days ago
  • Nuro, based in Mountain View, is seeking senior engineers to build and scale its large-scale computing infrastructure. The role involves...  ...applications. The ideal candidate has experience with distributed applications and holds a bachelor's degree in Computer Science... 

    I did my part and supported the Regular Toilet

    Mountain View, CA
    1 day ago
  •  ...experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture...  ...diverse perspectives. Join us as we shape the future...  ...for a strategic software engineering lead who is passionate...  ...optimize inference like distributed kv-cache, disaggregation... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  •  ...About Us : In today's world, where data spreads across various clouds and devices, traditional security...  ...We’re looking for a  Staff Software Engineer to join our Confidential Computing...  ...services powering secure, distributed systems at scale. This is a  high-impact... 
    H1b
    Worldwide

    Fortanix

    Santa Clara, CA
    5 days ago
  • $136.3k - $231.7k

    ## Software Engineer II (Distributed / Scalable Systems)Applylocations: Milpitas, CAtime type: Full timeposted on: Posted...  ...have made it into your hands without us. KLA invents systems and solutions...  ...teams of physicists, engineers, data scientists and problem-solvers... 
    Minimum wage
    Temporary work
    Work experience placement
    Flexible hours

    KLA-Belgium

    Milpitas, CA
    2 days ago
  • $160.36k - $240.54k

     ...About the Role We’re looking for senior engineers to build/scale Nuro's large-scale computing infrastructure in the cloud/data center. This system is the foundation of many critical...  ...building and developing large-scale distributed applications (e.g. Kubernetes). You’... 

    Icehouseventures

    Mountain View, CA
    1 day ago
  • $175k - $263k

     ...Technical Lead, Distributed Systems, Portworx Santa Clara, California We...  ...fundamentally reshaping the data storage industry. Here, you lead...  ...thinking, grow along with us, and join the smartest team in...  ...scalable and production quality software ~ Proven design sensibility... 
    Work at office
    Flexible hours

    Pure Storage

    Santa Clara, CA
    a month ago
  • $230k - $315k

     ...we invite you to join us! We believe collaboration...  .... As a Distinguished Engineer on the Enterprise DLP...  ...architecting and scaling the data platform that underpins...  ...the standards and systems necessary to process and...  ...trade-offs for distributed systems ~ Demonstrated... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $140k - $265k

     ...Software Engineer, Data Foundations Glean is the Work AI platform that helps...  ...works, you'll help build systems used daily across Microsoft...  ...deep integrations that allow us to automate tasks, perform...  ...backpressure, and retries across distributed queues, workers, and... 
    Work at office
    Home office
    Flexible hours

    Glean.info

    Mountain View, CA
    2 days ago
  • $213k - $263k

     ...contribute to Waymo's data infrastructure...  ...the data needs, data distributions, data quality, data value...  ...experience in the field of software engineering ~ Experience...  ...scalable distributed systems We prefer:...  ...time position across US locations is listed below... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $165k - $242k

     ...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is...  ...Integrations with internal/external systems and data sources that feed...  ...structured cabling, power distribution, and cooling. You don't...  ...from you, too. Come join us! The base salary range for... 

    CoreWeave

    Sunnyvale, CA
    5 days ago
  • $160.36k - $240.54k

     ...Software Engineer, ML Data Infrastructure Mountain View, California (HQ) Nuro...  ...access to the world around us, that's why we're building...  ...technology. In an ML-first system, the overall system...  ...Experience working with large-scale distributed data systems Experience... 
    Work experience placement
    Immediate start
    Flexible hours

    Nuro

    Mountain View, CA
    5 days ago
  • $181.1k - $272.1k

    Sr Full-stack Software Engineer, AIML Data Operations Cupertino, California, United...  ...build and operate large-scale distributed data-processing pipelines....  ...at scale, come join us. Minimum Qualifications Bachelor...  ...production-grade software systems including meaningful... 
    Relocation

    Apple Inc.

    Cupertino, CA
    5 days ago
  • $193.93k - $291.15k

     ...to the world around us, that's why we're building...  ...where ML and systems engineering converge to push autonomy...  .... As a Perception ML Data Engineer, you'll bridge...  ...real-world driving distributions Develop high fidelity...  ...4+ years of industry software engineering... 
    Immediate start
    Flexible hours

    Nuro

    Mountain View, CA
    a month ago
  • $210k - $267k

     ...ingest large‑scale data—weather, prices,...  ...scale battery storage systems more efficient,...  ...re looking for an engineer to help lead the scaling...  ...schemas. Strong distributed systems and...  ...Temporal. Strong software engineering skills...  ...Click below or email us at careers@... 
    Work at office
    Remote work
    Work from home
    Home office
    Flexible hours
    3 days per week

    Gridmatic

    Cupertino, CA
    3 days ago
  •  ...Distributed Software Engineer Bengaluru, Karnataka, India; Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer...  ...business. Members of our team tell us there are five main reasons they joined... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    5 days ago
  • $200k - $220k

     ...energy, manufacturing, data center...  ..., come build with us at Crusoe. About...  ...as a Senior Data Engineer, an early and pivotal...  ...and implement the systems that make those pipelines...  ...: Partner with software engineers, data scientists...  ...frameworks. Distributed Systems Knowledge:... 
    Full time
    Temporary work
    Work at office
    Remote work

    G2 Venture Partners

    Sunnyvale, CA
    2 days ago
  • $213k - $263k

     ...Senior Software Engineer, ML/Eval Data Platforms & Infrastructure Waymo is an autonomous driving technology...  ...and data scientists to help us improve how we characterize and...  ...software engineering experience. Distributed Systems: Hands-on experience with systems that... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • $147k - $211k

    Software Engineer, Distributed Rate Limiting Services Experience driving progress, solving...  ..., distributed systems or networks, or experience...  ...2 years of experience with data structures or algorithms....  ...critical business problems. The US base salary range for this... 
    Full time

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $184k - $287.5k

     ...Infrastructure organization is seeking a Senior System Software Engineer to lead the evolution of our next-generation Data & Observability Platform. We serve and...  ...engineers rely on to visualize chip telemetry, debug distributed pipelines, and ensure platform reliability.... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $165k - $242k

     ...the role: The Data Platforms Team serves...  ...seeking a senior engineer with...  ...processing who can help us fulfill the goal of...  ...of experience in a software or infrastructure...  ...familiar with one of the distributed NewSQL datastores...  ...and operating these systems at scale. You'... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    a month ago
  • $174k - $252k

    Senior Software Engineer, Infrastructure, Google Cloud Data Management Google Sunnyvale, CA, USA Qualifications Bachelor...  ...large-scale infrastructure, distributed systems or networks, or experience with...  ...most critical business problems. US base salary range for this full-... 
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $320k

    Overview NVIDIA data center systems, such as DGX and HGX, are core to our...  ...these products at the system software level, covering firmware,...  ...Computer Science, Electrical Engineering, or related field (or...  ...storage architectures and distributed parallel processing paradigms... 
    Shift work

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $160k - $240k

     ...the world around us, that's why we're...  ...for self-motivated engineers to build the next-...  ...Platform team, and systems/safety team to make...  ...Work Work on distributed systems inside the...  ...infrastructure and data collection frameworks...  ...with other software teams to build foundational... 
    Immediate start
    Flexible hours

    Nuro

    Mountain View, CA
    a month ago
  • $135.96k - $197.76k

     ..., an automotive software development team with...  ...like you to help us create code that moves...  ...The Sr Software Engineer, Integration is...  ...where applicable), distribution, and flashing of ADAS...  ...vehicles and lab systems Work with ECU...  ...Experience using data/metrics to improve... 
    Permanent employment
    Temporary work
    Casual work
    Local area

    Cariad, Inc.

    Mountain View, CA
    5 days ago
  •  ...converse with all of their business systems through natural language to...  ...with Moveworks' Reasoning Engine and natural language capabilities...  ...world-class talent to help us extend agentic AI to every employee...  ...is not an ML role. This is a distributed systems engineering role at... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    10 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Distributed Data Systems (US). Be the first to apply!