Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Data Infrastructure

$180k - $250k

datologyai

About the Company

Companies want to train their own large models on their own data. The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to model quality at worst. There is compelling research showing that smarter data selection can train better models faster—we know because we did much of this research. Given the high costs of training, this presents a huge market opportunity. We founded DatologyAI to translate this research into tools that enable enterprise customers to identify the right data on which to train, resulting in better models for cheaper. Our team has pioneered deep learning data research, built startups, and created tools for enterprise ML. For more details, check out our recent blog posts sharing our high-level results for text models and image-text models.

We've raised over $57M in funding from top investors like Radical Ventures, Amplify Partners, Felicis, Microsoft, Amazon, and notable angels like Jeff Dean, Geoff Hinton, Yann LeCun and Elad Gil. We're rapidly scaling our team and computing resources to revolutionize data curation across modalities.

This role is based in Redwood City, CA. We are in office 4 days a week.

About the Role

We're looking for an experienced Data Platform Engineer to join as a member of our core Datology AI team. As one of our early senior hires, you will partner closely with our founders on the direction of our product and drive business-critical technical decisions. You will lead the development of our core product and data platform. These are key components of our stack that allow us to process customer data and apply state of the art research for identifying the most informative data points in large-scale datasets. You will have a broad impact over the technology, product, and our company's culture. We provide visa sponsorship for candidates selected for this role.

What You'll Work On
  • Design, build and maintain highly scalable data processing solutions, while ensuring scalability, reliability, and security.

  • Architect, build, and deploy the back-end systems and services that power our data curation platform.

  • Partner with researchers and engineers to bring new features and research capabilities to our customers.

  • Ensure that our systems are reliable, secure, and worthy of our customers' trust.

About You
  • Have meaningful experience with leading and building production data systems to deliver on major product initiatives.

    • You have built and managed highly scalable data processing solutions (e.g. Spark, Flink), data lakes or warehouses (e.g. Snowflake, Hive), authored queries (SQL), distributed storage systems (e.g., HDFS, S3), used workflow management (e.g. Airflow, Dagster), and have experience maintaining the infra that supports these.

  • Proficiency in at least one programming language commonly used within Data Engineering, such as Python, Scala, or Java.

  • Expertise with any of ETL schedulers such as Airflow, Dagster, or similar frameworks.

  • Experience maintaining a high quality bar for design, correctness, and testing.

  • Take pride in building and operating scalable, reliable, secure systems.

  • Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed.

  • Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done.

  • You have experience being the technical lead of a Data Engineering / Platform / Infrastructure Team.

  • Experience building ML/DL systems and/or data infrastructure that feeds into training large ML models.

Don’t meet every single requirement? We still encourage you to apply. If you’re excited about our mission and eager to learn, we want to hear from you!

Compensation

At DatologyAI, we are dedicated to rewarding talent with highly competitive salary and significant equity. The base salary for this position ranges from $180,000 to $250,000.

  • The candidate's starting pay will be determined based on job-related skills, experience, qualifications, and interview performance.

We offer a comprehensive benefits package to support our employees' well-being and professional growth:

  • 100% covered health benefits (medical, vision, and dental).

  • 401(k) plan with a generous 4% company match.

  • Unlimited PTO policy.

  • Annual $2,000 wellness stipend.

  • Annual $1,000 learning and development stipend.

  • Daily lunches and snacks are provided in our office!

  • Relocation assistance for employees moving to the Bay Area.

#J-18808-Ljbffr
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, Data Infrastructure in Redwood City, CA vacancy
  •  ...About the Role As a Data Infrastructure Engineer in Research at Luma, you will play a critical role in building and scaling the data infrastructure that supports our cutting-edge multimodal AI systems. Your work will focus on developing high-throughput, large-scale... 
    Suggested

    Luma AI

    Redwood City, CA
    2 days ago
  •  ...Software Engineer, Data Infrastructure, Entry Level Join to apply for the Software Engineer, Data Infrastructure, Entry Level role at Jobright.ai Software Engineer, Data Infrastructure, Entry Level Join to apply for the Software Engineer, Data Infrastructure... 
    Suggested
    Full time
    H1b

    jobright.com

    Mountain View, CA
    3 days ago
  • $193.93k - $291.15k

     ...Sr. Software Engineer, Perception Data Infrastructure Mountain View, California (HQ) About the Role We are a team of high-output generalists where ML and systems engineering converge to push autonomy performance forward. As a Senior Perception ML Data Infrastructure... 
    Suggested

    Nuro

    Mountain View, CA
    3 days ago
  • $155k - $185k

    The Opportunity We are looking for an experienced Software Engineer with a passion for building robust and scalable data infrastructure to join our Data Platform team. In this role, you'll design and develop the foundational systems that power the flow of data across the... 
    Suggested
    Permanent employment

    Otter.ai

    Mountain View, CA
    3 days ago
  • $213k - $263k

     ...Waymo ML Platform team, builds tools and infrastructure to realize the ML flywheel at Waymo....  ...Develop and contribute to Waymo's data infrastructure platform to enable plant...  ...professional experience in the field of software engineering ~ Experience programming in C++ ~... 
    Suggested
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • $160.36k - $240.54k

     ...Software Engineer, ML Data Infrastructure Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with... 
    Work experience placement

    Nuro

    Mountain View, CA
    11 days ago
  • $147.2k - $200.9k

     ...Software Engineer, Data Infrastructure Mountain View, California Intrinsic is an AI robotics group at Google aiming to reimagine the potential of industrial robotics. Our team believes that advances in AI, perception and simulation will redefine what's possible... 
    Full time
    Local area

    Intrinsic

    Mountain View, CA
    4 days ago
  • $214k - $295k

     ...Staff Software Engineer, Data Infrastructure, AI Compute Platform Redwood City, CA (Hybrid) Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose... 
    Work at office
    Worldwide
    Relocation package
    Flexible hours
    3 days per week

    Biohub

    Redwood City, CA
    1 day ago
  • $153k - $222k

     ...exception.) About the role We are looking for infrastructure engineers with expertise in scaling open-source data infrastructure to join the Data & ML infra group...  ...integration hooks. Develop and deploy high-quality software using modern tooling and frameworks, especially... 
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Decisive Point

    Mountain View, CA
    10 hours ago
  • $147k - $211k

    Software Engineer, Infrastructure and Data AI, Ads Platform Google Mountain View, CA, USA Bachelor’s degree in Computer Science, a related technical field, or equivalent practical experience. 2 years of experience with software development in Java, or 1 year of experience... 
    Full time
    Local area

    Google Inc.

    Mountain View, CA
    10 hours ago
  •  ...A tech-forward robotics company located in California is seeking a Software Engineer for their Data Infrastructure team. This role focuses on designing and maintaining data pipelines that power innovative robotics systems, integrating real-world data into machine learning... 
    Full time

    Company

    Mountain View, CA
    3 days ago
  •  ...updated 3D information about the places, infrastructure, terrain, and activity that shape...  ...designed to create high-resolution 3D data products of the Earth at unprecedented...  ...Earth.   About the Job As Staff Software Engineer for data infrastructure, you will play... 
    Permanent employment
    Full time
    Remote work
    Night shift

    Array Labs

    Redwood City, CA
    3 days ago
  • $180k - $250k

     ...A tech-driven AI company in Redwood City is seeking an Infrastructure Engineer to develop core infrastructure and support multi-cloud environments. The ideal candidate has experience in large-scale infrastructure, proficiency with tools such as Kubernetes, and a passion... 

    Datology

    Redwood City, CA
    4 days ago
  • $150k - $300k

    As Staff Software Engineer for data infrastructure, you will play a crucial role in designing and implementing the systems that process, analyze, and serve our satellite constellation’s data to end‑users. You will have the opportunity to shape highly reliable backend infrastructure... 
    Permanent employment
    Full time
    Remote work

    Array Labs Inc.

    Redwood City, CA
    1 day ago
  • $145k - $187k

     ...is the Enterprise AI application software company. C3 AI delivers a family...  ...AI is looking for Senior Software Engineers to join the rapidly growing Data org within the Platform Engineering...  ...-scale distributed systems, data infrastructure, and machine learning. You will... 
    Work experience placement

    C3 AI

    Redwood City, CA
    3 days ago
  • $119.8k - $234.7k

     ...are building a large-scale, productized data platform that powers critical insights...  ...across Azure-based services for AI Infrastructure. This platform will process terabytes...  ...-term evolution. As a Senior Software Engineer - Data Platform, AI Framework you will... 
    Ongoing contract
    Local area

    Microsoft Corporation

    Mountain View, CA
    9 days ago
  • $192k - $240k

     ...the model, it starts with the data. We're on a mission to...  ...organizations to empower scientists, engineers, financial experts, product...  ...standards to codebases, infrastructure, and processes. Work a...  ...developing and shipping enterprise software products , specifically... 
    Work at office
    Local area
    3 days per week

    Snorkel AI

    Redwood City, CA
    1 day ago
  •  ...landscape of robotic automation. Position Overview As a Software Engineer, Data Infra you are the architect of the "Laboratory" where...  ...a high-impact, hands-on role where you will design the infrastructure to visualize model performance, automate data labeling,... 

    DYNA Robotics Inc

    Redwood City, CA
    2 days ago
  •  ...to one-of-a-kind vintage and luxury. The Big Data team is a central player in the Poshmark organization...  ...new business critical initiatives. The Data Engineering team at Poshmark is looking for an experienced software engineer to scale Datalake, ensuring real-time... 

    Poshmark

    Redwood City, CA
    4 days ago
  •  ...Senior Software Engineer - Ai Data & Analytics HOAi is the fastest-growing company revolutionizing the community association management...  ..., not just inputs Customer-focused: Understands how infrastructure decisions impact end-user experience Data-driven: Makes... 
    Work at office
    Remote work
    Flexible hours

    Vantaca

    Redwood City, CA
    12 hours ago
  •  ...innovation, we'd love to hear from you. What to Expect Data is our lifeblood. We use AR headsets and Skill Capture Gloves™ to capture human motion at scale, and your role as a Full Stack Engineer for Data Operations is to grow the platform that manages this pipeline... 
    Remote work
    Shift work

    Sunday

    Redwood City, CA
    4 days ago
  •  ...What to Expect You are the bridge between raw data and robotic intelligence. As a Full Stack Engineer, ML Data & Evals, you will build the "Laboratory"...  ...the research-to-production loop, creating the infrastructure to launch on-robot evaluations and visualize model... 
    Shift work

    Sunday

    Redwood City, CA
    4 days ago
  • $180k - $250k

     ...training compute is wasted training on data that are already learned, irrelevant, or...  ...on both data research and data engineering necessary to solve this incredibly challenging...  ...Role We're looking for an experienced Infrastructure Engineer to join as a member of our core... 
    Work at office
    Relocation package

    Datology

    Redwood City, CA
    3 days ago
  •  ...at one of the fastest-growing voice AI startups. Let's build the future together. About The Role As a Senior Software Engineer - Infrastructure, you'll be the owner of our build, release, and runtime foundations. You'll design and automate deployment pipelines... 
    H1b
    Work at office
    Relocation

    Retell AI

    Redwood City, CA
    1 day ago
  • $145k - $215k

     ...Software Engineer In Test - Infrastructure Redwood City, CA (Hybrid); San Francisco, CA (Hybrid) At Snorkel, we believe meaningful AI doesn't start with the model, it starts with the data. We're on a mission to help enterprises transform expert knowledge into... 
    Local area

    Snorkel AI

    Redwood City, CA
    16 days ago
  • $180k - $250k

     ...A fast-growing tech startup in Redwood City, CA is seeking an experienced Infrastructure Engineer to lead the development of their data infrastructure. The role involves architecting core systems across multiple cloud providers and ensuring reliability at scale. The ideal... 

    datologyai

    Redwood City, CA
    3 days ago
  •  ...Snowflake is hiring a Senior Software Engineer for the Observe team in Menlo Park, California. The ideal candidate has over 7 years of software...  ..., and stream processing. Responsibilities include owning the data modeling surface, designing APIs, and leading technical teams... 

    Snowflake Computing

    Menlo Park, CA
    3 days ago
  •  ...operate highly performant and scalable batch and stream data processing infrastructure and solutions to support day to day ML operations...  ...bring to the table: ~6+ years of experience as senior/software engineer ~ Experience with Python or Golang or Java or C++ ~... 

    Moveworks.ai

    Mountain View, CA
    3 days ago
  • $200k - $250k

     ...commerce company is looking for a Senior Software Engineer to architect and scale systems that process large volumes of social data from platforms like Instagram and TikTok....  ...a collaborative team dedicated to shaping AI-native marketing infrastructure. #J-18808-Ljbffr... 

    Nectar Social

    Palo Alto, CA
    3 days ago
  • $185k - $250k

     ...impact contributions to Mudflap’s explosive growth. Senior Software Engineer, Data Platforms Location Palo Alto, CA Employment Type...  ...’ll play a critical role in designing and operating the infrastructure, frameworks, and services that enable teams to ingest,... 
    Full time
    Work at office
    Remote work

    Mudflapinc

    Palo Alto, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Data Infrastructure. Be the first to apply!