Software Engineer, Data Infrastructure
$180k - $250kDatologyAI
About the Company Companies want to train their own large models on their own data. The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to model quality at worst. There is compelling research showing that smarter data selection can train better models faster—we know because we did much of this research. Given the high costs of training, this presents a huge market opportunity. We founded DatologyAI to translate this research into tools that enable enterprise customers to identify the right data on which to train, resulting in better models for cheaper. Our team has pioneered deep learning data research, built startups, and created tools for enterprise ML. For more details, check out our recent blog posts sharing our high-level results for text models and image-text models. We've raised over $57M in funding from top investors like Radical Ventures, Amplify Partners, Felicis, Microsoft, Amazon, and notable angels like Jeff Dean, Geoff Hinton, Yann LeCun and Elad Gil. We're rapidly scaling our team and computing resources to revolutionize data curation across modalities. This role is based in Redwood City, CA. We are in office 4 days a week. About the Role We're looking for an experienced Data Platform Engineer to join as a member of our core Datology AI team. As one of our early senior hires, you will partner closely with our founders on the direction of our product and drive business-critical technical decisions. You will lead the development of our core product and data platform. These are key components of our stack that allow us to process customer data and apply state of the art research for identifying the most informative data points in large-scale datasets. You will have a broad impact over the technology, product, and our company's culture. We provide visa sponsorship for candidates selected for this role. What You'll Work On Design, build and maintain highly scalable data processing solutions, while ensuring scalability, reliability, and security. Architect, build, and deploy the back-end systems and services that power our data curation platform. Partner with researchers and engineers to bring new features and research capabilities to our customers. Ensure that our systems are reliable, secure, and worthy of our customers' trust. About You Have meaningful experience with leading and building production data systems to deliver on major product initiatives. You have built and managed highly scalable data processing solutions (e.g. Spark, Flink), data lakes or warehouses (e.g. Snowflake, Hive), authored queries (SQL), distributed storage systems (e.g., HDFS, S3), used workflow management (e.g. Airflow, Dagster), and have experience maintaining the infra that supports these. Proficiency in at least one programming language commonly used within Data Engineering, such as Python, Scala, or Java. Expertise with any of ETL schedulers such as Airflow, Dagster, or similar frameworks. Experience maintaining a high quality bar for design, correctness, and testing. Take pride in building and operating scalable, reliable, secure systems. Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed. Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done. You have experience being the technical lead of a Data Engineering / Platform / Infrastructure Team. Experience building ML/DL systems and/or data infrastructure that feeds into training large ML models. Don’t meet every single requirement? We still encourage you to apply. If you’re excited about our mission and eager to learn, we want to hear from you! Compensation At DatologyAI, we are dedicated to rewarding talent with highly competitive salary and significant equity. The base salary for this position ranges from $180,000 to $250,000. The candidate's starting pay will be determined based on job-related skills, experience, qualifications, and interview performance. We offer a comprehensive benefits package to support our employees' well-being and professional growth: 100% covered health benefits (medical, vision, and dental). 401(k) plan with a generous 4% company match. Unlimited PTO policy. Annual $2,000 wellness stipend. Annual $1,000 learning and development stipend. Daily lunches and snacks are provided in our office! Relocation assistance for employees moving to the Bay Area. #J-18808-Ljbffr DatologyAI
$147.2k - $200.9k
...Software Engineer, Data Infrastructure Mountain View, California Intrinsic is an AI robotics group at Google aiming to reimagine the potential of industrial robotics. Our team believes that advances in AI, perception and simulation will redefine what's possible...SuggestedFull timeLocal area$155k - $185k
The Opportunity We are looking for an experienced Software Engineer with a passion for building robust and scalable data infrastructure to join our Data Platform team. In this role, you'll design and develop the foundational systems that power the flow of data across the...SuggestedPermanent employment$213k - $263k
...Waymo ML Platform team, builds tools and infrastructure to realize the ML flywheel at Waymo.... ...Develop and contribute to Waymo's data infrastructure platform to enable plant... ...professional experience in the field of software engineering ~ Experience programming in C++ ~...SuggestedFull timeRemote work$193.93k - $291.15k
...Sr. Software Engineer, Perception Data Infrastructure Mountain View, California (HQ) Who We Are Nuro believes self-driving vehicles are the most immediate and profound opportunity for AI to drive positive change in the physical world. Safer streets, more time...SuggestedImmediate startFlexible hours$160.36k - $240.54k
...Software Engineer, ML Data Infrastructure Mountain View, California (HQ) Nuro believes self-driving vehicles are the most immediate and profound opportunity for AI to drive positive change in the physical world. Safer streets, more time for what matters, and easier...SuggestedWork experience placementImmediate startFlexible hours$214k - $295k
...Staff Software Engineer, Data Infrastructure, AI Compute Platform Redwood City, CA (Hybrid) Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose...Work at officeWorldwideRelocation packageFlexible hours3 days per week- ...is the Enterprise AI application software company. C3 AI delivers a family... ...AIis looking for Senior Software Engineers to join the rapidly growing Data org within the Platform Engineering... ...-scale distributed systems, data infrastructure, and machine learning. You will...Work experience placement
$180k - $250k
A tech-driven AI company in Redwood City is seeking an Infrastructure Engineer to develop core infrastructure and support multi-cloud environments. The ideal candidate has experience in large-scale infrastructure, proficiency with tools such as Kubernetes, and a passion...$190k - $240k
...model, it starts with the data. We’re on a mission to help... ...organizations to empower scientists, engineers, financial experts, product... ...standards to codebases, infrastructure, and processes. * Work a... ...and shipping enterprise software products, specifically those...Work at officeLocal area3 days per week- ...updated 3D information about the places, infrastructure, terrain, and activity that shape... ...designed to create high-resolution 3D data products of the Earth at unprecedented... ...Earth. About the Job As Staff Software Engineer for data infrastructure, you will play...Permanent employmentFull timeRemote workNight shift
- ...just accounting and management software, Vantaca is intelligent... ...generate an enormous amount of data: every task they execute,... ...anomaly detection, and the engineering rigor that makes numbers trustworthy... ...-focused: Understands how infrastructure decisions impact end-user...Work at officeRemote workFlexible hours
- ...to one-of-a-kind vintage and luxury. The Big Data team is a central player in the Poshmark organization... ...new business critical initiatives. The Data Engineering team at Poshmark is looking for an experienced software engineer to scale Datalake, ensuring real-time...
- ...landscape of robotic automation. Position Overview As a Software Engineer, Data Infra you are the architect of the "Laboratory" where... ...a high-impact, hands-on role where you will design the infrastructure to visualize model performance, automate data labeling,...
- ...at one of the fastest-growing voice AI startups. Let's build the future together. About The Role As a Senior Software Engineer - Infrastructure, you'll be the owner of our build, release, and runtime foundations. You'll design and automate deployment pipelines...H1bWork at officeRelocation
$150k - $200k
...Software Engineer In Test - Infrastructure Redwood City, CA (Hybrid); San Francisco, CA (Hybrid) At Snorkel, we believe meaningful AI doesn't start with the model, it starts with the data. We're on a mission to help enterprises transform expert knowledge into...Local area$180k - $300k
...training compute is wasted training on data that are already learned, irrelevant, or... ...on both data research and data engineering necessary to solve this incredibly challenging... ...We're looking for an experienced Infrastructure Engineer to join as a member of our core...Work at officeRelocation package$228.6k - $314.25k
Databricks is looking for an experienced engineer to join the ManagedTables team. You'll drive the development of storage solutions, optimize large production clusters, and mentor fellow engineers. With 15+ years in distributed systems, you’ll work on enhancing database...$228.6k - $314.25k
Databricks is seeking an experienced software engineer to work on enterprise-grade analytical data systems, focusing on distributed systems and performance optimization. In this role, you will be responsible for delivering scalable architectures and mentoring team members...$213k - $263k
...Senior Software Engineer, ML/Eval Data Platforms & Infrastructure Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building...Full timeRemote work$185k - $215k
.... We are building the foundational data platform that powers reliable, scalable... ...Mudflap's systems. As a Senior Software Engineer, Data Platforms , you'll play a critical... ...critical role in designing and operating the infrastructure, frameworks, and services that enable...Remote work$150k - $200k
...lack great design, make limited use of software, and are difficult and stressful to adopt... ...embedded C++ on custom devices, cloud infrastructure, and mobile apps. Right now, too much... ...the code. We're looking for a software engineer to own our developer infrastructure: the...Full timeWork at officeLocal areaRelocation packageShift work3 days per week- ...vintage and luxury. The Core Infrastructure team drives developer... ..., release, run, and support software; for example, ensuring automated... ...developer experience a.k.a. engineering enablement at Poshmark,... ...with strong competencies in data structures, algorithms, and...
$180k - $250k
A fast-growing tech startup in Redwood City, CA is seeking an experienced Infrastructure Engineer to lead the development of their data infrastructure. The role involves architecting core systems across multiple cloud providers and ensuring reliability at scale. The ideal...$154.4k - $212.3k
About the role This role sits within our Data Layer and Marketing AI (MAI) platform,... ..., distributed compute, and platform engineering. Key Responsibilities Design and build... ...search. Experience building platform or infrastructure layers supporting multiple teams. Benefits...- ...the Role At Solace, we are building the data foundation that will power patient outcomes... ...moves fast. We are looking for a Data Engineer who loves solving hard problems with... ...environment. In this role, you will architect the infrastructure that allows us to scale. You will be a...
$160.36k - $240.54k
...its training and evaluation data. The team plays a crucial role... ...scalable and reliable data infrastructure. This infrastructure is... ...collaborates closely with system engineers to thoroughly validate the... ...best practices across broader software organization. A bachelor's...Work experience placement$160.36k - $240.54k
...its training and evaluation data. The team plays a crucial... ...scalable and reliable data infrastructure. This infrastructure is designed... ...closely with system engineers to thoroughly validate the autonomous... ...practices across broader software organization A bachelor's...Work experience placementImmediate startFlexible hours$220k - $260k
...the model, it starts with the data. We're on a mission to... ...organizations to empower scientists, engineers, financial experts, product... ...We are seeking a Senior Software Engineer to evolve Snorkel's... ...posture across our cloud infrastructure, developer platform, and product...Local area$180k - $250k
...training compute is wasted training on data that are already learned, irrelevant, or... ...on both data research and data engineering necessary to solve this incredibly challenging... ...We’re looking for an experienced Cloud Infrastructure Engineer to join our core team at DatologyAI...Work at officeRelocation package$139.8k - $205.04k
A leading electric vehicle manufacturer is seeking an Engineering Manager to lead a team focused on data engineering and infrastructure. Responsibilities include managing teams, guiding technical direction, and ensuring the reliable operation of data systems. Candidates...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Data Infrastructure. Be the first to apply!
- federal - software developer Redwood City, CA
- software engineer healthcare Redwood City, CA
- network software engineer Redwood City, CA
- ngo software engineer Redwood City, CA
- software development engineer aws Redwood City, CA
- software developer fintech Redwood City, CA
- software data engineer Redwood City, CA
- senior software engineer remote Redwood City, CA
- intel software engineer Redwood City, CA
- software engineer Redwood City, CA


