Software Engineer, Data Infrastructure
$180k - $300kdatologyai
About the Company Models are what they eat. But a large portion of training compute is wasted training on data that are already learned, irrelevant, or even harmful, leading to worse models that cost more to train and deploy. At DatologyAI, we've built a state of the art data curation suite to automatically curate and optimize petabytes of data to create the best possible training data for your models. Training on curated data can dramatically reduce training time and cost (7-40x faster training depending on the use case), dramatically increase model performance as if you had trained on >10x more raw data without increasing the cost of training, and allow smaller models with fewer than half the parameters to outperform larger models despite using far less compute at inference time, substantially reducing the cost of deployment. For more details, check out our recent blog posts sharing our high-level results for text models and image-text models. We raised a total of $57.5M in two rounds, a Seed and Series A. Our investors include Felicis Ventures, Radical Ventures, Amplify Partners, Microsoft, Amazon, and AI visionaries like Geoff Hinton, Yann LeCun, Jeff Dean, and many others who deeply understand the importance and difficulty of identifying and optimizing the best possible training data for models. Our team has pioneered this frontier research area and has the deep expertise on both data research and data engineering necessary to solve this incredibly challenging problem and make data curation easy for anyone who wants to train their own model on their own data. This role is based in Redwood City, CA. We are in office 4 days a week. About the Role We're looking for an experienced Data Platform Engineer to join as a member of our core Datology AI team. As one of our early senior hires, you will partner closely with our founders on the direction of our product and drive business-critical technical decisions. You will lead the development of our core product and data platform. These are key components of our stack that allow us to process customer data and apply state of the art research for identifying the most informative data points in large-scale datasets. You will have a broad impact over the technology, product, and our company's culture. We provide visa sponsorship for candidates selected for this role. What You'll Work On
- Design, build and maintain highly scalable data processing solutions, while ensuring scalability, reliability, and security
- Architect, build, and deploy the back-end systems and services that power our data curation platform
- Partner with researchers and engineers to bring new features and research capabilities to our customers
- Ensure that our systems are reliable, secure, and worthy of our customers' trust
- Have meaningful experience with leading and building production data systems to deliver on major product initiatives.
- You have built and managed highly scalable data processing solutions (e.g. Spark, Flink), data lakes or warehouses (e.g. Snowflake, Hive), authored queries (SQL), distributed storage systems (e.g., HDFS, S3), used workflow management (e.g. Airflow, Dagster), and have experience maintaining the infra that supports these.
- Proficiency in at least one programming language commonly used within Data Engineering, such as Python, Scala, or Java.
- Expertise with any of ETL schedulers such as Airflow, Dagster, or similar frameworks.
- Experience maintaining a high quality bar for design, correctness, and testing.
- Take pride in building and operating scalable, reliable, secure systems
- Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed
- Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done
- You have experience being the technical lead of a Data Engineering / Platform / Infrastructure Team.
- Experience building ML/DL systems and/or data infrastructure that feeds into training large ML models
- The candidate's starting pay will be determined based on job-related skills, experience, qualifications, and interview performance.
- 100% covered health benefits (medical, vision, and dental).
- 401(k) plan with a generous 4% company match.
- Unlimited PTO policy
- Annual $2,000 wellness stipend.
- Annual $1,000 learning and development stipend.
- Daily lunches and snacks are provided in our office!
- Relocation assistance for employees moving to the Bay Area.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, Data Infrastructure in Redwood City, CA vacancy
- ...About the Role As a Data Infrastructure Engineer in Research at Luma, you will play a critical role in building and scaling the data infrastructure that supports our cutting-edge multimodal AI systems. Your work will focus on developing high-throughput, large-scale...Suggested
$193.93k - $291.15k
...Sr. Software Engineer, Perception Data Infrastructure Mountain View, California (HQ) About the Role We are a team of high-output generalists where ML and systems engineering converge to push autonomy performance forward. As a Senior Perception ML Data Infrastructure...Suggested$160.36k - $240.54k
...Software Engineer, ML Data Infrastructure Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with...SuggestedWork experience placement$180k - $197k
...Software Engineer, Data Infrastructure Mountain View, California Intrinsic is an AI robotics group at Google aiming to reimagine the potential of industrial robotics. Our team believes that advances in AI, perception and simulation will redefine what's possible...SuggestedFull timeLocal area$155k - $185k
The Opportunity We are looking for an experienced Software Engineer with a passion for building robust and scalable data infrastructure to join our Data Platform team. In this role, you'll design and develop the foundational systems that power the flow of data across the...SuggestedPermanent employment$213k - $263k
...Waymo ML Platform team, builds tools and infrastructure to realize the ML flywheel at Waymo.... ...Develop and contribute to Waymo's data infrastructure platform to enable plant... ...professional experience in the field of software engineering ~ Experience programming in C++ ~...Full timeRemote work$150k - $300k
...and commercial organizations supporting disaster response, infrastructure resilience, and mission-critical geopolitical... ...where they're needed most. About the Job As Staff Software Engineer for data infrastructure, you will play a crucial role in designing...Permanent employmentFull timeRemote work$214k - $295k
...Staff Software Engineer, Data Infrastructure, AI Compute Platform Redwood City, CA (Hybrid) Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose...Work at officeWorldwideRelocation packageFlexible hours3 days per week$153k - $222k
...exception.) About the role We are looking for infrastructure engineers with expertise in scaling open-source data infrastructure to join the Data & ML infra group... ...integration hooks. Develop and deploy high-quality software using modern tooling and frameworks, especially...Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$147k - $211k
Software Engineer, Infrastructure and Data AI, Ads Platform Google Mountain View, CA, USA Bachelor’s degree in Computer Science, a related technical field, or equivalent practical experience. 2 years of experience with software development in Java, or 1 year of experience...Full timeLocal area$140k - $200k
...include frontend and backend engineers, AI research scientists, and... ...'re looking to hire for our Data side of our AI team at... ...through a tight integration of infrastructure, engineering, and research work... ...are looking for a skilled Software Engineer to join us. What...Full timeWork at officeShift work$150k - $300k
As Staff Software Engineer for data infrastructure, you will play a crucial role in designing and implementing the systems that process, analyze, and serve our satellite constellation’s data to end‑users. You will have the opportunity to shape highly reliable backend infrastructure...Permanent employmentFull timeRemote work- Job Recommendation When you upload your resume, we provide job recommendations to you. Please confirm you have read and understand how your data may be processed pursuant to the Microsoft Data Privacy Notice and Transparency FAQ.
$145k - $187k
...is the Enterprise AI application software company. C3 AI delivers a family... ...AI is looking for Senior Software Engineers to join the rapidly growing Data org within the Platform Engineering... ...-scale distributed systems, data infrastructure, and machine learning. You will...Work experience placement$192k - $240k
...the model, it starts with the data. We're on a mission to... ...organizations to empower scientists, engineers, financial experts, product... ...standards to codebases, infrastructure, and processes. Work a... ...developing and shipping enterprise software products , specifically...Work at officeLocal area3 days per week$154.9k - $209.6k
...Role Overview: We're looking for an engineer who can own data collection, scalable data systems,... ...be doing Develop Python and C++ infrastructure to ingest, validate, and organize camera... ...vehicles. Debug hardware/software integration issues on data collection...Full time- ...updated 3D information about the places, infrastructure, terrain, and activity that shape... ...designed to create high-resolution 3D data products of the Earth at unprecedented... ...Earth. About the Job As Staff Software Engineer for data infrastructure, you will play...Permanent employmentFull timeRemote workNight shift
- ...to one-of-a-kind vintage and luxury. The Big Data team is a central player in the Poshmark organization... ...new business critical initiatives. The Data Engineering team at Poshmark is looking for an experienced software engineer to scale Datalake, ensuring real-time...
- ...performance in the industry. Position Overview As a Software Engineer, Data Infra you are the architect of the "Laboratory" where... ...is a high-impact, hands-on role where you will design the infrastructure to visualize model performance, automate data labeling, and...
- ...What to Expect You are the bridge between raw data and robotic intelligence. As a Full Stack Engineer, ML Data & Evals, you will build the "Laboratory"... ...the research-to-production loop, creating the infrastructure to launch on-robot evaluations and visualize model...Shift work
- ...innovation, we'd love to hear from you. What to Expect Data is our lifeblood. We use AR headsets and Skill Capture Gloves™ to capture human motion at scale, and your role as a Full Stack Engineer for Data Operations is to grow the platform that manages this pipeline...Remote workShift work
- ...at one of the fastest-growing voice AI startups. Let's build the future together. About The Role As a Senior Software Engineer - Infrastructure, you'll be the owner of our build, release, and runtime foundations. You'll design and automate deployment pipelines...H1bWork at officeRelocation
$145k - $215k
...Software Engineer In Test - Infrastructure Redwood City, CA (Hybrid); San Francisco, CA (Hybrid) At Snorkel, we believe meaningful AI doesn't start with the model, it starts with the data. We're on a mission to help enterprises transform expert knowledge into...Local area$180k - $300k
...DatologyAI Infrastructure Engineer Models are what they eat. But a large portion of training compute is wasted training on data that are already learned, irrelevant, or even harmful, leading to worse models that cost more to train and deploy. At DatologyAI, we've...Work at officeRelocation package$228.6k - $314.25k
Databricks is seeking an experienced software engineer to work on enterprise-grade analytical data systems, focusing on distributed systems and performance optimization. In this role, you will be responsible for delivering scalable architectures and mentoring team members...$207k - $300k
Staff Software Engineer, ML Data Infrastructure corporate_fare Google place San Bruno, CA, USA Apply Bachelor's degree or equivalent practical experience. 8 years of experience programming in C++. 5 years of experience testing, and launching software products. 5 years...Full time$228.6k - $314.25k
Databricks is looking for an experienced engineer to join the ManagedTables team. You'll drive the development of storage solutions, optimize large production clusters, and mentor fellow engineers. With 15+ years in distributed systems, you’ll work on enhancing database...- ...Data Infrastructure Architect Significant experience in software engineering with a specialization in architecting, optimizing, and scaling data infrastructure, ideally as a founding or principal engineer. Proven track record of architecting and scaling data stores...
$150.32k - $225.48k
...Software Engineer II - Data Platform Pittsburgh, PA Latitude AI develops automated driving technologies, including L3, for Ford vehicles... ...our mission is to build the scalable, high-performance infrastructure that turns structured data into insight to drive...Permanent employmentFull timeWork at officeImmediate startVisa sponsorship$185k - $215k
.... We are building the foundational data platform that powers reliable, scalable... ...Mudflap's systems. As a Senior Software Engineer, Data Platforms , you'll play a critical... ...critical role in designing and operating the infrastructure, frameworks, and services that enable...Remote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Data Infrastructure. Be the first to apply!
Related searches
- software engineer full time Redwood City, CA
- startup software engineer Redwood City, CA
- rust software engineer Redwood City, CA
- software developer Redwood City, CA
- software development engineer aws Redwood City, CA
- ngo software engineer Redwood City, CA
- software engineer staff Redwood City, CA
- software engineer Redwood City, CA
- senior software engineer Redwood City, CA
- cybersecurity software engineer Redwood City, CA



