Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Data Infrastructure

Cartesia, Inc.

About Cartesia Our mission is to architect AI that learns from and interacts with the world like humans do. We’re pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training efficient, large-scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design‑minded product engineering team to build and ship cutting edge models and experiences. We’re funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We’re fortunate to have the support of many amazing advisors, and 90+ angels across many industries, including the world’s foremost experts in AI. About the Role Data is the lifeblood of our models, and we’re looking for a Software Engineer to help build the training data and ML data infrastructure at Cartesia. This role sits at the intersection of data systems, model training, and inference — it is not a siloed data org. You’ll design and ship the pipelines, datasets, and infrastructure that feed our pre‑training and post‑training, with particular depth in audio and other multimodal data. Your work will directly shape the capabilities and quality of our foundation models. This is a hands‑on technical role. We’re looking for someone fluent at the application and ML infrastructure layer, who ships modern, well‑tested code and partners closely with research and inference teams. This is not a traditional data warehousing, analytics, or BI engineering role. Your Impact Contribute to Cartesia’s multi‑modal data strategy across pre‑training and post‑training, spanning human, synthetic, and web‑scale sources, with particular depth in audio. Design and build scalable, high‑throughput data pipelines for text, audio, and video — covering ingestion, preprocessing, augmentation, dataset versioning, and data loading for training. Partner closely with research and inference teams so data systems are co‑designed with training and serving infrastructure (batching, GPU‑aware loading, evaluation pipelines). Drive rigorous standards for data quality, with a tight feedback loop between dataset characteristics and model behavior. Identify and integrate novel datasets, including working with external data vendors and partners. What You Bring Hands‑on experience with ML data infrastructure: training data pipelines, dataset versioning, large‑scale data loading, and the interplay between data systems and model training and inference. Working knowledge of multimodal data, i.e. audio: formats, preprocessing, augmentation, and large‑scale storage and streaming patterns. Strong modern engineering execution: clean, well‑tested code, fluency with current tools, and a willingness to pick the right tool for the problem rather than defaulting to familiar patterns. Track record of driving significant technical projects end‑to‑end in a fast‑moving, research‑driven environment. Familiarity with building and evaluating datasets for generative models and reasonable working knowledge of how they’re trained and inference. More Details In‑office policy: We’re an in‑person team based out of offices in San Francisco, London and Bangalore. We love being in the office, hanging out together, and learning from each other every day. Visa sponsorship: We provide visa sponsorship support and assess each circumstance on a case‑by‑case basis. However, visa sponsorship is dependent on many factors, including the role you are applying for and the location you are going to be based, and so we can’t always guarantee success. Your Recruiter will work with you to understand your visa sponsorship needs from the first call. We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don’t sacrifice quality or design along the way. We support each other. We have an open & inclusive culture that’s focused on giving everyone the resources they need to succeed. Our Benefits Compensation. Competitive base salary alongside attractive equity package. Commuter Allowance . A monthly stipend to help you get to and from the office. Flexible PTO . Take as much time as you need to recharge your batteries. Meals & Snacks . Lunch, dinner and plenty of snacks, provided daily. Your own personal Yoshi. #J-18808-Ljbffr Cartesia, Inc.

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, Data Infrastructure in San Francisco, CA vacancy
  • $191k - $225k

     ...The Community You Will Join: Data represents the voice of...  ...at scale. The Data Warehouse Infrastructure team is responsible for the...  ...which is used by hundreds of engineers to collect, manage, and analyze...  ...and contribute to open source software, and have industry impact.... 
    Suggested
    Work experience placement

    Nerdleveltech

    San Francisco, CA
    2 days ago
  •  ...running OpenAI’s LLM training and inference infrastructure that powers frontier models at massive...  .... About the Role We are looking for an engineer to design and implement the dataset...  ...dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive... 
    Suggested

    Slope

    San Francisco, CA
    1 day ago
  • $200k - $400k

     ...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences. Our technology enables industry-defining enterprises like Avis Budget Group, Block's Cash App and Square, Chime,... 
    Suggested
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    14 days ago
  •  ...London and Amsterdam. Making data driven decisions is key to...  ...and guidance to teams across engineering, product, and business and help...  .... Engineers on Data Infrastructure are domain experts in Data Warehouse...  ...Qualifications 5+ years of software engineering experience... 
    Suggested
    Work experience placement
    Local area

    Plaid

    San Francisco, CA
    9 hours ago
  • $160k - $225k

     ...agentic platform synthesizes complex employee data, pinpoints risky behaviours, and deploys...  ...Join Us Build and scale the foundational data infrastructure powering a category‑defining product Work closely with engineering, data science, and product teams to operationalize... 
    Suggested
    Work experience placement
    Relocation package
    Flexible hours

    Fable Security LLP

    San Francisco, CA
    4 days ago
  • $140k - $200k

     ...include frontend and backend engineers, AI research scientists, and...  ...'re looking to hire for our Data side of our AI team at...  ...through a tight integration of infrastructure, engineering, and research work...  ...are looking for a skilled Software Engineer to join us. What You... 
    Full time
    Work at office
    Shift work

    Clutch Canada

    San Francisco, CA
    2 days ago
  • $175k

     ...building: models that are trained to use software and take actions just as a person...  ...large-scale models requires performant data infrastructure to create and store the datasets used...  ...optimize for company value Partner with engineers and research scientists to facilitate... 
    Work at office
    Remote work

    I did my part and supported the Regular Toilet

    San Francisco, CA
    4 days ago
  •  ...for exceptional people to join us! About the Role As an engineer on the Data Infrastructure team at Persona, you will play a key role in designing,...  ...What you’ll bring to Persona 3+ years of experience in software engineering, with a focus on data infrastructure or large... 
    Full time
    For contractors
    Internship

    Persona

    San Francisco, CA
    2 days ago
  • $162k - $216k

     ...Software Engineer - Infrastructure, Data Platform San Francisco, California, United States Who We Are Baton is Ryder's in-house product development group focused on harnessing emerging technologies to redefine transportation and logistics. With $10B in freight... 
    Full time
    Work at office
    Immediate start
    Remote work
    Monday to Friday

    Baton

    San Francisco, CA
    2 days ago
  • $222k - $256k

     ...Rewards philosophy. About the Role: Gusto’s Data team leverages Gusto’s rich dataset to...  ..., you will work with senior Product, Engineering and Design stakeholders to build better...  ...a track record of delivering impactful software solutions. Strong expertise in machine learning... 
    Work at office
    Local area
    2 days per week
    3 days per week

    Prudence Holdings Inc

    San Francisco, CA
    1 day ago
  •  ...A data intelligence platform is seeking a Senior Software Engineer in San Francisco to drive projects from ideation to production while working with cutting-edge technology in a high-autonomy environment. The ideal candidate has over 6 years of full-stack experience,... 
    Remote work

    Metriport

    San Francisco, CA
    1 day ago
  • $197.3k - $313.7k

    ## Staff Software Engineer, Data InfrastructureApplyremote type: Office Tech-Flexiblelocations: California - San Francisco: Washington - Seattle...  ...looking for a Staff Software Engineer to join the **Data Infrastructure** team within the broader Data Engineering organization.... 
    Permanent employment
    Work at office

    Slack Enterprise

    San Francisco, CA
    3 days ago
  • Job Description Slack is looking for a Staff Software Engineer to join the Data Infrastructure team within the broader Data Engineering organization. The mission is to build secure, reliable, performant, scalable, and cost‑efficient infrastructure that powers Slack’s data... 

    100 Salesforce, Inc.

    San Francisco, CA
    1 day ago
  •  ...Making data driven decisions is key to Plaid's culture. To support...  ...guidance to teams across engineering, product, and business and...  ...the data and machine learning infrastructure to enable Plaid engineers to...  ...~5+ years of software engineering experience  ~ Extensive... 

    Plaid

    San Francisco, CA
    more than 2 months ago
  • $160k - $180k

     ...What You’ll Be Doing Build and maintain data infrastructure processing petabytes of data across...  ...end-to-end with guidance from senior engineers. From design through deployment and production...  ...What You Should Have 3+ years of software engineering experience with strong... 
    Full time
    Relocation
    Relocation package

    United States Digital Space LLC

    San Francisco, CA
    5 days ago
  • $200k - $220k

     ...only vertically integrated AI infrastructure company built from the ground...  ...energy, manufacturing, data center construction, and cloud...  ...Crusoe Energy as a Senior Data Engineer, an early and pivotal hire on...  ...Engineering Teams: Partner with software engineers, data scientists,... 
    Full time
    Temporary work
    Work at office
    Remote work

    Crusoe

    San Francisco, CA
    3 days ago
  • $250k - $380k

     ...running OpenAI’s LLM training and inference infrastructure that powers frontier models at massive...  ...About the Role We are looking for an engineer to design and implement the dataset...  ...dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build... 
    Work at office
    Local area
    Relocation package
    Flexible hours

    OpenAI

    San Francisco, CA
    more than 2 months ago
  • $140k - $260k

     ...Software Engineer, Data Platform Profound is on a mission to help companies understand and control their AI presence. We are looking for...  ...Engineer, Data Platform to design, build, and scale the infrastructure that powers data across our organization. You will architect... 
    Work at office
    Visa sponsorship

    Profound

    San Francisco, CA
    8 hours ago
  • $196k - $245k

     ...technical vision for the company's data platform, enabling scalable,...  ..., and optimize data infrastructure to process and analyze petabytes...  ...with data scientists, data engineers, product managers, and...  ...Requirements 5+ years of experience in software engineering with a focus on... 
    Work at office
    Relocation
    Relocation package

    United States Digital Space LLC

    San Francisco, CA
    5 days ago
  •  ...our growing team. About the role We are hiring a Senior Software Engineer to own the data platform that powers Plenful’s automation engine. You...  ...software engineering experience building backend or data infrastructure in production Deep expertise in relational databases:... 
    Work at office
    Flexible hours
    2 days per week

    Plenful

    San Francisco, CA
    5 days ago
  • $200k - $236k

     ...Software Engineer, Data Platform Hybrid - SF Bay Area About GlossGenius GlossGenius is the AI-powered system behind the world's...  ...platform powering all aspects of GlossGenius' data platform and infrastructure. Core responsibilities include building out the... 
    Work at office
    Home office
    Flexible hours
    3 days per week

    GlossGenius

    San Francisco, CA
    3 days ago
  • $170k - $230k

     ...Senior Software Engineer - Data Platform San Francisco About Highnote Founded in 2020 by a team of leaders from Braintree, PayPal, and Lending Club, Highnote is an embedded finance company that sets the standard in modern card platform management. As an all-in... 
    Work at office
    Local area
    Home office
    Flexible hours

    HighNote

    San Francisco, CA
    5 days ago
  • $230k - $265k

     ...financial tools they need. About the Position: We're looking for a seasoned software engineer to join Parafin's Infrastructure team and lead the development of our next-generation Data Platform. This role is critical to ensuring that our data infrastructure is... 
    Work from home
    Flexible hours

    Parafin Inc

    San Francisco, CA
    4 days ago
  •  ...Whatnot updates on our news and engineering blogs and join us as we...  ...commerce. Role The Data Platform team at Whatnot builds...  ..., multi-tenant streaming infrastructure rather than simply building...  ...understanding You As our next Software Engineer, Data Platform, you... 
    Local area
    Remote work
    Work from home
    Home office

    Whatnot

    San Francisco, CA
    2 days ago
  • $147.4k - $272.1k

    Sr Software Engineer, AI & Data Platforms (AiDP) San Francisco Bay Area, California, United States Software and Services Description Join our team and help us build intelligent, scalable solutions. You'll be at the forefront, architecting, designing, and implementing... 
    Relocation

    Apple

    San Francisco, CA
    4 days ago
  •  ...do You’ll join a full‑stack data team building the systems making...  ...our real‑time eventing infrastructure, streaming and batch ETL pipelines...  ...of these areas: Platform Engineering You have designed, built,...  .... Shared Qualities Strong software engineering background with... 
    Full time
    Flexible hours

    Neura Market

    San Francisco, CA
    4 days ago
  • Epoch Biodesign in San Francisco is seeking a Senior Data Engineer to architect and build foundational data platform infrastructure for AI operations. This full-time position requires proficiency in Python and systems-level languages, experience with data platforms, and... 
    Full time

    Epoch Biodesign

    San Francisco, CA
    3 days ago
  • $250k - $280k

    A leading technological company is seeking a Sales Engineer to join their rapidly growing team in San Francisco. The ideal candidate will...  ...clients to understand their needs and educate them on WEKA's advanced data management solutions, focusing on high performance workloads and... 

    WekaIO

    San Francisco, CA
    3 days ago
  • $255k - $405k

     ...About the Team The Agent Infrastructure team at OpenAI is responsible...  ...code, debug issues, and develop software just as human SWEs do. Our...  ...About the Role As a Software Engineer on the Agent Infrastructure...  ...your possession (including the data contained therein) upon... 
    Work at office
    Worldwide
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  •  ...About the Team Data is at the foundation of DoorDash success. The Data Engineering team builds database solutions for various use cases including reporting, product...  ...powerhouse to help us scale our data infrastructure, automation and tools to meet growing business... 
    Hourly pay
    Local area
    Flexible hours

    Fairygodboss

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Data Infrastructure. Be the first to apply!