Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer, Data

Cartesia

About Cartesia Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason over a year-long stream of audio, video and text—1B text tokens, 10B audio tokens and 1T video tokens—let alone do this on-device. We’re pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training efficient, large‑scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design‑minded product engineering team to build and ship cutting edge models and experiences. We’re funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We’re fortunate to have the support of many amazing advisors, and 90+ angels across many industries, including the world’s foremost experts in AI. About The Role To build truly global AI, our models must be trained on data that reflects the world’s diversity of languages and cultures. We are searching for a Machine Learning Engineer to own the quality and coverage of the data behind our models. You will be our in‑house expert on global data, ensuring our models perform exceptionally well across dozens of languages. You have a keen eye for linguistic nuance, and a passion for building inclusive and representative datasets at scale. Your Impact Design and build large‑scale datasets for model training. Build evaluations of speech models, both via manual annotation and at scale with automated metrics. Implement techniques for steering data generation to improve model intelligence through data and mitigate bias. Build automated quality control systems to validate and filter generated data. Partner with product teams to ensure support for key languages and markets. What You Bring Experience building or working with large multilingual datasets. Experience with generative models (speech, text, or multimodal). Ability to help guide human annotation and evaluation across multiple languages. Strong applied ML background with a focus on data‑centric approaches. Excitement for building scalable systems that bridge research and production. What We Offer Lunch, dinner and snacks at the office. Fully covered medical, dental, and vision insurance for employees. 401(k). ✈️ Relocation and immigration support. Your own personal Yoshi. Our Culture We’re an in‑person team based out of San Francisco. We love being in the office, hanging out together, and learning from each other every day. We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don’t sacrifice quality or design along the way. We support each other. We have an open & inclusive culture that’s focused on giving everyone the resources they need to succeed. #J-18808-Ljbffr Cartesia

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Research Engineer, Data in San Francisco, CA vacancy
  • talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities... 
    Suggested

    talentpluto

    San Francisco, CA
    4 days ago
  • $250k - $350k

     ...of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including...  ...agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working... 
    Suggested
    Full time

    DiversityJobs Inc

    San Francisco, CA
    16 days ago
  •  ...petabytes of video, lidar, radar, and sensor data. But today's data platforms (Databricks,...  ...in 2022 to close it. Our open‑source engine, Daft, is the distributed data engine purpose...  ...Mission district office. Your Role As a Research Engineer on the Visual Understanding team... 
    Suggested
    Hourly pay
    Work at office
    Flexible hours
    Night shift
    1 day per week

    Eventual

    San Francisco, CA
    2 days ago
  •  ...great technology. The Liquid team is a community of world-class engineers, researchers, and builders creating the next generation of AI. Whether...  ...consolidating, gathering, and generating high-quality text data for pretraining, midtraining, SFT, and preference optimization... 
    Suggested

    Liquid AI

    San Francisco, CA
    1 day ago
  •  ...mission to democratize AI. Leveraging proprietary AI Studio and AI Engines, the company helps drive the clients’ AI Enterprise...  ...Overview We’re hiring a mid-to-senior Machine Learning Engineer / Data Scientist to build and deploy machine learning solutions that drive... 
    Suggested
    Full time
    H1b
    Local area
    Remote work

    GrabJobs

    San Francisco, CA
    1 day ago
  •  ...ML Engineer - Data Scientist (Enterprise) Hilbert is building the ML systems that power demand intelligence for the world's largest consumer companies - recommendation engines, demand forecasting, customer lifecycle models, and activation systems that must work across... 
    Live in
    Flexible hours
    Shift work

    Hilbert\'s AI

    San Francisco, CA
    4 days ago
  •  ...Washington D.C., London and Amsterdam. The Data Foundation and AI team within Plaid's...  ...and ongoing monitoring. As a Senior Research Scientist on the Data Foundation and AI...  ..., model serving infrastructure, feature engineering, and monitoring. In addition, you will... 
    Work experience placement
    Local area

    Plaid

    San Francisco, CA
    5 days ago
  • Cartesia is seeking a Research Engineer in San Francisco to develop large-scale datasets essential for training our AI models. This role focuses on ensuring data quality and linguistic representation to enhance performance across multiple languages. The ideal candidate... 
    Flexible hours

    Cartesia

    San Francisco, CA
    5 days ago
  • $72k - $184.44k

     ...They evaluate compliance with regulations including assessing governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients in developing solutions that help build trust, drive improvement, and... 
    Full time
    H1b

    PwC

    San Francisco, CA
    11 days ago
  • $119k - $299.93k

     ...governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients...  ...Degree - At least 8 years of professional AI/ML development, engineering, or testing experience What Sets You Apart - Master's... 
    Full time
    H1b

    PwC

    San Francisco, CA
    11 days ago
  •  ...largest consumer companies - recommendation engines, demand forecasting, customer lifecycle...  ...work across wildly different retailers, data environments, and business contexts. This...  ...ML systems in production - not just in research. You understand collaborative filtering,... 
    Live in
    Shift work

    Hilbert\'s AI

    San Francisco, CA
    3 days ago
  • $99k - $252.45k

     ...governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients...  ...requirements. The Opportunity As part of the AI Engineering team, you will design, test, and deploy innovative AI/ML solutions... 
    Full time
    H1b

    PwC

    San Francisco, CA
    10 days ago
  • $181.1k - $318.4k

     ...Software Development Engineer (SAP Data Archiving Analyst), IS&T Enterprise Systems The people here at Apple don't just build products — we craft the kind of wonder that's revolutionized entire industries. It's the diversity of those people and their ideas that supports... 
    Work experience placement
    Relocation

    Apple

    San Francisco, CA
    2 days ago
  • $200k - $250k

    Founding Forward Deployed Data Scientist San Francisco, CA (4 days/week onsite) About the Role Our client is building an AI-native product engineer that helps teams understand what to build next. Instead of relying on instinct or fragmented analysis, their platform automatically... 
    H1b
    Remote work

    Career Mentors, LLC

    San Francisco, CA
    3 days ago
  •  ...techniques, while collaborating closely with product, design and research teams to bring your ideas to life. If you are passionate about...  ...models for specific use cases and domains. Build scalable and robust data pipelines for large-scale image and video datasets. Develop... 

    Apple

    San Francisco, CA
    5 days ago
  • $189.6k - $237k

     ...About This Role Lead applied ML engineering on Scale's Applied ML team, powering data infrastructure for leading agentic LLMs (ChatGPT, Gemini, Llama)...  ...reasoning and behaviors, scale human expertise, and drive research into real-world agent reliability failures despite... 
    Full time

    Scale AI

    San Francisco, CA
    7 days ago
  •  ...AI Research Scientist We're building the first truly private, personal AI that learns your skills, judgment, and preferences without big tech—or us—ever seeing your data. Our core ML challenge: how do we train the world's best personal models? What You'll Do... 
    Shift work

    Workshop Labs

    San Francisco, CA
    3 days ago
  •  ...Achira, we are building a team of world-class scientists, ML researchers, and engineers to work together to move beyond the beaten path in drug...  ...operate at the frontier scale of massive compute, massive data, and massive ambition. You'll own impactful work end-to-end... 
    Work at office

    Achira

    San Francisco, CA
    3 days ago
  •  ...About David AI David AI is the first audio data research company. We bring an R&D approach to data-developing datasets with the same...  ...David AI was founded in 2024 by a team of former Scale AI engineers and operators. In less than a year, we've brought on most FAANG... 
    Work at office

    David AI

    San Francisco, CA
    3 days ago
  •  ...Machine Learning Research Engineer We're assisting a profitable Enterprise AI Customer Support startup with their search for their first Machine Learning Research Engineer. In this position, you'll be responsible for building AI systems that can perform previously impossible... 
    Work at office

    DRH Search

    San Francisco, CA
    1 day ago
  • $147.4k - $220.9k

     ...AI/ML - Machine Learning Research Engineer, Machine Translation Work Locations (3) Submit Resume Summary The Apple Machine Translation team is seeking exceptional researchers and scientists to contribute to the development of the next generation of core machine... 
    Relocation

    Apple

    San Francisco, CA
    2 days ago
  • $185k

     ...while working closely with software and research partners to co-design hardware tightly integrated...  ...'re seeking a Research-Hardware Codesign Engineer to operate at the boundary between model...  ...in your possession (including the data contained therein) upon termination of employment... 
    Relocation package
    3 days per week

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...Description: Job brief Join our San Francisco office as an ML Engineer focused on Data Engineering. Visa sponsorship available for global talent...  ...and videos. This position is essential to supporting our research and development teams by guaranteeing effective data access... 
    Full time
    H1b
    Work at office
    Immediate start
    Visa sponsorship

    Dfbooking Recruitment Services

    San Francisco, CA
    3 days ago
  • $185k - $235k

     ...current delivery mechanism. The real product is a scalable risk engine. We stay when traditional insurers exit. We model what others...  ...after it happens. It relies on coarse proxies, backward-looking data, and manual processes, then accepts damage as unavoidable. Stand... 
    Full time
    Temporary work
    H1b
    Work at office
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Stand Insurance

    San Francisco, CA
    2 days ago
  •  ...Machine Learning Engineer, Data & Training Infrastructure Rime builds voice AI for enterprises running customer experiences at scale...  ...Ventures, and we've built a team at the intersection of product, research, and craft. Building voice models is an art. We intend to... 
    Remote work
    Visa sponsorship

    Rime Labs

    San Francisco, CA
    3 days ago
  •  ...Data Engineer/Data Analyst Rootshell Enterprise Technologies Inc. is a recognized provider of professional IT Consulting services in the US. We are actively seeking a Data Engineer/Data Analyst for one of our clients. Locations: SAN FRANCISCO, CA - Onsite Only... 
    Local area

    Rootshell Inc

    San Francisco, CA
    3 days ago
  •  ...We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management for training loops. Your expertise will drive the success of data processing... 

    Hyphen Connect Limited

    San Francisco, CA
    3 days ago
  • $130k - $200k

     ...real growth. How we work at Aircall: We're customer-obsessed, data-driven, and focused on delivering meaningful outcomes. We value...  ..., you'll feel at home here. About the role The Data Engineering team at Aircall works on providing high-quality, reliable, and... 
    Worldwide

    Aircall

    San Francisco, CA
    1 day ago
  •  ...spatial profiling, imaging, genetics, and multi-modal experimental data; that integrate deep biological expertise with foundation modeling and agentic systems. We are seeking a Principal ML Research Engineer to be the founding engineering leader on this team . This... 

    Lila Sciences

    San Francisco, CA
    3 days ago
  • $105k - $125k

    10a Labs is seeking a Data Engineer to design and implement data pipelines for scraping and processing data. The role includes web scraping, data cleaning, and API development, collaborating with ML engineers and software developers. Candidates should have at least 2 years... 
    Remote work

    10a Labs

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Data. Be the first to apply!