Research Engineer, Data
Cartesia
About Cartesia Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason over a year-long stream of audio, video and text—1B text tokens, 10B audio tokens and 1T video tokens—let alone do this on-device. We’re pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training efficient, large‑scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design‑minded product engineering team to build and ship cutting edge models and experiences. We’re funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We’re fortunate to have the support of many amazing advisors, and 90+ angels across many industries, including the world’s foremost experts in AI. About The Role To build truly global AI, our models must be trained on data that reflects the world’s diversity of languages and cultures. We are searching for a Machine Learning Engineer to own the quality and coverage of the data behind our models. You will be our in‑house expert on global data, ensuring our models perform exceptionally well across dozens of languages. You have a keen eye for linguistic nuance, and a passion for building inclusive and representative datasets at scale. Your Impact Design and build large‑scale datasets for model training. Build evaluations of speech models, both via manual annotation and at scale with automated metrics. Implement techniques for steering data generation to improve model intelligence through data and mitigate bias. Build automated quality control systems to validate and filter generated data. Partner with product teams to ensure support for key languages and markets. What You Bring Experience building or working with large multilingual datasets. Experience with generative models (speech, text, or multimodal). Ability to help guide human annotation and evaluation across multiple languages. Strong applied ML background with a focus on data‑centric approaches. Excitement for building scalable systems that bridge research and production. What We Offer Lunch, dinner and snacks at the office. Fully covered medical, dental, and vision insurance for employees. 401(k). ✈️ Relocation and immigration support. Your own personal Yoshi. Our Culture We’re an in‑person team based out of San Francisco. We love being in the office, hanging out together, and learning from each other every day. We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don’t sacrifice quality or design along the way. We support each other. We have an open & inclusive culture that’s focused on giving everyone the resources they need to succeed. #J-18808-Ljbffr Cartesia
- talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities...Suggested
$250k - $350k
...of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including... ...agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working...SuggestedFull time- ...petabytes of video, lidar, radar, and sensor data. But today's data platforms (Databricks,... ...in 2022 to close it. Our open‑source engine, Daft, is the distributed data engine purpose... ...Mission district office. Your Role As a Research Engineer on the Visual Understanding team...SuggestedHourly payWork at officeFlexible hoursNight shift1 day per week
- ...great technology. The Liquid team is a community of world-class engineers, researchers, and builders creating the next generation of AI. Whether... ...consolidating, gathering, and generating high-quality text data for pretraining, midtraining, SFT, and preference optimization...Suggested
- ...mission to democratize AI. Leveraging proprietary AI Studio and AI Engines, the company helps drive the clients’ AI Enterprise... ...Overview We’re hiring a mid-to-senior Machine Learning Engineer / Data Scientist to build and deploy machine learning solutions that drive...SuggestedFull timeH1bLocal areaRemote work
- ...ML Engineer - Data Scientist (Enterprise) Hilbert is building the ML systems that power demand intelligence for the world's largest consumer companies - recommendation engines, demand forecasting, customer lifecycle models, and activation systems that must work across...Live inFlexible hoursShift work
- ...Washington D.C., London and Amsterdam. The Data Foundation and AI team within Plaid's... ...and ongoing monitoring. As a Senior Research Scientist on the Data Foundation and AI... ..., model serving infrastructure, feature engineering, and monitoring. In addition, you will...Work experience placementLocal area
- Cartesia is seeking a Research Engineer in San Francisco to develop large-scale datasets essential for training our AI models. This role focuses on ensuring data quality and linguistic representation to enhance performance across multiple languages. The ideal candidate...Flexible hours
$72k - $184.44k
...They evaluate compliance with regulations including assessing governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients in developing solutions that help build trust, drive improvement, and...Full timeH1b$119k - $299.93k
...governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients... ...Degree - At least 8 years of professional AI/ML development, engineering, or testing experience What Sets You Apart - Master's...Full timeH1b- ...largest consumer companies - recommendation engines, demand forecasting, customer lifecycle... ...work across wildly different retailers, data environments, and business contexts. This... ...ML systems in production - not just in research. You understand collaborative filtering,...Live inShift work
$99k - $252.45k
...governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients... ...requirements. The Opportunity As part of the AI Engineering team, you will design, test, and deploy innovative AI/ML solutions...Full timeH1b$181.1k - $318.4k
...Software Development Engineer (SAP Data Archiving Analyst), IS&T Enterprise Systems The people here at Apple don't just build products — we craft the kind of wonder that's revolutionized entire industries. It's the diversity of those people and their ideas that supports...Work experience placementRelocation$200k - $250k
Founding Forward Deployed Data Scientist San Francisco, CA (4 days/week onsite) About the Role Our client is building an AI-native product engineer that helps teams understand what to build next. Instead of relying on instinct or fragmented analysis, their platform automatically...H1bRemote work- ...techniques, while collaborating closely with product, design and research teams to bring your ideas to life. If you are passionate about... ...models for specific use cases and domains. Build scalable and robust data pipelines for large-scale image and video datasets. Develop...
$189.6k - $237k
...About This Role Lead applied ML engineering on Scale's Applied ML team, powering data infrastructure for leading agentic LLMs (ChatGPT, Gemini, Llama)... ...reasoning and behaviors, scale human expertise, and drive research into real-world agent reliability failures despite...Full time- ...AI Research Scientist We're building the first truly private, personal AI that learns your skills, judgment, and preferences without big tech—or us—ever seeing your data. Our core ML challenge: how do we train the world's best personal models? What You'll Do...Shift work
- ...Achira, we are building a team of world-class scientists, ML researchers, and engineers to work together to move beyond the beaten path in drug... ...operate at the frontier scale of massive compute, massive data, and massive ambition. You'll own impactful work end-to-end...Work at office
- ...About David AI David AI is the first audio data research company. We bring an R&D approach to data-developing datasets with the same... ...David AI was founded in 2024 by a team of former Scale AI engineers and operators. In less than a year, we've brought on most FAANG...Work at office
- ...Machine Learning Research Engineer We're assisting a profitable Enterprise AI Customer Support startup with their search for their first Machine Learning Research Engineer. In this position, you'll be responsible for building AI systems that can perform previously impossible...Work at office
$147.4k - $220.9k
...AI/ML - Machine Learning Research Engineer, Machine Translation Work Locations (3) Submit Resume Summary The Apple Machine Translation team is seeking exceptional researchers and scientists to contribute to the development of the next generation of core machine...Relocation$185k
...while working closely with software and research partners to co-design hardware tightly integrated... ...'re seeking a Research-Hardware Codesign Engineer to operate at the boundary between model... ...in your possession (including the data contained therein) upon termination of employment...Relocation package3 days per week- ...Description: Job brief Join our San Francisco office as an ML Engineer focused on Data Engineering. Visa sponsorship available for global talent... ...and videos. This position is essential to supporting our research and development teams by guaranteeing effective data access...Full timeH1bWork at officeImmediate startVisa sponsorship
$185k - $235k
...current delivery mechanism. The real product is a scalable risk engine. We stay when traditional insurers exit. We model what others... ...after it happens. It relies on coarse proxies, backward-looking data, and manual processes, then accepts damage as unavoidable. Stand...Full timeTemporary workH1bWork at officeRemote workVisa sponsorshipWork visaFlexible hours- ...Machine Learning Engineer, Data & Training Infrastructure Rime builds voice AI for enterprises running customer experiences at scale... ...Ventures, and we've built a team at the intersection of product, research, and craft. Building voice models is an art. We intend to...Remote workVisa sponsorship
- ...Data Engineer/Data Analyst Rootshell Enterprise Technologies Inc. is a recognized provider of professional IT Consulting services in the US. We are actively seeking a Data Engineer/Data Analyst for one of our clients. Locations: SAN FRANCISCO, CA - Onsite Only...Local area
- ...We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management for training loops. Your expertise will drive the success of data processing...
$130k - $200k
...real growth. How we work at Aircall: We're customer-obsessed, data-driven, and focused on delivering meaningful outcomes. We value... ..., you'll feel at home here. About the role The Data Engineering team at Aircall works on providing high-quality, reliable, and...Worldwide- ...spatial profiling, imaging, genetics, and multi-modal experimental data; that integrate deep biological expertise with foundation modeling and agentic systems. We are seeking a Principal ML Research Engineer to be the founding engineering leader on this team . This...
$105k - $125k
10a Labs is seeking a Data Engineer to design and implement data pipelines for scraping and processing data. The role includes web scraping, data cleaning, and API development, collaborating with ML engineers and software developers. Candidates should have at least 2 years...Remote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Data. Be the first to apply!
- senior research engineer San Francisco, CA
- research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research programmer San Francisco, CA
- deep learning research engineer San Francisco, CA
- ai research engineer San Francisco, CA
- research assistant engineering San Francisco, CA
- research software engineer San Francisco, CA
- director data engineering San Francisco, CA
- junior big data engineer San Francisco, CA

