Research Scientist - Frontier Benchmarks
$200k - $325kNeura Market
About the Role We're looking for a Research Scientist to collaborate with partners and lead the development of the next frontier benchmarks and datasets. This is a highly visible, customer-facing role at the intersection of research, company strategy, and go-to-market. You'll design datasets taking into account frontier model performance and work with our academic partners, and then partner with delivery, product and go-to-market to scale out production. You will also serve as a credible technical partner for our customers, prospects, and drive results that impact the broader research community. This role reports directly to the Head of Research and is ideal for someone who is energized by cross‑functional work and wants to understand how startups operate across research, data operations, and commercial teams. Main Responsibilities Design state of the art datasets that drive frontier model training and evaluation based on current model performance and academic partnerships Translate benchmark insights into clear, compelling narratives that articulate the ROI of expert‑curated data for customer‑facing presentations, technical reports, and go‑to‑market materials. Work cross‑functionally with data operations, product, engineering, and strategy to surface research findings that inform the company roadmap. Stay at the frontier of LLM evaluation research and bring best practices into Snorkel's workflows. Represent Snorkel's research externally through publications, blog posts, conference talks, and customer engagements that advance the conversation around data‑centric AI. Preferred Qualifications Strong research background in AI/ML evaluation, NLP, or related fields, with a track record of rigorous experimental design — especially around measuring the impact of training and evaluation data on model behavior. Exceptional communication skills — able to present complex technical findings clearly to both technical and non‑technical audiences. Comfort operating in a fast‑moving, cross‑functional environment with ambiguous problem spaces. Genuine interest in GTM strategy, startup dynamics, and the commercial side of AI data services. Ph.D. in machine learning, NLP, or a related field preferred; equivalent industry or research lab experience considered. Salary Salary Range: $200,000 - $325,000 USD Equal Employment Opportunity Statement Snorkel AI is proud to be an Equal Employment Opportunity employer and is committed to building a team that represents a variety of backgrounds, perspectives, and skills. Snorkel AI embraces diversity and provides equal employment opportunities to all employees and applicants for employment. Snorkel AI prohibits discrimination and harassment of any type on the basis of race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local law. All employment is decided on the basis of qualifications, performance, merit, and business need. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation. #J-18808-Ljbffr Neura Market
$200k - $325k
Neura Market is seeking a Research Scientist to lead the development of benchmarks and datasets that impact the research community. The role involves designing datasets, collaborating with partners, and impacting the company strategy. Candidates should have a strong AI...Suggested- ...computing offers a path through these bottlenecks. As an ML Research Scientist, you'll work at the frontier of generative modeling and quantum acceleration,... ..., denoising, and likelihood estimation Develop and benchmark novel solver methods for diffusion ODEs/SDEs Quantum...SuggestedFull timeCasual workVisa sponsorship
$250k
...data and evaluation infrastructure that frontier AI labs use to make their models better.... ...rigorous evaluations that go beyond static benchmarks. We are a small, early team (post Series... ...and measured. Working directly with research teams at top AI labs, you’ll experiment...Suggested$150k - $250k
Research Scientist - Frontier Data — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $150,000 - $250,000 base | $250,000 - $450,00... ...identifying model failure modes Data slicing strategy, benchmark design, and evaluation framework construction Reward signal...SuggestedFull timeVisa sponsorship- Fleet AI, Inc. is seeking a Research Scientist to join their core research team in San Francisco. This role focuses on investigating... ...leading labs. Key responsibilities include generating benchmarks to evaluate frontier models, automating environment construction for agent...Suggested
$150k - $250k
...rearchitect critical operations for the frontier of AI. Our customers include the largest... ...goods, and global social organizations. We research and deploy technologies that power AI-... ...to drive incremental improvements on benchmarks or optimize an existing process but instead...Work at office3 days per week$150k - $250k
David Joseph & Company is seeking a Research Scientist in San Francisco, focusing on applied research to enhance frontier AI systems. The role involves designing datasets, developing measurement frameworks, and executing rapid experiments. Candidates should possess quantitative...- ...A leading AI evaluation company is looking for a Staff Machine Learning Research Scientist to advance LLM evaluation methodologies. This role involves designing benchmarks, collaborating with teams, and mentoring others. Ideal candidates have significant experience in...
- ...Research Scientist / Machine Learning Scientist Location:SF Bay Area/Hybrid / Remote Type:Full... ...protocols that go beyond traditional benchmarks • Analyze large-scale human voting and... ...referenced by industry leaders pushing the frontier of safe and reliable AI. Sundar Pichai...Full timeRemote work
- Member of Technical Staff, Research Scientist About Fleet Fleet studies how environments produce intelligence... ...survive, and what "good" looks like. We work with frontier labs on post-training across modalities — building benchmarks that expose where frontier models break,...
- ...Hedra is building a world-class Physical AI research team to push the boundaries of action-... ...who are excited to go beyond benchmarks and build models that operate in the real... ...move research into production Stay at the frontier of the field — synthesizing relevant literature...Work at office
$310k
...A leading AI research organization is seeking a Research Scientist to develop novel reinforcement learning techniques. The role involves researching algorithms, designing experiments, and collaborating on model training. Ideal candidates have a strong background in reinforcement...Flexible hours- ...enterprises. We aim to push the frontier of AI that understands real,... ...role is for an experienced scientist who thrives both in... ...and deep content extraction. Research, evaluate, and integrate the... ...product impact. Develop new benchmarks, datasets, and evaluation methodologies...
$200k - $250k
...Safety (CAIS) is a leading research and advocacy organization focused... ...Fund. As a Senior Research Scientist here, you will lead and... ...the safety and reliability of frontier AI systems, taking ownership... ...and maintain datasets and benchmarks. Run distributed training and...Work at officeLocal area- ...Senior / Principal ML Scientist Merge Labs is a frontier research lab with the mission of bridging biological and artificial intelligence to maximize... ...acquisition-strategy using internal and public datasets; benchmark and validate model performance. Integrate ML...
$150k - $250k
Research Scientist - Post Training — AfterQuery Location: San Francisco, CA (Onsite) Compensation... ...evaluation systems used by leading frontier AI labs. They work directly with top labs... ..., AI safety organizations, or benchmarking groups Experience running controlled...$225k - $400k
ABOUT THE ROLE This is a research-driven, high-impact role for ML researchers who want... ...into working prototypes. Evaluation & Benchmarking - Design novel evaluation frameworks, datasets... ...conversational quality. Advance the Frontier - Stay at the cutting edge of ML...H1bRelocation- Wheel the World seeks a full-time ML Research Scientist in San Francisco to advance generative AI and quantum computing. This role involves developing theoretical and practical implementations for quantum acceleration in generative models. Ideal candidates should have...Full timeVisa sponsorship
- .... Founded by engineers and researchers who dreamed of building the... ...evaluations that go beyond benchmarks. We've run a profitable business... .... The Role As a Research Scientist focused on data, you’ll work... ...the datasets that shape how frontier models behave. You’ll partner...
$166k - $230k
...missions. The Databricks AI Research organization enables companies... .... As a Sr. Research Scientist on the Scaling team, you will... ...and advancing the scientific frontier by creating new techniques that... ...empirically validate hypotheses and benchmark against state‑of‑the‑art...Local area$400k
A dynamic technology company in San Francisco is seeking innovative individuals to push the boundaries of AI research. Candidates should have a PhD and experience in long-horizon reasoning and reinforcement learning. The role involves building systems to outperform existing...- ...datasets. This is a rare intersection of frontier AI and real-world scientific impact.... ...mode. The Role We’re looking for research scientists who want to work at the intersection of... ...Evaluation: Contributing to meaningful benchmarks and evaluation methods for domain-specific...
$240k - $380k
...Staff Machine Learning Research Scientist, LLM Evals Ready to Apply? Join the team shaping the... ...leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building...Full time- ...whole brain, and a product that has the potential to improve people’s daily lives more than any other. The Research Team at Nudge is innovating on multiple frontiers in neuroscience and neurotechnology, and is primarily focused on developing ultrasound-based...Work experience placement
- ...compounds. Accelerate change - Ship fast, adapt faster, and move frontier ideas into production. Create win-wins - Creatively turn trade-... ...fail. But succeed an unfair amount. Job: Our first dedicated research hire - you will answer the question: how to train and scale a model...
- ...Team The Future of Computing Research team is an applied research team... ...of computing. We work at the frontier of multimodal AI, helping... ...looking for a Research Engineer / Scientist to join the Future of... ...: success is not just higher benchmark performance, but better model...Work at officeImmediate startRelocation package
$302.4k - $378k
...Senior / Staff Machine Learning Research Scientist, Agents San Francisco, CA; Seattle, WA;... ..., we're accelerating the abundance of frontier data to pave the road to Artificial General... ...environments and RL reward signals, benchmarking autonomous agent performance across...Full time$259.2k - $324k
...round, we’re accelerating the abundance of frontier data to pave the road to Artificial... ...Environments (ACE) team, part of Scale’s Research organization, brings together customer-... ...environments and RL reward signals, benchmarking autonomous agent performance across real...Full time- ...well known in the AI community for seminal research accomplishments at top AI labs, have run... ...a highly experienced AI Research Scientist to play a crucial role in the development... ...What We Offer Highly engaging work at the frontier of AI-driven drug discovery that will fundamentally...
$225k - $300k
Research Scientist About Latent Health Healthcare today is only truly personalized for two groups: those with wealth and access, and those... ...small group of researchers and engineers focused on pushing the frontier while shipping real systems into production. We are a small...Work at officeImmediate start
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist - Frontier Benchmarks. Be the first to apply!
- safety scientist San Francisco, CA
- image scientist San Francisco, CA
- entry level research scientist San Francisco, CA
- regulatory scientist San Francisco, CA
- water quality scientist San Francisco, CA
- senior principal scientist San Francisco, CA
- pharmaceutical scientist San Francisco, CA
- remote scientist San Francisco, CA
- genomics scientist San Francisco, CA
- nlp research scientist San Francisco, CA

