Research Scientist - Frontier Benchmarks
$200k - $325kNeura Market
About the Role We're looking for a Research Scientist to collaborate with partners and lead the development of the next frontier benchmarks and datasets. This is a highly visible, customer-facing role at the intersection of research, company strategy, and go-to-market. You'll design datasets taking into account frontier model performance and work with our academic partners, and then partner with delivery, product and go-to-market to scale out production. You will also serve as a credible technical partner for our customers, prospects, and drive results that impact the broader research community. This role reports directly to the Head of Research and is ideal for someone who is energized by cross‑functional work and wants to understand how startups operate across research, data operations, and commercial teams. Main Responsibilities Design state of the art datasets that drive frontier model training and evaluation based on current model performance and academic partnerships Translate benchmark insights into clear, compelling narratives that articulate the ROI of expert‑curated data for customer‑facing presentations, technical reports, and go‑to‑market materials. Work cross‑functionally with data operations, product, engineering, and strategy to surface research findings that inform the company roadmap. Stay at the frontier of LLM evaluation research and bring best practices into Snorkel's workflows. Represent Snorkel's research externally through publications, blog posts, conference talks, and customer engagements that advance the conversation around data‑centric AI. Preferred Qualifications Strong research background in AI/ML evaluation, NLP, or related fields, with a track record of rigorous experimental design — especially around measuring the impact of training and evaluation data on model behavior. Exceptional communication skills — able to present complex technical findings clearly to both technical and non‑technical audiences. Comfort operating in a fast‑moving, cross‑functional environment with ambiguous problem spaces. Genuine interest in GTM strategy, startup dynamics, and the commercial side of AI data services. Ph.D. in machine learning, NLP, or a related field preferred; equivalent industry or research lab experience considered. Salary Salary Range: $200,000 - $325,000 USD Equal Employment Opportunity Statement Snorkel AI is proud to be an Equal Employment Opportunity employer and is committed to building a team that represents a variety of backgrounds, perspectives, and skills. Snorkel AI embraces diversity and provides equal employment opportunities to all employees and applicants for employment. Snorkel AI prohibits discrimination and harassment of any type on the basis of race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local law. All employment is decided on the basis of qualifications, performance, merit, and business need. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation. #J-18808-Ljbffr
$200k - $325k
...Neura Market is seeking a Research Scientist to lead the development of benchmarks and datasets that impact the research community. The role involves designing datasets, collaborating with partners, and impacting the company strategy. Candidates should have a strong AI...Suggested$250k
...data and evaluation infrastructure that frontier AI labs use to make their models better.... ...rigorous evaluations that go beyond static benchmarks. We are a small, early team (post Series... ...and measured. Working directly with research teams at top AI labs, you’ll experiment...Suggested$150k - $250k
...Research Scientist – Frontier Data — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $150,000 – $250,000 base | $250,000 – $450,0... ...and identifying model failure modes Data slicing strategy, benchmark design, and evaluation framework construction Reward...SuggestedFull timeVisa sponsorship- ...Fleet AI, Inc. is seeking a Research Scientist to join their core research team in San Francisco. This role focuses on investigating... ...leading labs. Key responsibilities include generating benchmarks to evaluate frontier models, automating environment construction for agent...Suggested
$216k - $270k
...Scale Labs, Research Scientist - Frontier Risk Evaluations As the leading data and evaluation partner for frontier AI companies, Scale plays... ...Nice to have: Experience in crafting evaluations and benchmarks, or a background in data science roles related to LLM technologies...SuggestedFull time- ...computing offers a path through these bottlenecks. As an ML Research Scientist, you'll work at the frontier of generative modeling and quantum acceleration,... ..., denoising, and likelihood estimation Develop and benchmark novel solver methods for diffusion ODEs/SDEs Quantum-...Full timeCasual workVisa sponsorship
$150k - $250k
...rearchitect critical operations for the frontier of AI. Our customers include the largest... ...goods, and global social organizations. We research and deploy technologies that power AI-... ...to drive incremental improvements on benchmarks or optimize an existing process but instead...Work at office3 days per week$150k - $250k
...David Joseph & Company is seeking a Research Scientist in San Francisco, focusing on applied research to enhance frontier AI systems. The role involves designing datasets, developing measurement frameworks, and executing rapid experiments. Candidates should possess quantitative...- ...A leading AI evaluation company is looking for a Staff Machine Learning Research Scientist to advance LLM evaluation methodologies. This role involves designing benchmarks, collaborating with teams, and mentoring others. Ideal candidates have significant experience in...
- ...Research Scientist / Machine Learning Scientist Location:SF Bay Area/Hybrid / Remote Type:Full... ...protocols that go beyond traditional benchmarks • Analyze large-scale human voting and... ...referenced by industry leaders pushing the frontier of safe and reliable AI. Sundar Pichai...Full timeRemote work
- ...Hedra is building a world-class Physical AI research team to push the boundaries of action-... ...who are excited to go beyond benchmarks and build models that operate in the real... ...move research into production Stay at the frontier of the field — synthesizing relevant literature...Work at office
- ...Member of Technical Staff, Research Scientist About Fleet Fleet studies how environments produce intelligence... ...survive, and what "good" looks like. We work with frontier labs on post-training across modalities — building benchmarks that expose where frontier models break,...
$200k - $250k
...Safety (CAIS) is a leading research and advocacy organization focused... ...Fund. As a Senior Research Scientist here, you will lead and... ...the safety and reliability of frontier AI systems, taking ownership... ...Build and maintain datasets and benchmarks. Run distributed training...Work at officeLocal area- ...enterprises. We aim to push the frontier of AI that understands real,... ...role is for an experienced scientist who thrives both in... ...and deep content extraction. Research, evaluate, and integrate the... ...product impact. Develop new benchmarks, datasets, and evaluation methodologies...
- ...Merge Labs is a frontier research lab with the mission of bridging biological and artificial... ...collaborate closely with engineers and data scientists. We work across disciplines to come up... ...using internal and public datasets; benchmark and validate performance. Serve to non...
- ...Wheel the World seeks a full-time ML Research Scientist in San Francisco to advance generative AI and quantum computing. This role involves developing theoretical and practical implementations for quantum acceleration in generative models. Ideal candidates should have...Full timeVisa sponsorship
$225k - $400k
...ABOUT THE ROLE This is a research-driven, high-impact role for ML researchers who want... ...into working prototypes. Evaluation & Benchmarking – Design novel evaluation frameworks, datasets... ...conversational quality. Advance the Frontier – Stay at the cutting edge of ML...H1bRelocation$166k - $230k
...missions. The Databricks AI Research organization enables companies... ...production. As a Sr. Research Scientist on the Scaling team, you will... ...and advancing the scientific frontier by creating new techniques... ...empirically validate hypotheses and benchmark against state‑of‑the‑art...Local area$150k - $250k
...Research Scientist – Post Training — AfterQuery Location: San Francisco, CA (Onsite) Compensation... ...evaluation systems used by leading frontier AI labs. They work directly with top labs... ..., AI safety organizations, or benchmarking groups Experience running controlled training...- ...brilliance. Founded by engineers and researchers who dreamed of building the... ...evaluations that go beyond benchmarks. We've run a profitable... .... The Role As a Research Scientist focused on data, you’ll work... ...the datasets that shape how frontier models behave. You’ll partner...
- ...datasets. This is a rare intersection of frontier AI and real-world scientific impact.... ...mode. The Role We’re looking for research scientists who want to work at the intersection of... ...Evaluation: Contributing to meaningful benchmarks and evaluation methods for domain-specific...
$400k
...A dynamic technology company in San Francisco is seeking innovative individuals to push the boundaries of AI research. Candidates should have a PhD and experience in long-horizon reasoning and reinforcement learning. The role involves building systems to outperform existing...$80 - $120 per hour
...remote basis. This role involves generating detailed trajectories for intricate physics problems while collaborating closely with a frontier LLM. Candidates should have a strong background in physics, fluency in LaTeX and Python, and the ability to work independently.The...Hourly payRemote work$240k - $380k
...Staff Machine Learning Research Scientist, LLM Evals Ready to Apply? Join the team shaping the future... ...leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building...Full time- ...whole brain, and a product that has the potential to improve people’s daily lives more than any other. The Research Team at Nudge is innovating on multiple frontiers in neuroscience and neurotechnology, and is primarily focused on developing ultrasound-based...Work experience placement
- ...Team The Future of Computing Research team is an applied research team... ...of computing. We work at the frontier of multimodal AI, helping... ...looking for a Research Engineer / Scientist to join the Future of... ...: success is not just higher benchmark performance, but better model...Work at officeImmediate startRelocation package
- ...compounds. Accelerate change - Ship fast, adapt faster, and move frontier ideas into production. Create win-wins - Creatively turn trade-... ...fail. But succeed an unfair amount. Job: Our first dedicated research hire - you will answer the question: how to train and scale a model...
- ...well known in the AI community for seminal research accomplishments at top AI labs, have run... ...a highly experienced AI Research Scientist to play a crucial role in the development... ...What We Offer Highly engaging work at the frontier of AI-driven drug discovery that will fundamentally...
- ...Worktrace AI, our our mission is to eliminate the AI divide. Frontier AI labs are developing AI that can free people to focus on... ...build those automations. The role We are hiring a Founding Research Scientist to develop AI that can close the growing chasm between agent...Visa sponsorship
- ...Granica’s mission is to remove that inefficiency. We combine new research in information theory , probabilistic modeling , and... ...focus on unstructured text or media, we are exploring the next frontier: systems that understand and reason over the information that runs...Flexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist - Frontier Benchmarks. Be the first to apply!
- safety scientist San Francisco, CA
- image scientist San Francisco, CA
- regulatory scientist San Francisco, CA
- water quality scientist San Francisco, CA
- senior principal scientist San Francisco, CA
- pharmaceutical scientist San Francisco, CA
- remote scientist San Francisco, CA
- genomics scientist San Francisco, CA
- nlp research scientist San Francisco, CA
- scientist biology San Francisco, CA

