LLM Evaluation & Research Lead

$260k - $350k

Scale AI

A leading AI technology company in New York is seeking a Tech Lead Manager for their LLM Evals Research team. In this role, you will lead a team focused on developing innovative evaluation methodologies for large language models. The ideal candidate has extensive experience in NLP, previous leadership roles, and a track record of published research. Competitive salary of $260,000 — $350,000 is offered for this full-time position. #J-18808-Ljbffr

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the LLM Evaluation & Research Lead in New York, NY vacancy

Senior Qualitative Research & Evaluation Lead
$103k - $174k
...Resource Innovations is hiring a Sr. Qualitative Research & Evaluation Team Lead to oversee qualitative evaluations for energy efficiency programs. The ideal candidate will have over 5 years of relevant experience and a Bachelor's degree in a related field, with proficiency...
Suggested
Remote Jobs
New York, NY
6 days ago
Remote Impactful Research & Evaluation Leader
$80k - $95k
...Prison Fellowship Financial is seeking a Research and Program Evaluation Manager to enhance program efficiency through evaluation and learning initiatives. The ideal candidate will have over 8 years of experience in measurement and evaluation within mission-driven organizations...
Suggested
Remote work
Prison Fellowship Financial
New York, NY
6 days ago
Lead Applied Scientist, Document Understanding
$147.6k - $274.2k
...Lead Applied Scientist, Document Understanding About the... ...model development, distillation, evaluation, and deployment. You publish,... ...taxonomies Develop LLM-based knowledge graph construction... ...systems into production - not research-only experience ~ Publications...
Suggested
Local area
Flexible hours
Thomson Reuters
New York, NY
6 days ago
Research Lead, New AI Products & Platforms
$140k - $155k
...Responsibilities: We are looking for a Research Lead with a passion for ensuring that reader-... ...for rapid prototype iteration. Own evaluation for quality and editorial integrity: Create... ...industry trends, new research methodologies, LLM advancements, and emerging technologies....
Suggested
Full time
Work at office
Local area
Remote work
Flexible hours
The New York Times
New York, NY
1 day ago
Equity Research Quality Lead (AI-Reviewed)
$150 - $180 per hour
...A financial services firm in the United States is hiring an Expert Equities Research Reviewer to evaluate AI-generated investment research reports. Responsibilities include reviewing equity research for accuracy, assessing investment theses, and providing structured feedback...
Suggested
Hourly pay
Great Value Hiring
New York, NY
6 days ago
Research Lead
$100k - $160k
...a way,” to redefine the future of pet ownership together. Research Lead Fi is hiring a Research Lead to own our research function... ...during the hiring process. As a fast‑growing Series B startup, Fi evaluates compensation opportunistically to align with the right...
Work at office
Local area
Flexible hours
Fi
New York, NY
2 days ago
Research Lead, Training Insights
...is a quickly growing group of committed researchers, engineers, policy experts, and business... ...About The Role As a Research Lead on the Training Insights team, you'll develop... ...you'll drive original research into new evaluation methodologies while leading a small team...
Work at office
Visa sponsorship
Flexible hours
Shift work
Anthropic
New York, NY
2 days ago
ML Lead: AI Evaluation & Benchmark Design
...NewtonX is seeking an ML Lead to serve as the primary technical point of contact for clients' applied science and ML teams. In this... ...in applied machine learning will enable you to deliver precise evaluation systems, fostering growth and innovation across diverse industries...
NewtonX
New York, NY
5 days ago
Research Lead
$220k - $270k
...Learn more about working at dYdX Responsibilities As the Research Lead at dYdX, you will drive strategic insights at the... ...continuously enhance data infrastructure by optimizing systems, evaluating new tools, and promoting data best practices across the organization...
Work experience placement
dYdX
New York, NY
2 days ago
AIML- Compliance & Policies Lead, Evaluation
$212k - $386.3k
...AIML- Compliance & Policies Lead, Evaluation New York City, New York, United States Machine Learning and AI Imagine what you could do here. At Apple, new ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication...
Relocation
Apple
New York, NY
2 days ago
Lead, LLM Customization & Data Science Team
...Capital One National Association is seeking a Manager for the Data Science - LLM Customization Team in New York. The focus is on utilizing AI technologies to create innovative financial solutions that enhance customer interactions. Candidates must possess a solid background...
Capital One National Association
New York, NY
2 days ago
Medical AI Evaluation Lead - Remote, Flexible Hours
...Turing is seeking a licensed physician to improve AI clinical reasoning. You'll design evaluation methods and assess AI performance on real medical challenges. This flexible remote role allows up to 30 hours per week over a month, with possible extensions based on performance...
Remote work
Flexible hours
Turing Inc
New York, NY
3 days ago
Senior Python Lead: AI Backend & LLM Orchestration
...EPAM Systems, Inc. is seeking a Lead Python Developer to guide the technical strategy and hands-on development within the team. This role emphasizes backend AI services and LLM orchestration, offering an opportunity to produce production-ready code while coaching fellow...
EPAM Systems Inc
New York, NY
5 days ago
Applied AI & ML Lead for Payments & LLM Solutions
TwinThread is seeking a highly skilled professional to contribute to the development of innovative AI solutions using Large Language Models (LLM) in New York. The role involves collaborating with diverse teams to address key challenges in the payments domain while maintaining...
Aumni
New York, NY
4 days ago
AI & ML Lead - LLM‑Driven Trading Tech Innovator
$150k - $250k
A leading financial technology company is seeking an experienced AI/LLM Product Engineer in New York City. The role involves designing and building systems that translate user intent into structured workflows using large language models. Candidates should have extensive...
Tradeweb
New York, NY
4 days ago
Evaluation Lead
...York City, NYRR is a 501(c)(3) organization. Description The Evaluation Lead is a key role that manages and implements NYRR’s evaluation... ...conducting formative and summative program evaluations or applied research 3+ years of experience with quantitative and qualitative...
Work at office
Local area
2 days per week
New York Road Runners
New York, NY
2 days ago
Medical Education Evaluation Lead
Scheurer is seeking an Evaluation Manager to lead the design and implementation of evaluations for medical educational programs. The role involves extensive collaboration with faculty, data analysis, and visualization of evaluation data. Candidates should have 3 to 5 years...
https:/www.scheurer.org/careers/
Brooklyn, NY
1 day ago
Measurement Science Lead
$105.94k - $158.91k
...Universal Ads is seeking a Measurement Science Lead to help scale and operationalize how we... ...lift, holdout, DiD, reverse holdout). Evaluate test power, holdout allocation, duration,... ...(Programming Language) Experimental Research Design Stakeholder Influence Salary Primary...
Work experience placement
Comcast
New York, NY
4 days ago
Research Lead, LLMs, Games & Multi-Agent Environments
$140k - $250k
...Research Lead, LLMs, Games & Multi-Agent Environments Toronto/NYC • $140-250K The Opportunity Good Start Labs builds games that make... ...Published research in RL, multi-agent systems, game theory, or LLM training Strong Python and PyTorch/JAX skills—you implement...
Remote work
Visa sponsorship
Flexible hours
Good Start Labs
New York, NY
5 days ago
Lead AI Platform Architect for Scalable LLM Services
$187.5k - $281.24k
...services to define and evolve architectural direction. The ideal candidate will lead design and mentor engineers while building scalable API-driven solutions in Go, focusing on modern LLM systems. Qualifications include extensive experience in software engineering, proficiency...
Commerce.com US, Inc.
New York, NY
5 days ago
Agents Research Lead
...automate workflows - they replace them. We're looking for a researcher to lead our applied AI team to work towards our goal of fully... ...use, memory, planning, and multi-agent coordination Lead evaluations: build datasets, success criteria, and continuous benchmarks...
The General Intelligence Company of New York
New York, NY
6 days ago
Lead Python AI Architect for Backend & LLM Apps
...collaborate with teams on observability and reliability. The ideal candidate has over 5 years of experience, especially with running LLM platforms, and expert-level skills in Python. Join us to participate in projects with top brands across the globe. #J-18808-Ljbffr...
EPAM Systems Inc
New York, NY
3 days ago
AI Model Evaluation Lead: Metrics, Bias & Fairness
...We are seeking an expert to evaluate and improve our AI models through comprehensive testing and analysis. You will be responsible for designing evaluation frameworks, conducting model assessments, and providing actionable insights for model improvement. Key Responsibilities...
MERIT Beauty
New York, NY
5 days ago
Lead, Training Insights & Model Evaluation
...A progressive AI research organization based in New York is looking for a Research Lead to develop and lead evaluations of AI model capabilities. In this influential role, you'll contribute to impactful research frameworks while mentoring a team of researchers. Ideal...
Anthropic
New York, NY
2 days ago
Remote Internal Medicine Physician - Case Evaluation Lead
$130 per hour
...who are board-certified or licensed and have strong evidence-based reasoning and communication skills. Responsibilities include evaluating clinical outputs and delivering feedback to enhance quality and accuracy. Compensation ranges from $130 to $300 per hour, reflecting...
Hourly pay
Remote work
CuraSenseAI
New York, NY
6 days ago
Clinical Psychology Associate Evaluations & Counseling Lead
...licensed psychologist to provide psychological services. In this role, you will design and administer psychological tests, conduct evaluations, lead individual and group counseling sessions, and supervise lower-level staff. Candidates must have a doctoral degree in clinical...
Work at office
NYS Office for People With Developmental Disabilities
New York, NY
2 days ago
Lead Project Evaluator
Lead Project Evaluator The Lead Project Evaluator will independently design, implement and supervise program evaluations, qualitative or quantitative research studies, needs assessments, data dashboards—including selecting evaluation approaches, developing logic models...
Local area
University of Georgia
Brooklyn, NY
3 days ago
Program Evaluation & Monitoring Lead (NYC TeenSpace)
...municipal health department in New York City seeks a qualified individual to provide analytic leadership for NYC TeenSpace, overseeing evaluation and support for mental health services for youth. Candidates should have a minimum of a master's degree in a relevant field and...
NYC Department of Health and Mental Hygiene
New York, NY
4 days ago
Lead Clinical Research Coordinator
$40 - $47 per hour
...Lead Clinical Research Coordinator - Contract - Bronx, NY Ready to be the heartbeat of medical research? Join our client as Lead Clinical Research... ...and administration, and lab specimen processing Evaluate and screen potential subjects for protocol eligibility; manage...
Hourly pay
Contract work
Work at office
Proclinical Group
New York, NY
3 days ago
Medical Technologist, Lead
$39.11 - $50 per hour
...Job Title: Medical Technologist, Lead Location: Jersey City Medical Center Department... ...preparation and capital equipment evaluation. Qualifications: Required:... ...high-quality patient care, education, and research to address both the clinical and social...
Hourly pay
Full time
Temporary work
Work experience placement
Local area
Flexible hours
Shift work
Afternoon shift
RWJBarnabas Health
Jersey City, NJ
7 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to LLM Evaluation & Research Lead. Be the first to apply!