Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

LLM Evaluation & Research Lead

$260k - $350k

Scale AI

A leading AI technology company in New York is seeking a Tech Lead Manager for their LLM Evals Research team. In this role, you will lead a team focused on developing innovative evaluation methodologies for large language models. The ideal candidate has extensive experience in NLP, previous leadership roles, and a track record of published research. Competitive salary of $260,000 — $350,000 is offered for this full-time position. #J-18808-Ljbffr

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the LLM Evaluation & Research Lead in New York, NY vacancy
  • $103k - $174k

     ...Resource Innovations is hiring a Sr. Qualitative Research & Evaluation Team Lead to oversee qualitative evaluations for energy efficiency programs. The ideal candidate will have over 5 years of relevant experience and a Bachelor's degree in a related field, with proficiency... 
    Suggested

    Remote Jobs

    New York, NY
    6 days ago
  • $80k - $95k

     ...Prison Fellowship Financial is seeking a Research and Program Evaluation Manager to enhance program efficiency through evaluation and learning initiatives. The ideal candidate will have over 8 years of experience in measurement and evaluation within mission-driven organizations... 
    Suggested
    Remote work

    Prison Fellowship Financial

    New York, NY
    6 days ago
  • $147.6k - $274.2k

     ...Lead Applied Scientist, Document Understanding About the...  ...model development, distillation, evaluation, and deployment. You publish,...  ...taxonomies Develop LLM-based knowledge graph construction...  ...systems into production - not research-only experience ~ Publications... 
    Suggested
    Local area
    Flexible hours

    Thomson Reuters

    New York, NY
    6 days ago
  • $140k - $155k

     ...Responsibilities: We are looking for a Research Lead with a passion for ensuring that reader-...  ...for rapid prototype iteration. Own evaluation for quality and editorial integrity: Create...  ...industry trends, new research methodologies, LLM advancements, and emerging technologies.... 
    Suggested
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    The New York Times

    New York, NY
    1 day ago
  • $150 - $180 per hour

     ...A financial services firm in the United States is hiring an Expert Equities Research Reviewer to evaluate AI-generated investment research reports. Responsibilities include reviewing equity research for accuracy, assessing investment theses, and providing structured feedback... 
    Suggested
    Hourly pay

    Great Value Hiring

    New York, NY
    6 days ago
  • $100k - $160k

     ...a way,” to redefine the future of pet ownership together. Research Lead Fi is hiring a Research Lead to own our research function...  ...during the hiring process. As a fast‑growing Series B startup, Fi evaluates compensation opportunistically to align with the right... 
    Work at office
    Local area
    Flexible hours

    Fi

    New York, NY
    2 days ago
  •  ...is a quickly growing group of committed researchers, engineers, policy experts, and business...  ...About The Role As a Research Lead on the Training Insights team, you'll develop...  ...you'll drive original research into new evaluation methodologies while leading a small team... 
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    Anthropic

    New York, NY
    2 days ago
  •  ...NewtonX is seeking an ML Lead to serve as the primary technical point of contact for clients' applied science and ML teams. In this...  ...in applied machine learning will enable you to deliver precise evaluation systems, fostering growth and innovation across diverse industries... 

    NewtonX

    New York, NY
    5 days ago
  • $220k - $270k

     ...Learn more about working at dYdX Responsibilities As the Research Lead at dYdX, you will drive strategic insights at the...  ...continuously enhance data infrastructure by optimizing systems, evaluating new tools, and promoting data best practices across the organization... 
    Work experience placement

    dYdX

    New York, NY
    2 days ago
  • $212k - $386.3k

     ...AIML- Compliance & Policies Lead, Evaluation New York City, New York, United States Machine Learning and AI Imagine what you could do here. At Apple, new ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication... 
    Relocation

    Apple

    New York, NY
    2 days ago
  •  ...Capital One National Association is seeking a Manager for the Data Science - LLM Customization Team in New York. The focus is on utilizing AI technologies to create innovative financial solutions that enhance customer interactions. Candidates must possess a solid background... 

    Capital One National Association

    New York, NY
    2 days ago
  •  ...Turing is seeking a licensed physician to improve AI clinical reasoning. You'll design evaluation methods and assess AI performance on real medical challenges. This flexible remote role allows up to 30 hours per week over a month, with possible extensions based on performance... 
    Remote work
    Flexible hours

    Turing Inc

    New York, NY
    3 days ago
  •  ...EPAM Systems, Inc. is seeking a Lead Python Developer to guide the technical strategy and hands-on development within the team. This role emphasizes backend AI services and LLM orchestration, offering an opportunity to produce production-ready code while coaching fellow... 

    EPAM Systems Inc

    New York, NY
    5 days ago
  • TwinThread is seeking a highly skilled professional to contribute to the development of innovative AI solutions using Large Language Models (LLM) in New York. The role involves collaborating with diverse teams to address key challenges in the payments domain while maintaining... 

    Aumni

    New York, NY
    4 days ago
  • $150k - $250k

    A leading financial technology company is seeking an experienced AI/LLM Product Engineer in New York City. The role involves designing and building systems that translate user intent into structured workflows using large language models. Candidates should have extensive... 

    Tradeweb

    New York, NY
    4 days ago
  •  ...York City, NYRR is a 501(c)(3) organization. Description The Evaluation Lead is a key role that manages and implements NYRR’s evaluation...  ...conducting formative and summative program evaluations or applied research 3+ years of experience with quantitative and qualitative... 
    Work at office
    Local area
    2 days per week

    New York Road Runners

    New York, NY
    2 days ago
  • Scheurer is seeking an Evaluation Manager to lead the design and implementation of evaluations for medical educational programs. The role involves extensive collaboration with faculty, data analysis, and visualization of evaluation data. Candidates should have 3 to 5 years... 

    https:/www.scheurer.org/careers/

    Brooklyn, NY
    1 day ago
  • $105.94k - $158.91k

     ...Universal Ads is seeking a Measurement Science Lead to help scale and operationalize how we...  ...lift, holdout, DiD, reverse holdout). Evaluate test power, holdout allocation, duration,...  ...(Programming Language) Experimental Research Design Stakeholder Influence Salary Primary... 
    Work experience placement

    Comcast

    New York, NY
    4 days ago
  • $140k - $250k

     ...Research Lead, LLMs, Games & Multi-Agent Environments Toronto/NYC • $140-250K The Opportunity Good Start Labs builds games that make...  ...Published research in RL, multi-agent systems, game theory, or LLM training Strong Python and PyTorch/JAX skills—you implement... 
    Remote work
    Visa sponsorship
    Flexible hours

    Good Start Labs

    New York, NY
    5 days ago
  • $187.5k - $281.24k

     ...services to define and evolve architectural direction. The ideal candidate will lead design and mentor engineers while building scalable API-driven solutions in Go, focusing on modern LLM systems. Qualifications include extensive experience in software engineering, proficiency... 

    Commerce.com US, Inc.

    New York, NY
    5 days ago
  •  ...automate workflows - they replace them. We're looking for a researcher to lead our applied AI team to work towards our goal of fully...  ...use, memory, planning, and multi-agent coordination Lead evaluations: build datasets, success criteria, and continuous benchmarks... 

    The General Intelligence Company of New York

    New York, NY
    6 days ago
  •  ...collaborate with teams on observability and reliability. The ideal candidate has over 5 years of experience, especially with running LLM platforms, and expert-level skills in Python. Join us to participate in projects with top brands across the globe. #J-18808-Ljbffr... 

    EPAM Systems Inc

    New York, NY
    3 days ago
  •  ...We are seeking an expert to evaluate and improve our AI models through comprehensive testing and analysis. You will be responsible for designing evaluation frameworks, conducting model assessments, and providing actionable insights for model improvement. Key Responsibilities... 

    MERIT Beauty

    New York, NY
    5 days ago
  •  ...A progressive AI research organization based in New York is looking for a Research Lead to develop and lead evaluations of AI model capabilities. In this influential role, you'll contribute to impactful research frameworks while mentoring a team of researchers. Ideal... 

    Anthropic

    New York, NY
    2 days ago
  • $130 per hour

     ...who are board-certified or licensed and have strong evidence-based reasoning and communication skills. Responsibilities include evaluating clinical outputs and delivering feedback to enhance quality and accuracy. Compensation ranges from $130 to $300 per hour, reflecting... 
    Hourly pay
    Remote work

    CuraSenseAI

    New York, NY
    6 days ago
  •  ...licensed psychologist to provide psychological services. In this role, you will design and administer psychological tests, conduct evaluations, lead individual and group counseling sessions, and supervise lower-level staff. Candidates must have a doctoral degree in clinical... 
    Work at office

    NYS Office for People With Developmental Disabilities

    New York, NY
    2 days ago
  • Lead Project Evaluator The Lead Project Evaluator will independently design, implement and supervise program evaluations, qualitative or quantitative research studies, needs assessments, data dashboards—including selecting evaluation approaches, developing logic models... 
    Local area

    University of Georgia

    Brooklyn, NY
    3 days ago
  •  ...municipal health department in New York City seeks a qualified individual to provide analytic leadership for NYC TeenSpace, overseeing evaluation and support for mental health services for youth. Candidates should have a minimum of a master's degree in a relevant field and... 

    NYC Department of Health and Mental Hygiene

    New York, NY
    4 days ago
  • $40 - $47 per hour

     ...Lead Clinical Research Coordinator - Contract - Bronx, NY Ready to be the heartbeat of medical research? Join our client as Lead Clinical Research...  ...and administration, and lab specimen processing Evaluate and screen potential subjects for protocol eligibility; manage... 
    Hourly pay
    Contract work
    Work at office

    Proclinical Group

    New York, NY
    3 days ago
  • $39.11 - $50 per hour

     ...Job Title: Medical Technologist, Lead Location: Jersey City Medical Center Department...  ...preparation and capital equipment evaluation. Qualifications: Required:...  ...high-quality patient care, education, and research to address both the clinical and social... 
    Hourly pay
    Full time
    Temporary work
    Work experience placement
    Local area
    Flexible hours
    Shift work
    Afternoon shift

    RWJBarnabas Health

    Jersey City, NJ
    7 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to LLM Evaluation & Research Lead. Be the first to apply!