LLM Evaluation & Research Lead
$260k - $350kScale AI
A leading AI technology company in New York is seeking a Tech Lead Manager for their LLM Evals Research team. In this role, you will lead a team focused on developing innovative evaluation methodologies for large language models. The ideal candidate has extensive experience in NLP, previous leadership roles, and a track record of published research. Competitive salary of $260,000 — $350,000 is offered for this full-time position. #J-18808-Ljbffr
$103k - $174k
...Resource Innovations is hiring a Sr. Qualitative Research & Evaluation Team Lead to oversee qualitative evaluations for energy efficiency programs. The ideal candidate will have over 5 years of relevant experience and a Bachelor's degree in a related field, with proficiency...Suggested$80k - $95k
...Prison Fellowship Financial is seeking a Research and Program Evaluation Manager to enhance program efficiency through evaluation and learning initiatives. The ideal candidate will have over 8 years of experience in measurement and evaluation within mission-driven organizations...SuggestedRemote work$147.6k - $274.2k
...Lead Applied Scientist, Document Understanding About the... ...model development, distillation, evaluation, and deployment. You publish,... ...taxonomies Develop LLM-based knowledge graph construction... ...systems into production - not research-only experience ~ Publications...SuggestedLocal areaFlexible hours$140k - $155k
...Responsibilities: We are looking for a Research Lead with a passion for ensuring that reader-... ...for rapid prototype iteration. Own evaluation for quality and editorial integrity: Create... ...industry trends, new research methodologies, LLM advancements, and emerging technologies....SuggestedFull timeWork at officeLocal areaRemote workFlexible hours$150 - $180 per hour
...A financial services firm in the United States is hiring an Expert Equities Research Reviewer to evaluate AI-generated investment research reports. Responsibilities include reviewing equity research for accuracy, assessing investment theses, and providing structured feedback...SuggestedHourly pay$100k - $160k
...a way,” to redefine the future of pet ownership together. Research Lead Fi is hiring a Research Lead to own our research function... ...during the hiring process. As a fast‑growing Series B startup, Fi evaluates compensation opportunistically to align with the right...Work at officeLocal areaFlexible hours- ...is a quickly growing group of committed researchers, engineers, policy experts, and business... ...About The Role As a Research Lead on the Training Insights team, you'll develop... ...you'll drive original research into new evaluation methodologies while leading a small team...Work at officeVisa sponsorshipFlexible hoursShift work
- ...NewtonX is seeking an ML Lead to serve as the primary technical point of contact for clients' applied science and ML teams. In this... ...in applied machine learning will enable you to deliver precise evaluation systems, fostering growth and innovation across diverse industries...
$220k - $270k
...Learn more about working at dYdX Responsibilities As the Research Lead at dYdX, you will drive strategic insights at the... ...continuously enhance data infrastructure by optimizing systems, evaluating new tools, and promoting data best practices across the organization...Work experience placement$212k - $386.3k
...AIML- Compliance & Policies Lead, Evaluation New York City, New York, United States Machine Learning and AI Imagine what you could do here. At Apple, new ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication...Relocation- ...Capital One National Association is seeking a Manager for the Data Science - LLM Customization Team in New York. The focus is on utilizing AI technologies to create innovative financial solutions that enhance customer interactions. Candidates must possess a solid background...
- ...Turing is seeking a licensed physician to improve AI clinical reasoning. You'll design evaluation methods and assess AI performance on real medical challenges. This flexible remote role allows up to 30 hours per week over a month, with possible extensions based on performance...Remote workFlexible hours
- ...EPAM Systems, Inc. is seeking a Lead Python Developer to guide the technical strategy and hands-on development within the team. This role emphasizes backend AI services and LLM orchestration, offering an opportunity to produce production-ready code while coaching fellow...
- TwinThread is seeking a highly skilled professional to contribute to the development of innovative AI solutions using Large Language Models (LLM) in New York. The role involves collaborating with diverse teams to address key challenges in the payments domain while maintaining...
$150k - $250k
A leading financial technology company is seeking an experienced AI/LLM Product Engineer in New York City. The role involves designing and building systems that translate user intent into structured workflows using large language models. Candidates should have extensive...- ...York City, NYRR is a 501(c)(3) organization. Description The Evaluation Lead is a key role that manages and implements NYRR’s evaluation... ...conducting formative and summative program evaluations or applied research 3+ years of experience with quantitative and qualitative...Work at officeLocal area2 days per week
- Scheurer is seeking an Evaluation Manager to lead the design and implementation of evaluations for medical educational programs. The role involves extensive collaboration with faculty, data analysis, and visualization of evaluation data. Candidates should have 3 to 5 years...
$105.94k - $158.91k
...Universal Ads is seeking a Measurement Science Lead to help scale and operationalize how we... ...lift, holdout, DiD, reverse holdout). Evaluate test power, holdout allocation, duration,... ...(Programming Language) Experimental Research Design Stakeholder Influence Salary Primary...Work experience placement$140k - $250k
...Research Lead, LLMs, Games & Multi-Agent Environments Toronto/NYC • $140-250K The Opportunity Good Start Labs builds games that make... ...Published research in RL, multi-agent systems, game theory, or LLM training Strong Python and PyTorch/JAX skills—you implement...Remote workVisa sponsorshipFlexible hours$187.5k - $281.24k
...services to define and evolve architectural direction. The ideal candidate will lead design and mentor engineers while building scalable API-driven solutions in Go, focusing on modern LLM systems. Qualifications include extensive experience in software engineering, proficiency...- ...automate workflows - they replace them. We're looking for a researcher to lead our applied AI team to work towards our goal of fully... ...use, memory, planning, and multi-agent coordination Lead evaluations: build datasets, success criteria, and continuous benchmarks...
- ...collaborate with teams on observability and reliability. The ideal candidate has over 5 years of experience, especially with running LLM platforms, and expert-level skills in Python. Join us to participate in projects with top brands across the globe. #J-18808-Ljbffr...
- ...We are seeking an expert to evaluate and improve our AI models through comprehensive testing and analysis. You will be responsible for designing evaluation frameworks, conducting model assessments, and providing actionable insights for model improvement. Key Responsibilities...
- ...A progressive AI research organization based in New York is looking for a Research Lead to develop and lead evaluations of AI model capabilities. In this influential role, you'll contribute to impactful research frameworks while mentoring a team of researchers. Ideal...
$130 per hour
...who are board-certified or licensed and have strong evidence-based reasoning and communication skills. Responsibilities include evaluating clinical outputs and delivering feedback to enhance quality and accuracy. Compensation ranges from $130 to $300 per hour, reflecting...Hourly payRemote work- ...licensed psychologist to provide psychological services. In this role, you will design and administer psychological tests, conduct evaluations, lead individual and group counseling sessions, and supervise lower-level staff. Candidates must have a doctoral degree in clinical...Work at office
- Lead Project Evaluator The Lead Project Evaluator will independently design, implement and supervise program evaluations, qualitative or quantitative research studies, needs assessments, data dashboards—including selecting evaluation approaches, developing logic models...Local area
- ...municipal health department in New York City seeks a qualified individual to provide analytic leadership for NYC TeenSpace, overseeing evaluation and support for mental health services for youth. Candidates should have a minimum of a master's degree in a relevant field and...
$40 - $47 per hour
...Lead Clinical Research Coordinator - Contract - Bronx, NY Ready to be the heartbeat of medical research? Join our client as Lead Clinical Research... ...and administration, and lab specimen processing Evaluate and screen potential subjects for protocol eligibility; manage...Hourly payContract workWork at office$39.11 - $50 per hour
...Job Title: Medical Technologist, Lead Location: Jersey City Medical Center Department... ...preparation and capital equipment evaluation. Qualifications: Required:... ...high-quality patient care, education, and research to address both the clinical and social...Hourly payFull timeTemporary workWork experience placementLocal areaFlexible hoursShift workAfternoon shift
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to LLM Evaluation & Research Lead. Be the first to apply!
- anthropology research New York, NY
- research dietitian New York, NY
- history research New York, NY
- education policy research New York, NY
- research pharmacist New York, NY
- research professional New York, NY
- student research intern New York, NY
- research intern New York, NY
- physics research New York, NY
- nutrition research New York, NY

