Staff Applied Researcher, AI Quality

$140.4k - $372.3k

GitHub, Inc.

Locations In this role you can work from Remote, United States Overview At GitHub, we’re building the next generation of AI‑powered developer experiences. We’re looking for a Staff Applied Researcher with deep expertise in Large Language Model (LLM) evaluation, LLM agents, strong engineering instincts, and a bias for action to help shape the future of GitHub Copilot and our AI platform. This is a high‑impact role where you will design evaluation systems that directly influence how millions of developers experience AI every day. Responsibilities Lead Model Quality & Evaluation Design next‑generation evaluation frameworks for code generation, reasoning, safety, multimodal tasks, and agentic workflows. Develop scalable automatic metrics, LLM‑judge systems, reward models, and human‑in‑the‑loop evaluation pipelines. Establish high‑signal, repeatable methodologies that influence product decisions across GitHub AI. Drive Applied Research & Engineering Build and optimize evaluation tooling, datasets, benchmarking systems, and experimentation pipelines. Create and onboard new benchmarks for the hardest tasks for the coding agents. Collaborate closely with engineering teams to productionize research, validate improvements, and accelerate model iteration cycles. Own end‑to‑end quality insights for the models behind GitHub Copilot and new AI features. Work closely with product development, engineering, and design teams to integrate advanced research findings into practical applications, ensuring alignment with product goals and user needs. Influence, Mentor & Lead Shape GitHub’s strategy for model quality, alignment, and evaluation. Mentor other researchers and engineers, helping elevate technical standards across the organization. Drive clarity in ambiguous problem spaces and champion fast, high‑quality execution. Qualifications Required Qualifications Bachelor\'s degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 8+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR master\'s degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 6+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR doctorate in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 4+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR equivalent experience. 3+ years of strong engineering skills in Python/Typescript and experience building production grade evaluation or data/ML pipelines at scale. Proven track record shipping research or evaluation systems in production environments. Strong cross‑functional communication and influence skills. Preferred Qualifications Experience with LLM judge systems, reward modeling, alignment, or safety evaluations. Background in code generation, developer tools, or AI‑assisted programming. Experience with large‑scale experimentation and online/offline evaluation strategies. Open‑source contributions or experience working with developer communities. Experience designing and leading complex research projects from ideation to implementation Ability to define and articulate data‑driven strategies that consider cross‑functional impacts and align with organizational priorities, particularly in a software development platform context Compensation The base salary range for this job is USD $140,400.00 - USD $372,300.00 /Yr. These pay ranges are intended to cover roles based across the United States. An individual\'s base pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant. At GitHub certain roles are eligible for benefits and additional rewards, including annual bonus and stock. These rewards are allocated based on individual impact in role. In addition, certain roles also have the opportunity to earn sales incentives based on revenue or utilization, depending on the terms of the plan and the employee\'s role. GitHub values Customer-obsessed Ship to learn Growth mindset Own the outcome Better together Diverse and inclusive Manager fundamentals Model Coach Care Leadership principles Create clarity Generate energy Deliver success Who We Are GitHub is the world’s leading AI-powered developer platform with 150 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub. Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms. Join us, and let’s change the world, together. EEO Statement GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don\'t discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there\'s any way we can make the interview process better for you; we\'re happy to accommodate! #J-18808-Ljbffr GitHub, Inc.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Staff Applied Researcher, AI Quality in New York, NY vacancy

Applied AI Researcher, Benchmarking
$150k - $250k
...Distyl AI Job Opportunity Distyl is an applied AI technology company partnering with the world's most ambitious... ...global social organizations. We research and deploy technologies that power... ...capture reasoning depth, interaction quality, reliability, and operational impact...
Quality
Work at office
3 days per week
Distyl AI
New York, NY
3 days ago
AI Applied Researcher
$125k - $160k
About the Role We’re looking for an AI Applied Researcher to join our growing team and help push the boundaries of how we build and improve our... ...team tackle their toughest problems and level up the quality of their work. This is not a customer‑facing role — you’ll be...
Quality
Hello Patient
New York, NY
3 days ago
Staff User Researcher
We are seeking a Staff User Researcher to define how research operates across... ...prioritization processes, governance, quality standards, participant... ...judgment on when and how to apply them Experience designing... ...modern research tools and AI‑assisted workflows to increase...
Quality
Work at office
Local area
Remote work
Flexible hours
Buyers Edge Platform, LLC
New York, NY
4 days ago
Staff UX Quantitative Researcher, Search AI, Quality and Trust
$189k - $274k
Staff UX Quantitative Researcher, Search AI, Quality and Trust You will have an opportunity to share your preferred working location from the following: Mountain... ...practical experience. 8 years of experience in an applied research setting, or similar. Experience in...
Quality
Full time
Google Inc.
New York, NY
2 days ago
Applied Researcher I (AI Foundations)
$218.7k - $249.6k
...Overview Applied Researcher I (AI Foundations) Overview: At Capital One, we are creating trustworthy and reliable AI systems, changing... ...A professional with a track record of coming up with high quality ideas or improving upon existing ideas in machine learning,...
Quality
Full time
Part time
Local area
Flexible hours
Capital One
New York, NY
6 days ago
Applied Researcher I (AI Foundations, LLM Customization, Finetuning, Reinforcement Learning)
$218.7k - $249.6k
...Overview Applied Researcher I (AI Foundations, LLM Customization, Finetuning, Reinforcement Learning) Overview: At Capital One, we are... ...A professional with a track record of coming up with high quality ideas or improving upon existing ideas in machine learning,...
Quality
Full time
Part time
Local area
Flexible hours
Capital One
New York, NY
a month ago
Staff UX Quantitative Researcher: AI Search Quality & Trust
Google Inc. in New York seeks a Staff UX Quantitative Researcher to influence product design through empirical user research. The role requires 8+ years of experience in research settings and programming skills in Python, R, and other languages. Candidates should have...
Quality
Google Inc.
New York, NY
3 days ago
Remote Senior Applied Protocol Researcher - Khalani Network
...distributed systems to be able to consume new research on consensus & scaling (should have, for... ...methods or an eagerness to learn and apply them Understanding of the fundamentals of... ...curiosity. Bonuses for those who have published quality research in one of the topics mentioned...
Quality
Remote job
WorksHub
New York, NY
4 days ago
Applied Researcher II (AI Foundations)
...we are creating trustworthy and reliable AI systems, changing banking for good. We are... ...to banking. We are building world-class applied science and engineering teams and scalable... ...to life, touching every aspect of the research lifecycle, from partnering with academia...
Full time
Part time
Capital One
New York, NY
4 days ago
Applied Researcher II (AI Foundations, LLM Core and Agentic AI)
$262.5k - $299.6k
Applied Researcher II (AI Foundations, LLM Core and Agentic AI) At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, Capital One has been leading the industry in using machine learning to create real‑time, intelligent...
Full time
Part time
Local area
Flexible hours
Capital One National Association
New York, NY
1 day ago
Applied AI Foundations Researcher I
A leading financial institution is seeking an Applied Researcher I to help develop trustworthy AI systems. This role requires collaboration with data scientists and engineers to deliver innovative products that transform customer interactions. Candidates should have a strong...
Capital One National Association
New York, NY
1 day ago
Applied AI Foundations Researcher II - LLM & Agentic AI
A leading financial services company is seeking an Applied Researcher II to enhance AI solutions that transform customer interactions. This role in New York requires collaboration with various stakeholders and applying cutting-edge AI techniques. Ideal candidates will have...
Capital One National Association
New York, NY
1 day ago
Casual Staff - Rabbinic Researcher
$19.23 per hour
...Casual Staff - Rabbinic Researcher Job no: 498341 Work type: Staff Part-time, Temporary and/or Hourly Department: RIETS... ...professional development, YU is committed to ensuring a high quality of life for its employees, offering a work environment that...
Quality
Hourly pay
Temporary work
Part time
Casual work
Local area
Worldwide
Yeshiva University
New York, NY
1 day ago
Applied Research - Artificial Intelligence - Associate
Applied Research - Artificial Intelligence - Associate Applied Research - Artificial Intelligence... ...management services. They are seeking an Applied AI Researcher to drive the adoption of... ..., testing, and maintaining high-quality, production-ready code. Demonstrating technical...
Quality
Full time
Jobright.ai
New York, NY
11 hours ago
Applied Machine Learning Scientist II (AI/ML - Fraud/Risk, GenAI & Agentic AI)
$96.13k - $155.95k
...Job Description: The Applied Machine Learning Scientist II... ...lead role in the development of AI/ML systems to solve a range of... ...emerging industry trends, academic research, and evolving AI technologies,... ...service to the business, quality, innovation and teamwork and ensures...
Quality
Work experience placement
Work at office
Local area
Work from home
Flexible hours
TD Bank Group
New York, NY
3 days ago
Staff User Researcher
...content, meal logging, recipe/meal planning, AI chat, and more. We already have... ...here. About The Role We’re looking for a Staff User Researcher to lead strategic research across... ...experience Key Qualifications 8-12+ years of applied user research experience in high‑growth...
Full time
Live in
Work at office
Remote work
Nourish
New York, NY
4 days ago
Applied Scientist- AI/ML Intern
$45 - $55 per hour
Title: Applied Scientist AI/ML Intern Location: Remote/United States About Us Wealth.com is the industry... ...and improve the end‑to‑end AI quality of our production systems. Effectively... ...g., statistics, predictive analytics, research) Experience building sophisticated RAG...
Quality
Internship
Remote work
Wealth.com
New York, NY
4 days ago
Senior Customer AI Researcher
...Senior Customer AI Researcher Bluefish is on the cutting edge of AI-powered SaaS, helping brands understand their performance and optimization... ...Commitment to craftsmanship—you take pride in delivering high-quality work, paying attention to the small details while keeping the...
Quality
Immediate start
Bluefish AI
New York, NY
2 days ago
Member of Partnerships Staff, Physical AI
...commercial opportunities in the broader physical AI realm. You will conceptualize, validate,... ...teams at Runway, especially with applied research. What you’ll do Research and identify... ...-growth environment, with a focus on quality, scale, and automation Ability to work...
Quality
Contract work
Remote work
runwayml.com
New York, NY
4 days ago
Remote Applied Mathematician for AI Reasoning & Evaluation
$40 per hour
A technology firm is seeking an Applied Mathematician to train AI models in a remote role. Responsibilities include providing complex math problems... ...chatbots, measuring their performance, and ensuring output quality. Ideal candidates should have strong mathematical skills...
Quality
Remote job
Hourly pay
Contract work
Flexible hours
DataAnnotation
New York, NY
1 day ago
Remote AI Training & Quality Researcher
$20 per hour
A leading AI training company is seeking individuals to help train AI chatbots. This... ...generating diverse conversations, writing quality responses, and fact-checking. Candidates... ...degree, and possess excellent writing and research abilities. Compensation starts at $20+ USD...
Quality
Remote job
Hourly pay
DataAnnotation
New York, NY
4 days ago
Staff Enterprise Architect
$144k - $286k
...agency partners Verify the quality of their digital... ...DoubleVerify is hiring a Staff Enterprise Architect to... ...reusable patterns. AI / LLM enablement... ...systems and how they can be applied pragmatically in an enterprise... ...Not-so-fun fact: Research shows that while men apply...
Quality
Live in
DoubleVerify
New York, NY
3 days ago
UX Researcher
$20 per hour
Join us as a UX Researcher at DataAnnotation . We are looking for a UX... ...train and improve cutting‑edge AI models. You’ll assess and... ...aesthetics, usability, and creative quality. Your insights will directly... ...this stage, designers should apply through our core assessment and...
Quality
Full time
Contract work
Part time
Remote work
DataAnnotation
Brooklyn, NY
2 days ago
Associate Researcher-MSH/QHC/ELM
$17 - $34.12 per hour
...Job Title Research Program Coordinator Job Description The... ...Assistants and other itinerant staff needed for research projects... ...our time — discovering and applying new scientific learning and knowledge... ...by delivering high-quality care to all who need it. Through...
Quality
Hourly pay
Full time
Traineeship
Work at office
Local area
Shift work
Mount Sinai Hospital
Elmhurst, NY
8 days ago
Remote Applied Mathematician for AI Model Evaluation
A leading data services company is seeking an Applied Mathematician to evaluate AI models by providing complex mathematical problems to chatbots and assessing their outputs for quality and performance. This role offers flexibility with fully remote work and allows you...
Quality
Remote job
Hourly pay
DataAnnotation
New York, NY
4 days ago
Member of Technical Staff, Senior/Staff MLE
...security-first enterprise AI company. We build... ...customers. Cohere is a team of researchers, engineers, designers,... ...This is not a typical "Applied Scientist" or "ML... ...a Member of Technical Staff, Applied ML, you will:... ...engineering and modeling quality across the organization...
Quality
Full time
Work at office
Local area
Remote work
Home office
Flexible hours
Cohere
New York, NY
1 day ago
Applied Researcher I: AI Foundations & Large-Scale Models
A leading financial tech firm is seeking an expert in natural language processing (NLP) to lead AI-driven projects. The ideal candidate will hold a PhD or equivalent experience, with a proven track record in developing large-scale deep learning models and multiple publications...
Capital One
New York, NY
4 days ago
Staff AI Architect, Family AI Lab
$197k - $290k
...of any specified location above. We are AI Native We are building an AI native company... ...the tradeoffs between latency, cost, and quality. You build the eval harness that tells us... ...We encourage people of all backgrounds to apply. We believe that a diversity of perspectives...
Quality
Work at office
Remote work
Flexible hours
Life360
New York, NY
4 days ago
Quantitative UX Researcher
...playbook exists. Quantitative researchers are central to that mission.... ...user experience questions, applying the right methodology to the... ...fielding logistics - ensuring data quality and participant criteria are... ..., digital twin modeling, and AI-augmented research approaches...
Quality
Local area
Immediate start
AnswerLab
New York, NY
4 days ago
Staff Developer, Usage Based Billing
$192k - $278k
...cybersecurity category built for the way people and AI agents work today. As one of the most... ...are scalable, secure, and meet high quality standards. You have experience setting technical... ...practices, identifying opportunities to apply AI in meaningful ways, and driving...
Quality
Currently hiring
Local area
Immediate start
Remote work
Work from home
1Password
New York, NY
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Applied Researcher, AI Quality. Be the first to apply!