Staff Applied Researcher, AI Quality
$140.4k - $372.3kGitHub, Inc.
Locations In this role you can work from Remote, United States Overview At GitHub, we’re building the next generation of AI‑powered developer experiences. We’re looking for a Staff Applied Researcher with deep expertise in Large Language Model (LLM) evaluation, LLM agents, strong engineering instincts, and a bias for action to help shape the future of GitHub Copilot and our AI platform. This is a high‑impact role where you will design evaluation systems that directly influence how millions of developers experience AI every day. Responsibilities Lead Model Quality & Evaluation Design next‑generation evaluation frameworks for code generation, reasoning, safety, multimodal tasks, and agentic workflows. Develop scalable automatic metrics, LLM‑judge systems, reward models, and human‑in‑the‑loop evaluation pipelines. Establish high‑signal, repeatable methodologies that influence product decisions across GitHub AI. Drive Applied Research & Engineering Build and optimize evaluation tooling, datasets, benchmarking systems, and experimentation pipelines. Create and onboard new benchmarks for the hardest tasks for the coding agents. Collaborate closely with engineering teams to productionize research, validate improvements, and accelerate model iteration cycles. Own end‑to‑end quality insights for the models behind GitHub Copilot and new AI features. Work closely with product development, engineering, and design teams to integrate advanced research findings into practical applications, ensuring alignment with product goals and user needs. Influence, Mentor & Lead Shape GitHub’s strategy for model quality, alignment, and evaluation. Mentor other researchers and engineers, helping elevate technical standards across the organization. Drive clarity in ambiguous problem spaces and champion fast, high‑quality execution. Qualifications Required Qualifications Bachelor\'s degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 8+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR master\'s degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 6+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR doctorate in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 4+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR equivalent experience. 3+ years of strong engineering skills in Python/Typescript and experience building production grade evaluation or data/ML pipelines at scale. Proven track record shipping research or evaluation systems in production environments. Strong cross‑functional communication and influence skills. Preferred Qualifications Experience with LLM judge systems, reward modeling, alignment, or safety evaluations. Background in code generation, developer tools, or AI‑assisted programming. Experience with large‑scale experimentation and online/offline evaluation strategies. Open‑source contributions or experience working with developer communities. Experience designing and leading complex research projects from ideation to implementation Ability to define and articulate data‑driven strategies that consider cross‑functional impacts and align with organizational priorities, particularly in a software development platform context Compensation The base salary range for this job is USD $140,400.00 - USD $372,300.00 /Yr. These pay ranges are intended to cover roles based across the United States. An individual\'s base pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant. At GitHub certain roles are eligible for benefits and additional rewards, including annual bonus and stock. These rewards are allocated based on individual impact in role. In addition, certain roles also have the opportunity to earn sales incentives based on revenue or utilization, depending on the terms of the plan and the employee\'s role. GitHub values Customer-obsessed Ship to learn Growth mindset Own the outcome Better together Diverse and inclusive Manager fundamentals Model Coach Care Leadership principles Create clarity Generate energy Deliver success Who We Are GitHub is the world’s leading AI-powered developer platform with 150 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub. Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms. Join us, and let’s change the world, together. EEO Statement GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don\'t discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there\'s any way we can make the interview process better for you; we\'re happy to accommodate! #J-18808-Ljbffr GitHub, Inc.
$150k - $250k
...Distyl AI Job Opportunity Distyl is an applied AI technology company partnering with the world's most ambitious... ...global social organizations. We research and deploy technologies that power... ...capture reasoning depth, interaction quality, reliability, and operational impact...QualityWork at office3 days per week$125k - $160k
About the Role We’re looking for an AI Applied Researcher to join our growing team and help push the boundaries of how we build and improve our... ...team tackle their toughest problems and level up the quality of their work. This is not a customer‑facing role — you’ll be...Quality- We are seeking a Staff User Researcher to define how research operates across... ...prioritization processes, governance, quality standards, participant... ...judgment on when and how to apply them Experience designing... ...modern research tools and AI‑assisted workflows to increase...QualityWork at officeLocal areaRemote workFlexible hours
$189k - $274k
Staff UX Quantitative Researcher, Search AI, Quality and Trust You will have an opportunity to share your preferred working location from the following: Mountain... ...practical experience. 8 years of experience in an applied research setting, or similar. Experience in...QualityFull time$218.7k - $249.6k
...Overview Applied Researcher I (AI Foundations) Overview: At Capital One, we are creating trustworthy and reliable AI systems, changing... ...A professional with a track record of coming up with high quality ideas or improving upon existing ideas in machine learning,...QualityFull timePart timeLocal areaFlexible hours$218.7k - $249.6k
...Overview Applied Researcher I (AI Foundations, LLM Customization, Finetuning, Reinforcement Learning) Overview: At Capital One, we are... ...A professional with a track record of coming up with high quality ideas or improving upon existing ideas in machine learning,...QualityFull timePart timeLocal areaFlexible hours- Google Inc. in New York seeks a Staff UX Quantitative Researcher to influence product design through empirical user research. The role requires 8+ years of experience in research settings and programming skills in Python, R, and other languages. Candidates should have...Quality
- ...distributed systems to be able to consume new research on consensus & scaling (should have, for... ...methods or an eagerness to learn and apply them Understanding of the fundamentals of... ...curiosity. Bonuses for those who have published quality research in one of the topics mentioned...QualityRemote job
- ...we are creating trustworthy and reliable AI systems, changing banking for good. We are... ...to banking. We are building world-class applied science and engineering teams and scalable... ...to life, touching every aspect of the research lifecycle, from partnering with academia...Full timePart time
$262.5k - $299.6k
Applied Researcher II (AI Foundations, LLM Core and Agentic AI) At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, Capital One has been leading the industry in using machine learning to create real‑time, intelligent...Full timePart timeLocal areaFlexible hours- A leading financial institution is seeking an Applied Researcher I to help develop trustworthy AI systems. This role requires collaboration with data scientists and engineers to deliver innovative products that transform customer interactions. Candidates should have a strong...
- A leading financial services company is seeking an Applied Researcher II to enhance AI solutions that transform customer interactions. This role in New York requires collaboration with various stakeholders and applying cutting-edge AI techniques. Ideal candidates will have...
$19.23 per hour
...Casual Staff - Rabbinic Researcher Job no: 498341 Work type: Staff Part-time, Temporary and/or Hourly Department: RIETS... ...professional development, YU is committed to ensuring a high quality of life for its employees, offering a work environment that...QualityHourly payTemporary workPart timeCasual workLocal areaWorldwide- Applied Research - Artificial Intelligence - Associate Applied Research - Artificial Intelligence... ...management services. They are seeking an Applied AI Researcher to drive the adoption of... ..., testing, and maintaining high-quality, production-ready code. Demonstrating technical...QualityFull time
$96.13k - $155.95k
...Job Description: The Applied Machine Learning Scientist II... ...lead role in the development of AI/ML systems to solve a range of... ...emerging industry trends, academic research, and evolving AI technologies,... ...service to the business, quality, innovation and teamwork and ensures...QualityWork experience placementWork at officeLocal areaWork from homeFlexible hours- ...content, meal logging, recipe/meal planning, AI chat, and more. We already have... ...here. About The Role We’re looking for a Staff User Researcher to lead strategic research across... ...experience Key Qualifications 8-12+ years of applied user research experience in high‑growth...Full timeLive inWork at officeRemote work
$45 - $55 per hour
Title: Applied Scientist AI/ML Intern Location: Remote/United States About Us Wealth.com is the industry... ...and improve the end‑to‑end AI quality of our production systems. Effectively... ...g., statistics, predictive analytics, research) Experience building sophisticated RAG...QualityInternshipRemote work- ...Senior Customer AI Researcher Bluefish is on the cutting edge of AI-powered SaaS, helping brands understand their performance and optimization... ...Commitment to craftsmanship—you take pride in delivering high-quality work, paying attention to the small details while keeping the...QualityImmediate start
- ...commercial opportunities in the broader physical AI realm. You will conceptualize, validate,... ...teams at Runway, especially with applied research. What you’ll do Research and identify... ...-growth environment, with a focus on quality, scale, and automation Ability to work...QualityContract workRemote work
$40 per hour
A technology firm is seeking an Applied Mathematician to train AI models in a remote role. Responsibilities include providing complex math problems... ...chatbots, measuring their performance, and ensuring output quality. Ideal candidates should have strong mathematical skills...QualityRemote jobHourly payContract workFlexible hours$20 per hour
A leading AI training company is seeking individuals to help train AI chatbots. This... ...generating diverse conversations, writing quality responses, and fact-checking. Candidates... ...degree, and possess excellent writing and research abilities. Compensation starts at $20+ USD...QualityRemote jobHourly pay$144k - $286k
...agency partners Verify the quality of their digital... ...DoubleVerify is hiring a Staff Enterprise Architect to... ...reusable patterns. AI / LLM enablement... ...systems and how they can be applied pragmatically in an enterprise... ...Not-so-fun fact: Research shows that while men apply...QualityLive in$20 per hour
Join us as a UX Researcher at DataAnnotation . We are looking for a UX... ...train and improve cutting‑edge AI models. You’ll assess and... ...aesthetics, usability, and creative quality. Your insights will directly... ...this stage, designers should apply through our core assessment and...QualityFull timeContract workPart timeRemote work$17 - $34.12 per hour
...Job Title Research Program Coordinator Job Description The... ...Assistants and other itinerant staff needed for research projects... ...our time — discovering and applying new scientific learning and knowledge... ...by delivering high-quality care to all who need it. Through...QualityHourly payFull timeTraineeshipWork at officeLocal areaShift work- A leading data services company is seeking an Applied Mathematician to evaluate AI models by providing complex mathematical problems to chatbots and assessing their outputs for quality and performance. This role offers flexibility with fully remote work and allows you...QualityRemote jobHourly pay
- ...security-first enterprise AI company. We build... ...customers. Cohere is a team of researchers, engineers, designers,... ...This is not a typical "Applied Scientist" or "ML... ...a Member of Technical Staff, Applied ML, you will:... ...engineering and modeling quality across the organization...QualityFull timeWork at officeLocal areaRemote workHome officeFlexible hours
- A leading financial tech firm is seeking an expert in natural language processing (NLP) to lead AI-driven projects. The ideal candidate will hold a PhD or equivalent experience, with a proven track record in developing large-scale deep learning models and multiple publications...
$197k - $290k
...of any specified location above. We are AI Native We are building an AI native company... ...the tradeoffs between latency, cost, and quality. You build the eval harness that tells us... ...We encourage people of all backgrounds to apply. We believe that a diversity of perspectives...QualityWork at officeRemote workFlexible hours- ...playbook exists. Quantitative researchers are central to that mission.... ...user experience questions, applying the right methodology to the... ...fielding logistics - ensuring data quality and participant criteria are... ..., digital twin modeling, and AI-augmented research approaches...QualityLocal areaImmediate start
$192k - $278k
...cybersecurity category built for the way people and AI agents work today. As one of the most... ...are scalable, secure, and meet high quality standards. You have experience setting technical... ...practices, identifying opportunities to apply AI in meaningful ways, and driving...QualityCurrently hiringLocal areaImmediate startRemote workWork from home
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Applied Researcher, AI Quality. Be the first to apply!
- visiting researcher New York, NY
- court researcher New York, NY
- remote researcher New York, NY
- independent researcher New York, NY
- machine learning researcher New York, NY
- qualitative researcher New York, NY
- security researcher New York, NY
- title researcher New York, NY
- trend researcher New York, NY
- product researcher New York, NY

