Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Applied Researcher, AI Quality

$140.4k - $372.3k

GitHub, Inc.

Locations In this role you can work from Remote, United States Overview At GitHub, we’re building the next generation of AI‑powered developer experiences. We’re looking for a Staff Applied Researcher with deep expertise in Large Language Model (LLM) evaluation, LLM agents, strong engineering instincts, and a bias for action to help shape the future of GitHub Copilot and our AI platform. This is a high‑impact role where you will design evaluation systems that directly influence how millions of developers experience AI every day. Responsibilities Lead Model Quality & Evaluation Design next‑generation evaluation frameworks for code generation, reasoning, safety, multimodal tasks, and agentic workflows. Develop scalable automatic metrics, LLM‑judge systems, reward models, and human‑in‑the‑loop evaluation pipelines. Establish high‑signal, repeatable methodologies that influence product decisions across GitHub AI. Drive Applied Research & Engineering Build and optimize evaluation tooling, datasets, benchmarking systems, and experimentation pipelines. Create and onboard new benchmarks for the hardest tasks for the coding agents. Collaborate closely with engineering teams to productionize research, validate improvements, and accelerate model iteration cycles. Own end‑to‑end quality insights for the models behind GitHub Copilot and new AI features. Work closely with product development, engineering, and design teams to integrate advanced research findings into practical applications, ensuring alignment with product goals and user needs. Influence, Mentor & Lead Shape GitHub’s strategy for model quality, alignment, and evaluation. Mentor other researchers and engineers, helping elevate technical standards across the organization. Drive clarity in ambiguous problem spaces and champion fast, high‑quality execution. Qualifications Required Qualifications Bachelor\'s degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 8+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR master\'s degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 6+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR doctorate in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 4+ years\' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field OR equivalent experience. 3+ years of strong engineering skills in Python/Typescript and experience building production grade evaluation or data/ML pipelines at scale. Proven track record shipping research or evaluation systems in production environments. Strong cross‑functional communication and influence skills. Preferred Qualifications Experience with LLM judge systems, reward modeling, alignment, or safety evaluations. Background in code generation, developer tools, or AI‑assisted programming. Experience with large‑scale experimentation and online/offline evaluation strategies. Open‑source contributions or experience working with developer communities. Experience designing and leading complex research projects from ideation to implementation Ability to define and articulate data‑driven strategies that consider cross‑functional impacts and align with organizational priorities, particularly in a software development platform context Compensation The base salary range for this job is USD $140,400.00 - USD $372,300.00 /Yr. These pay ranges are intended to cover roles based across the United States. An individual\'s base pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant. At GitHub certain roles are eligible for benefits and additional rewards, including annual bonus and stock. These rewards are allocated based on individual impact in role. In addition, certain roles also have the opportunity to earn sales incentives based on revenue or utilization, depending on the terms of the plan and the employee\'s role. GitHub values Customer-obsessed Ship to learn Growth mindset Own the outcome Better together Diverse and inclusive Manager fundamentals Model Coach Care Leadership principles Create clarity Generate energy Deliver success Who We Are GitHub is the world’s leading AI-powered developer platform with 150 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub. Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms. Join us, and let’s change the world, together. EEO Statement GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don\'t discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there\'s any way we can make the interview process better for you; we\'re happy to accommodate! #J-18808-Ljbffr GitHub, Inc.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Staff Applied Researcher, AI Quality in New York, NY vacancy
  • $150k - $250k

     ...Distyl AI Job Opportunity Distyl is an applied AI technology company partnering with the world's most ambitious...  ...global social organizations. We research and deploy technologies that power...  ...capture reasoning depth, interaction quality, reliability, and operational impact... 
    Quality
    Work at office
    3 days per week

    Distyl AI

    New York, NY
    3 days ago
  • $125k - $160k

    About the Role We’re looking for an AI Applied Researcher to join our growing team and help push the boundaries of how we build and improve our...  ...team tackle their toughest problems and level up the quality of their work. This is not a customer‑facing role — you’ll be... 
    Quality

    Hello Patient

    New York, NY
    3 days ago
  • We are seeking a Staff User Researcher to define how research operates across...  ...prioritization processes, governance, quality standards, participant...  ...judgment on when and how to apply them Experience designing...  ...modern research tools and AI‑assisted workflows to increase... 
    Quality
    Work at office
    Local area
    Remote work
    Flexible hours

    Buyers Edge Platform, LLC

    New York, NY
    4 days ago
  • $189k - $274k

    Staff UX Quantitative Researcher, Search AI, Quality and Trust You will have an opportunity to share your preferred working location from the following: Mountain...  ...practical experience. 8 years of experience in an applied research setting, or similar. Experience in... 
    Quality
    Full time

    Google Inc.

    New York, NY
    2 days ago
  • $218.7k - $249.6k

     ...Overview Applied Researcher I (AI Foundations) Overview: At Capital One, we are creating trustworthy and reliable AI systems, changing...  ...A professional with a track record of coming up with high quality ideas or improving upon existing ideas in machine learning,... 
    Quality
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    New York, NY
    6 days ago
  • $218.7k - $249.6k

     ...Overview Applied Researcher I (AI Foundations, LLM Customization, Finetuning, Reinforcement Learning) Overview: At Capital One, we are...  ...A professional with a track record of coming up with high quality ideas or improving upon existing ideas in machine learning,... 
    Quality
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    New York, NY
    a month ago
  • Google Inc. in New York seeks a Staff UX Quantitative Researcher to influence product design through empirical user research. The role requires 8+ years of experience in research settings and programming skills in Python, R, and other languages. Candidates should have... 
    Quality

    Google Inc.

    New York, NY
    3 days ago
  •  ...distributed systems to be able to consume new research on consensus & scaling (should have, for...  ...methods or an eagerness to learn and apply them Understanding of the fundamentals of...  ...curiosity. Bonuses for those who have published quality research in one of the topics mentioned... 
    Quality
    Remote job

    WorksHub

    New York, NY
    4 days ago
  •  ...we are creating trustworthy and reliable AI systems, changing banking for good. We are...  ...to banking. We are building world-class applied science and engineering teams and scalable...  ...to life, touching every aspect of the research lifecycle, from partnering with academia... 
    Full time
    Part time

    Capital One

    New York, NY
    4 days ago
  • $262.5k - $299.6k

    Applied Researcher II (AI Foundations, LLM Core and Agentic AI) At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, Capital One has been leading the industry in using machine learning to create real‑time, intelligent... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One National Association

    New York, NY
    1 day ago
  • A leading financial institution is seeking an Applied Researcher I to help develop trustworthy AI systems. This role requires collaboration with data scientists and engineers to deliver innovative products that transform customer interactions. Candidates should have a strong... 

    Capital One National Association

    New York, NY
    1 day ago
  • A leading financial services company is seeking an Applied Researcher II to enhance AI solutions that transform customer interactions. This role in New York requires collaboration with various stakeholders and applying cutting-edge AI techniques. Ideal candidates will have... 

    Capital One National Association

    New York, NY
    1 day ago
  • $19.23 per hour

     ...Casual Staff - Rabbinic Researcher Job no: 498341 Work type: Staff Part-time, Temporary and/or Hourly Department: RIETS...  ...professional development, YU is committed to ensuring a high quality of life for its employees, offering a work environment that... 
    Quality
    Hourly pay
    Temporary work
    Part time
    Casual work
    Local area
    Worldwide

    Yeshiva University

    New York, NY
    1 day ago
  • Applied Research - Artificial Intelligence - Associate Applied Research - Artificial Intelligence...  ...management services. They are seeking an Applied AI Researcher to drive the adoption of...  ..., testing, and maintaining high-quality, production-ready code. Demonstrating technical... 
    Quality
    Full time

    Jobright.ai

    New York, NY
    11 hours ago
  • $96.13k - $155.95k

     ...Job Description: The Applied Machine Learning Scientist II...  ...lead role in the development of AI/ML systems to solve a range of...  ...emerging industry trends, academic research, and evolving AI technologies,...  ...service to the business, quality, innovation and teamwork and ensures... 
    Quality
    Work experience placement
    Work at office
    Local area
    Work from home
    Flexible hours

    TD Bank Group

    New York, NY
    3 days ago
  •  ...content, meal logging, recipe/meal planning, AI chat, and more. We already have...  ...here. About The Role We’re looking for a Staff User Researcher to lead strategic research across...  ...experience Key Qualifications 8-12+ years of applied user research experience in high‑growth... 
    Full time
    Live in
    Work at office
    Remote work

    Nourish

    New York, NY
    4 days ago
  • $45 - $55 per hour

    Title: Applied Scientist AI/ML Intern Location: Remote/United States About Us Wealth.com is the industry...  ...and improve the end‑to‑end AI quality of our production systems. Effectively...  ...g., statistics, predictive analytics, research) Experience building sophisticated RAG... 
    Quality
    Internship
    Remote work

    Wealth.com

    New York, NY
    4 days ago
  •  ...Senior Customer AI Researcher Bluefish is on the cutting edge of AI-powered SaaS, helping brands understand their performance and optimization...  ...Commitment to craftsmanship—you take pride in delivering high-quality work, paying attention to the small details while keeping the... 
    Quality
    Immediate start

    Bluefish AI

    New York, NY
    2 days ago
  •  ...commercial opportunities in the broader physical AI realm. You will conceptualize, validate,...  ...teams at Runway, especially with applied research. What you’ll do Research and identify...  ...-growth environment, with a focus on quality, scale, and automation Ability to work... 
    Quality
    Contract work
    Remote work

    runwayml.com

    New York, NY
    4 days ago
  • $40 per hour

    A technology firm is seeking an Applied Mathematician to train AI models in a remote role. Responsibilities include providing complex math problems...  ...chatbots, measuring their performance, and ensuring output quality. Ideal candidates should have strong mathematical skills... 
    Quality
    Remote job
    Hourly pay
    Contract work
    Flexible hours

    DataAnnotation

    New York, NY
    1 day ago
  • $20 per hour

    A leading AI training company is seeking individuals to help train AI chatbots. This...  ...generating diverse conversations, writing quality responses, and fact-checking. Candidates...  ...degree, and possess excellent writing and research abilities. Compensation starts at $20+ USD... 
    Quality
    Remote job
    Hourly pay

    DataAnnotation

    New York, NY
    4 days ago
  • $144k - $286k

     ...agency partners Verify the quality of their digital...  ...DoubleVerify is hiring a Staff Enterprise Architect to...  ...reusable patterns. AI / LLM enablement...  ...systems and how they can be applied pragmatically in an enterprise...  ...Not-so-fun fact: Research shows that while men apply... 
    Quality
    Live in

    DoubleVerify

    New York, NY
    3 days ago
  • $20 per hour

    Join us as a UX Researcher at DataAnnotation . We are looking for a UX...  ...train and improve cutting‑edge AI models. You’ll assess and...  ...aesthetics, usability, and creative quality. Your insights will directly...  ...this stage, designers should apply through our core assessment and... 
    Quality
    Full time
    Contract work
    Part time
    Remote work

    DataAnnotation

    Brooklyn, NY
    2 days ago
  • $17 - $34.12 per hour

     ...Job Title Research Program Coordinator Job Description The...  ...Assistants and other itinerant staff needed for research projects...  ...our time — discovering and applying new scientific learning and knowledge...  ...by delivering high-quality care to all who need it. Through... 
    Quality
    Hourly pay
    Full time
    Traineeship
    Work at office
    Local area
    Shift work

    Mount Sinai Hospital

    Elmhurst, NY
    8 days ago
  • A leading data services company is seeking an Applied Mathematician to evaluate AI models by providing complex mathematical problems to chatbots and assessing their outputs for quality and performance. This role offers flexibility with fully remote work and allows you... 
    Quality
    Remote job
    Hourly pay

    DataAnnotation

    New York, NY
    4 days ago
  •  ...security-first enterprise AI company. We build...  ...customers. Cohere is a team of researchers, engineers, designers,...  ...This is not a typical "Applied Scientist" or "ML...  ...a Member of Technical Staff, Applied ML, you will:...  ...engineering and modeling quality across the organization... 
    Quality
    Full time
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    Cohere

    New York, NY
    1 day ago
  • A leading financial tech firm is seeking an expert in natural language processing (NLP) to lead AI-driven projects. The ideal candidate will hold a PhD or equivalent experience, with a proven track record in developing large-scale deep learning models and multiple publications... 

    Capital One

    New York, NY
    4 days ago
  • $197k - $290k

     ...of any specified location above. We are AI Native We are building an AI native company...  ...the tradeoffs between latency, cost, and quality. You build the eval harness that tells us...  ...We encourage people of all backgrounds to apply. We believe that a diversity of perspectives... 
    Quality
    Work at office
    Remote work
    Flexible hours

    Life360

    New York, NY
    4 days ago
  •  ...playbook exists. Quantitative researchers are central to that mission....  ...user experience questions, applying the right methodology to the...  ...fielding logistics - ensuring data quality and participant criteria are...  ..., digital twin modeling, and AI-augmented research approaches... 
    Quality
    Local area
    Immediate start

    AnswerLab

    New York, NY
    4 days ago
  • $192k - $278k

     ...cybersecurity category built for the way people and AI agents work today. As one of the most...  ...are scalable, secure, and meet high quality standards. You have experience setting technical...  ...practices, identifying opportunities to apply AI in meaningful ways, and driving... 
    Quality
    Currently hiring
    Local area
    Immediate start
    Remote work
    Work from home

    1Password

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Applied Researcher, AI Quality. Be the first to apply!