Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

User Researcher, AI Evaluations

$196k - $230k

Apply

Who We Are Notion is the collaborative AI workspace where teams and agents think together. We're building one place where your knowledge, projects, meetings, and AI tools live side by side, so work is faster, clearer, and less fragmented. Millions of individuals, small teams, and large companies run their work on Notion. Notinos (our employees) are customer zero in bringing this future of work to life. We care about craft, building things that last, and the belief that great work is still fundamentally human. Our goal isn’t to ship the next feature. Each and every team of Notinos is working to set the standard for how humans work together in the AI era. From building a business’s system of record to making and managing AI agents to automating away the busy work, we care deeply about giving our customers more time for their life’s work. About the Role: We’re seeking an experienced UX Researcher to define and scale how we evaluate Notion’s AI-powered experiences—focusing on what “good” looks like not only for model output quality, but for the end-to-end product experience where people discover, set goals, delegate work, review results, and build trust over time with AI. This role sits at the intersection of research craft and evaluation operations: you’ll run studies that uncover user mental models, expectations, and failure/recovery behaviors, then translate those insights into reusable rubrics, workflows, and measurement approaches that product, design, engineering, and data science can apply consistently. This role can be based in either San Francisco or New York City. We work from our offices on Mondays, Tuesdays and Thursdays (our Anchor Days) because we do our best thinking and building together in person. We’re looking for someone who’s excited to work alongside the team during those days. What You’ll Achieve: Define what “good” looks like (frameworks & rubrics): Establish clear, reusable evaluation criteria that reflect real user expectations—helpfulness, trust, tone, control, and transparency. You’ll translate qualitative insight into scoring guidance that can be applied consistently across teams and over time. Run recurring evals (longitudinal & feature-specific): Run recurring longitudinal and feature-specific surveys and studies to measure experience quality over time against defined rubrics. Lead qualitative studies, side-by-side comparisons, and human-in-the-loop evaluation efforts to deepen understanding of where experiences break down and how they can improve. You’ll help teams spot regressions, benchmark improvements, and understand when expectations shift. Anchor evaluation in real workflows (context > isolated feedback): Ensure evals reflect jobs-to-be-done, user intent, and the full interaction journey (goal setting, delegation, review, iteration), not just decontextualized thumbs up/down. You’ll help teams understand who is evaluating, what they’re trying to do, and why outputs succeed or fail. Identify failure modes & recovery behavior (guardrails): Uncover breakdowns, regressions, and edge cases across the system—from model behavior to UI and integrations—and study how people notice issues, correct them, and continue their work. You’ll turn these insights into actionable guidance for guardrails, fixes, and prioritization. Operationalize evaluation with partners (process & tooling): Collaborate closely with Product, Design, Engineering, and Data Science to align on target use cases and build scalable evaluation loops (human-in-the-loop review, longitudinal studies, and calibration of automated/LLM-judge approaches against human judgment). Skills You’ll Need to Bring: Ability to operationalize insight into measurement: You’re comfortable turning “soft” user expectations (trust, tone, usefulness, clarity) into concrete rubrics, scoring guidelines, and observable metrics. AI fluency and systems thinking: You’re curious and hands-on with AI products, and can reason about how model behavior, uncertainty, and system constraints shape user experience. You also have experience evaluating AI-enabled products (LLMs, agents, generative UI/workflow automation) and working with Data Science/ML partners on measurement strategy and evaluation tooling. Clear communication and impact orientation: You can align diverse partners around shared definitions of quality and create artifacts that enable teams to act consistently. You tailor storytelling to different audiences, connect research to business outcomes, and drive follow-through so insights translate into product change. Strong UX research craft (quant + qual): You can choose the right methods for the question— interviews, benchmarking, surveys, experiments—and synthesize into actionable guidance. You also can prioritize ruthlessly, work through ambiguity, and balance scrappy iteration with deep dives when needed. Pragmatism in fast-moving environments: You can prioritize ruthlessly, work through ambiguity, and balance scrappy iteration with deep dives when needed. Experience: 5+ years doing UX research in industry Nice to Haves: Familiarity with LLM-as-judge methods, prompt design for evaluators, or “golden dataset” creation Experience using AI research tooling for rapid synthesis and communication (e.g., Dovetail, Listen Labs, Maze, Outset, etc.), as well as AI observability tooling like Braintrust Experience using data querying languages (e.g., SQL), scripting languages (e.g., Python), or statistical/mathematical software (e.g., R, SAS, Matlab, etc.) Master’s or PhD in HCI, Psychology, Behavioral Science, Anthropology, Sociology, or a related field You’re familiar with the work of computing heroes like Douglas Engelbart, Alan Kay, Bret Victor, etc. — and understand why we’re big fans. Notion is committed to providing highly competitive cash compensation, equity, and benefits. The compensation offered for this role will be based on multiple factors such as location, the role’s scope and complexity, and the candidate’s experience and expertise, and may vary from the range provided below. For roles based in San Francisco or New York City, the estimated base salary range for this role is $196,000-$230,000 per year. A Note on AI You don’t need deep AI expertise for every role, but we do expect every Notino to be intellectually curious, drawn to tinkering and discovery, and excited to use AI as a real collaborator in their work. For some roles, AI fluency is a core requirement — when that’s the case, we’ll say so explicitly in the qualifications. People who thrive here don’t treat AI as a novelty. They use it to think better, and make their work easier for others to build on. Equal Opportunity & Accommodations We hire talented people from a wide range of backgrounds. If you’re excited about this role but don’t meet every bullet, we still encourage you to apply. Notion is an equal opportunity employer and does not discriminate on the basis of any legally protected characteristic. Consistent with applicable law, we will consider for employment qualified applicants with arrest and conviction records. Notion provides reasonable accommodations during the application process; if you need one, please let your recruiter know. Notion is proud to be an equal opportunity employer. We do not discriminate in hiring or any employment decision based on race, color, religion, national origin, age, sex (including pregnancy, childbirth, or related medical conditions), marital status, ancestry, physical or mental disability, genetic information, veteran status, gender identity or expression, sexual orientation, or other applicable legally protected characteristic. Notion considers qualified applicants with criminal histories, consistent with applicable federal, state and local law. Notion is also committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please let your recruiter know. #J-18808-Ljbffr Apply

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the User Researcher, AI Evaluations in New York, NY vacancy
  • Notion is looking for an experienced UX Researcher in New York City to define and create evaluation frameworks for AI products. The role involves translating user insights into actionable guidelines and collaborating with cross-functional teams to improve product experiences... 
    Suggested

    Apply

    New York, NY
    1 day ago
  • $20 per hour

    A leading technology firm is seeking a UX Researcher to train and enhance AI models in understanding design principles. This role involves evaluating AI outputs and offering feedback to refine aesthetic and usability qualities. Successful candidates will work remotely on... 
    Suggested
    Remote job
    Contract work
    Flexible hours

    DataAnnotation

    New York, NY
    14 hours ago
  • $20 per hour

    A design-focused AI training company is seeking a UX Researcher to evaluate AI outputs and help shape future design tools. This flexible remote position offers $20+ USD/hr for general projects and $40+ USD/hr for design-focused tasks. Ideal candidates should have a strong... 
    Suggested
    Remote job
    For contractors
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • Notion is looking for an experienced UX Researcher based in New York City to define and scale evaluations of AI‑powered experiences. You will establish clear criteria for user expectations and run qualitative evaluations across products. The ideal candidate has over 5 years... 
    Suggested

    Notion

    New York, NY
    2 days ago
  • $25 - $40 per hour

    DataAnnotation is seeking a Lead Product Designer to help train AI models in Idaho, United States. You will evaluate and critique AI-generated UI/UX designs, ensuring they meet visual and usability standards. Your insights will help shape AI tools to better support designers... 
    Suggested
    For contractors
    Work from home

    DataAnnotation

    New York, NY
    1 day ago
  • $25 - $30 per hour

    DataAnnotation is seeking a Lead Product Designer to train AI models by evaluating their outputs and providing critique on designs. Your expertise will enhance the next generation of AI tools and ensure high-quality models. This position offers the flexibility to choose... 
    Remote work

    DataAnnotation

    New York, NY
    14 hours ago
  • $20 per hour

    A company focused on AI solutions is seeking a Graphic Designer to enhance AI models by evaluating UX/UI designs. This remote position allows you to select projects aligning with your schedule, with pay starting at $20/hr for general AI projects and $40/hr for design-focused... 
    Remote job
    For contractors

    DataAnnotation

    Brooklyn, NY
    1 day ago
  • $20 per hour

    Feedinkoo is looking for a Web Developer/Designer to enhance AI models by evaluating design work, including interfaces and user experiences. This role involves reviewing AI‑generated visuals and providing feedback to improve users' experience with AI tools. Working remotely... 
    Remote job

    Feedinkoo

    New York, NY
    3 days ago
  • $40 per hour

    DataAnnotation, located in the United States, is seeking an experienced UI Designer to aid in the training of AI models. This role involves evaluating AI chatbots, providing feedback, and refining design outputs to enhance usability and creativity. The position allows... 
    Remote job
    Work from home
    Flexible hours

    DataAnnotation

    New York, NY
    14 hours ago
  • $164k - $190k

    Who We Are Notion is the collaborative AI workspace where teams and agents think together...  ...the Role We’re seeking an experienced User Researcher to deliver insights that improve Notion’...  .../usability testing, and post-ship evaluation — with clear recommendations that drive... 

    Notion

    New York, NY
    2 days ago
  • $224k

     ...building a more open world. Join us. Principal Quantitative User Experience Researcher, AI (R-106384) Location New York, Austin, Los Angeles, San...  ...psychometric instruments, experience metrics, and AI evaluation rubrics — that apply statistical rigor to challenges like... 
    Flexible hours

    Expedia, Inc.

    New York, NY
    3 days ago
  • A flexible AI research firm is seeking skilled 3D Slicer users to support their AI research through flexible, hourly contract work. You'll leverage your hands-on experience with medical image analysis to evaluate AI-generated content and provide feedback to enhance AI understanding... 
    Remote job
    Hourly pay
    Contract work
    Flexible hours

    Handshake

    New York, NY
    14 hours ago
  • $25 - $40 per hour

    DataAnnotation is seeking a UI/UX Designer to train AI models and enhance their visual understanding. You will review AI-generated outputs, critique designs and help improve quality through your design expertise. This independent contractor position allows for flexible... 
    Remote job
    For contractors
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    14 hours ago
  • $183k - $278k

    Gray Swan is seeking a Machine Learning Researcher to develop new approaches to adversarial testing and model evaluation. Work at the intersection of research and production, contributing to the secure deployment of AI systems. This full-time position offers a remote work... 
    Remote job
    Full time

    Gray Swan

    New York, NY
    14 hours ago
  • $40 per hour

    DataAnnotation is seeking a Lead Product Designer in Georgia to assist in training AI models by critiquing their designs and enhancing usability. This position allows you to work on your own schedule from home while engaging in general and design-specific AI training projects... 

    DataAnnotation

    New York, NY
    14 hours ago
  • $45 - $50 per hour

     ...Senior User Researcher, Insights (Contractor) Senior User Researcher, Insights (Contractor) This range is provided by Swell Partners. Your...  ...community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr... 
    Full time
    Contract work
    For contractors
    Freelance
    Remote work

    Swell Partners

    New York, NY
    2 days ago
  • $89.7k - $166.5k

     ...Description The Product Design, UX Researcher will bring rigor, curiosity,...  ...Design team, connecting user behavior to product strategy through...  ...shift roadmaps, and you use AI tools to work faster without...  ...generative discovery through evaluative testing of shipped features. Contribute... 
    Temporary work
    Freelance
    Work at office
    Local area
    Immediate start
    Flexible hours
    Shift work

    Unavailable

    New York, NY
    4 days ago
  • $55 - $65 per hour

     ...ours. We are searching for a Part-Time UX Researcher for our faith-based tech client. In this...  ...If you’re passionate about understanding user needs and bringing purpose-driven work to...  ...insights directly into each article, started with the help of AI. #J-18808-Ljbffr... 
    Contract work
    Part time
    Remote work

    52 Limited

    New York, NY
    14 hours ago
  • $40 per hour

    A cybersecurity solutions provider is looking for experienced professionals to join their team and help train AI models. The role involves evaluating AI-generated security content and providing technical problem-solving to enhance AI systems. Candidates should have over... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    New York, NY
    14 hours ago
  • $20 per hour

    A leading AI training company in Georgia seeks a Mobile UI Designer to enhance AI model capabilities in understanding and generating design. Responsibilities include reviewing AI-generated designs and providing critiques, while qualifications include proficiency in UI/... 
    Remote job
    Contract work
    Flexible hours

    DataAnnotation

    New York, NY
    4 days ago
  • $20 per hour

    A technology firm specializing in AI is seeking a UX Web Designer to assess and enhance AI systems' understanding of design work. This role allows remote flexibility and project selection, with competitive pay rates starting from $20/hr for general projects and $40/hr... 
    Remote job

    DataAnnotation

    Brooklyn, NY
    4 days ago
  • $20 per hour

    A tech company specializing in AI is seeking a UX Web Designer for a remote position focusing on enhancing AI's understanding of design. You will assess and critique AI-generated designs, helping refine models to improve aesthetics and usability. Candidates should have... 
    Remote job
    Flexible hours

    DataAnnotation

    New York, NY
    1 day ago
  •  ...We are seeking a Staff User Researcher to define how research operates across BEP's full product portfolio and elevate product and design decisions...  ...ecosystems) Comfort leveraging modern research tools and AI‑assisted workflows to increase efficiency and scale Not sure... 
    Work at office
    Local area
    Remote work
    Flexible hours

    Buyers Edge Platform

    New York, NY
    14 hours ago
  • $20 per hour

    A data solutions company in the United States is looking for a Product Designer to enhance AI models through design evaluation and feedback. This role offers flexible, remote work opportunities and focuses on UI/UX evaluation. Candidates should have a design background... 
    Remote job
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    4 days ago
  • $89.7k - $166.5k

     ...Description The Product Design, UX Researcher will bring rigor, curiosity,...  ...Design team, connecting user behavior to product strategy...  ...and shift roadmaps, and you use AI tools to work faster without...  ...generative discovery through evaluative testing of shipped features.... 
    Temporary work
    Freelance
    Work at office
    Local area
    Immediate start
    Flexible hours
    Shift work

    Starcom Mediavest Group Germany Gmbh

    New York, NY
    2 days ago
  •  ...User Experience (UX) Researcher III Responsibilities: Design, plan, and perform comprehensive research to inform product development within...  ...autonomy and adapt quickly to shifting workflows and emerging AI-powered UX capabilities. Strong communication skills... 
    Shift work

    Medix

    New York, NY
    14 hours ago
  • $89.7k - $166.5k

     ...Product Design, UX Researcher The Product Design, UX Researcher will...  ...Product Design team, connecting user behavior to product strategy...  ...and shift roadmaps, and you use AI tools to work faster without...  ...generative discovery through evaluative testing of shipped features.... 
    Work at office
    Local area
    Flexible hours
    Shift work

    Digitas

    New York, NY
    3 days ago
  • A tech-driven company is seeking an Experience Designer to assess and enhance AI systems' design capabilities. In this remote role, you will leverage your design expertise to improve AI understanding of aesthetics and usability. Candidates should have a background in UI... 
    Remote job
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    14 hours ago
  • $20 per hour

    A leading tech company is seeking an Interaction Designer to enhance AI models. You'll review AI-generated UI/UX designs and provide feedback to improve aesthetic quality. Candidates should have a strong background in design, particularly UI/UX, and fluency in English.... 
    Remote job
    Flexible hours

    DataAnnotation

    New York, NY
    3 days ago
  • $135k - $160k

    A Place for Mom is seeking a Senior User Researcher to understand the journeys behind aging care. This role involves leading research, influencing product design, and leveraging AI tools to enhance insights. The candidate should have a strong background in user research... 
    Remote work

    Remote Jobs

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to User Researcher, AI Evaluations. Be the first to apply!