Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

User Researcher, AI Evaluations

$196k - $230k

Hack Chicago

Who We Are Notion is the collaborative AI workspace where teams and agents think together. We’re building one place where your knowledge, projects, meetings, and AI tools live side by side so work is faster, clearer, and less fragmented. Millions of individuals, small teams, and large companies run their work on Notion. Notinos (our employees) are customer zero in bringing this future of work to life. We care about craft, building things that last, and the belief that great work is still fundamentally human. Our goal isn’t to ship the next feature. Each and every team of Notinos is working to set the standard for how humans work together in the AI era. From building a business’s system of record to making and managing AI agents to automating away the busy work, we care deeply about giving our customers more time for their life’s work. About The Role We’re seeking an experienced UX Researcher to define and scale how we evaluate Notion’s AI‑powered experiences—focusing on what “good” looks like not only for model output quality, but for the end‑to‑end product experience where people discover, set goals, delegate work, review results, and build trust over time with AI. This role sits at the intersection of research craft and evaluation operations. You’ll run studies that uncover user mental models, expectations, and failure/recovery behaviors, then translate those insights into reusable rubrics, workflows, and measurement approaches that product, design, engineering, and data science can apply consistently. This role can be based in either San Francisco or New York City. We work from our offices on Mondays, Tuesdays, and Thursdays (our Anchor Days) because we do our best thinking and building together in person. We’re looking for someone who’s excited to work alongside the team during those days. What You’ll Achieve Define what “good” looks like (frameworks & rubrics): Establish clear, reusable evaluation criteria that reflect real user expectations—helpfulness, trust, tone, control, and transparency. Translate qualitative insight into scoring guidance that can be applied consistently across teams and over time. Run recurring evals (longitudinal & feature‑specific): Conduct longitudinal and feature‑specific surveys and studies to measure experience quality over time against defined rubrics. Lead qualitative studies, side‑by‑side comparisons, and human‑in‑the‑loop evaluation efforts to deepen understanding of where experiences break down and how they can improve. Help teams spot regressions, benchmark improvements, and understand when expectations shift. Anchor evaluation in real workflows (context > isolated feedback): Ensure evals reflect jobs‑to‑be‑done, user intent, and the full interaction journey (goal setting, delegation, review, iteration), not just decontextualized thumbs up/down. Help teams understand who is evaluating, what they’re trying to do, and why outputs succeed or fail. Identify failure modes & recovery behavior (guardrails): Uncover breakdowns, regressions, and edge cases across the system—from model behavior to UI and integrations—and study how people notice issues, correct them, and continue their work. Turn these insights into actionable guidance for guardrails, fixes, and prioritization. Operationalize evaluation with partners (process & tooling): Collaborate closely with Product, Design, Engineering, and Data Science to align on target use cases and build scalable evaluation loops (human‑in‑the‑loop review, longitudinal studies, and calibration of automated/LLM‑judge approaches against human judgment). Skills You’ll Need To Bring Ability to operationalize insight into measurement: Comfortable turning “soft” user expectations (trust, tone, usefulness, clarity) into concrete rubrics, scoring guidelines, and observable metrics. AI fluency and systems thinking: Curious and hands‑on with AI products, capable of reasoning about how model behavior, uncertainty, and system constraints shape user experience. Experience evaluating AI‑enabled products (LLMs, agents, generative UI/workflow automation) and working with Data Science/ML partners on measurement strategy and evaluation tooling. Clear communication and impact orientation: Can align diverse partners around shared definitions of quality and create artifacts that enable teams to act consistently. Tailors storytelling to different audiences, connects research to business outcomes, and drives follow‑through so insights translate into product change. Strong UX research craft (quant+qual): Chooses the right methods for the question—interviews, benchmarking, surveys, experiments—and synthesizes into actionable guidance. Prioritizes ruthlessly, works through ambiguity, and balances scrappy iteration with deep dives when needed. Pragmatism in fast‑moving environments: Prioritizes ruthlessly, works through ambiguity, and balances scrappy iteration with deep dives when needed. Experience: 5+ years doing UX research in industry. Nice To Haves Familiarity with LLM‑as‑judge methods, prompt design for evaluators, or “golden dataset” creation. Experience using AI research tooling for rapid synthesis and communication (e.g., Dovetail, Listen Labs, Maze, Outset, etc.) and AI observability tooling like Braintrust. Experience using data querying languages (e.g., SQL), scripting languages (e.g., Python), or statistical/mathematical software (e.g., R, SAS, Matlab, etc.). Master’s or PhD in HCI, Psychology, Behavioral Science, Anthropology, Sociology, or a related field. Familiarity with the work of computing heroes like Douglas Engelbart, Alan Kay, Bret Victor, etc. — and understanding why we’re big fans. Notion is committed to providing highly competitive cash compensation, equity, and benefits. The compensation offered for this role will be based on multiple factors such as location, the role’s scope and complexity, and the candidate’s experience and expertise, and may vary from the range provided below. For roles based in San Francisco or New York City, the estimated base salary range for this role is $196,000–$230,000 per year. By clicking “Submit Application”, I understand and agree that Notion and its affiliates and subsidiaries will collect and process my information in accordance with Notion’s Global Recruiting Privacy Policy and NYLL 144. Equal Opportunity & Accommodations We hire talented people from a wide range of backgrounds. If you’re excited about this role but don’t meet every bullet, we still encourage you to apply. Notion is an equal opportunity employer and does not discriminate on the basis of any legally protected characteristic. Consistent with applicable law, we will consider for employment qualified applicants with arrest and conviction records. Notion provides reasonable accommodations during the application process; if you need one, please let your recruiter know. Notion is proud to be an equal opportunity employer. We do not discriminate in hiring or any employment decision based on race, color, religion, national origin, age, sex (including pregnancy, childbirth, or related medical conditions), marital status, ancestry, physical or mental disability, genetic information, veteran status, gender identity or expression, sexual orientation, or other applicable legally protected characteristic. Notion considers qualified applicants with criminal histories, consistent with applicable federal, state and local law. Notion is also committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please let your recruiter know. #J-18808-Ljbffr Hack Chicago

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the User Researcher, AI Evaluations in San Francisco, CA vacancy
  • Solinius is searching for an experienced UX Researcher to define how to evaluate AI-powered experiences. This role involves running studies to uncover user mental models and translating insights into reusable rubrics for product and design teams. The role is hybrid, based... 
    Suggested

    Solinius

    San Francisco, CA
    5 days ago
  • Notion is hiring a UX Researcher based in San Francisco to evaluate AI-powered experiences and define user expectations. The role involves conducting studies to improve product interactions and working closely with various teams to create evaluation frameworks. The ideal... 
    Suggested

    Hack Chicago

    San Francisco, CA
    1 day ago
  • $196k - $230k

     ...studio, based in San Francisco, is seeking an experienced UX Researcher to evaluate and scale AI-powered experiences. You will run studies to establish...  ...evaluations and ensuring evaluations represent real user interactions. The ideal candidate should have 5+ years in... 
    Suggested

    mmmanyfold dev studio

    San Francisco, CA
    5 days ago
  • $164k - $190k

     ...Who We Are Notion is the collaborative AI workspace where teams and agents think together...  ...the Role: We’re seeking an experienced User Researcher to deliver insights that improve Notion’...  .../usability testing, and post‑ship evaluation — with clear recommendations that drive... 
    Suggested
    Local area

    Verana Health

    San Francisco, CA
    1 day ago
  • $120k - $135k

     ...future of CliftonStrengths through research that anchors what we build and why. As a senior user researcher at Gallup, you’ll...  ...decision‑making. Lead generative and evaluative research using a range of...  ...impact. Thoughtfully incorporate AI tools into research workflows, such... 
    Suggested
    Work at office
    3 days per week

    Gallup

    San Francisco, CA
    3 days ago
  • $164k - $190k

    Who We Are Notion is the collaborative AI workspace where teams and agents think together...  ...About the Role: We’re seeking a Rolling User Researcher (Rolling UXR) to deliver fast, high‑...  ...with AI. Concept testing: Help teams evaluate early ideas (storyboards, low/high‑fidelity... 
    Local area

    Monograph

    San Francisco, CA
    1 day ago
  • $185k - $210k

    Senior User Researcher, Enterprise & Platform About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry’s most...  ...the product lifecycle—foundational, generative, evaluative, and post‑launch. Help shape critical product... 
    Work at office
    Home office
    Flexible hours

    Glean Technologies, Inc.

    San Francisco, CA
    5 days ago
  •  ...Grammarly is now part of Superhuman, the AI productivity platform on a mission to...  ...measurable ROI. We’re seeking a Staff User Researcher to help define the future of Go: how it...  ...This Role, You Will Lead exploratory and evaluative research to shape the strategy and roadmap... 
    Worldwide
    Home office
    Flexible hours

    Superhuman

    San Francisco, CA
    5 days ago
  • Drata is seeking an Applied AI Engineer in San Francisco, California to drive the effectiveness of AI systems through rigorous research and experimentation. You will optimize retrieval strategies and build evaluation frameworks, ensuring our AI delivers accurate results... 
    Flexible hours

    Drata

    San Francisco, CA
    4 days ago
  •  ...was not designed for them. The research questions here are not...  ...‑level stakeholders Leverage AI tools and workflows to extend...  ...years of experience conducting user research, with demonstrated experience...  ...statistical methods used to evaluate research data (significance testing... 
    Local area
    Remote work

    Hinge Health

    San Francisco, CA
    1 day ago
  • Gallup is seeking a Senior User Researcher to drive evidence-based product decisions across the CliftonStrengths platform. The role involves...  ...experience, strong communication skills, and the ability to leverage AI tools to enhance workflow. The position requires on-site work... 
    3 days per week

    Gallup

    San Francisco, CA
    3 days ago
  • Monograph is looking for a Rolling User Researcher to conduct fast, insightful research that enhances product experiences. You will manage usability...  ...years of UX research experience and a strong familiarity with AI products. This role is based in San Francisco and requires in-... 
    Work at office

    Monograph

    San Francisco, CA
    1 day ago
  • Superhuman is seeking a Staff User Researcher to shape the future of its AI assistant platform, Go. This role, based in San Francisco, involves leading user research to enhance AI-driven experiences for knowledge workers. Ideally, candidates should have 8+ years of experience... 

    Superhuman

    San Francisco, CA
    5 days ago
  • $164.74k - $237.95k

     ...full-scale UX design efforts to include research, blueprinting, and evaluating existing systems. -Manage the...  ...paced environment. -Lead all phases of user research and analysis needed to inform...  ...We may use artificial intelligence (AI) tools to support parts of the hiring... 
    Full time
    Work at office
    Flexible hours

    Contact Government Services, LLC

    San Francisco, CA
    1 day ago
  • $159k - $231k

     ...years of experience in an applied research setting (e.g., product or...  ...qualitative and quantitative user research, user experience measurement...  ...with the unique challenges of AI UX, including mental models of...  ...goals. Lead foundational and evaluative research studies to understand... 

    Google Inc.

    San Francisco, CA
    3 days ago
  • $50 - $55 per hour

     ...create consistent and scalable user experiences. Collaborate...  ...closely with product designers, UX researchers, and engineers to ensure the...  ...of Artificial Intelligence (AI): We may use Artificial...  ...including sourcing, screening, and evaluating candidates. AI helps assess... 
    Contract work
    Temporary work

    TEKsystems

    San Francisco, CA
    5 days ago
  • $50 - $125 per hour

     ...creative and technical talent with leading AI research labs. Headquartered in San Francisco,...  ...Commitment 15+ hours/week Role Responsibilities Evaluate product design decisions using...  ...Sketch, or Adobe tools. Provide structured User Experience/User Journey feedback to... 
    Contract work
    Summer work
    Remote work

    Mercor Inc

    San Francisco, CA
    21 hours ago
  •  ...About the role We’re looking for a User Researcher to join our growing research function. This person will help us better understand our...  ...Accelerate product learning: Help teams quickly test hypotheses, evaluate ideas, and gather feedback on features and experiences.... 

    The Consulting Solutions

    San Francisco, CA
    22 hours ago
  • $50 - $125 per hour

     ...experience at a prestigious tech firm. The selected candidate will evaluate and enhance product design decisions, develop mockups using Figma, Sketch, or Adobe tools, and collaborate with AI research teams. Candidates must be based in the US, UK, or Canada and are expected... 
    Remote job
    Contract work

    Mercor

    San Francisco, CA
    5 days ago
  • $188k - $275k

     ...Experience using Generative AI for creative concepting and asset...  ...vital premise: "Focus on the user and all else will follow." Google...  ...teams of UX Designers, Researchers, Writers, Content Strategists,...  ...Partner with User Research to evaluate the emotional resonance and usability... 
    Temporary work
    Remote work

    Google

    San Francisco, CA
    4 days ago
  • Join Hinge Health as a Senior UX Researcher in San Francisco, focusing on digital care for musculoskeletal conditions. You will lead user research efforts across product pods and collaborate...  ...significantly while leveraging innovative AI tools. #J-18808-Ljbffr Hinge Health

    Hinge Health

    San Francisco, CA
    5 days ago
  • $1,000 per month

     ...tools and services that provide instant access to cash and help users build credit. Our goal is to empower every customer to achieve...  ...that improve people’s lives. We are looking for a Senior User Researcher to lead how we understand our customers, their needs, and opportunities... 
    Immediate start
    Flexible hours

    Credit Genie

    San Francisco, CA
    2 days ago
  • $164k - $190k

    Verana Health is looking for an experienced User Researcher to enhance AI-powered experiences. The role involves conducting end-to-end research to provide insights that will drive product decisions. Candidates should have at least 3 years of UX research experience and be... 

    Verana Health

    San Francisco, CA
    3 days ago
  • $159k - $231k

    Google Inc. is seeking a User Experience Researcher (UXR) in San Francisco, California to join their UX team. In this role, you'll conduct primary...  ...help streamline complex workflows and engage in cutting-edge AI projects. The role offers a salary range of $159,000 to $231... 

    Google Inc.

    San Francisco, CA
    3 days ago
  • Sunny Health AI Inc. is seeking a UX Researcher to join their San Francisco team. In this role, you will uncover deep user insights and balance user needs with healthcare regulations. You'll be responsible for planning and conducting research studies, synthesizing findings... 

    Sunny Health AI Inc.

    San Francisco, CA
    4 days ago
  • Notion is looking for an experienced User Researcher to enhance AI experiences within their products. This position focuses on research that guides improvements in how users engage with AI features. Collaborative partnerships with design and engineering teams form a crucial... 

    Apply

    San Francisco, CA
    2 days ago
  • UX Researcher — On-site (San Francisco, CA) Job Description We’re looking for a UX Researcher to uncover deep user insights and ensure our products are clear, respectful, and trustworthy....  ...benefits and professional growth opportunities. #J-18808-Ljbffr Sunny Health AI Inc.

    Sunny Health AI Inc.

    San Francisco, CA
    4 days ago
  •  ...Product UI/UX Designer OpenArt is an AI-native visual storytelling platform used by millions...  ..., intuitive, and expressive for real users. Our work sits at the intersection of creativity...  ...and product-oriented, spanning user research, UX architecture, interaction design, and... 
    Remote work
    Worldwide
    Visa sponsorship

    Embedding VC

    San Francisco, CA
    2 days ago
  •  ...in global workforce solutions, specializing in Cloud Technology, AI/ML, Software Development, Technical Writing, and Digital Transformation...  ...collaborating with business stakeholders to deliver intuitive, user-friendly, visually appealing interfaces and conducting demos to... 

    Trilyon, Inc.

    San Francisco, CA
    2 days ago
  • $135k - $198k

     ...teams, and cross‑functional stakeholders to create user‑centered website experiences, evolve our design...  ...user experiences Passionate about leveraging AI‑powered tools and workflows to improve design speed, quality, research, ideation, and scalability Strong understanding... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to User Researcher, AI Evaluations. Be the first to apply!