User Researcher, AI Evaluations

$196k - $230k

Apply

Who We Are Notion is the collaborative AI workspace where teams and agents think together. We're building one place where your knowledge, projects, meetings, and AI tools live side by side, so work is faster, clearer, and less fragmented. Millions of individuals, small teams, and large companies run their work on Notion. Notinos (our employees) are customer zero in bringing this future of work to life. We care about craft, building things that last, and the belief that great work is still fundamentally human. Our goal isn’t to ship the next feature. Each and every team of Notinos is working to set the standard for how humans work together in the AI era. From building a business’s system of record to making and managing AI agents to automating away the busy work, we care deeply about giving our customers more time for their life’s work. About the Role: We’re seeking an experienced UX Researcher to define and scale how we evaluate Notion’s AI-powered experiences—focusing on what “good” looks like not only for model output quality, but for the end-to-end product experience where people discover, set goals, delegate work, review results, and build trust over time with AI. This role sits at the intersection of research craft and evaluation operations: you’ll run studies that uncover user mental models, expectations, and failure/recovery behaviors, then translate those insights into reusable rubrics, workflows, and measurement approaches that product, design, engineering, and data science can apply consistently. This role can be based in either San Francisco or New York City. We work from our offices on Mondays, Tuesdays and Thursdays (our Anchor Days) because we do our best thinking and building together in person. We’re looking for someone who’s excited to work alongside the team during those days. What You’ll Achieve: Define what “good” looks like (frameworks & rubrics): Establish clear, reusable evaluation criteria that reflect real user expectations—helpfulness, trust, tone, control, and transparency. You’ll translate qualitative insight into scoring guidance that can be applied consistently across teams and over time. Run recurring evals (longitudinal & feature-specific): Run recurring longitudinal and feature-specific surveys and studies to measure experience quality over time against defined rubrics. Lead qualitative studies, side-by-side comparisons, and human-in-the-loop evaluation efforts to deepen understanding of where experiences break down and how they can improve. You’ll help teams spot regressions, benchmark improvements, and understand when expectations shift. Anchor evaluation in real workflows (context > isolated feedback): Ensure evals reflect jobs-to-be-done, user intent, and the full interaction journey (goal setting, delegation, review, iteration), not just decontextualized thumbs up/down. You’ll help teams understand who is evaluating, what they’re trying to do, and why outputs succeed or fail. Identify failure modes & recovery behavior (guardrails): Uncover breakdowns, regressions, and edge cases across the system—from model behavior to UI and integrations—and study how people notice issues, correct them, and continue their work. You’ll turn these insights into actionable guidance for guardrails, fixes, and prioritization. Operationalize evaluation with partners (process & tooling): Collaborate closely with Product, Design, Engineering, and Data Science to align on target use cases and build scalable evaluation loops (human-in-the-loop review, longitudinal studies, and calibration of automated/LLM-judge approaches against human judgment). Skills You’ll Need to Bring: Ability to operationalize insight into measurement: You’re comfortable turning “soft” user expectations (trust, tone, usefulness, clarity) into concrete rubrics, scoring guidelines, and observable metrics. AI fluency and systems thinking: You’re curious and hands-on with AI products, and can reason about how model behavior, uncertainty, and system constraints shape user experience. You also have experience evaluating AI-enabled products (LLMs, agents, generative UI/workflow automation) and working with Data Science/ML partners on measurement strategy and evaluation tooling. Clear communication and impact orientation: You can align diverse partners around shared definitions of quality and create artifacts that enable teams to act consistently. You tailor storytelling to different audiences, connect research to business outcomes, and drive follow-through so insights translate into product change. Strong UX research craft (quant + qual): You can choose the right methods for the question— interviews, benchmarking, surveys, experiments—and synthesize into actionable guidance. You also can prioritize ruthlessly, work through ambiguity, and balance scrappy iteration with deep dives when needed. Pragmatism in fast-moving environments: You can prioritize ruthlessly, work through ambiguity, and balance scrappy iteration with deep dives when needed. Experience: 5+ years doing UX research in industry Nice to Haves: Familiarity with LLM-as-judge methods, prompt design for evaluators, or “golden dataset” creation Experience using AI research tooling for rapid synthesis and communication (e.g., Dovetail, Listen Labs, Maze, Outset, etc.), as well as AI observability tooling like Braintrust Experience using data querying languages (e.g., SQL), scripting languages (e.g., Python), or statistical/mathematical software (e.g., R, SAS, Matlab, etc.) Master’s or PhD in HCI, Psychology, Behavioral Science, Anthropology, Sociology, or a related field You’re familiar with the work of computing heroes like Douglas Engelbart, Alan Kay, Bret Victor, etc. — and understand why we’re big fans. Notion is committed to providing highly competitive cash compensation, equity, and benefits. The compensation offered for this role will be based on multiple factors such as location, the role’s scope and complexity, and the candidate’s experience and expertise, and may vary from the range provided below. For roles based in San Francisco or New York City, the estimated base salary range for this role is $196,000-$230,000 per year. A Note on AI You don’t need deep AI expertise for every role, but we do expect every Notino to be intellectually curious, drawn to tinkering and discovery, and excited to use AI as a real collaborator in their work. For some roles, AI fluency is a core requirement — when that’s the case, we’ll say so explicitly in the qualifications. People who thrive here don’t treat AI as a novelty. They use it to think better, and make their work easier for others to build on. Equal Opportunity & Accommodations We hire talented people from a wide range of backgrounds. If you’re excited about this role but don’t meet every bullet, we still encourage you to apply. Notion is an equal opportunity employer and does not discriminate on the basis of any legally protected characteristic. Consistent with applicable law, we will consider for employment qualified applicants with arrest and conviction records. Notion provides reasonable accommodations during the application process; if you need one, please let your recruiter know. Notion is proud to be an equal opportunity employer. We do not discriminate in hiring or any employment decision based on race, color, religion, national origin, age, sex (including pregnancy, childbirth, or related medical conditions), marital status, ancestry, physical or mental disability, genetic information, veteran status, gender identity or expression, sexual orientation, or other applicable legally protected characteristic. Notion considers qualified applicants with criminal histories, consistent with applicable federal, state and local law. Notion is also committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please let your recruiter know. #J-18808-Ljbffr Apply

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the User Researcher, AI Evaluations in New York, NY vacancy

UX Researcher — AI Evaluation & Experience
Notion is looking for an experienced UX Researcher in New York City to define and create evaluation frameworks for AI products. The role involves translating user insights into actionable guidelines and collaborating with cross-functional teams to improve product experiences...
Suggested
Apply
New York, NY
3 days ago
Remote AI UX Researcher — Design Evaluation & Training
$20 per hour
A leading technology firm is seeking a UX Researcher to train and enhance AI models in understanding design principles. This role involves evaluating AI outputs and offering feedback to refine aesthetic and usability qualities. Successful candidates will work remotely on...
Suggested
Remote job
Contract work
Flexible hours
DataAnnotation
New York, NY
2 days ago
Remote UX Researcher: AI Design and UI Evaluation
$20 per hour
A design-focused AI training company is seeking a UX Researcher to evaluate AI outputs and help shape future design tools. This flexible remote position offers $20+ USD/hr for general projects and $40+ USD/hr for design-focused tasks. Ideal candidates should have a strong...
Suggested
Remote job
For contractors
Flexible hours
DataAnnotation
New York, NY
4 days ago
UX Researcher for AI Evaluation & Experience
$196k - $230k
Notion is seeking an experienced UX Researcher based in either San Francisco or New York City. This role focuses on evaluating AI-powered experiences, ensuring quality through defined evaluation criteria and user insights. The successful candidate will work closely with...
Suggested
Notion
New York, NY
4 days ago
Lead UI/UX Designer for AI Training & Evaluation
$25 - $30 per hour
DataAnnotation is seeking a Lead Product Designer to train AI models by evaluating their outputs and providing critique on designs. Your expertise will enhance the next generation of AI tools and ensure high-quality models. This position offers the flexibility to choose...
Suggested
Remote work
DataAnnotation
New York, NY
2 days ago
Lead UI/UX Designer for AI Training & Evaluation
$25 - $40 per hour
DataAnnotation is seeking a Lead Product Designer to help train AI models in Idaho, United States. You will evaluate and critique AI-generated UI/UX designs, ensuring they meet visual and usability standards. Your insights will help shape AI tools to better support designers...
For contractors
Work from home
DataAnnotation
New York, NY
3 days ago
Remote UI Designer for AI Training & Evaluation
$40 per hour
DataAnnotation, located in the United States, is seeking an experienced UI Designer to aid in the training of AI models. This role involves evaluating AI chatbots, providing feedback, and refining design outputs to enhance usability and creativity. The position allows...
Remote job
Work from home
Flexible hours
DataAnnotation
New York, NY
2 days ago
User Researcher
$164k - $190k
Who We Are Notion is the collaborative AI workspace where teams and agents think together... ...the Role: We’re seeking an experienced User Researcher to deliver insights that improve Notion’... .../usability testing, and post‑ship evaluation — with clear recommendations that drive...
Local area
Notion, LLC
New York, NY
1 day ago
Senior User Experience Researcher
$120k - $150k
Role Summary Senior User Experience Researcher - Montefiore’s Experience Design and End‑User Research team... ...Research Lead generative and evaluative research across web, mobile, voice, and... ...research for emerging technologies such as AI, wearables, or voice interfaces....
Monday to Friday
Shift work
Montefiore Medical
New York, NY
2 days ago
Principal Quantitative User Experience Researcher, AI
$224k
...building a more open world. Join us. Principal Quantitative User Experience Researcher, AI (R-106384) Location New York, Austin, Los Angeles, San... ...psychometric instruments, experience metrics, and AI evaluation rubrics — that apply statistical rigor to challenges like...
Flexible hours
Expedia, Inc.
New York, NY
20 hours ago
Remote 3D Slicer Specialist for AI Evaluation
A flexible AI research firm is seeking skilled 3D Slicer users to support their AI research through flexible, hourly contract work. You'll leverage your hands-on experience with medical image analysis to evaluate AI-generated content and provide feedback to enhance AI understanding...
Remote job
Hourly pay
Contract work
Flexible hours
Handshake
New York, NY
2 days ago
Lead UI/UX Designer for AI Training & Evaluation
$40 per hour
DataAnnotation is seeking a Lead Product Designer in Georgia to assist in training AI models by critiquing their designs and enhancing usability. This position allows you to work on your own schedule from home while engaging in general and design-specific AI training projects...
DataAnnotation
New York, NY
2 days ago
Remote AI Psychiatry Researcher — Clinical Scenarios & Evaluation
$130 - $180 per hour
Call For Referral is offering a remote opportunity for experienced Psychiatry professionals to engage in advanced AI research and evaluation projects. In this role, you will apply your clinical expertise to improve the quality and safety of next-generation AI systems by...
Remote job
Hourly pay
Call For Referral
New York, NY
2 days ago
Mobile UI Designer for AI UX Evaluation (Remote)
$20 per hour
A leading AI training company in Georgia seeks a Mobile UI Designer to enhance AI model capabilities in understanding and generating design. Responsibilities include reviewing AI-generated designs and providing critiques, while qualifications include proficiency in UI/...
Remote job
Contract work
Flexible hours
DataAnnotation
New York, NY
1 day ago
UX Researcher
$89.7k - $166.5k
...Overview The Product Design, UX Researcher will bring rigor, curiosity,... ...Design team, connecting user behavior to product strategy through... ...shift roadmaps, and you use AI tools to work faster without... ...generative discovery through evaluative testing of shipped features....
Temporary work
Freelance
Work at office
Local area
Immediate start
Flexible hours
Shift work
Publicis Groupe
New York, NY
4 days ago
Remote UX Designer for AI Design Evaluation
$20 per hour
A tech company specializing in AI is seeking a UX Web Designer for a remote position focusing on enhancing AI's understanding of design. You will assess and critique AI-generated designs, helping refine models to improve aesthetics and usability. Candidates should have...
Remote job
Flexible hours
DataAnnotation
New York, NY
3 days ago
UX Researcher
$80 - $85 per hour
...Immediate need for a talented UX Researcher . This is a 06+ Months... ...strategic design and execution of user-centered research and... ...project metrics and conducting evaluations. Mentor and assist the UX... ...UX associates. Integrate AI-driven research tools to streamline...
Contract work
Work at office
Local area
Immediate start
Pyramid Consulting
New York, NY
3 days ago
Sr. UX Researcher
$150k - $195k
...Job Description The Senior Researcher for Commerce and Growth User Experience will play a pivotal role in defining... ...will lead research validating and evaluating product decisions, as well as the work... ...research methodologies and AI‑enabled tools to streamline and scale...
Work at office
Flexible hours
NBCUniversal
New York, NY
1 day ago
AI Security Researcher: Threat & Vulnerability Evaluator
$40 per hour
A cybersecurity solutions provider is looking for experienced professionals to join their team and help train AI models. The role involves evaluating AI-generated security content and providing technical problem-solving to enhance AI systems. Candidates should have over...
Hourly pay
Remote work
Flexible hours
DataAnnotation
New York, NY
2 days ago
UX Researcher
$89.7k - $166.5k
...Overview The Product Design, UX Researcher will bring rigor, curiosity,... ...Product Design team, connecting user behavior to product strategy... ...and shift roadmaps, and you use AI tools to work faster without... ...generative discovery through evaluative testing of shipped features. Contribute...
Temporary work
Freelance
Work at office
Local area
Immediate start
Flexible hours
Shift work
Publicis Groupe
New York, NY
1 day ago
User Experience Researcher
$55 - $65 per hour
...ours. We are searching for a Part-Time UX Researcher for our faith-based tech client. In this... ...If you’re passionate about understanding user needs and bringing purpose-driven work to... ...insights directly into each article, started with the help of AI. #J-18808-Ljbffr...
Contract work
Part time
Remote work
52 Limited
New York, NY
2 days ago
Senior User Researcher, Insights (Contractor)
$45 - $50 per hour
...Senior User Researcher, Insights (Contractor) Senior User Researcher, Insights (Contractor) This range is provided by Swell Partners. Your... ...community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr...
Full time
Contract work
For contractors
Freelance
Remote work
Swell Partners
New York, NY
4 days ago
Lead UX Researcher
$146.8k - $272.6k
...Lead UX Researcher Role Summary: About the UX Team Come and... ...by focusing on the individual user. We are looking for curious, smart... ...generative research and evaluative studies, shaping how the team... ...in roadmap prioritization. AI & Research Acceleration...
Work at office
Local area
Remote work
Flexible hours
2 days per week
3 days per week
Thomson Reuters
New York, NY
3 days ago
Remote Interaction Designer for AI UI/UX Evaluation
$20 per hour
A leading tech company is seeking an Interaction Designer to enhance AI models. You'll review AI-generated UI/UX designs and provide feedback to improve aesthetic quality. Candidates should have a strong background in design, particularly UI/UX, and fluency in English....
Remote job
Flexible hours
DataAnnotation
New York, NY
20 hours ago
Senior Mixed Methods UX Researcher, Google Lens
$159k - $231k
...years of experience in an applied research setting (e.g., product or... ...Job At Google, we "Focus on the user and all else will follow." As... ...evolution to a more Multimodal, AI-forward experience. You will... ...recommendations. Lead teams to define and evaluate product, service, ecosystem...
Google
New York, NY
2 days ago
Senior User Researcher — AI-Driven Care Journeys
A Place for Mom is seeking a Senior User Researcher to enhance the understanding of family and provider experiences in aging care. This role... ...user research and be proficient in both qualitative methods and AI tool applications. A commitment to improving family care...
A Place for Mom
New York, NY
3 days ago
Sr. User Researcher
$173.48k - $213.96k
Peloton is seeking a Senior User Researcher with a strong foundation in hardware research to support the innovation, iteration, and optimization... ...person user research methodologies. Comfortable incorporating AI tools into research workflows to support efficiency and...
Temporary work
Live in
Work at office
Local area
Remote work
Peloton Interactive
New York, NY
4 days ago
Senior Staff UX Researcher, Google Labs
$236k - $330k
...years of experience in an applied research setting, or similar.... ...conducting product research in the AI, machine learning, gaming, or... ...Job At Google, we "Focus on the user and all else will follow." As... ...recommendations. Lead teams to define and evaluate product, service, and...
Google
New York, NY
2 days ago
Remote User Researcher - AI Model QA & Content R&D
$20 per hour
A leading AI training company in Canada is looking for motivated individuals to help train AI chatbots. This position offers remote... ...conversations, writing high-quality responses to prompts, and conducting research. Applicants should be fluent in English and have a bachelor's...
Remote job
Hourly pay
DataAnnotation
New York, NY
2 days ago
Remote UI/UX Product Designer for AI Training
$20 per hour
A technology-focused AI company is seeking a UI/UX Product Designer to enhance and evaluate AI systems' design capabilities. This remote role allows you to choose projects and work on your own schedule. Ideal candidates should have a strong background in UI/UX design,...
Remote job
DataAnnotation
New York, NY
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to User Researcher, AI Evaluations. Be the first to apply!