Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Program Manager - Model Evals and Safety

Reflection

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond. About The Role Research Program Managers at Reflection are high‑leveraged leaders and operators who embed directly with research and infrastructure teams to accelerate the pace of frontier model development. They are not project trackers. They are force multipliers who bring clarity to ambiguity, drive decisions when the path forward is unclear, and ensure that the work happening across multiple teams connects into a coherent whole. This is a foundational role. Reflection is building model evals and safety from the ground up, and this RPM will be at the center of that effort. You won’t be stepping into an established function with existing processes and tooling. You will be the person who figures out what this function needs to look like, stands it up, and makes it real. That means defining the evaluation frameworks, building the operational infrastructure for model safety, establishing the processes that connect evals to the model development lifecycle, and laying the groundwork for how Reflection interfaces with the broader safety ecosystem. This is 0‑to‑1 work in its purest form. You bring a first‑responder mentality. When things go sideways, you don’t wait to be asked. You jump in, assess the situation, cut through noise, align the people who need to be aligned, and drive resolution. What You’ll Do Build the foundational infrastructure for model evals and safety at Reflection. Define the evaluation frameworks, tooling requirements, and operational processes that will underpin how we assess model capabilities, risks, and readiness for release. Stand up model safety operations as a function, including establishing the workflows, review cadences, and decision frameworks that connect safety evaluation to the model development and release lifecycle. Partner with research and engineering leads across pre‑training, mid‑training, and post‑training to embed safety and evaluation checkpoints into the development process in a way that is rigorous without being a bottleneck. Drive the scoping and prioritization of eval science and eval infrastructure investments, working with technical leads to determine what to build in‑house, what to adopt, and where to invest research effort. Establish Reflection’s engagement with the external safety ecosystem, including third‑party assessments, academic partnerships, and industry safety frameworks. Represent the company’s safety posture to external stakeholders with credibility and clarity. Create visibility and reporting structures that give leadership a clear, honest picture of model safety status, evaluation coverage, and open risks, so they can make informed decisions at the pace the business requires. Champion a culture of blameless post‑mortems and continuous learning, turning every safety‑relevant finding into a concrete improvement to our systems and processes. About You 7+ years of experience in technical program management, research operations, or ML engineering, with demonstrated experience standing up new functions, teams, or programs from scratch. Familiar with the landscape of model evaluation and AI safety, including evaluation methodologies, red‑teamings, alignment research, and the evolving regulatory and industry safety ecosystem. Deep enough technically to engage with researchers and engineers on topics like model behavior, evaluation design, data pipelines, and safety‑critical system architecture. Proven ability to build structures where none exists. You’ve taken ambiguous mandates and turned them into functioning programs with clear ownership, measurable outcomes, and durable processes. Strong stakeholder management skills spanning deeply technical ICs, research leadership, and external partners. You build trust through competence and follow‑through. Excited to build from zero to one. We are a small, fast‑moving team and this role will help define how model safety and evaluation works at Reflection. Motivated by enabling researchers and engineers to build the world’s most capable open‑weight AI systems, responsibly. What We Offer We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent‑dense team. You will help define our future as a company, and help define the frontier of open foundational models. We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported. Top‑tier compensation: Salary and equity structured to recognize and retain the best talent globally. Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance. Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning. Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time. Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off‑sites and team celebrations. #J-18808-Ljbffr Reflection

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Research Program Manager - Model Evals and Safety in New York, NY vacancy
  • $290k - $365k

     ...growing group of committed researchers, engineers, policy experts,...  ...organization works across the full model development lifecycle — from...  ..., interpretability, and safety — each operating at the...  ...development. As a Technical Program Manager for Research, you'll define... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    Menlo Ventures

    New York, NY
    3 days ago
  • $84.58k - $109.2k

    Position Summary We have an exciting opportunity to join our team as a Research Program Manager. In this role, the successful candidate will provide a broad range of complex program coordination, develop and promote assigned programs, design, budget, staff, logistics and... 
    Suggested
    Work at office

    NYU Langone Hospitals

    New York, NY
    2 days ago
  •  ...and Engagement (COE) ensures that MCC’s research, programs, and collaborations are responsive to...  ...prevention and control. The Senior COE Manager leads the operational execution of this...  ...based approaches such as the Kirkpatrick model, RE-AIM model, and other evaluative... 
    Suggested
    Traineeship
    Work at office
    Local area
    Flexible hours
    Afternoon shift

    Weill Cornell Medicine - Qatar

    New York, NY
    4 days ago
  • $93k

    New York University is seeking a Research Program Manager for the ACAD study focusing on older adults of Asian ancestry. The successful candidate will coordinate research operations, engage with stakeholders, and manage data collection. This full-time role requires a U... 
    Suggested
    Full time

    New York University

    New York, NY
    23 hours ago
  • $84.58k - $122.69k

     ...improving the human condition through medical education, scientific research, and direct patient care. At NYU Langone Health, equity and...  ...: We have an exciting opportunity to join our team as a Program Manager. In this role, the successful candidate is responsible for managing... 
    Suggested
    Contract work
    For contractors
    Work at office
    Remote work

    NYU Langone Health

    New York, NY
    1 day ago
  • Thank you for considering a career with the Research Foundation of The City University of New York (RFCUNY)! We are thrilled that you...  ...Unit:NoWe invite applications for a Research Scientist Program Manager (Part-Time, Postdoctoral Level) under the supervision of Dr.... 
    Part time
    Local area
    Remote work

    Research Foundation Cuny

    New York, NY
    1 day ago
  • $100k - $160k

    A nonprofit applied AI research organization is looking for a Technical Program Manager to drive ambitious research projects in New York. The role involves managing technical project roadmaps, facilitating team meetings, and ensuring project alignment with strategic goals... 
    Full time

    Basis Research Institute

    New York, NY
    2 days ago
  •  ...and deploying frontier models for developers and...  ...Cohere is a team of researchers, engineers, designers,...  ...Overview We are seeking a Safety Research PM to bridge...  ...5+ years of product management or research operations...  ...you don't need to run evals yourself, but you need... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    23 hours ago
  • $111k - $150.7k

     ...and lives better. As a member of the Network Innovations team, our Network Innovations Program Manager, Primary Care Programs leads and oversees the full lifecycle of innovation models and programs — from inception through sustainment — ensuring deliverables, KPIs, and... 
    Immediate start
    Work from home
    Flexible hours

    Cambia Health Solutions

    New York, NY
    23 hours ago
  • $101.72k - $120k

     ...Overview The Program Manager will be responsible for leadership, development, implementation,...  ...work environments. Sustainability and safety are foundational to our culture and influence...  ...Vendor Analysis, Process and Policy Research, etc.). Work with internal and external... 
    Temporary work
    For contractors
    Work experience placement
    Local area
    Remote work
    Flexible hours

    Casey Industrial

    New York, NY
    1 day ago
  • $72k

     ...solutions that help businesses maintain top standards in quality, safety, and compliance. As a family-owned company, we believe in...  ..., we’d love to have you join us! Job Summary: The VMI Program Operations Manager is responsible for leading the daily execution, long-term... 
    Full time
    Work at office
    Remote work

    Recruiting - Original Template

    New York, NY
    2 days ago
  • $130k - $250k

    Goldman Sachs's Model Risk Management (MRM) team plays a critical role in ensuring the safety and soundness of the firm's models. MRM...  ...You will work closely with AI researchers, AI model developers, and...  ...related quantitative field. Programming expertise in Python, including... 
    Full time
    Temporary work
    Part time

    Goldman Sachs Group, Inc.

    New York, NY
    3 days ago
  • $204k - $259k

    Senior Research Scientist, Foundation Model (LLM/VLM) Waymo Position type: Full‑time Location...  ...to improve mobility and safety. The Applied Research team...  ...to a Technical Lead Manager. Responsibilities Conduct...  ...generous company benefits program. #J-18808-Ljbffr SupportFinity... 
    Full time

    SupportFinity™

    New York, NY
    2 days ago
  •  ...Icahn School of Medicine at Mount Sinai is seeking a Clinical Research Nurse Manager to lead a growing portfolio of cellular therapy research....  ...malignant and malignant CAR‑T studies, ensuring participant safety and adherence to protocols. The ideal candidate will have 5... 

    Icahn School of Medicine at Mount Sinai

    New York, NY
    2 days ago
  •  ...Job Title : Program Manager Location : Kings County Hospital Center - 451 Clarkson Ave, Brooklyn, NY 11203 Duration : 2+ Months Contract...  ...-functional teams including Facilities, IT, Supply Chain, Safety, and external vendors to support the successful launch and ongoing... 
    Contract work
    Part time
    Shift work

    ProKatchers LLC

    Brooklyn, NY
    1 hour ago
  • $88.28k - $103.22k

     ..., helping to restore health, safety, education, economic wellbeing...  ...testing, and overall Project Management in an effort to help the...  ...help build and deliver.As IT Program Manager, you'll manage the digital...  ...team is working in a hybrid model with one day per week in the... 
    Full time
    Part time
    Work at office
    Immediate start
    Remote work
    1 day per week

    International Rescue Committee

    New York, NY
    23 hours ago
  • A leading AI research firm in New York, NY, is looking for a specialist to oversee the red-...  ...adversarial evaluation pipeline for their models. The ideal candidate should possess a...  ...field and have a deep understanding of LLM safety and adversarial techniques. A strong software... 

    Reflection AI

    New York, NY
    3 days ago
  • $54k - $68k

     ...the workplace to anyplace. The National Safety Council is America's leading nonprofit...  ...injuries and deaths. Through leadership, research, education, and advocacy, NSC focuses...  ...occur. We are currently looking for a Program Manager - Our Driving Concern (FL) to join us in... 
    Remote work

    National Safety Council

    New York, NY
    1 day ago
  •  ...pursuing society’s most critical safety and justice issues with our...  ...that provides GPT‑class models, chat assistants, and secure...  ...build, and deliver training programs (workshops, office hours, lunch...  ...covering topics like secrets management, data classification, prompt... 
    Work at office

    Axon

    New York, NY
    23 hours ago
  • ## Global Paint Program ManagerApplylocations: McConnellsburg, Pennsylvania, United Statestime...  ...John L. Grove set out to resolve growing safety concerns in the construction industry....  ...world.**As the **Global Paint Program Manager** at Oshkosh Corporation, you will serve... 
    Permanent employment

    Oshkosh Corporation, Inc.

    New York, NY
    23 hours ago
  • $130k - $150k

     ...innovateteam.com . Secure Information: We prioritize your data safety. We never request bank information via email or text. Instead...  ...differs from this process, it’s not from Innovate. GIS Program Manager Location: Remote Apply for this Position Position: GIS Program... 
    Full time
    Contract work
    Local area
    Remote work

    Innovate!, Inc.

    New York, NY
    1 day ago
  • $70k - $75k

     ...offering more than 160 integrated programs in: Education & Youth...  ...BIVO is part of the NYC Crisis Management System (CMS). CMS is the city...  ...implement the Cure Violence Model. The Cure Violence model is a...  ...helping organize Brownsville Safety Alliance resource fairs with... 
    Full time
    Contract work
    Work at office
    Local area

    CAMBA

    New York, NY
    1 day ago
  • Hackensack Meridian Health Inc. seeks a Manager, Assistant Member CDI in Nutley, NJ. This role involves conducting scientific research and improving patient outcomes through innovative...  ...publications. Responsibilities include program development, managing staff, and engaging... 

    Hackensack Meridian Health Inc.

    Nutley, NJ
    3 days ago
  • New York State Psychiatry Institute is hiring a full-time Program Manager to support the Columbia University Department of Psychiatry. This...  ...within the PMASS and CogPsy programs, focusing on clinical-research activities, organizing seminars, and data analysis. Candidates... 
    Full time

    New York State Psychiatry Institute

    New York, NY
    2 days ago
  • The University of Cincinnati is seeking a Program Manager to enhance support for graduate research. This role includes overseeing various programs, facilitating communication, and organizing events such as the Three Minute Thesis competition. The ideal candidate will have... 

    University of Cincinnati

    New York, NY
    1 day ago
  • $85k - $90k

     ...the Future of Clinical Research Adams Clinical is a leading...  ...data quality, safety, and diverse enrollment...  ...with extensive experience managing multiple complex or high...  ...quality study execution. Model best practices in...  ...insurance Employee assistance program Flexible spending... 
    Local area
    Flexible hours

    Adams Clinical

    New York, NY
    2 days ago
  • $75k - $80k

     ...Brooklyn Clinical Research was founded to increase access to clinical trials for underrepresented...  ...help improve therapeutic efficacy and safety for all individuals. Position Overview:...  ...Coordinator (“CRC”) monitors data management and regulatory aspects of clinical-trial... 
    Immediate start

    Brooklyn Clinical Research

    New York, NY
    4 days ago
  •  ...Deepgram's voice-native foundation models are accessed through cloud...  ...today's reality. While our research scientists pioneer new...  ...we need exceptional technical program leadership to turn these breakthroughs...  ..., precise dependency management, and relentless focus on delivering... 
    Home office
    Flexible hours
    Shift work

    Remote Jobs

    New York, NY
    1 day ago
  •  ...our associates, please click here. Overview The Clinical Research Program Manager, under the supervision of the Director of Quality Assurance...  ...Procedures (SOPs), the Montefiore Einstein Cancer Center Data and Safety Monitoring Plan (DSMP), and any other applicable regulatory... 
    Work experience placement
    Work at office

    ViziRecruiter,LLC.

    New York, NY
    1 day ago
  • $84k - $105k

    ## Program Manager Clinical ResearchApplylocations: 111 East 210th Streettime type: Full timeposted...  ...**$84,000.00-$105,000.00The **Clinical Research Program Manager,** under the...  ...Montefiore Einstein Cancer Center Data and Safety Monitoring Plan (DSMP), and any other applicable... 
    Work experience placement
    Work at office
    Shift work

    Montefiore Hudson Valley Collaborative

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Program Manager - Model Evals and Safety. Be the first to apply!