Scientific Evals
$160k - $300kEdison Scientific Inc.
About Edison Scientific builds and commercializes AI agents for science. Scientific discovery moves too slowly, and autonomous AI agents are how we intend to fix that. We're assembling a team of top researchers and engineers across AI and biology to build an AI scientist. Role We are seeking an ambitious, scientifically grounded person to join our team focused on developing rigorous benchmarks and training datasets that advance AI capabilities in biology. This role sits at the intersection of biology, data curation, and machine learning, and is ideal for someone with deep scientific training who is excited to shape how frontier AI systems learn to do science. This role is on-site at our San Francisco office in the Dogpatch neighborhood. Our office is a converted warehouse with high ceilings, open space, and a team that genuinely believes in what they're building. This position is part of the Evals team. Responsibilities Design benchmarks that capture the complexity of real biological research, drawing on your domain expertise to identify what makes scientific reasoning hard. This will include open-ended scientific benchmarks and building on prior work like LAB-Bench and BixBench. Curate and vet biological datasets to ensure scientific rigor. Analyze model outputs, identify failure modes, and contribute to iterative improvements in both datasets and evaluation criteria. Collaborate with AI/ML researchers to translate scientific intuition into training signal, helping AI systems learn not just facts but how scientists think. Coordinate operations and manage workflows, including working with domain experts, tracking task progress, and maintaining documentation. Qualifications Graduate-level training in biology, biochemistry, computational biology, or a related field, with hands‑on research experience. Working knowledge of machine learning concepts, particularly deep learning and large language models. Comfortable with Python and building workflows for data processing, analysis, and experimentation. Possess strong scientific taste and able to identify what distinguishes expert-level reasoning from surface-level pattern matching. Detail-oriented and willing to take on high-value but occasionally tedious work. Energized by ambiguous, open-ended problems that require creativity, collaboration, and first-principles thinking to solve. Organized and communicative, able to manage multiple workstreams and coordinate across teams. Bonus points for Prior experience creating evaluation datasets, annotation guidelines, or working on human-in-the-loop data pipelines. Experience with bioinformatics pipelines, biological databases, or sequence analysis tools. Hands‑on experience fine-tuning or evaluating large language models, or familiarity with RLHF and preference-based training. Publications or research experience in areas relevant to AI for science. Salary $160,000 - $300,000 Offers equity Why join us? Competitive salary and equity Full healthcare coverage — we pay 100% of premiums for you and your dependents Support for growing families, including a yearly new parent stipend and fertility coverage through Carrot 401(k) company matching $300 health and wellness benefit Lunch is on us every day you're in the office, and dinner is on us when you're working late Regular team offsites and company events A fast-moving, mission-driven culture where smart people do their best work and actually enjoy doing it #J-18808-Ljbffr Edison Scientific Inc.
- ...outcomes, rather than chasing generic benchmarks. Can look at evals, transcripts, and metrics and quickly form grounded hypotheses... ...data pipelines for training new models, evals, analysis, etc. Scientific Mindset. You formulate hypotheses, and you are good at evaluating...Scientific
- ...scale our ML systems, train and evaluate models, and engineer scientific prototypes into production. While we prefer candidates willing... ...the path to foundation model development. Engineer meaningful evals and metrics which enable rapid model iteration. Design, build...ScientificWork at office
$150k - $300k
...post-training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and... ...including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem...ScientificRemote workWorldwideVisa sponsorshipRelocation packageFlexible hours- ...handful of sponsor customers. The Vision We exist to accelerate scientific development and improve human well‑being by increasing the... ...build evaluation datasets, author metrics, and run structured evals to measure model and pipeline quality Instrument AI systems for...ScientificWorldwide
- ...inform the next decision — you will build lightweight offline evals and shadow‑mode testing infrastructure that let the team iterate... ...mentoring others, shaping team practices, or leading cross‑functional scientific initiatives. Strong ability to explain complex systems, trade‑...ScientificTemporary workWork at officeLocal areaRemote workFlexible hours2 days per week3 days per week
- ...observations and interpret findings Conduct experiments under defined conditions to verify/reject various types of hypotheses using refined scientific methods Organize and store all chemicals, substances, fluids and compressed gases according to safety instructions Record all...Scientific
- ...suites that gate every model and agent release — capability, behavior, regressions, and human-rated rubrics that catch what automated evals miss The dashboards and tooling that make researcher experiment loops fast and leadership decisions easy The bar — what counts as...Full timeRelocation package
- ...frontier language models. Your work will define how we measure tool use, agentic behavior, and real-world reasoning. You’ll design and run evals, build rubrics and scorers, and turn failure analysis into actionable improvements for post-training, RLVR, and data pipelines....Work at office
- ...seeking a Field Director for Medical Science Liaisons (MSLs) in California. The role involves coaching and leading MSLs to achieve scientific and business goals in Dermatology. The ideal candidate will have an advanced scientific degree and extensive experience in...Scientific
$160k - $250k
...milestones are clear, and progress is visible without slowing scientific velocity. Translate Research into Execution — Turn technical ideas... ...or direct contributions to AI research (e.g., modeling, data, evals, systems, or related areas). Experience supporting research in...ScientificWork at officeRelocation packageFlexible hours- ...change control assessments, CAPA assessments etc. Qualifications/Skills PhD in biology, biochemistry, chemical engineering or related scientific discipline with minimum 7 years of cell culture process development experience in the biotech/biopharma industry; or BS/MS with 1...ScientificContract work
$245k - $285k
...this role, you will: Design and execute capability evaluations ("evals") to assess the capabilities of new models Collaborate closely... ...OR equivalent professional experience Extensive experience in scientific computing and data analysis, with proficiency in programming (...ScientificFull timeWork at officeVisa sponsorshipFlexible hoursShift work$100k - $120k
...motivated to become top-ranked in the biotech industry Conducting proprietary research and evaluating drug pipelines by analyzing scientific literature, attending medical conferences, and consulting industry experts Writing research reports for initiations of coverage,...ScientificFull time$200k - $250k
...Edison Scientific focuses on building and commercializing AI agents for science, and shares FutureHouse’s mission to build an AI Scientist- scaling autonomous research, productizing it, and applying it to critical challenges such as drug development. Role We’re looking...Scientific- ...environments and enjoy working with a collaborative team. Joining Merge means contributing to groundbreaking research that merges biological and artificial intelligence. If you are motivated by scientific challenges, we encourage you to apply. #J-18808-Ljbffr Merge LabsScientific
- ...cryopreservation efforts. The ideal candidate will have a graduate degree or significant industry experience, with a passion for novel scientific discovery. Benefits include competitive compensation, comprehensive insurance, and flexible time off. #J-18808-Ljbffr UntilScientificFlexible hours
- ...contributing to significant revenue growth. Ideal candidates will possess a strong background in product management, financial analysis, and customer collaboration, enabling successful product launches in a competitive landscape. #J-18808-Ljbffr ThermoFisher ScientificScientific
- ...computations, and maintaining detailed documentation. The ideal candidate will possess a B.S. in Physics, have familiarity with scientific computing, and demonstrate strong teamwork and communication abilities. Join a prestigious institution recognized for 17 Nobel Prizes...Scientific
$120k - $180k
...expertise to determine where the boundary lies between routine scientific work and potentially dangerous biological capabilities. You will... ...k (base + performance pay that scales with output) Produce 50 evals/week at baseline — exceed that and performance pay scales...ScientificRemote workVisa sponsorship- ...workflows and operating advanced LC-MS instrumentation. The ideal candidate must possess a Ph.D. and solid experience in proteomics, with a focus on collaborative scientific innovation. Comprehensive benefits and competitive pay are included. #J-18808-Ljbffr Yoh Services LLCScientific
- ...research environment. This position focuses on establishing and optimizing neuronal cultures, collaborating with various teams to drive scientific progress, and providing hands-on expertise to enhance research workflows. Ideal candidates will have extensive experience with...Scientific
$180k - $250k
...Research Engineer to pioneer developments in probabilistic machine learning models. Candidates should have a strong background in scientific Python, deep learning frameworks, and foundational mathematics. This role offers a competitive salary ranging from $180,000 to $2...Scientific$150k - $200k
...expertise in CNS disorders, with responsibilities including relationship development with healthcare professionals and providing scientific support. The ideal candidate must hold an advanced degree with MSL experience preferred, and ability to travel up to 75%. A competitive...Scientific- Uncountable is seeking a Scientific Implementation Manager to connect the scientific world with their R&D platform. In this role, you'll work closely with R&D leaders and scientists from leading companies to configure the platform to enhance their experimental workflows...Scientific
- ...collaborating across various research disciplines. The ideal candidate comes with a solid background in machine learning and a passion for scientific discovery. Join us in advancing the understanding of biological models and their clinical applications. #J-18808-Ljbffr Radical...Scientific
- ...scientist who thrives in a collaborative environment, possesses excellent communication skills, and is eager to contribute to rigorous scientific endeavors. If you are passionate about pushing the boundaries of science and making a significant impact, this opportunity is...Scientific
- ...Spectrometry Scientist based in San Francisco, CA, to develop innovative mass spectrometry data analysis solutions. This role blends scientific expertise with client engagement, requiring 20% travel within the US and occasional trips to Europe. Key responsibilities include...Scientific
$190k - $210k
...paced clinical environments. Your advanced training has prepared you to rapidly expand your expertise and stay at the forefront of scientific innovation. You build strong relationships with ease, engage comfortably with thought leaders, and excel at collaborating with...ScientificFlexible hours- ...global pharmaceutical company is seeking a Medical Science Liaison to develop relationships with healthcare professionals and provide scientific support in the oncology field. Candidates should possess a PharmD, MD, or equivalent degree, with at least one year of relevant...Scientific
$55k - $58.6k
...experiments, maintaining Drosophila lines, and assisting with lab operations. Required qualifications include a Baccalaureate degree in a scientific discipline or relevant research experience. The salary range is estimated between $55,000 and $58,600 annually. #J-18808-Ljbffr...Scientific
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Scientific Evals. Be the first to apply!
- scientific communications San Francisco, CA
- scientific advisor San Francisco, CA
- scientific manager San Francisco, CA
- scientific software engineer San Francisco, CA
- scientific programmer San Francisco, CA
- chief scientific officer San Francisco, CA
- associate scientific director San Francisco, CA
- scientific consultant San Francisco, CA
- scientific research San Francisco, CA
- scientific marketing manager San Francisco, CA

