Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote AI Benchmark & Datasets Engineer

Pathway Genomics Corporation

Palo Alto, CA
  • Remote job

An innovative AI startup is seeking a Benchmark Specialist to design and execute rigorous benchmarks and evaluate datasets for their post-transformer models. This role requires strong experience in ML/LLM evaluation and the ability to communicate technical specifications to both engineers and customers. The position is full-time and offers remote work flexibility. If you are passionate about high-quality data and groundbreaking research, this opportunity could be for you. #J-18808-Ljbffr Pathway Genomics Corporation

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Remote AI Benchmark & Datasets Engineer in Palo Alto, CA vacancy
  • $70 - $85 per hour

     ...Industrial Technical Visual Reasoning Benchmark, an evaluation dataset designed to assess AI models'' reasoning capabilities over authentic engineering visuals. This role invites...  ...finalization. This is a part-time, remote, hourly position ideal for practicing... 
    Remote job
    Hourly pay
    Part time

    SaidGig

    Remote
    2 days ago
  • $70 - $85 per hour

     ...Industrial Technical Visual Reasoning Benchmark, an evaluation dataset designed to assess AI models'' reasoning capabilities over authentic engineering visuals. As a Process Engineering...  ...publication. This is a part-time, remote, hourly position ideal for practicing... 
    Remote job
    Hourly pay
    Part time

    SaidGig

    Remote
    7 days ago
  •  ...frontier model that solves AI's fundamental memory problem...  ...fastest data processing engine on the market, Pathway enables...  ...and execute rigorous benchmarks and define dataset standards. Collaborating closely...  ...and location. Location : Remote work. Possibility to work or... 
    Remote work
    Permanent employment
    Full time
    Contract work
    Immediate start

    Pathway

    Palo Alto, CA
    3 days ago
  •  ...Langfuse Open Source LLM Engineering Platform that helps teams build useful AI applications via...  ...work). Workplace: Remote-friendly. European roles...  ...evals, prompt management, datasets, metrics, self-hosting,...  ..., guides, cookbooks, benchmarks, demos, webpages, docs,... 
    Remote work
    Work at office

    Langfuse GmbH

    San Francisco, CA
    4 days ago
  •  ...technical Quality Assurance Engineer with strong...  ...data architectures, XBRL datasets, and performance-optimized...  ...proficiency in AI tools and AI-driven workflows...  ...opportunity is 100% remote.  Key Responsibilities...  ...Query performance benchmarking Data freshness and... 
    Remote work

    Anika Systems

    Leesburg, VA
    3 days ago
  • $165k - $205k

     ...Manager, Quality Engineering & AI Validation The Manager of Quality Engineering and AI...  ...hybrid role with the flexibility to work remotely 2 days per week (also eligible for...  ...approach for AI-led products, including benchmark datasets, scoring rubrics, regression... 
    Remote work
    Work at office
    Local area
    2 days per week

    Accordion USA

    Dallas, TX
    4 days ago
  •  ...Machine Learning Test Engineer Location: United...  ...Francisco, Toronto, and remotely. Autodesk is a hybrid-...  ..., metrics, and test datasets Evaluating CAD RL...  ...engineering or QA for ML/AI systems ~ Strong...  ...methods, metrics, and benchmarking Passion for learning... 
    Remote work
    For contractors
    Work at office

    Autodesk

    United States
    2 days ago
  • $70 - $85 per hour

     ...technical talent with leading AI research labs. Headquartered...  ...Francisco, our investors include Benchmark , General Catalyst ,...  ...Dorsey . Position: Process Engineering Expert - Visual Reasoning...  ...$85/hour Location: Remote Role Responsibilities... 
    Remote job
    Contract work
    Summer work

    Mercor

    San Francisco, CA
    4 days ago
  •  ...is seeking a Software Engineer to join the NeuralPLexer...  .... This is a remote position based on the...  ...systems Build and expand benchmarking systems for running models...  ...and affinity datasets, computing metrics, and...  ...novel medicines using its AI-driven discovery and development... 
    Remote work
    Flexible hours

    GrabJobs

    United States
    1 day ago
  • $50 - $175 per hour

    Title: AI Safety and Evaluations Engineer Job Type: Contract Contract Length: 12 Months Pay Range: $50/...  ...175/hr Start Date: ASAP Location: Remote About the Opportunity: Our client...  ...toxicity. Creating automated "Eval" datasets to benchmark new models before they are... 
    Remote work
    Contract work
    Immediate start

    DeWinter Group

    Campbell, CA
    3 days ago
  • $85.4k - $143.2k

     ...are understood across engineering, diagnostics, and service...  ...state-of-the-art AI-powered Embedded Vehicle...  ...diverse global ecosystem of remote users, 3rd-party...  ...s rigorous production benchmarks. Engineering for Personas...  ...and cleaning datasets exceeding 10,000+ records... 
    Remote work
    Local area
    Immediate start
    Flexible hours
    Shift work

    Ford Motor Company

    Dearborn, MI
    5 days ago
  • $157k - $175k

     ...In 2025, we started Handshake AI and built the fastest-growing...  ...create evaluations, publish benchmarks, and push the boundary of data...  ...Work together with engineers, scientists, operators, and more...  ...stipend, ongoing development Remote & Office: Internet, commuting... 
    Remote work
    Full time
    Work at office
    Flexible hours

    Handshake

    San Francisco, CA
    a month ago
  •  ...Machine Learning Engineer Innovation starts from the heart. At Edwards Lifesciences, we...  ...and enhancing technological solutions. AI Factory team focuses in developing and delivering...  ..., processing, and analyzing large-scale datasets. Mentor junior engineers and... 
    Remote work

    Edwards Lifesciences

    United States
    5 days ago
  • $70 - $85 per hour

     ...technical talent with leading AI research labs. Headquartered...  ...Francisco, our investors include Benchmark , General Catalyst ,...  ...Dorsey . Position: Civil Engineering Expert - Visual Reasoning Benchmark...  ...$70–$85/hour Location: Remote Role Responsibilities... 
    Remote work
    Contract work
    Summer work
    Flexible hours

    Mercor

    San Francisco, CA
    3 days ago
  •  ...is looking for an NLP Engineer to design and develop NLP applications and work with various AI/ML projects. Responsibilities include...  ...science prototypes, selecting datasets, and training models. Candidates...  ...offers flexible hours and a remote work option. Ideal candidates... 
    Remote job
    Internship
    Flexible hours

    www.WorkAsAService.ai

    Chicago, IL
    1 day ago
  • $80 - $140 per hour

     ...researchers to author and verify golden reference solutions for the CritPt benchmark. Applicants will solve research-level physics problems, audit expert solutions, and contribute to benchmark datasets. The role offers flexible hours at a pay rate of $80-$140 per hour,... 
    Remote job
    Hourly pay
    10 hours per week
    Flexible hours

    Mercor

    Mountain View, CA
    5 days ago
  • $80k - $89k

     ...distance may be considered on a remote basis. SUMMARY: Avetta...  ..., safety and performance benchmarking in a single, integrated experience...  ...120 countries, Avetta blends AI-driven insights and human...  ...scale with certainty. The QA Engineer II is a fully contributing... 
    Remote work
    Work at office
    Work from home

    Avetta

    United States
    1 day ago
  •  ...Developer Relations Engineering (Events & Community) at Langfuse Langfuse...  ...fits into the way they build AI applications. A lot of that...  ..., evals, prompt management, datasets, metrics, and related LLM...  ...: Quick intro and logistics, remote # Founder Call: Marketing... 
    Remote work
    Part time
    Work at office

    Langfuse

    San Francisco, CA
    1 day ago
  •  ...Electronics Expert - DesignSpark PCB to work remotely. In this role, you will utilize your expertise to train AI systems, focusing on analyzing and...  ...a solid foundation in electronics engineering. Responsibilities include creating datasets, simulating circuits, and... 
    Remote job

    YO IT Consulting

    San Antonio, TX
    3 days ago
  • $187.6k - $257.95k

     ...Senior Manager, Clinical Engineering & Data Analytics This role...  ...clinical engineering outcomes, AI-enabled insights, and best-in...  ...dashboards, and high-fidelity datasets. Strong business acumen...  ...benefits, Solventum regularly benchmarks with other companies that are... 
    Remote work
    H1b
    Relocation package
    Flexible hours

    Solventum

    United States
    3 days ago
  • Alignerr is seeking a Mechanical Engineering Expert to work remotely on cutting-edge AI projects. You will design complex engineering problems and develop solutions for AI learning benchmarks. Ideal candidates hold a Master’s or Ph.D. in related fields, possess strong... 
    Remote job

    Alignerr

    Atlanta, GA
    2 days ago
  • $320k

    Distinguished Engineer - High Performance AI page is loaded## Distinguished Engineer - High Performance AIlocations...  ...: US, CA, Santa Clara: US, GA, Remote: US, TX, Austin: US, TX, Remote: US,...  ...optimizations, evidenced by benchmark wins or published results.* Publications... 
    Remote work

    NVIDIA Corporation

    Austin, TX
    1 day ago
  • $65 - $85 per hour

     ...prominent recruiting agency is seeking a Computational Biologist for a remote role with a pay range of $65 to $85 per hour. In this position, you will design benchmark problems and analyze biological datasets. Applicants must possess an advanced degree in a relevant field... 
    Remote job
    Hourly pay
    Contract work
    10 hours per week
    Flexible hours

    Crossing Hurdles

    New York, NY
    3 days ago
  •  ...Senior AI Technologist Essential Functions: Identify...  ...completeness, and accuracy across AI datasets and workflows. Build and...  ...in computer science, Data Engineering, Information Systems,...  ...equivalent experience REMOTE WORK NOTICE: This position may... 
    Remote work
    Work at office

    ARA Brand

    United States
    2 days ago
  •  ...About Langfuse Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing,...  ...office (how we work). Workplace: Remote-friendly. European roles are remote...  ..., evals, prompt management, datasets, metrics, and related LLM engineering... 
    Remote work
    Part time
    Work at office

    Langfuse GmbH

    San Francisco, CA
    4 days ago
  •  ...Ubuntu Linux Kernel Test Engineer Home Based - APAC; Office Based...  ...: This role will be based remotely in the APAC region, except for...  ...validation Conduct performance benchmarking and regression detection...  ...projects and the platform for AI, IoT and the cloud, we are changing... 
    Remote work
    Work at office
    Work from home

    Canonical

    United States
    4 days ago
  • $125k - $205k

     ...investments. Built on a native, AI-powered platform and more...  ...: As a Senior Security Engineer at Later, you will play a critical...  ...factors. Our salaries are benchmarked against market Total Cash...  ...we are open to hiring fully remote candidates. We post our positions... 
    Remote work
    Permanent employment
    Local area

    Later

    Boston, MA
    3 days ago
  •  ...Quality Assurance Engineer – Av Behavior Simulation Testing Austin...  ...reliable testing processes, and datasets. Our main goal is to detect...  ...offering relocation sponsorship, and remote work options are not available...  ...essential functions of a job, please email ****@*****.***.ai.... 
    Remote work
    Relocation

    Avride

    Austin, TX
    25 days ago
  • $189.6k - $312.73k

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer...  ...by running performance benchmarks, tuning vLLM parameters, and...  ...troubleshoot complex CNI failures. AI Inference Proficiency : You...  ...or equity. For positions with Remote‑US locations, the actual... 
    Remote work
    Permanent employment
    Full time
    Contract work
    Work experience placement
    Work at office
    Flexible hours

    Red Hat, Inc.

    Albany, NY
    1 day ago
  •  ...you will establish and lead an AI Systems & Performance Lab...  ...and custom accelerators). Benchmarking & Infrastructure Development...  ...small, highly skilled team of engineers and researchers. Drive thought...  ..., Colorado, New York or remote jobs that can be performed in... 
    Remote work
    Temporary work
    Flexible hours
    Shift work

    SanDisk

    Milpitas, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote AI Benchmark & Datasets Engineer. Be the first to apply!