Remote AI Benchmark & Datasets Engineer
Pathway Genomics Corporation
- Remote job
An innovative AI startup is seeking a Benchmark Specialist to design and execute rigorous benchmarks and evaluate datasets for their post-transformer models. This role requires strong experience in ML/LLM evaluation and the ability to communicate technical specifications to both engineers and customers. The position is full-time and offers remote work flexibility. If you are passionate about high-quality data and groundbreaking research, this opportunity could be for you. #J-18808-Ljbffr Pathway Genomics Corporation
$70 - $85 per hour
...Industrial Technical Visual Reasoning Benchmark, an evaluation dataset designed to assess AI models'' reasoning capabilities over authentic engineering visuals. This role invites... ...finalization. This is a part-time, remote, hourly position ideal for practicing...Remote jobHourly payPart time$70 - $85 per hour
...Industrial Technical Visual Reasoning Benchmark, an evaluation dataset designed to assess AI models'' reasoning capabilities over authentic engineering visuals. As a Process Engineering... ...publication. This is a part-time, remote, hourly position ideal for practicing...Remote jobHourly payPart time- ...frontier model that solves AI's fundamental memory problem... ...fastest data processing engine on the market, Pathway enables... ...and execute rigorous benchmarks and define dataset standards. Collaborating closely... ...and location. Location : Remote work. Possibility to work or...Remote workPermanent employmentFull timeContract workImmediate start
- ...Langfuse Open Source LLM Engineering Platform that helps teams build useful AI applications via... ...work). Workplace: Remote-friendly. European roles... ...evals, prompt management, datasets, metrics, self-hosting,... ..., guides, cookbooks, benchmarks, demos, webpages, docs,...Remote workWork at office
- ...technical Quality Assurance Engineer with strong... ...data architectures, XBRL datasets, and performance-optimized... ...proficiency in AI tools and AI-driven workflows... ...opportunity is 100% remote. Key Responsibilities... ...Query performance benchmarking Data freshness and...Remote work
$165k - $205k
...Manager, Quality Engineering & AI Validation The Manager of Quality Engineering and AI... ...hybrid role with the flexibility to work remotely 2 days per week (also eligible for... ...approach for AI-led products, including benchmark datasets, scoring rubrics, regression...Remote workWork at officeLocal area2 days per week- ...Machine Learning Test Engineer Location: United... ...Francisco, Toronto, and remotely. Autodesk is a hybrid-... ..., metrics, and test datasets Evaluating CAD RL... ...engineering or QA for ML/AI systems ~ Strong... ...methods, metrics, and benchmarking Passion for learning...Remote workFor contractorsWork at office
$70 - $85 per hour
...technical talent with leading AI research labs. Headquartered... ...Francisco, our investors include Benchmark , General Catalyst ,... ...Dorsey . Position: Process Engineering Expert - Visual Reasoning... ...$85/hour Location: Remote Role Responsibilities...Remote jobContract workSummer work- ...is seeking a Software Engineer to join the NeuralPLexer... .... This is a remote position based on the... ...systems Build and expand benchmarking systems for running models... ...and affinity datasets, computing metrics, and... ...novel medicines using its AI-driven discovery and development...Remote workFlexible hours
$50 - $175 per hour
Title: AI Safety and Evaluations Engineer Job Type: Contract Contract Length: 12 Months Pay Range: $50/... ...175/hr Start Date: ASAP Location: Remote About the Opportunity: Our client... ...toxicity. Creating automated "Eval" datasets to benchmark new models before they are...Remote workContract workImmediate start$85.4k - $143.2k
...are understood across engineering, diagnostics, and service... ...state-of-the-art AI-powered Embedded Vehicle... ...diverse global ecosystem of remote users, 3rd-party... ...s rigorous production benchmarks. Engineering for Personas... ...and cleaning datasets exceeding 10,000+ records...Remote workLocal areaImmediate startFlexible hoursShift work$157k - $175k
...In 2025, we started Handshake AI and built the fastest-growing... ...create evaluations, publish benchmarks, and push the boundary of data... ...Work together with engineers, scientists, operators, and more... ...stipend, ongoing development Remote & Office: Internet, commuting...Remote workFull timeWork at officeFlexible hours- ...Machine Learning Engineer Innovation starts from the heart. At Edwards Lifesciences, we... ...and enhancing technological solutions. AI Factory team focuses in developing and delivering... ..., processing, and analyzing large-scale datasets. Mentor junior engineers and...Remote work
$70 - $85 per hour
...technical talent with leading AI research labs. Headquartered... ...Francisco, our investors include Benchmark , General Catalyst ,... ...Dorsey . Position: Civil Engineering Expert - Visual Reasoning Benchmark... ...$70–$85/hour Location: Remote Role Responsibilities...Remote workContract workSummer workFlexible hours- ...is looking for an NLP Engineer to design and develop NLP applications and work with various AI/ML projects. Responsibilities include... ...science prototypes, selecting datasets, and training models. Candidates... ...offers flexible hours and a remote work option. Ideal candidates...Remote jobInternshipFlexible hours
$80 - $140 per hour
...researchers to author and verify golden reference solutions for the CritPt benchmark. Applicants will solve research-level physics problems, audit expert solutions, and contribute to benchmark datasets. The role offers flexible hours at a pay rate of $80-$140 per hour,...Remote jobHourly pay10 hours per weekFlexible hours$80k - $89k
...distance may be considered on a remote basis. SUMMARY: Avetta... ..., safety and performance benchmarking in a single, integrated experience... ...120 countries, Avetta blends AI-driven insights and human... ...scale with certainty. The QA Engineer II is a fully contributing...Remote workWork at officeWork from home- ...Developer Relations Engineering (Events & Community) at Langfuse Langfuse... ...fits into the way they build AI applications. A lot of that... ..., evals, prompt management, datasets, metrics, and related LLM... ...: Quick intro and logistics, remote # Founder Call: Marketing...Remote workPart timeWork at office
- ...Electronics Expert - DesignSpark PCB to work remotely. In this role, you will utilize your expertise to train AI systems, focusing on analyzing and... ...a solid foundation in electronics engineering. Responsibilities include creating datasets, simulating circuits, and...Remote job
$187.6k - $257.95k
...Senior Manager, Clinical Engineering & Data Analytics This role... ...clinical engineering outcomes, AI-enabled insights, and best-in... ...dashboards, and high-fidelity datasets. Strong business acumen... ...benefits, Solventum regularly benchmarks with other companies that are...Remote workH1bRelocation packageFlexible hours- Alignerr is seeking a Mechanical Engineering Expert to work remotely on cutting-edge AI projects. You will design complex engineering problems and develop solutions for AI learning benchmarks. Ideal candidates hold a Master’s or Ph.D. in related fields, possess strong...Remote job
$320k
Distinguished Engineer - High Performance AI page is loaded## Distinguished Engineer - High Performance AIlocations... ...: US, CA, Santa Clara: US, GA, Remote: US, TX, Austin: US, TX, Remote: US,... ...optimizations, evidenced by benchmark wins or published results.* Publications...Remote work$65 - $85 per hour
...prominent recruiting agency is seeking a Computational Biologist for a remote role with a pay range of $65 to $85 per hour. In this position, you will design benchmark problems and analyze biological datasets. Applicants must possess an advanced degree in a relevant field...Remote jobHourly payContract work10 hours per weekFlexible hours- ...Senior AI Technologist Essential Functions: Identify... ...completeness, and accuracy across AI datasets and workflows. Build and... ...in computer science, Data Engineering, Information Systems,... ...equivalent experience REMOTE WORK NOTICE: This position may...Remote workWork at office
- ...About Langfuse Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing,... ...office (how we work). Workplace: Remote-friendly. European roles are remote... ..., evals, prompt management, datasets, metrics, and related LLM engineering...Remote workPart timeWork at office
- ...Ubuntu Linux Kernel Test Engineer Home Based - APAC; Office Based... ...: This role will be based remotely in the APAC region, except for... ...validation Conduct performance benchmarking and regression detection... ...projects and the platform for AI, IoT and the cloud, we are changing...Remote workWork at officeWork from home
$125k - $205k
...investments. Built on a native, AI-powered platform and more... ...: As a Senior Security Engineer at Later, you will play a critical... ...factors. Our salaries are benchmarked against market Total Cash... ...we are open to hiring fully remote candidates. We post our positions...Remote workPermanent employmentLocal area- ...Quality Assurance Engineer – Av Behavior Simulation Testing Austin... ...reliable testing processes, and datasets. Our main goal is to detect... ...offering relocation sponsorship, and remote work options are not available... ...essential functions of a job, please email ****@*****.***.ai....Remote workRelocation
$189.6k - $312.73k
The vLLM and LLM-D Engineering team at Red Hat is looking for a customer... ...by running performance benchmarks, tuning vLLM parameters, and... ...troubleshoot complex CNI failures. AI Inference Proficiency : You... ...or equity. For positions with Remote‑US locations, the actual...Remote workPermanent employmentFull timeContract workWork experience placementWork at officeFlexible hours- ...you will establish and lead an AI Systems & Performance Lab... ...and custom accelerators). Benchmarking & Infrastructure Development... ...small, highly skilled team of engineers and researchers. Drive thought... ..., Colorado, New York or remote jobs that can be performed in...Remote workTemporary workFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote AI Benchmark & Datasets Engineer. Be the first to apply!
- senior ai engineer Palo Alto, CA
- ai ml engineer Palo Alto, CA
- ai engineer remote Palo Alto, CA
- ai engineer Palo Alto, CA
- ai prompt engineer Palo Alto, CA
- ai developer Palo Alto, CA
- ai research engineer Palo Alto, CA
- machine learning ai engineer Palo Alto, CA
- remote nonprofit Palo Alto, CA
- remote financial analyst Palo Alto, CA



