AI Benchmark & Datasets Engineer/ Researcher Internship
Pathway Vet Alliance
About Pathway Pathway builds the first post-transformer frontier model that solves AI's fundamental memory problem. While transformers wake up in the same state every time—like Groundhog Day—our architecture enables true continuous learning, infinite context reasoning, and real-time adaptation. We're not optimizing yesterday's technology; we're building what comes after transformers. Our breakthrough architecture outperforms Transformer and provides the enterprise with full visibility into how the model works. Combining the foundational model with the fastest data processing engine on the market, Pathway enables enterprises to move beyond incremental optimization and toward truly contextualized, experience-driven intelligence. We are trusted by organizations such as NATO, La Poste, and Formula 1 racing teams. Pathway is led by co-founder & CEO Zuzanna Stamirowska, a complexity scientist who created a team consisting of AI pioneers, including CTO Jan Chorowski who was the first person to apply Attention to speech and worked with Nobel laureate Geoff Hinton at Google Brain, as well as CSO Adrian Kosowski, a leading computer scientist and quantum physicist who obtained his PhD at the age of 20. The company is backed by leading investors and advisors, including Lukasz Kaiser, co-author of the Transformer (“the T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models. Pathway is headquartered in Palo Alto, California. The Opportunity We are currently seeking AI Benchmark & Dataset Engineering interns to support the definition and execution of benchmarking processes for model evaluation. You Will Proactively identify, prioritize, and curate relevant public and client-driven benchmarks across our target use cases and markets. Evaluate candidate benchmarks for clarity, data quality, evaluation methodology, and fit with our model roadmap. Run benchmarks with baseline models to validate setup, uncover edge cases, and de‑risk R&D runs. Hand off “benchmark-ready” packages to R&D (specs, data, evaluation scripts, expected metrics, constraints) Maintain a shared vocabulary and documentation around benchmarks, datasets, and evaluation formats that GTM and R&D can both use. Track and organize benchmark results, model leaderboards, and “what good looks like” for different customers and scenarios. Contribute to demos and public‑facing proof points based on benchmark outcomes. You will play a key role in defining and driving the benchmarking process for AI model evaluation. Your work will directly influence what we build, how we talk about it, and how customers and the market experience BDH. Cover letter It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that. You are expected to meet at least one of the following criteria: You were an ICPC World Finalist, or an IOI, IMO, IOAI or IPhO medalist in High School. You have published a research paper at an A-rated or A*-rated venue (according to ICORE). You have completed coding projects - ideally with a GitHub repository showcasing previous work. You were an intern at a leading Machine Learning research center (e.g. at: Google Brain / Deepmind, Apple, Meta, Anthropic, Nvidia, MILA). You can get a warm recommendation from you university faculty member. You Have experience with ML/LLM evaluation , data science, or technical product roles, ideally around benchmarks or experimentation. Are comfortable reading papers , leaderboards , and Github repos, and turning them into clear, repeatable benchmark specs. Can talk comfortably with both engineers and customers, and translate between technical detail and business value . Care about high‑quality data , reproducible experiments , and crisp documentation . Are respectful of others. Are fluent in English. Why You Should Apply Join an intellectually stimulating work environment. During your internship, you will collaborate on a cutting edge research project . Be a pioneer: you get to work with a new type of "Live AI" challenges. Be part of one of an early-stage AI startup that believes in impactful research and foundational changes. Further details Preferable joining date : July 2026. The positions are open until filled – please apply immediately. Duration : 3-6 months Compensation : based on profile and location. Location : Hybrid - regular presence in our office in Palo Alto, CA is required. Possibility to work or meet with other team members in one of our other offices: Paris, France or Wroclaw, Poland. As a general rule, permanent residence will be required in the EU, UK, US, or Canada. If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us. #J-18808-Ljbffr
- ...frontier model that solves AI's fundamental memory problem... ...fastest data processing engine on the market, Pathway enables... ...T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models... ...and execute rigorous benchmarks and define dataset standards. Collaborating closely...SuggestedPermanent employmentFull timeContract workImmediate startRemote work
- ...An AI technology startup is seeking a Benchmarking Specialist in Palo Alto to design and execute ML evaluation benchmarks. You'll work closely with the R&D team to define data standards and maintain documentation. The ideal candidate has experience in ML/LLM evaluation...SuggestedFull timeImmediate startRemote work
$19 - $65 per hour
PlusAI is a Physical AI company pioneering AI-based virtual... ...growing teams. We are seeking a Research Engineer Intern to join the... ...detection on a curated long-tail benchmark. The project will first establish... .... \$19 - \$65 an hour Our internship hourly rates are a standard...InternshipHourly pay$174k - $252k
Research Engineer, Gemmaverse Variants Research, DeepMind corporate_fare DeepMind place Mountain... ...with Python, PyTorch, JAX, LLM datasets/benchmarks/metrics, etc. Preferred qualifications... ...to demonstrate the positive power of AI and establish Gemma as the definitive...SuggestedFull time- ...Bespoke Labs is an applied AI research lab pioneering data and RL environment... ...of the best open reasoning datasets used by multiple frontier... ...'re looking for a Research Engineer to bridge cutting-edge... ...Customize environment suites and benchmarks for different use cases and...Suggested
- ...on building next-generation Embodied AI systems and intelligent robotic platforms... ...deep experience in frontier AI research, robotics engineering, product development, and company building... ...subject: Application - [Full-time/Internship] - [Your Area, e.g., Embodied AI/...InternshipFull time
$174k - $252k
...Computer Science, Computer Engineering, Cybersecurity, a... ...professional or academic research experience applying machine... ...or curating large‑scale datasets for training or evaluating AI/ML models, particularly... ...functions, and evaluation benchmarks grounded in realistic...Full time- ...California About the Team The AI Research organization is dedicated to... ...group of researchers and engineers tackling some of the most complex... ...‑driving technology. This internship provides experience with... ...Experience working with large‑scale datasets and training ML models in...InternshipFull timeSummer internshipWork at officeLocal area
- ...SAP SE is offering a Summer 2026 internship for an AI Software Engineer within the SAP SuccessFactors iXp team. The role revolves around developing AI-driven solutions for enhanced performance analysis. Interns will engage in full-stack development using Java, Python,...InternshipSummer internship
- ...Research Engineer, Foundation Models About the Opportunity We are... ...next generation of large-scale AI systems. This role sits at... ..., from building large-scale datasets and training infrastructure... ...Research, prototype, and benchmark novel model architectures and...Visa sponsorshipRelocation packageFlexible hours
- ...for an intern for the SAP SuccessFactors iXp program as an AI Software Engineer for Summer 2026. The intern will work on AI Software Development... ...in full-stack development with Java, Python, or Go. The internship offers collaborative mentorship and a project-driven...InternshipSummer workSummer internship
- Gen Digital Inc. is offering a paid summer internship in Mountain View, CA, focusing on optimizing AI and data systems for a global sales organization. Interns will work on transitioning a CRM platform into a predictive tool, enhance AI applications, and have hands-on...InternshipSummer internship
$40 - $45 per hour
Coupang is seeking interns for Summer 2026 in Mountain View, California. The internship program focuses on disciplines like Data Science, AI, and Software Engineering. Interns will work on real projects, gain mentorship, and collaborate with teams to impact users positively...InternshipHourly payFull timeSummer workSummer internship$90 - $121.86 per hour
...Job Description Job Description LLM Research Engineer Key Responsibilities: Design, train, and... ...architectures, multi-modal models, and emergent AI behaviors. Collect, clean, and preprocess large-scale text datasets from diverse sources. Develop and...Hourly pay- ...systems and semiconductors where AI can design and create beyond... ...Stanford professors, SAIL researchers, Olympiad medalists (IPhO, IOI... ...’ll build tools, performance benchmarks, and integration layers that... ...closely with researchers and engineers, you’ll help make Voltai the...
$180k
...mission is to create AI systems that can accurately... ..., and focused on engineering excellence. This organization... .... All engineers and researchers are expected to have... ..., synthetic reasoning datasets, or hybrid techniques... ...: Develop creative benchmarks and metrics to assess...Work at officeWork from homeRelocation$19 - $65 per hour
PlusAI is a Physical AI company pioneering AI-based virtual driver software for factory... ...drive our automated map‑updating engine, keeping our fleet safe and scalable. Responsibilities... .... Compensation $19 - $65 an hour. Our internship hourly rates are a standard pay...InternshipHourly payPermanent employmentSummer internshipNight shift$145k - $185k
...Growth Intelligence Engineer (Ads & Revenue) Mountain... ...powered by advanced AI, recommendation... ...feature pipelines, training datasets, feedback loops) that... ...performance data, market benchmarks, and business targets... ...context Prior internship or work experience at...InternshipFull timeWork experience placementLocal areaWork from home- ...Mountain View, CA Summer 2026 Internship $120K – $150K annualized Join Corvic's AI team for a summer... ...data processing and feature engineering pipelines Contribute to benchmarking and evaluation of AI systems Collaborate with researchers on novel approaches to multimodal...InternshipSummer workSummer internship
- TryApplyNow is offering an Applied Data Science Summer Internship in Palo Alto, CA. This 10-week full-time program is designed for graduate... ...students and exceptional undergraduates who are passionate about AI and data science. Interns will engage in real enterprise...InternshipFull timeSummer internship
- ...experience in machine learning engineering or large-scale software... ...Experience working directly on AI safety, adversarial robustness... ...evaluation, or responsible AI research. Experience in Python and C++... ...building evaluation frameworks, benchmarks, or automated testing...
$158.8k - $218.1k
...Summary: The Robot Intelligence Lab at Samsung Research America is a new facility dedicated to... ...Our ideal candidate is to explore novel AI technologies towards generalizing robots... ...robotic systems Conduct experiments and benchmark performance against state-of-the-art approaches...Full timeWork at officeLocal area- ...At Rhoda AI, we’re building the next generation of generalist... ...possible by our cutting edge research and end-to-end system design.... ...Research Scientist or Research Engineer focused on model efficiency —... ...from the start Profile and benchmark models across hardware targets...
$141k - $202k
...is pursuing a ground‑breaking research program in materials, aiming... ...of artificial intelligence (AI) and computational simulation... ...experts, ML researchers, and engineers exploring a diverse set of important... ...visualising large and noisy datasets. Experience using Jax,...Full time- About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save... ...culture on Role Summary About the Research Engineering team The team spans Platform (shared infra... ...of GPUs). Design, implement and benchmark ML algorithms; write clear, efficient code...Work at officeVisa sponsorship
- Research Engineer - Multimodal AI Los Altos, CA About Orbifold AI Backed by top-tier VCs in the Silicon Valley and trusted by Fortune 500 companies... ...augment messy multimodal data into optimized, AI-ready datasets—powering the next generation of cutting‑edge visual and...Flexible hours
- ...dramatically overlooked. While AI is becoming more ubiquitous,... ...robots, and conduct necessary research to build products that just work... ...for a motivated perception engineer to join us on the ground floor... ...infrastructure to train on large datasets and simultaneously be...Immediate start
$150k
...Models We are a dedicated research lab for building,... ...the next generation of AI builders, and drive... ...data scientists, and engineers, tackling the most fundamental... ...across large-scale datasets. Key Responsibilities... ...Key Responsibilities: Benchmarking & Evaluation...Visa sponsorship- ...At Rhoda AI, we're building the full-stack foundation for the... ...robotics, and systems, with a research team that includes... ...Research Scientist or Research Engineer to advance dexterous manipulation... ...simulation environments and benchmarks for dexterous manipulation research...
- ...electronics systems and semiconductors where AI can design and create beyond human... ...of previous Stanford professors, SAIL researchers, Olympiad medalists (IPhO, IOI, etc.),... ...Building high-quality evaluation datasets and benchmarks for complex reasoning or design tasks...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Benchmark & Datasets Engineer/ Researcher Internship. Be the first to apply!
- senior ai engineer Palo Alto, CA
- ai ml engineer Palo Alto, CA
- ai engineer remote Palo Alto, CA
- ai engineer Palo Alto, CA
- ai prompt engineer Palo Alto, CA
- ai developer Palo Alto, CA
- ai research engineer Palo Alto, CA
- machine learning ai engineer Palo Alto, CA
- deep learning research engineer Palo Alto, CA
- research software engineer Palo Alto, CA



