Senior Software Engineer for LLM Evaluation [Remote]

$50 per hour

SaidGig

Remote job

This role focuses on creating innovative datasets for training and advancing large language models, working closely with researchers to enhance AI-driven coding solutions. As a Software Engineering evaluator, you will curate code examples, provide precise solutions, and make corrections, primarily using Python, while also engaging with JavaScript (including ReactJS), C/C++, Java, Rust, and Go.

Key Responsibilities:

Curate code examples, build solutions, and correct code primarily in Python, with additional work in JavaScript (including ReactJS), C/C++, Java, Rust, and Go.
Evaluate and refine AI-generated code to ensure efficiency, scalability, and reliability.
Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks.
Develop agents and automated verification tools in Python to verify code quality and identify error patterns.
Hypothesize on software engineering cycle steps (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities.
Design verification mechanisms to automatically verify solutions to software engineering tasks.

Qualifications:

Minimum of 3 years of software engineering experience.
Strong expertise in Python, with deep knowledge of frameworks, tooling, and best practices for building production-grade software.
Experience in building full-stack applications and deploying scalable software using modern languages and tools.
Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
Excellent oral and written communication skills for clear, structured evaluation rationales.

Work Terms:

Flexible engagement with a minimum commitment of 10 hours per week, up to 40 hours per week.
Contractor position (no medical/paid leave).
Initial duration of 1 month with potential extensions based on performance and fit.
Candidates must be based in the United States.

Compensation:

Details regarding compensation will be discussed during the interview process.

Eligibility:

The application process takes 15, 30 minutes.
Completion of an AI video interview is required.

Apply

Vacancy posted 14 days ago

Similar jobs that could be interesting for youBased on the Senior Software Engineer for LLM Evaluation [Remote] in United States vacancy

Senior Software Engineer for LLM Evaluation & Repository Validation
$40 per hour
...Role Overview: This position focuses on developing and evaluating large language model (LLM) training datasets to address realistic software engineering challenges. You will engage in creating verifiable software engineering tasks based on public repository histories,...
Senior
For contractors
Remote work
SaidGig
United States
14 days ago
Senior Software Engineer for LLM Evaluation & Repository Validation
$40 per hour
...In this role, you will contribute to the development of LLM evaluation and training datasets designed to tackle realistic software engineering challenges. Your expertise will help create verifiable software engineering tasks based on public repository histories, utilizing...
Senior
Contract work
For contractors
SaidGig
United States
14 days ago
Senior Software Engineer for LLM Evaluation & Repository Validation
$40 per hour
...Role Overview: This position offers a unique opportunity to engage in the development of LLM evaluation and training datasets aimed at solving realistic software engineering challenges. You will play a critical role in creating verifiable software engineering tasks based...
Senior
For contractors
SaidGig
United States
14 days ago
Senior Software Engineer for LLM Evaluation
$40 per hour
...Role Overview: This position focuses on building LLM evaluation and training datasets aimed at addressing realistic software engineering challenges. The role involves creating verifiable software engineering tasks based on public repository histories, utilizing a synthetic...
Senior
Remote job
Contract work
For contractors
SaidGig
United States
28 days ago
Senior Software Engineer, Ruby for LLM Evaluation
$40 per hour
...Join a pioneering team focused on building LLM evaluation and training datasets designed to tackle realistic software engineering challenges. This role offers the opportunity to contribute to the development of verifiable software engineering tasks based on public repository...
Senior
For contractors
SaidGig
United States
14 days ago
Senior Software Engineer for LLM Evaluation
$50 per hour
...Role Overview As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making...
Senior
Remote job
Full time
For contractors
Flexible hours
SaidGig
United States
28 days ago
Remote Senior Python Engineer - LLM Evaluation (US-based)
...training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality,... ...pedigree. Project Overview What Does a Typical Day Look Like? Evaluate and refine AI-generated code across backend and frontend...
Senior
For contractors
Remote work
Flexible hours
Turing
Chicago, IL
1 day ago
Remote Senior Software Engineer (LLM) - 34953
$40 - $100 per hour
Remote Senior Software Engineer (LLM) - 34953 Remote Senior Software Engineer (LLM) - 34953 3 days ago Be among the first 25 applicants Get AI-powered... ...AI. Project Overview: We're building high-quality evaluation and training datasets to improve how Large Language Models...
Senior
Remote job
Full time
Contract work
For contractors
Turing
New York, NY
4 days ago
Senior Software Engineer - TensorRT Edge-LLM
...inference? Join NVIDIA’s TensorRT Edge‑LLM team and help shape the next generation... ...for automotive and robotics. We build the software stack that enables Large Language, Vision... ...Computer Science, Electrical/Computer Engineering, or a closely related field. 4+ years of...
Senior
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles
$148k - $356.5k
Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles page is loaded Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles Apply locations US, CA, Santa Clara US, GA, Remote US, NC, Remote US, WA, Remote US, DC, Remote time type Full time...
Senior
Full time
Remote work
NVIDIA Corporation
Raleigh, NC
4 days ago
Senior Software Engineer, Go - LLM Team
$180k - $240k
...support the next generation of powerful, meaningful products built with AI. Job Overview We’re seeking an exceptional Senior Software Engineer to join our LLM team. This role is focused on building and maintaining our LLM gateway service—a unified API platform that...
Senior
Full time
Remote work
Easy work
AssemblyAI
New York, NY
3 days ago
Senior Software Engineer, Autonomy Evaluation
$144.7k - $221.4k
...analyses to introspect autonomous driving software performance at interfaces across the... ...closely with autonomy developers and systems engineers. Design and implement analysis... ...components in the autonomy stack, including evaluation of perception, prediction, and planning...
Senior
Local area
Remote work
Relocation
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
2 days ago
(Senior) Software Engineer - Evaluation
$120k - $250k
...processing, curation, and multi-dimensional evaluation. Design and implement high-performance... .... Build and optimize downstream engineering workflows for Large Language Models (LLMs... ...skills in C/C++, Python, and software design Strong foundation in mathematics...
Senior
Temporary work
Pony.ai Inc.
Fremont, CA
6 days ago
Contract Senior Software Engineer - AI Code Review & Evaluation
$50 - $150 per hour
A leading AI company is seeking a software engineer to review and evaluate model-generated code. This contract role requires several years of software engineering experience, particularly as a full-stack engineer at notable tech firms. You will assess code quality and...
Senior
Hourly pay
Contract work
Flexible hours
Turing
San Francisco, CA
3 days ago
Senior Software Engineer, Trust & Safety - LLM Defenses
$196k - $294k
Nerdleveltech seeks a Senior Software Engineer to join their Trust & Safety team. Based in San Francisco, you'll protect millions of developers from... ...scale, using your skills in JavaScript, Python, and LLM methodologies. Responsibilities include analyzing threat actor...
Senior
Flexible hours
Nerdleveltech
San Francisco, CA
4 days ago
Senior Software Engineer: Sampling & Evaluation Pipelines
A leading autonomous technology company based in California seeks an experienced engineer to develop importance sampling techniques for evaluation pipelines. You will collaborate with teams to optimize objectives and drive data-driven decisions. Ideal candidates will have...
Senior
Waymo
Mountain View, CA
2 days ago
Senior Deep Learning Software Engineer, LLM Performance
$184k - $287.5k
We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference. NVIDIA is rapidly growing our research and development for...
Senior
NVIDIA
Santa Clara, CA
4 days ago
Senior Python Engineer: LLM Eval & Repo Validation Remote
$19 - $20 per hour
A tech consulting firm is seeking a Senior Software Engineer specializing in Python to evaluate and validate LLM performance in real-world scenarios. This remote position involves analyzing GitHub issues, developing software solutions, and collaborating with researchers...
Senior
Remote job
Hourly pay
For contractors
Crossing Hurdles
New York, NY
4 days ago
Senior AI Test Engineer: Build Scalable AI & LLM Evaluations
TWG Global AI in New York is seeking a Senior or Staff AI Software Engineer in Test to develop test automation frameworks for AI products. The ideal candidate will have significant experience in Python and Java, along with a strong software engineering background focused...
Senior
TWG Global AI
New York, NY
2 days ago
Senior AI Systems Engineer - LLM in Production
$160k - $200k
Madrona Venture Labs is seeking experienced engineers passionate about AI to design and build LLM powered systems in Seattle, Washington. The role offers a... ...with large language models, retrieval systems, and evaluation frameworks. Flexible time off and meaningful equity...
Senior
Flexible hours
Madrona Venture Labs
Seattle, WA
2 days ago
Senior LLM Evaluation Data Scientist - Remote
Driverai is seeking an Applied Data Scientist with expertise in LLM evaluation to join its innovative team in Austin, TX. This role focuses on building the evaluation function from scratch and requires a strong background in statistics and machine learning. The successful...
Senior
Remote job
Driverai
Austin, TX
4 days ago
Senior AI Quality Engineer — Agent Evaluation & Testing
$176k - $253k
Harper is seeking a Senior Member of Technical Staff, AI... ...standards through robust evaluation processes. You'll build capability... ..., and work directly with engineers to ensure our AI systems... ...have 3-6 years of software experience, particularly in LLM and agent evaluations. Competitive...
Senior
Harper
San Francisco, CA
6 days ago
Senior Lead AI Engineer (FM Hosting, LLM Inference)
$229.9k - $262.4k
...Senior Lead AI Engineer (FM Hosting, LLM Inference) Overview: At Capital One, we are creating responsible... ..., test, deploy, and support AI software components including foundation model... ...search, guardrails, model evaluation, experimentation, governance, and...
Senior
Full time
Part time
Local area
Capital One
San Jose, CA
1 day ago
Senior Software Engineer - Siri Evaluation Platform
Apple Inc. is looking for a Senior Software Engineer to join the Siri Agentic Evaluation Platform team in Cambridge. You will build software platforms that define, measure, and report Siri quality and effectiveness, providing insights to development teams across Apple....
Senior
Apple Inc.
Cambridge, MA
4 days ago
Senior Software Engineer / AI Product / Python / LLM
$225k
This employer is hiring an AI Solutions Engineer for a direct hire role based in Kennett Square... ...position sits at the intersection of software engineering and business consulting, focused... ...You Will Be Doing Tech Breakdown AI / LLM-based development (RAG, agents, workflows...
Senior
2 days per week
Motion Recruitment Partners LLC
Kennett Square, PA
2 days ago
Senior AI Software Engineer (Agentic Systems & LLM Applications) | W2 Only | REMOTE |
Role Overview We are seeking an experienced AI‑focused Software Engineer to design, build, and scale intelligent applications powered by modern... ...in Python software development Hands‑on experience with LLM frameworks and orchestration tools (LangChain, LangGraph, AutoGen...
Senior
Remote job
Xlysi LLC.
Chicago, IL
3 days ago
Agentic AI & LLM Applications Software Development Engineer, Senior
...That is why we need you, an experienced Software Development Engineer who can operate at a system-of-... ...design and build agentic workflows, build LLM integrations, support tool‑calling... ...makes those decisions correctly Build evaluation pipelines for LLM quality, including...
Senior
Full time
Contract work
Part time
Work at office
Local area
Remote work
Phase2 Technology
Washington DC
5 days ago
Senior AI Engineer — LLM Platform & Production
Lendistry is seeking a Senior AI Engineer to enhance our AI capabilities and lead the development of LLM features. This hands-on role involves collaborating with cross-functional... ...in Python, LLM applications, and AI system evaluation. Our comprehensive benefits and supportive...
Senior
Lendistry
Los Angeles, CA
5 days ago
Senior Software Engineer
...CSC, we are seeking a skilled .NET Software Developer to join our engineering team. The role involves designing,... ...agents, and intelligent workflows. Evaluate, implement, and optimize AI-augmented... ...with large language model (LLM) architectures, prompt engineering,...
Senior
Full time
Local area
Remote work
Corporation Service Company
Wilmington, DE
3 days ago
Senior AI/ML Engineer - LLM and Agentic AI Systems - Fulltime
Senior AI/ML Engineer - LLM & Agentic AI Systems - Hybrid We are looking for a Senior AI/ML Engineer with strong expertise in Large Language... ...prompt engineering strategies Support LLM fine‑tuning, model evaluation, and continuous model improvement initiatives Build...
Senior
Full time
NLP PEOPLE
Dearborn, MI
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer for LLM Evaluation [Remote]. Be the first to apply!