Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote Senior Software Engineer - LLM Evaluation (US-based)

Turing

About Us:

Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.

Ideal Background:

This role is ideal for engineers who have built production systems at companies like Google, Microsoft, Apple, Amazon, Meta, or similar high-scale engineering organizations. We especially welcome graduates from leading programs such as Harvard, Columbia, Princeton, Yale, University of Pennsylvania, and comparable institutions — though exceptional experience and skill always take precedence over pedigree.

Project Overview:

As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections across the full stack — in Python for backend and ML workflows, and JavaScript (React, Node.js) for frontend and API layers, alongside C/C++, Java, Rust, and Go. You will evaluate and refine AI-generated code for efficiency, scalability, and reliability, and work with cross-functional teams to enhance enterprise-level AI-driven coding solutions.

What Does a Typical Day Look Like?

  • Work on AI model training initiatives by curating code examples, building solutions, and correcting code across both Python and JavaScript (React, Node.js), with additional work in C/C++, Java, Rust, and Go.
  • Evaluate and refine AI-generated code across backend and frontend contexts to ensure that it is efficient, scalable, and reliable.
  • Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks.
  • Build agents that can verify the quality of the code and identify error patterns across full-stack applications.
  • Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them.
  • Design verification mechanisms that can automatically verify a solution to a software engineering task.

Required Skills:

  • Several years of software engineering experience (3 years or more)
  • Strong expertise in building full-stack applications using Python and JavaScript (React, Node.js), with the ability to work across backend and frontend codebases.
  • Experience deploying scalable, production-grade software using modern languages and tools.
  • Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
  • Excellent oral and written communication skills for clear, structured evaluation rationales.

Engagement Details:

  • Commitment: flexible engagement, minimum 10 hrs/week, up to 40 hrs/week
  • Type: Contractor (no medical/paid leave)
  • Duration: 1 month (potential extensions based on performance and fit)
  • Location: Candidates must be based in the United States

Evaluation Process:

  • The application process takes 15–30 minutes.
  • Completion of an AI video interview is required.

Note: As part of assessments you will go through an AI video interview.

After applying, you will receive an email with a login link. Please use that link to access the portal and complete your profile.

Know amazing talent? Refer them at turing.com/referrals , and earn money from your network.

Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Remote Senior Software Engineer - LLM Evaluation (US-based) in Gresham, OR vacancy
  •  ...About Us: Based in San Francisco, California, Turing is the world’s leading research...  ...top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality...  ...Overview: As a Software Engineering evaluator, you will create cutting-edge datasets... 
    Remote work
    Senior
    For contractors
    Flexible hours

    Turing

    Allentown, PA
    7 hours ago
  • $175k - $245k

     ...Senior Software Engineer II - Applied AI and Evaluations (Remote Eligible) -REMOTE, USA- For over 20 years, Smartsheet...  ...at the intersection of LLM evaluation, prompt and...  ...work and productivity US employees are automatically...  ...provides a competitive base salary range for roles... 
    Remote work
    Senior
    Full time
    Temporary work
    Local area
    Immediate start

    Smartsheet

    New York, NY
    2 days ago
  • Senior Agentic AI Software Engineer - Hybrid US Job ID: 497243 Posted since: 04-Mar-2...  ...Full-time, Hybrid (Remote/Office), Permanent...  ...services, to reliability, evaluation, and long-term...  ...human-in-the-loop) based on problem...  ...experience building LLM-powered applications... 
    Remote work
    Senior
    Permanent employment
    Full time
    Work at office
    Local area
    Work from home

    Siemens Mobility

    Wilsonville, OR
    1 day ago
  • $40 - $100 per hour

     ...Remote Senior Software Engineer (LLM) - 34953 Remote Senior Software Engineer (LLM) - 34953 3 days ago Be among...  ...: We're building high-quality evaluation and training datasets to improve how...  ...starting next week; potential extensions based on performance and fit) Rates: $40... 
    Remote work
    Senior
    Full time
    Contract work
    For contractors

    Turing Inc

    New York, NY
    5 days ago
  • $180k - $240k

     ...re seeking an exceptional Senior Software Engineer to join our LLM team. This role is focused...  ...00 - $240,000The expected base compensation for this role...  ...flexibility of being fully remote.Working at AssemblyAIWe...  ...just fit in, but will help us define and build our company... 
    Remote work
    Senior
    Easy work

    AssemblyAI

    New York, NY
    2 days ago
  • $148k - $356.5k

     ...Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles page is loaded Senior Software Engineer...  ...Vehicles Apply locations US, CA, Santa Clara US, GA, Remote US, NC, Remote US, WA, Remote...  ...Linux (Ubuntu) or another Unix based system ~ Ability and enthusiasm... 
    Remote work
    Senior
    Full time

    NVIDIA

    Raleigh, NC
    5 days ago
  •  ...looking for experienced Software Engineers to design and...  ...pipelines used to evaluate frontier AI models...  .... This is a fully remote contract role. If...  ...and implementing LLM coding benchmarks...  ...Makes a Perfect Match Senior or Lead‑level...  ...benchmarking Why Join Us Work on cutting‑... 
    Remote work
    Senior
    Hourly pay
    Full time
    Contract work
    Freelance

    Alignerr

    New York, NY
    2 days ago
  • $140k - $220k

     ...As a Senior Software Engineer on our Advertising, Company Intelligence, and Intent...  ...at designing and implementing LLM‑powered systems such as RAG pipelines...  ...compensation offered will be based on factors such as the...  ...when you apply for jobs with us. Please review our Job... 
    Remote work
    Senior

    Zoom Information, Inc.

    Oklahoma City, OK
    4 days ago
  •  ...Senior Software Engineer, Knowledge Engine About Pinecone Pinecone is the leading...  ...friendly technology. Pinecone is based in New York and raised $138M...  ...unstructured data–to modern LLM-powered applications,...  ...Improve retrieval quality through evaluation and observability frameworks... 
    Remote work
    Senior
    Local area
    Work from home
    Flexible hours

    PineCone

    New York, NY
    5 days ago
  • $125k - $160k

     ...Full Stack Software Engineer (US Based Remote) Torus is headquartered in Utah and is expanding manufacturing at our 540,000-square-foot facility in...  ...enabled APIs or services into applications Understanding of evaluating AI outputs for accuracy and reliability Interest in... 
    Remote work
    Temporary work
    Casual work
    Work at office

    Torus

    Salt Lake City, UT
    5 days ago
  • $156k - $185k

     ...Senior Full Stack Software Engineer - Remote in US Knock is redefining the home buying and selling experience. We’re a...  ...into production applications—such as LLM integration (OpenAI, Anthropic),...  ...Philosophy: As a fully remote (U.S.‑based) workforce, our goal is to ensure... 
    Remote work
    Senior
    Full time
    Local area
    Flexible hours

    Knock

    New York, NY
    2 days ago
  • $136k - $199.2k

    ## Senior Software Engineer, Autonomy EvaluationApplyremote type: Remote/Hybridlocations: Sunnyvale, California, United...  ...scale. Join us to help deliver the next...  ...the Organization**The Evaluation team builds and evolves...  ...Remote:** This role can be based remotely but if you... 
    Remote work
    Senior
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  •  ...more at As a Senior AI Infrastructure Engineer at Sword Health,...  ...From optimizing LLM inference and deploying...  ...strategies – evaluate and implement techniques...  ...your hours (remotely) with unlimited...  ..., check here. US - Sword Benefits...  ...valid EU visa and be based in Portugal... 
    Remote work
    Senior
    Full time
    Work from home
    Worldwide
    Relocation package
    Flexible hours
    Shift work

    SWORD Health

    New Bremen, OH
    5 days ago
  •  ...We are building LLM evaluation and training datasets...  ...on realistic software engineering problems. One of...  ...verifiable SWE tasks based on public...  ...quality Why Join Us? Turing is one of...  ...Work in a fully remote environment. Opportunity...  ...date as next week Seniority level Seniority... 
    Remote work
    Senior
    Contract work
    For contractors
    Freelance
    Internship

    Turing Inc

    New York, NY
    1 day ago
  • $200k

     ...of a high‑velocity engineering team that creates...  ...Role As an early Senior Full Stack Engineer...  ...Haves 5+ years of software development experience...  ...Worked in a fully remote environment with...  ...Worked in sprint‑based environments with...  ...with company match (US) Either SF, or... 
    Remote work
    Senior

    Short Story

    New York, NY
    2 days ago
  •  ...first AI Hardware Engineer. Our goal is...  ...Engineer, software that can design...  ...prompt. As a Senior Software Engineer...  ...Experience integrating LLM-based systems into...  ...in a fully remote, async team....  ...retrieval systems, evaluation pipelines, or...  ...flux.ai and tell us a little about... 
    Remote work
    Senior
    Internship
    Shift work

    Flux Enterprise

    San Francisco, CA
    5 days ago
  •  ...technology. What makes us different? Kraken is...  ...here. As a fully remote company, we have...  ...production-oriented team. Engineers here combine strong systems...  ...for model deployment, evaluation, and monitoring...  ...infrastructure for agent-based or LLM-powered systems... 
    Remote work
    Senior
    Local area

    Kraken

    New Bremen, OH
    5 days ago
  • $176k - $215k

     ...Senior Software Engineer, Merchandising (US Eastern) Remote - United States At Algolia, we’re proud to be a pioneer and market leader in AI Search, empowering 17...  ...Site Reliability Engineering Experience in Kubernetes based deployments Algolia is an Equal Opportunity... 
    Remote work
    Senior
    Work at office
    Flexible hours

    Algolia

    New York, NY
    2 days ago
  • $189.6k - $260.7k

     ...Senior Software Engineer, AI Tools & Security This range...  ...actual pay will be based on your skills...  ...what matters. Our remote‑first team spans...  ...started. Come join us for a whale of a...  ...Anthropic, or similar LLM APIs, and the MCP...  .... As part of the evaluation process we... 
    Remote work
    Senior
    Full time
    Home office

    Docker

    Seattle, WA
    5 days ago
  • $185.1k - $198.6k

     ...are looking for a hands‑on Senior Software Engineer to serve as the technical owner...  ...technical execution during US business hours, including...  ...AI solutions, including LLM‑based systems. Build and support...  .... Support model/service evaluation, monitoring, and... 
    Remote work
    Senior
    Fixed term contract
    Local area

    First American

    New York, NY
    4 days ago
  • $75k - $90k

     ...consulting company is seeking a Sr. AI Engineer to lead the creation of AI solutions...  ...development, dataset creation, and LLM-based AI applications. Ideal candidates...  ...000, healthcare, and paid time off. Remote work is not available, US citizenship or residency is required... 
    Remote work
    Senior

    Invonto LLC.

    Bridgewater, MA
    2 days ago
  •  ...Senior AI Engineer - LLM & Agentic Systems (Python) Remote Role Overview We are seeking a senior AI engineer...  ...and cloud platforms Establish evaluation, reliability, and performance...  ...skills with experience building and scaling cloud-based applications
    Remote work
    Senior

    RIT Solutions, Inc.

    Atlanta, GA
    3 days ago
  •  ...Senior Software Engineer (AI Applications) page is loaded##...  ..., AI-enabled web based educational software...  ...*Agent Quality & Evaluation:** Establish and...  ..., optimal LLM integration strategies...  ...we do, visit .Our Remote First approach...  ...results. It allows us to work collaboratively... 
    Remote work
    Senior
    Permanent employment
    Work at office

    Cambium Learning Group

    New York, NY
    2 days ago
  •  ...The AI Platform Engineering team is looking...  ...building high-quality software and adhering to...  ...or you may work remotely from anywhere in the US where Smartsheet...  ...rigorous evaluation pipelines, you will...  ...building or extending LLM evaluation...  ...scheduling, auto-scaling based on token... 
    Remote work
    Senior
    Full time
    Temporary work
    Work at office
    Local area

    Smartsheet

    New York, NY
    2 days ago
  •  ...At GOAT Group, the Engineering team is an integral part of...  ...solve problems and build software. From launching...  ...boutiques.We are looking for a Senior Backend Engineer to...  ...Voluntary Self-Identification (US)In an effort to improve...  ...law.Candidates based outside of the United States... 
    Remote work
    Senior
    Local area
    Immediate start

    GOAT Group

    New York, NY
    5 days ago
  • $156k - $234k

     ...and the world around us. As a Fortune 500...  ...develops pioneering software empowering federal...  ...Are you a software engineer that wants to learn...  ...working with bright senior engineers ? Do...  ...The annualized base salary ranges for the...  ...in-person time and remote. Our approach enables... 
    Remote work
    Senior
    Work at office
    Immediate start
    Home office
    Visa sponsorship
    Work visa
    Flexible hours

    Workday

    Atlanta, GA
    4 days ago
  • $132k - $190k

     ...outcomes. Our engineering teams build...  ...sophisticated software that makes a...  ...impact: As a Senior Front-End /...  ...modern component-based architectures...  ...engineering, evaluation techniques,...  ...integrating LLM-based...  ...USA Award for Remote Work! We have...  ...that connect us and that inspire... 
    Remote work
    Senior

    eClinical Solutions

    Mansfield, MA
    3 days ago
  •  ...Conversica is seeking a Senior AI Software Engineer to design, build,...  ...including agent evaluation, interpretability...  ...deploying LLM-powered systems/applications...  ...in modern cloud-based architectures...  ...and center. Show us what you’ve built...  ...Conversica is a remote-first company... 
    Remote work
    Senior
    Full time

    Conversica

    New York, NY
    2 days ago
  • $130k - $170k

     ...Software Engineer – Full Stack AI - US-RemoteRemoteWhat is PerfectServe?PerfectServe...  ...governance and evaluation frameworks, and partnering...  ...experience with LLM APIs, RAG systems,...  ...transparency and is based on market data, internal...  ...People Operations.Remote first work... 
    Remote work
    Immediate start
    Shift work

    PerfectServe

    Knoxville, TN
    5 days ago
  • $126.1k - $227k

    Job Family: Software Req ID: 484868 Siemens EDA is a global technology...  ...state of the art software engineering processes and development tools...  ...necessary export license. Why us? Working at Siemens Software...  ...employment decisions at Siemens are based on qualifications, merit, and... 
    Remote job
    Senior
    Full time
    Work at office
    Work from home
    Flexible hours

    Siemens AG

    Fremont, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote Senior Software Engineer - LLM Evaluation (US-based). Be the first to apply!