Remote Software Engineer - AI Research & Evaluation (US-based)
Turing
About Us:
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.
Ideal Background:
This role is ideal for engineers who have worked at the frontier of AI — at companies like OpenAI, NVIDIA, Databricks, Palantir, Snowflake, or similar organizations pushing the boundaries of intelligent systems. We especially welcome graduates from programs with strong CS foundations such as University of Washington, University of Illinois Urbana-Champaign, UT Austin, University of Michigan, Purdue, and comparable institutions — though exceptional experience and skill always take precedence over pedigree.
Project Overview:
As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections — with a primary focus on Python across backend services, data pipelines, and ML infrastructure, alongside JavaScript (including ReactJS), C/C++, Java, Rust, and Go. You will evaluate and refine AI-generated code for efficiency, scalability, and reliability, and work with cross-functional teams to enhance enterprise-level AI-driven coding solutions.
What Does a Typical Day Look Like?
- Work on AI model training initiatives by curating code examples, building solutions, and correcting code — primarily in Python, with additional work in JavaScript (including ReactJS), C/C++, Java, Rust, and Go.
- Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable.
- Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks.
- Build agents and automated verification tools in Python that can verify the quality of code and identify error patterns.
- Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them.
- Design verification mechanisms that can automatically verify a solution to a software engineering task.
Required Skills:
- Several years of software engineering experience (3 years or more)
- Strong expertise in Python with deep knowledge of frameworks, tooling, and best practices for building production-grade software.
- Experience building full-stack applications and deploying scalable software using modern languages and tools.
- Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
- Excellent oral and written communication skills for clear, structured evaluation rationales.
Engagement Details:
- Commitment: flexible engagement, minimum 10 hrs/week, up to 40 hrs/week
- Type: Contractor (no medical/paid leave)
- Duration: 1 month (potential extensions based on performance and fit)
- Location: Candidates must be based in the United States
Evaluation Process:
- The application process takes 15–30 minutes.
- Completion of an AI video interview is required.
Note: As part of assessments you will go through an AI video interview.
After applying, you will receive an email with a login link. Please use that link to access the portal and complete your profile.
Know amazing talent? Refer them at turing.com/referrals, and earn money from your network.
- ...About Us: Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global... ...who specialize in software engineering, logical reasoning, STEM... ...Software Engineering evaluator, you will create...Remote workFor contractorsFlexible hours
$125k - $160k
...Full Stack Software Engineer (US Based Remote) Torus is headquartered in Utah and is expanding manufacturing... ...proper design and implementation AI Experience & Application Experience... ...into applications Understanding of evaluating AI outputs for accuracy and reliability...Remote workTemporary workCasual workWork at office$238k - $302k
...The Large Model Evaluation team is at the... ...of Waymo’s AI ambition . With... ...-minded engineers to research and propose new... ...metric computation based on large-scale... ...quantitative software engineering... ...position across US locations is listed... ...be performed remote, the specific...Remote workFull time$130k - $170k
...Software Engineer – Full Stack AI - US-Remote Remote What is PerfectServe? PerfectServe... ...like Gartner® and KLAS Research have consistently validated... ...governance and evaluation frameworks, and partnering... ...pay transparency and is based on market data, internal...Remote workImmediate startShift work- ...Based in San Francisco, California, Turing... ...the world’s leading research accelerator for frontier AI labs and a trusted... ...who specialize in software engineering, logical reasoning,... ...Typical Day Look Like? Evaluate and refine AI-... ...must be based out of US, Canada or WEU countries...Remote workFull timeFor contractorsFlexible hours
$175k - $215k
...art Generative AI to create a training... ...The Simulator Evaluation team faces the... ...for aSoftware Engineer to build the... ...to Senior Staff Software Engineering Manager... ...The expected base salary range... ...position across US locations is listed... ...be performed remote, the specific...Remote workFull time$204k - $259k
...art Generative AI to create a training... ...The Simulator Evaluation team faces the... ...for aSenior Software Engineer to build the metrics... ...with AI research and other simulation... ...are heuristic-based, physics-based,... ...across US locations is listed... ...can be performed remote, the specific salary...Remote workFull time$170k - $216k
...Driver. Our software allows the Waymo... .... We conduct research to address... ...enabling software engineers like you to... ...system Evaluate new hardware... ...in industrial AI applications... ...The expected base salary range... ...position across US locations is... ...be performed remote, the specific...Remote workFull time$148k - $356.5k
...Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles page is loaded... ...Vehicles Apply locations US, CA, Santa Clara US, GA, Remote US, NC, Remote US, WA,... ...) or another Unix based system ~ Ability and... ...can solve. Our work in AI and digital twins is transforming...Remote workFull time$140k - $160k
...Software Engineer III, iOS - Meal Planning - (Remote - US) Software Engineer III, iOS - Meal Planning - (... ...right opportunities through AI-driven job matching.... ...annually, plus performance-based bonuses Comprehensive... ...your fit for the role, evaluating each experience in detail...Remote workFull timeInternshipWorldwideFlexible hours$175k - $245k
...Senior Software Engineer II - Applied AI and Evaluations (Remote Eligible) -REMOTE, USA- For over 20 years, Smartsheet has helped... ...your work and productivity US employees are automatically covered... ...Smartsheet provides a competitive base salary range for roles that may be...Remote workFull timeTemporary workLocal areaImmediate start$137k - $205.4k
...Software Engineer Your work days are brighter here... ...the world around us. As a Fortune 500... ...and a leading AI platform for managing... ...as code. Research, evaluate, prototype and drive... ...Statement The annualized base salary ranges for... ...-person time and remote. Our approach...Remote workWork at officeHome officeFlexible hours$163.8k - $245.8k
...the world around us. As a Fortune 50... ...and a leading AI platform for managing... ...extraordinary engineers to build and... ...and develop core software modules for real... ...availability. Evaluate and implement new... ...The annualized base salary ranges for... ...person time and remote. Our approach...Remote workWork at officeHome officeFlexible hours$80 - $100 per hour
...and locations. For US applicants: This is a... ...benchmarks and evaluation pipelines used to test frontier AI models on real software engineering work: Design coding... ...Location: Fully remote — work from anywhere... ...Compensation: $80–$100/hr based on location and seniority...Remote workFull timeContract workFor contractors$136k - $199.2k
## Senior Software Engineer, Autonomy EvaluationApplyremote type: Remote/Hybridlocations: Sunnyvale... ...unprecedented scale. Join us to help deliver... ...Organization**The Evaluation team builds and... ...leveraging AI-assisted development... ...This role can be based remotely but if you...Remote workRelocationRelocation packageFlexible hours$150k - $180k
...Staff Software Engineer (Typescript) - US Based Remote Torus is headquartered in Utah and is expanding manufacturing at our 540,000-square-foot facility in... ...cloud-native development, IoT orchestration, and emerging AI capabilities. Who You Are Passionate about climate...Remote workTemporary workCasual workWork at office- Senior Agentic AI Software Engineer - Hybrid US Job ID: 497243 Posted since: 04-Mar-... ...type: Full-time, Hybrid (Remote/Office), Permanent Location... ...services, to reliability, evaluation, and long-term operation.... ...driven, human-in-the-loop) based on problem constraints and...Remote workPermanent employmentFull timeWork at officeLocal areaWork from home
- ...About us We intend to defy the cliches... ...workflows using AI. You will be... ...We're looking for software engineers with solid foundations... ...roles available with remote work flexibility,... ..., or on-site based on your preferences... ...Integrate model training, evaluation, deployment, and...Remote workFull time
$150k - $200k
...Technical Sales Lead - Engineering Background (New York based) New York,... ...the world’s #1 AI reading tutor, transform... ...working 100% remotely and enjoying... ...We’re seeking a software engineer or... ...Conduct market research to generate new... ...English fluency and US-based Strong organizational...Remote workPermanent employmentLocal areaDay shift$98.4k - $164k
...Description SummaryFull-Stack Software Engineer & Science (Virtual... ...that bridge physics, AI, and distributed... ...applications.Ability to evaluate and test the performance... ..., Modelica, graph-based modeling) and AI/ML model... ...Assistance Provided: NoLI-Remote - This is a remote...Remote workContract workWork experience placementRelocation package$136.3k - $273.9k
...users every day. As a Software Developer - iOS ,... ...products. As well, AI and automation are... ...your title and scope based on experience and... ...iOS, Android, Web; evaluate new Apple platform... ...arrangements (work-from-home, remote, or access to one... .... Dogfooding helps us experience what...Remote workTemporary workWork at officeWork from homeWorldwideFlexible hours- An innovative AI company based in the US is seeking a Mid-Senior level developer. The role involves developing and maintaining evaluation servers, implementing logic for agent actions, and creating... ...and Linux CLI. This part-time, remote opportunity offers competitive pay...Remote workPart time
$135k - $200k
...the world's leading software for data-driven... ...Deployed Software Engineers (FDSEs) work directly... ...workflows, deploying AI agents, advising customer... ...infrastructure based on AWS EC2 and S3.... ...community enables us to optimize our opportunities... ...that allow for "Remote" work on an...Remote workWork experience placementWork at officeWork from homeRelocation package$87.7k
...Position Title Research Software Engineer Hiring Range... ...NH Moore Hall Remote Work Eligibility? Hybrid... ...as part of the evaluation. Demonstrated efficiency with agentic AI systems for software... ...grant. Continuation is based upon available funds....Remote workFull timeWorldwide$177k - $245k
...build the best tools for AI developers. We founded... ...the process of evaluating, purchasing, and managing... ...business. As a Senior Software Engineer, you will lead initiatives... ...and Family Coverage Remote first culture with in-... ...will flourish with us. We are an equal opportunity...Remote workFull timeTemporary workWork at officeHome officeFlexible hours$156k - $185k
...Senior Full Stack Software Engineer - Remote in US Knock is redefining the home buying and selling experience... ...‑service communication. Contribute to AI‑powered features and initiatives, integrating... ...Philosophy: As a fully remote (U.S.‑based) workforce, our goal is to ensure that...Remote workFull timeLocal areaFlexible hours$176k - $215k
...pioneer and market leader in AI Search, empowering 17,000+... ...This strong foundation enables us to keep investing in our... ...solutions leveraging Search Engine, Rules, AI Studio, Analytics... ...Experience in Kubernetes based deployments #LI-Remote Algolia is an Equal Opportunity...Remote workWork at officeFlexible hours$163.8k - $245.8k
...the world around us. As a Fortune 500... ...company and a leading AI platform for... ...UX, design, and research to solve real user... .... As a Senior Software Development Engineer in IDX, you will:... ...Boot or similar JVM-based frameworks. ~... ...-person time and remote. Our approach...Remote workWork at officeHome officeFlexible hours- ...Our customers love us for reducing their... ...Position: Senior Software Engineer, Intelligence Services... ...: US (Hybrid or Remote) What you would do... ...projects from research, ideation and incubation... .... Practicingtrunk-based development by... ...Experience in leveraging AI to increase...Remote workContract workShift work
- ...Software Engineer GeoLogics is currently seeking a Software... ...Position can be fully remote/telework, fully on-site... ...and we can NOT do 1099. US CITIZENSHIP IS REQUIRED... ...within reasonable time based on requirements LOCATION... ...prompt engineering and AI tools to accelerate workloads...Remote workPermanent employmentContract workRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote Software Engineer - AI Research & Evaluation (US-based). Be the first to apply!
- remote entry level software developer Carol Stream, IL
- part time software developer remote Carol Stream, IL
- entry level software engineer remote Carol Stream, IL
- remote coding part time Carol Stream, IL
- telecommute Carol Stream, IL
- remote contract Carol Stream, IL
- customer service associate remote Carol Stream, IL
- remote data encoder Carol Stream, IL
- remote medical data entry Carol Stream, IL
- remote virtual Carol Stream, IL


