Remote Senior Python Engineer - LLM Evaluation (US-based)
Turing
- Remote job
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L. Ideal Background This role is ideal for engineers who have built production systems at companies like Google, Microsoft, Apple, Amazon, Meta, or similar high-scale engineering organizations. We especially welcome graduates from leading programs such as Harvard, Columbia, Princeton, Yale, University of Pennsylvania, and comparable institutions — though exceptional experience and skill always take precedence over pedigree. Project Overview What Does a Typical Day Look Like? Evaluate and refine AI-generated code across backend and frontend contexts to ensure that it is efficient, scalable, and reliable. Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks. Build agents that can verify the quality of the code and identify error patterns across full-stack applications. Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them. Design verification mechanisms that can automatically verify a solution to a software engineering task. Required Skills Several years of software engineering experience (3 years or more) Experience deploying scalable, production-grade software using modern languages and tools. Deep understanding of software architecture, design, development, debugging, and code quality/review assessment. Excellent oral and written communication skills for clear, structured evaluation rationales. Commitment: flexible engagement, minimum 10 hrs/week, up to 40 hrs/week Type: Contractor (no medical/paid leave) Duration: 1 month (potential extensions based on performance and fit) Location: Candidates must be based in the United States #J-18808-Ljbffr Turing
$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company... ...programming skills (eg: Python, C/C++) We prefer:... ...to all eligible US based employees. Benefits for... ...the role can be performed remote, the specific salary range...Remote workSeniorFull timeTemporary work$80 - $100 per hour
...locations. For US applicants:... ...and evaluation pipelines used... ...real software engineering work: Design... ...~ Expert Python — clean, performant... ...implementing LLM coding... ...to Have Senior or Lead-level... ...Location: Fully remote — work from anywhere... ...$80–$100/hr based on location...Remote workSeniorFull timeContract workFor contractors- ...Senior Python Developer Join us at Provectus as part of a team dedicated... ...services, and data engineering, and we take pride in... ...Python services and LLM features (including... ...Experience with LLM evaluation frameworks (RAGAS, custom... ...; ~100% remote — with flexible hours...Remote workSeniorFlexible hours
- ...About Us At FunCodeNet, we are a global... ...connecting top-tier engineers with leading companies... ...**This is fully remote opportunities (candidates must be based in the US, Canada,... ...using Java and/or Python, Go * Develop high... ...*Nice to have** * LLM frameworks (LangChain...Remote workSeniorHourly payContract work
$19 - $20 per hour
...A tech consulting firm is seeking a Senior Software Engineer specializing in Python to evaluate and validate LLM performance in real-world scenarios. This remote position involves analyzing GitHub issues, developing software solutions, and collaborating with researchers...Remote workSeniorHourly payFor contractors- ...help develop Gradio ( a Python framework that lets users... ...experiences for Python-based web applications. Adapting to evolving engineering challenges and contributing... ...interested in joining us, but don't tick every... ...flexible working hours and remote options. We offer health...Remote workSeniorTemporary workWork at officeLocal areaFlexible hours
- ...Senior AI Engineer - LLM & Agentic Systems (Python) Remote Role Overview We are seeking a senior AI engineer... ...and cloud platforms Establish evaluation, reliability, and performance... ...skills with experience building and scaling cloud-based applicationsRemote workSenior
- ...Senior AI / LLM Backend Engineer Publicis Sapient is a digital transformation partner... ...You'll work hands-on with Python, LLMs, and cloud-native architectures... ...RAG pipelines and agent-based workflows. Develop and... ..., you may contact us at ****@*****.***....Remote workSenior
$50 per hour
...Role Overview As a Senior Python Engineer focused on LLM Evaluation, you will play a crucial role in creating innovative datasets for training and benchmarking... ...duration of 1 month, with potential extensions based on performance and fit. Candidates must be based in...Remote jobSeniorFor contractors10 hours per weekFlexible hours$132k - $149k
...Discord is looking for a Technical Sourcer to support their engineering roles by activating passive candidates. This role involves partnering... ...field, and proficiency in advanced sourcing techniques. The US base salary for this role ranges from $132,000 to $149,000 annually...Remote workSenior$160k - $190k
...email list. We’re a remote-first company... ...Reporting to the Senior Product Designer Manager... ..., this role is based out of San... ...our Vietnam based engineering team. If you have... ...off 401k match (US employees only) Flodesk... ...conduct performance evaluations. To monitor work eligibility...Remote workSeniorWork at officeRelocationHome officeFlexible hours3 days per week$150k - $250k
...Senior AI Engineer, Agentic Evaluation & V&V Remote At Slingshot Aerospace, we're on a... ...experience ~ Strong Python engineering skills... ..., or protocol-based integrations ~ Experience... ...workflows (e.g., LLM-based agents,... ...Location: Remote, US Salary: $150,000...Remote workSeniorFull timeCurrently hiring- ...Location: based in the USA (remote) About Xata... ...platform that helps engineering teams ship... ...Europe and the US (around 25 people... ...and the teams evaluating or adopting... ...programming ability in Python, TypeScript/... ...and walk a senior DBA through... ...architectures, LLM integrations...Remote workSeniorContract workLocal areaHome office
- ...Senior Data Scientist, LLM Buenos Aires, Argentina Xometry powers... ..., fine-tune, and evaluate Visual Language... ...Collaborate with data engineering and machine learning... ...visualization tools (such as Python, Jupyter Notebooks,... ...status. For US based roles: Xometry...Remote workSeniorContract work
- ...looking for a Senior Software Engineer to contribute... ...development and evaluation of AI training... ...coding tasks based on real‑world... ...annotation, or LLM evaluation projects... ...in a remote, asynchronous,... ...Experience with Python‑heavy workflows... ...codebase. Additional US Timezone...Remote workSenior
- ...Senior Agentic AI Software Engineer - Hybrid US Job ID: 497243 Posted since:... ...-time, Hybrid (Remote/Office), Permanent... ...reliability, evaluation, and long-term... ...human-in-the-loop) based on problem... ...experience building LLM-powered... ...proficiency in Python (or similar agent...Remote workSeniorPermanent employmentFull timeWork at officeLocal areaWork from home
- ...skilled Machine Learning Engineer who specializes in... ...) for automated evaluation and quality... ...pipelines for evaluating LLM outputs. Develop... ...consistency using LLM-based evaluations... ...programming skills in Python and SQL. ~ Experience... ...About us Grid Dynamics...Remote workWork at officeFlexible hours
- ...currently looking for a Senior Software Engineer (Python/.NET) in United... ...of Python-based services. Design... ...development tools, LLM-based workflows,... ...: ~ Fully remote work within the United... ...personal data to evaluate your candidacy... ...data is processed, please contact us....Remote jobSeniorFull timeFlexible hoursShift work
$80 per hour
...Very LLC is looking for a Senior Software Engineer to join their remote team in the United States. This role involves... ...will have extensive experience in Python backend development, microservices,... ...possibly approximating full-time hours based on performance. #J-18808-Ljbffr...Remote workSeniorHourly payFull timeContract work$89.44k - $143.1k
...Senior Health Integration Engineer - Remote based in US 4 days ago Be among the first 25 applicants Overview We are seeking an experienced Health Integration... ...operations—primarily through code (ObjectScript, Embedded Python, or similar), minimizing reliance on BPL. Interpret...Remote workSeniorRelocation packageFlexible hours- ...staffing and recruiting agency that pairs remote work with top-tier talent. We help individuals... ..., bookings, and foot traffic for service-based businesses where conversion paths are... ...environments Benefits Remote Working for US Company Competitive Salary #J-18808-Ljbffr...Remote workSeniorLocal area
- ...Job Title: Senior Helpdesk Technician (US Based) Location: Remote (US) Engagement Type: Contractor Department: Global IT Reports To: IT Operations Manager Overview We are seeking an experienced Senior Helpdesk Technician...Remote workSeniorFor contractors
$125k - $156.3k
...Senior Software Engineer (Data & AI Solutions) US Remote Job Summary Natera is seeking an experienced... ...proficiency in Python, SQL, and... ...development tools (e.g., LLM copilots) to... ...lifecycle Ability to evaluate emerging data and... ...semantic search, or RAG‑based architectures is a...Remote workSeniorImmediate startWorldwide- ...currently looking for a Senior Python Software Engineer, ML Developer Tools... ...a collaborative, remote-first environment, you... ...technologies into Python-based applications to... ...your personal data to evaluate your candidacy and share... ...your data is processed, please contact us....Remote jobSeniorFull timeWork at officeWorldwideFlexible hours
$200k - $225k
...Senior Python Engineer Remote - USA The Role We're looking for a Senior Software... ...training, inference, and evaluation Partner closely with ML... ...backend systems ML and LLM capabilities are seamlessly... ...& Benefits The expected base salary range for this role...Remote workSeniorFlexible hours- ...Engineering At Lawhive We are a team of 40 engineers and researchers... ...every day in the UK, US, and beyond. There are... ...Role We're looking for a Senior Python Engineer to join our AI Engineering... .... Nice to Have LLM Observability & Evaluation – familiarity with tools...Remote workSenior
$80 per hour
...specialists with project-based AI opportunities... ...on testing, evaluating, and improving AI... ...project is suited for a Senior Python developer with... ...experience as a Software Engineer (primarily Python)... ...understand with LLM many coding... ...by Toloka AI)Fully remote and flexible participation...Remote workSeniorPermanent employmentTemporary workFreelanceFlexible hours$40 per hour
...position focuses on building LLM evaluation and training datasets... ...realistic software engineering challenges. The role... ...engineering tasks based on public repository histories... ...following languages: Python, JavaScript, Java, Go,... ...position is fully remote. Open to candidates...Remote jobSeniorContract workFor contractors$119k - $179.75k
...Candidates must be a US Citizen or Green Card... ...This position is remote within the Greater Boston... .... We're looking for a Senior Python Engineer to join our ever evolving... ...reasonably expect to offer based on the role's... ...opportunity employer. We evaluate qualified applicants without...Remote workSeniorFull time$146k - $277k
...Medical Director/Medical Director - US & Canada Based Updated: Yesterday Location: USA-MS-Remote Job ID: 25107998 Description... ...studies. Interacts with senior management, customers, and project... ...materials, and site feasibility evaluations. Provides medical input into data...Remote workSeniorWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote Senior Python Engineer - LLM Evaluation (US-based). Be the first to apply!
- python programmer Chicago, IL
- python developer data analytics Chicago, IL
- python engineer Chicago, IL
- python developer Chicago, IL
- senior python developer Chicago, IL
- backend python developer Chicago, IL
- python developer remote Chicago, IL
- full stack / python developer (remote) Chicago, IL
- remote education consultant Chicago, IL
- remote nonprofit Chicago, IL


