Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote Senior Python Engineer - LLM Evaluation (US-based)

Turing

Chicago, IL
  • Remote job

Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L. Ideal Background This role is ideal for engineers who have built production systems at companies like Google, Microsoft, Apple, Amazon, Meta, or similar high-scale engineering organizations. We especially welcome graduates from leading programs such as Harvard, Columbia, Princeton, Yale, University of Pennsylvania, and comparable institutions — though exceptional experience and skill always take precedence over pedigree. Project Overview What Does a Typical Day Look Like? Evaluate and refine AI-generated code across backend and frontend contexts to ensure that it is efficient, scalable, and reliable. Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks. Build agents that can verify the quality of the code and identify error patterns across full-stack applications. Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them. Design verification mechanisms that can automatically verify a solution to a software engineering task. Required Skills Several years of software engineering experience (3 years or more) Experience deploying scalable, production-grade software using modern languages and tools. Deep understanding of software architecture, design, development, debugging, and code quality/review assessment. Excellent oral and written communication skills for clear, structured evaluation rationales. Commitment: flexible engagement, minimum 10 hrs/week, up to 40 hrs/week Type: Contractor (no medical/paid leave) Duration: 1 month (potential extensions based on performance and fit) Location: Candidates must be based in the United States #J-18808-Ljbffr Turing

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Remote Senior Python Engineer - LLM Evaluation (US-based) in Chicago, IL vacancy
  • $204k - $259k

     ...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company...  ...programming skills (eg: Python, C/C++) We prefer:...  ...to all eligible US based employees. Benefits for...  ...the role can be performed remote, the specific salary range... 
    Remote work
    Senior
    Full time
    Temporary work

    Waymo

    Kirkland, WA
    5 days ago
  • $80 - $100 per hour

     ...locations. For US applicants:...  ...and evaluation pipelines used...  ...real software engineering work: Design...  ...~ Expert Python — clean, performant...  ...implementing LLM coding...  ...to Have Senior or Lead-level...  ...Location: Fully remote — work from anywhere...  ...$80–$100/hr based on location... 
    Remote work
    Senior
    Full time
    Contract work
    For contractors

    G2i Inc.

    United States
    2 days ago
  •  ...Senior Python Developer Join us at Provectus as part of a team dedicated...  ...services, and data engineering, and we take pride in...  ...Python services and LLM features (including...  ...Experience with LLM evaluation frameworks (RAGAS, custom...  ...; ~100% remote — with flexible hours... 
    Remote work
    Senior
    Flexible hours

    Provectus

    United States
    4 days ago
  •  ...About Us At FunCodeNet, we are a global...  ...connecting top-tier engineers with leading companies...  ...**This is fully remote opportunities (candidates must be based in the US, Canada,...  ...using Java and/or Python, Go * Develop high...  ...*Nice to have** * LLM frameworks (LangChain... 
    Remote work
    Senior
    Hourly pay
    Contract work

    FunCodeNet

    San Diego, CA
    a month ago
  • $19 - $20 per hour

     ...A tech consulting firm is seeking a Senior Software Engineer specializing in Python to evaluate and validate LLM performance in real-world scenarios. This remote position involves analyzing GitHub issues, developing software solutions, and collaborating with researchers... 
    Remote work
    Senior
    Hourly pay
    For contractors

    Crossing Hurdles

    New York, NY
    5 days ago
  •  ...help develop Gradio ( a Python framework that lets users...  ...experiences for Python-based web applications. Adapting to evolving engineering challenges and contributing...  ...interested in joining us, but don't tick every...  ...flexible working hours and remote options. We offer health... 
    Remote work
    Senior
    Temporary work
    Work at office
    Local area
    Flexible hours

    Hugging Face

    United States
    5 days ago
  •  ...Senior AI Engineer - LLM & Agentic Systems (Python) Remote Role Overview We are seeking a senior AI engineer...  ...and cloud platforms Establish evaluation, reliability, and performance...  ...skills with experience building and scaling cloud-based applications
    Remote work
    Senior

    RIT Solutions, Inc.

    Atlanta, GA
    5 days ago
  •  ...Senior AI / LLM Backend Engineer Publicis Sapient is a digital transformation partner...  ...You'll work hands-on with Python, LLMs, and cloud-native architectures...  ...RAG pipelines and agent-based workflows. Develop and...  ..., you may contact us at ****@*****.***.... 
    Remote work
    Senior

    MSLGROUP

    United States
    6 days ago
  • $50 per hour

     ...Role Overview As a Senior Python Engineer focused on LLM Evaluation, you will play a crucial role in creating innovative datasets for training and benchmarking...  ...duration of 1 month, with potential extensions based on performance and fit. Candidates must be based in... 
    Remote job
    Senior
    For contractors
    10 hours per week
    Flexible hours

    SaidGig

    Remote
    5 days ago
  • $132k - $149k

     ...Discord is looking for a Technical Sourcer to support their engineering roles by activating passive candidates. This role involves partnering...  ...field, and proficiency in advanced sourcing techniques. The US base salary for this role ranges from $132,000 to $149,000 annually... 
    Remote work
    Senior

    Ultimate LLC

    San Francisco, CA
    5 days ago
  • $160k - $190k

     ...email list. We’re a remote-first company...  ...Reporting to the Senior Product Designer Manager...  ..., this role is based out of San...  ...our Vietnam based engineering team. If you have...  ...off 401k match (US employees only) Flodesk...  ...conduct performance evaluations. To monitor work eligibility... 
    Remote work
    Senior
    Work at office
    Relocation
    Home office
    Flexible hours
    3 days per week

    Flodesk

    San Francisco, CA
    2 days ago
  • $150k - $250k

     ...Senior AI Engineer, Agentic Evaluation & V&V Remote At Slingshot Aerospace, we're on a...  ...experience ~ Strong Python engineering skills...  ..., or protocol-based integrations ~ Experience...  ...workflows (e.g., LLM-based agents,...  ...Location: Remote, US Salary: $150,000... 
    Remote work
    Senior
    Full time
    Currently hiring

    Slingshot Aerospace

    United States
    4 days ago
  •  ...Location: based in the USA (remote) About Xata...  ...platform that helps engineering teams ship...  ...Europe and the US (around 25 people...  ...and the teams evaluating or adopting...  ...programming ability in Python, TypeScript/...  ...and walk a senior DBA through...  ...architectures, LLM integrations... 
    Remote work
    Senior
    Contract work
    Local area
    Home office

    Xata

    Miami, FL
    a month ago
  •  ...Senior Data Scientist, LLM Buenos Aires, Argentina Xometry powers...  ..., fine-tune, and evaluate Visual Language...  ...Collaborate with data engineering and machine learning...  ...visualization tools (such as Python, Jupyter Notebooks,...  ...status. For US based roles: Xometry... 
    Remote work
    Senior
    Contract work

    Xometry

    United States
    4 days ago
  •  ...looking for a Senior Software Engineer to contribute...  ...development and evaluation of AI training...  ...coding tasks based on real‑world...  ...annotation, or LLM evaluation projects...  ...in a remote, asynchronous,...  ...Experience with Python‑heavy workflows...  ...codebase. Additional US Timezone... 
    Remote work
    Senior

    KAKE

    New York, NY
    2 days ago
  •  ...Senior Agentic AI Software Engineer - Hybrid US Job ID: 497243 Posted since:...  ...-time, Hybrid (Remote/Office), Permanent...  ...reliability, evaluation, and long-term...  ...human-in-the-loop) based on problem...  ...experience building LLM-powered...  ...proficiency in Python (or similar agent... 
    Remote work
    Senior
    Permanent employment
    Full time
    Work at office
    Local area
    Work from home

    Siemens Mobility

    Wilsonville, OR
    2 days ago
  •  ...skilled Machine Learning Engineer who specializes in...  ...) for automated evaluation and quality...  ...pipelines for evaluating LLM outputs. Develop...  ...consistency using LLM-based evaluations...  ...programming skills in Python and SQL. ~ Experience...  ...About us Grid Dynamics... 
    Remote work
    Work at office
    Flexible hours

    Grid Dynamics Holdings

    United States
    6 days ago
  •  ...currently looking for a Senior Software Engineer (Python/.NET) in United...  ...of Python-based services. Design...  ...development tools, LLM-based workflows,...  ...: ~ Fully remote work within the United...  ...personal data to evaluate your candidacy...  ...data is processed, please contact us.... 
    Remote job
    Senior
    Full time
    Flexible hours
    Shift work

    jobgether

    United States
    5 days ago
  • $80 per hour

     ...Very LLC is looking for a Senior Software Engineer to join their remote team in the United States. This role involves...  ...will have extensive experience in Python backend development, microservices,...  ...possibly approximating full-time hours based on performance. #J-18808-Ljbffr... 
    Remote work
    Senior
    Hourly pay
    Full time
    Contract work

    VERY

    New York, NY
    5 days ago
  • $89.44k - $143.1k

     ...Senior Health Integration Engineer - Remote based in US 4 days ago Be among the first 25 applicants Overview We are seeking an experienced Health Integration...  ...operations—primarily through code (ObjectScript, Embedded Python, or similar), minimizing reliance on BPL. Interpret... 
    Remote work
    Senior
    Relocation package
    Flexible hours

    Conifer Health Solutions

    Frisco, TX
    1 day ago
  •  ...staffing and recruiting agency that pairs remote work with top-tier talent. We help individuals...  ..., bookings, and foot traffic for service-based businesses where conversion paths are...  ...environments Benefits Remote Working for US Company Competitive Salary #J-18808-Ljbffr... 
    Remote work
    Senior
    Local area

    Paired Inc

    New York, NY
    5 days ago
  •  ...Job Title: Senior Helpdesk Technician (US Based) Location: Remote (US) Engagement Type: Contractor Department: Global IT Reports To: IT Operations Manager Overview We are seeking an experienced Senior Helpdesk Technician... 
    Remote work
    Senior
    For contractors

    Saviance

    Boston, MA
    4 days ago
  • $125k - $156.3k

     ...Senior Software Engineer (Data & AI Solutions) US Remote Job Summary Natera is seeking an experienced...  ...proficiency in Python, SQL, and...  ...development tools (e.g., LLM copilots) to...  ...lifecycle Ability to evaluate emerging data and...  ...semantic search, or RAG‑based architectures is a... 
    Remote work
    Senior
    Immediate start
    Worldwide

    Natera

    New York, NY
    2 days ago
  •  ...currently looking for a Senior Python Software Engineer, ML Developer Tools...  ...a collaborative, remote-first environment, you...  ...technologies into Python-based applications to...  ...your personal data to evaluate your candidacy and share...  ...your data is processed, please contact us.... 
    Remote job
    Senior
    Full time
    Work at office
    Worldwide
    Flexible hours

    jobgether

    United States
    7 days ago
  • $200k - $225k

     ...Senior Python Engineer Remote - USA The Role We're looking for a Senior Software...  ...training, inference, and evaluation Partner closely with ML...  ...backend systems ML and LLM capabilities are seamlessly...  ...& Benefits The expected base salary range for this role... 
    Remote work
    Senior
    Flexible hours

    Wizard

    United States
    6 days ago
  •  ...Engineering At Lawhive We are a team of 40 engineers and researchers...  ...every day in the UK, US, and beyond. There are...  ...Role We're looking for a Senior Python Engineer to join our AI Engineering...  .... Nice to Have LLM Observability & Evaluation – familiarity with tools... 
    Remote work
    Senior

    Lawhive

    United States
    5 days ago
  • $80 per hour

     ...specialists with project-based AI opportunities...  ...on testing, evaluating, and improving AI...  ...project is suited for a Senior Python developer with...  ...experience as a Software Engineer (primarily Python)...  ...understand with LLM many coding...  ...by Toloka AI)Fully remote and flexible participation... 
    Remote work
    Senior
    Permanent employment
    Temporary work
    Freelance
    Flexible hours

    Mind Rift

    Houston, TX
    2 days ago
  • $40 per hour

     ...position focuses on building LLM evaluation and training datasets...  ...realistic software engineering challenges. The role...  ...engineering tasks based on public repository histories...  ...following languages: Python, JavaScript, Java, Go,...  ...position is fully remote. Open to candidates... 
    Remote job
    Senior
    Contract work
    For contractors

    SaidGig

    Remote
    4 days ago
  • $119k - $179.75k

     ...Candidates must be a US Citizen or Green Card...  ...This position is remote within the Greater Boston...  .... We're looking for a Senior Python Engineer to join our ever evolving...  ...reasonably expect to offer based on the role's...  ...opportunity employer. We evaluate qualified applicants without... 
    Remote work
    Senior
    Full time

    Worldpay

    Boston, MA
    5 days ago
  • $146k - $277k

     ...Medical Director/Medical Director - US & Canada Based Updated: Yesterday Location: USA-MS-Remote Job ID: 25107998 Description...  ...studies. Interacts with senior management, customers, and project...  ...materials, and site feasibility evaluations. Provides medical input into data... 
    Remote work
    Senior
    Worldwide

    Syneos Health Inc

    Jackson, MS
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote Senior Python Engineer - LLM Evaluation (US-based). Be the first to apply!