Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote Senior Python Engineer - LLM Evaluation (US-based)

Turing

Denver, CO
  • Remote job

Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L. Ideal Background This role is ideal for engineers who have worked at the frontier of AI — at companies like OpenAI, NVIDIA, Databricks, Palantir, Snowflake, or similar organizations pushing the boundaries of intelligent systems. We especially welcome graduates from top computer science programs such as Stanford, MIT, Carnegie Mellon, UC Berkeley, Georgia Tech, and comparable institutions — though exceptional experience and skill always take precedence over pedigree. Responsibilities Evaluate and refine AI-generated code across backend and frontend contexts to ensure that it is efficient, scalable, and reliable. Collaborate with cross‑functional teams to enhance AI-driven coding solutions against industry performance benchmarks. Build agents that can verify the quality of the code and identify error patterns across full‑stack applications. Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them. Design verification mechanisms that can automatically verify a solution to a software engineering task. Required Skills Several years of software engineering experience (3 years or more) Experience deploying scalable, production‑grade software using modern languages and tools. Deep understanding of software architecture, design, development, debugging, and code quality/review assessment. Excellent oral and written communication skills for clear, structured evaluation rationales. Commitment: flexible engagement, minimum 10 hrs/week, up to 40 hrs/week Type: Contractor (no medical/paid leave) Duration: 1 month (potential extensions based on performance and fit) Location: Candidates must be based in the United States #J-18808-Ljbffr Turing

Vacancy posted 16 hours ago
Similar jobs that could be interesting for youBased on the Remote Senior Python Engineer - LLM Evaluation (US-based) in Denver, CO vacancy
  • $204k - $259k

     ...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company...  ...programming skills (eg: Python, C/C++) We prefer:...  ...to all eligible US based employees. Benefits for...  ...the role can be performed remote, the specific salary range... 
    Remote work
    Senior
    Full time
    Temporary work

    Waymo

    Mountain View, CA
    16 hours ago
  • $80 - $100 per hour

     ...locations. For US applicants:...  ...and evaluation pipelines used...  ...real software engineering work: Design...  ...~ Expert Python — clean, performant...  ...implementing LLM coding...  ...to Have Senior or Lead-level...  ...Location: Fully remote — work from anywhere...  ...$80–$100/hr based on location... 
    Remote work
    Senior
    Full time
    Contract work
    For contractors

    G2i Inc.

    United States
    2 days ago
  • Location: based in the USA (remote) About Xata Xata...  ...platform that helps engineering teams ship...  ...and the US (around 25 people...  ...and the teams evaluating or adopting Xata...  ...programming ability in Python, TypeScript/...  ...and walk a senior DBA through...  ..., LLM integrations... 
    Remote job
    Senior
    Contract work
    Local area
    Home office

    Xata

    Miami, FL
    2 days ago
  • $19 - $20 per hour

     ...A tech consulting firm is seeking a Senior Software Engineer specializing in Python to evaluate and validate LLM performance in real-world scenarios. This remote position involves analyzing GitHub issues, developing software solutions, and collaborating with researchers... 
    Remote work
    Senior
    Hourly pay
    For contractors

    Crossing Hurdles

    New York, NY
    16 hours ago
  •  ...Senior AI Engineer - LLM & Agentic Systems (Python) Remote Role Overview We are seeking a senior AI engineer...  ...and cloud platforms Establish evaluation, reliability, and performance...  ...skills with experience building and scaling cloud-based applications
    Remote work
    Senior

    RIT Solutions, Inc.

    Atlanta, GA
    16 hours ago
  • $213k - $263k

     ...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous...  ...learning, and robust evaluation. This role follows...  ...experience. The expected base salary range for this...  ...-time position across US locations is listed...  ...role can be performed remote, the specific salary... 
    Remote work
    Senior
    Full time

    Waymo

    Mountain View, CA
    1 day ago
  •  ...About Us At FunCodeNet, we are a global...  ...connecting top-tier engineers with leading companies...  ...**This is fully remote opportunities (candidates must be based in the US, Canada,...  ...using Java and/or Python, Go * Develop high...  ...*Nice to have** * LLM frameworks (LangChain... 
    Remote work
    Senior
    Hourly pay
    Contract work

    FunCodeNet

    San Diego, CA
    24 days ago
  • $204k - $259k

     ...Senior Machine Learning Engineer, Perception LLM/VLM Waymo is an autonomous...  ...and rigorously evaluate metrics and methodologies...  ...proficiency in Python and deep...  ...The expected base salary range for...  ...across US locations is listed...  ...can be performed remote, the specific salary... 
    Remote work
    Senior
    Full time

    Waymo

    Mountain View, CA
    3 days ago
  • $160k - $190k

     ...email list. We’re a remote-first company...  ...Reporting to the Senior Product Designer Manager...  ..., this role is based out of San...  ...our Vietnam based engineering team. If you have...  ...off 401k match (US employees only) Flodesk...  ...conduct performance evaluations. To monitor work eligibility... 
    Remote work
    Senior
    Work at office
    Relocation
    Home office
    Flexible hours
    3 days per week

    Flodesk

    San Francisco, CA
    2 days ago
  • $184k - $287.5k

    Senior ML Evaluation Engineer - Autonomous Vehicles page is loaded##...  ...: US, CA, Santa Clara: US, GA, Remote: US, DC, Remote: US...  ...transition from rule-based to learned evaluation...  ...experience building LLM/VLM-based pipelines...  ...reviewable code in Python and C++* Experience... 
    Remote work
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $80 per hour

     ...Very LLC is looking for a Senior Software Engineer to join their remote team in the United States. This role involves...  ...will have extensive experience in Python backend development, microservices,...  ...possibly approximating full-time hours based on performance. #J-18808-Ljbffr... 
    Remote work
    Senior
    Hourly pay
    Full time
    Contract work

    VERY

    New York, NY
    16 hours ago
  •  ...staffing and recruiting agency that pairs remote work with top-tier talent. We help individuals...  ..., bookings, and foot traffic for service-based businesses where conversion paths are...  ...environments Benefits Remote Working for US Company Competitive Salary #J-18808-Ljbffr... 
    Remote work
    Senior
    Local area

    Paired Inc

    New York, NY
    16 hours ago
  • $127.2k - $209.8k

     ...and passion of all of us—from design and engineering to the manufacturing...  ...Systems (DS) regional based US business, focusing...  ...Contracting team, Senior Sales Leaders, Cross...  ...and work-life balance. Remote or field-based positions...  ...Employer. We evaluate applicants without regard... 
    Remote work
    Senior
    Hourly pay
    Contract work
    Work at office
    Local area
    Worldwide
    Shift work

    Becton Dickinson & Company

    Sparks Glencoe, MD
    16 hours ago
  •  ...Senior Helpdesk Technician (US Based) We are seeking an experienced Senior Helpdesk Technician to join our Global IT team. In this role, you'll act as a trusted technical partner, delivering high-quality support to our distributed workforce and administering our core... 
    Remote work
    Senior

    Saviance

    United States
    4 days ago
  • $60 - $70 per hour

     ...Machine Learning Engineer to join a high-...  ...on advancing LLM evaluation, NLP, and AI-driven...  ...: Mid-level to Senior Key...  ...and build LLM-based evaluation frameworks...  ...datasets using Python, SQL, and scalable...  ...is a fully remote position. Application...  ...partner with us for our scale,... 
    Remote work
    Contract work
    Temporary work
    3 days per week

    TEKsystems

    Seattle, WA
    4 days ago
  • $55k - $151.47k

     ...people in data and analytics engineering focus on leveraging advanced...  ...with PwC standards. As a Senior Associate you will analyze complex...  ...platform Executing LLM evaluation frameworks using defined metrics...  ...anticipated application deadlines: #LI-Remote #LI-Hybrid... 
    Remote work
    Senior
    Full time
    Work experience placement
    H1b

    PwC

    San Francisco, CA
    2 days ago
  • Senior Agentic AI Software Engineer - Hybrid US Job ID: 497243 Posted since:...  ...-time, Hybrid (Remote/Office), Permanent...  ...reliability, evaluation, and long-term...  ...human-in-the-loop) based on problem...  ...experience building LLM-powered...  ...proficiency in Python (or similar agent... 
    Remote work
    Senior
    Permanent employment
    Full time
    Work at office
    Local area
    Work from home

    Siemens Mobility

    Wilsonville, OR
    3 days ago
  • $80 per hour

     ...specialists with project-based AI opportunities...  ...on testing, evaluating, and improving AI...  ...project is suited for a Senior Python developer with...  ...experience as a Software Engineer (primarily Python)...  ...understand with LLM many coding...  ...Toloka AI) Fully remote and flexible participation... 
    Remote work
    Senior
    Permanent employment
    Temporary work
    Freelance
    Flexible hours

    Mind Rift

    Houston, TX
    2 days ago
  • $119k - $179.75k

     ...Candidates must be a US Citizen or Green Card...  ...This position is remote within the Greater Boston...  .... We're looking for a Senior Python Engineer to join our ever evolving...  ...reasonably expect to offer based on the role's...  ...opportunity employer. We evaluate qualified applicants without... 
    Remote work
    Senior
    Full time

    Worldpay

    Boston, MA
    16 hours ago
  • $170k - $260k

     ...clearance. Are you a Senior Python Software Engineer who is ready for a...  ...then dropped off on a remote contract and never...  ...principles. These provide us the capability to...  ...manual system security evaluation and authorization...  ...sustainment of various Python based REST end points,... 
    Remote work
    Senior
    Full time
    Contract work
    Work from home
    Relocation package
    Shift work

    GliaCell Technologies LLC

    Linthicum Heights, MD
    16 hours ago
  • $175k - $245k

     ...Senior Software Engineer II - Applied AI and Evaluations (Remote Eligible) -REMOTE, USA- For over 20...  ...the intersection of LLM evaluation, prompt...  ...quality Strong Python skills; comfortable...  ...and productivity US employees are automatically...  ...a competitive base salary range for... 
    Remote work
    Senior
    Full time
    Temporary work
    Local area
    Immediate start

    Smartsheet

    New York, NY
    16 hours ago
  • $144k - $244k

     ...Senior Client Executive Manufacturing (US based) Date: Feb 17, 2026 Company: NTT DATA Services NTT DATA strives to hire exceptional, innovative and passionate...  ...Executive to join our team. This position will work remotely from your home office located within the Dallas area.... 
    Remote work
    Senior
    Temporary work
    Work at office
    Work from home
    Home office
    Flexible hours

    NTT DATA

    Milwaukee, WI
    4 days ago
  • An innovative AI company based in the US is seeking a Mid-Senior level developer. The role...  ...developing and maintaining evaluation servers, implementing...  ...should have 4+ years of Python experience, solid API development...  ...CLI. This part-time, remote opportunity offers... 
    Remote work
    Part time

    Mind Rift

    Houston, TX
    7 days ago
  •  ...customer success, product, and engineering. We are a diverse and...  ...technology obsessed. Want to join us? Step 1: Apply & Showcase Your...  ...Workload Enjoy the flexibility of remote work and select how much you want...  ...connect with our New‑York based team, however a lot of our operations... 
    Remote work
    Senior
    Contract work
    Freelance
    Work at office

    Visory

    New York, NY
    2 days ago
  • $99.6k - $174k

     ...Senior Full Stack Engineer, AI Platform & Agents Build...  .... Location: US/Canada, Hybrid or Remote - Work Hours: Must...  ....js, React, Python, LangChain/LangGraph...  ...Apply current LLM patterns (RAG,...  ...and evaluation ~ Backend development...  ...range listed is based on primary... 
    Remote work
    Senior
    Work at office
    2 days per week

    Wolters Kluwer

    Chicago, IL
    4 days ago
  •  ...Miami and Oslo, and remote employees across the US and Europe,...  .... The role As a Senior Product Manager...  ...a core team of engineers and designers to...  ...role that must be based in Massachusetts...  ...Fluency with core LLM concepts and...  ...retrieval, and evaluation, and the judgment... 
    Remote work
    Senior
    Flexible hours

    Chooose

    New York, NY
    16 hours ago
  • $120k - $150k

    You are here: Home / Careers / Senior Assessor (CMMC) | US Based Apply Now Salary: $120,000 - $150,000 Work Type: Remote - 20% Travel to Client Sites Location: US Based (Preference for Eastern or Central Time Zones) About ControlCase ControlCase is a global leader in... 
    Remote work
    Senior
    Work from home
    Flexible hours

    ControlCase, LLC

    Fairfax, VA
    3 days ago
  •  ...Role: Senior AI Engineer - Agentic Systems and LLM Client Location: Mason, OH 100% Remote Job Description: We are seeking...  ...deployment—leveraging Python, modern LLM...  ...platforms Establish evaluation, reliability, and performance...  ...and scaling cloud-based applications... 
    Remote work
    Senior

    Vytwo

    Prosper, TX
    1 day ago
  • $70k - $85k

     ...Senior Insights Analyst, Financial Services - Remote, US-based only Escalent is an award-winning data analytics and advisory firm that helps clients understand human and market behaviors to navigate disruption. As catalysts of progress for more than 40 years, our strategies... 
    Remote work
    Senior
    Flexible hours

    Escalent

    New York, NY
    4 days ago
  • $75k - $90k

     ...company is seeking a Sr. AI Engineer to lead the creation of AI solutions...  ...expertise in full-stack Python development, dataset creation, and LLM-based AI applications. Ideal...  ...healthcare, and paid time off. Remote work is not available, US citizenship or residency is required... 
    Remote work
    Senior

    Invonto LLC.

    Bridgewater, MA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote Senior Python Engineer - LLM Evaluation (US-based). Be the first to apply!