Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Evaluation Engineer - Ruby

Biz Tech Analytics

Review and validate AI benchmark tasks in real‑world repositories. Run containerized test suites, verify patches and solutions, debug flaky tests, and assess quality for difficulty, reproducibility, and correctness. Require strong CLI, Docker, Linux skills. Required Candidate profile 3–10 years in production software in the relevant language. Hands‑on Docker and Linux. Experience with testing, debugging, and dependency tools. Can navigate large codebases and deliver clear feedback. #J-18808-Ljbffr Biz Tech Analytics

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Evaluation Engineer - Ruby in New York, NY vacancy
  • Biz Tech Analytics is looking for a skilled candidate to review and validate AI benchmark tasks in real-world repositories. Responsibilities include running containerized test suites, verifying patches, debugging flaky tests, and ensuring correctness and quality. The ideal... 
    Ruby

    Biz Tech Analytics

    New York, NY
    4 days ago
  • $40 per hour

    A leading AI security solutions provider is seeking experienced cybersecurity professionals to evaluate AI-generated security content and solve real-world technical problems. In this remote role, candidates will require over 2 years of cybersecurity experience, fluency... 
    Suggested
    Hourly pay
    Remote work

    DataAnnotation

    Brooklyn, NY
    5 days ago
  • Feedinkoo is looking for experienced software engineers to contribute their expertise to AI evaluation and improvement projects. This role involves applying software engineering skills to assess AI systems with no prior AI experience required, as training will be provided... 
    Suggested
    Remote job
    Freelance

    Feedinkoo

    New York, NY
    4 days ago
  • $60 per hour

    A leading AI development company seeks proficient programmers to contribute to cutting-edge AI systems while enjoying fully remote...  ...include solving coding problems, writing high-quality code, and evaluating AI-generated code. Ideal candidates should have a bachelor’s degree... 
    Suggested
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • A leading technology company is seeking AI Developers to design and implement AI/ML features in a remote role. Responsibilities include...  ...building AI services, developing data pipelines, and creating evaluations for LLMs. Ideal candidates have mid-senior experience in AI... 
    Suggested
    Remote job
    Hourly pay

    Rex.zone

    New York, NY
    2 days ago
  • $150k - $250k

    Slingshot Aerospace is seeking a Senior AI Engineer to join our AI and Data Science team. This role involves developing evaluation frameworks for intelligent systems in mission-critical space operations. Responsibilities include maintaining our validation SDK, designing... 
    Remote job

    Slingshot Aerospace

    New York, NY
    2 days ago
  • Biz Tech Consultants is seeking an AI Evaluation Engineer to work part-time. The role involves reviewing and validating AI benchmark tasks in Python repositories, running Docker-based test suites, and debugging flaky tests. The ideal candidate should have strong skills... 
    Remote job
    Part time
    Freelance
    Work from home

    Biz Tech Consultants

    New York, NY
    2 days ago
  •  ...Member of Technical Staff - Evals located in New York, NY, where you will ensure our AI-powered features are high quality and reliable. Your responsibilities include designing evaluation frameworks, building automated tests, and developing tools for seamless evaluations.... 

    Entendre

    New York, NY
    4 days ago
  • $40 per hour

    A cybersecurity firm in the United States seeks experienced professionals to evaluate AI-generated security content and solve technical cybersecurity problems. In this remote role, you'll work on your own schedule, contributing to the next generation of AI security systems... 
    Hourly pay
    Remote work

    DataAnnotation

    New York, NY
    2 days ago
  • AI Evaluation Engineer Shell / Bash Scripting Biz-Tech Analytics offers specialized, human‑in‑the‑loop annotation, RLHF, and dataset creation—leveraging our network of vetted developers, STEM professionals, linguists, and medical experts. Review AI benchmark tasks in... 
    Part time
    Freelance

    Biz Tech Analytics

    New York, NY
    5 days ago
  • $40 per hour

    A technology consulting company is seeking experienced cybersecurity professionals for a remote position. In this role, you'll evaluate AI-generated cybersecurity content, solve technical problems, and provide valuable feedback to enhance AI models. Ideal candidates should... 
    Hourly pay
    Remote work

    DataAnnotation

    New York, NY
    2 days ago
  • $40 per hour

    A technology company is seeking a Web Engineer to train AI models remotely, offering either full-time or part-time options. The role involves evaluating AI chatbots' coding challenges and assessing the quality of AI outputs. Applicants should be fluent in English and detail... 
    Remote job
    Hourly pay
    Full time
    Part time

    DataAnnotation

    New York, NY
    2 days ago
  • $165k - $200k

     ...Opportunity Carrot Fertility is hiring an Applied AI Engineer to join our Enterprise Technology team....  ...‑based access control, prompt hygiene, evaluation frameworks, and observability throughout...  ...and TypeScript/JavaScript; comfort with Ruby a plus. You write clean, well‑structured... 
    Ruby
    Temporary work
    Worldwide

    Pantera Capital

    New York, NY
    3 days ago
  •  ...> International Corporate Contractor - AI Engineer International Corporate Contractor - AI...  ...could be Mandarin or English, Python or Ruby. Languages offer unique views of the world...  ...‑free experience. Model Benchmarking & Evaluation: Develop model evals to identify the... 
    Ruby
    Contract work
    For contractors
    Remote work

    Helloglobo

    New York, NY
    2 days ago
  • $40 per hour

    A cybersecurity AI company is looking for experienced professionals to evaluate AI-generated security content and solve technical problems to improve AI systems. This role requires a minimum of 2 years in cybersecurity and allows for remote work with flexible scheduling... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • $40 per hour

    A cybersecurity company is seeking experienced professionals to evaluate AI-generated security content and solve technical issues. The role requires 2+ years in the field, coding experience, and strong analytical skills. Candidates should have a bachelor's degree and cybersecurity... 
    Hourly pay
    Full time
    Part time
    Remote work

    DataAnnotation

    New York, NY
    2 days ago
  • $40 per hour

    A leading AI training company is seeking a DevOps Engineer to join their remote team. In this role, you will provide coding challenges to AI chatbots and evaluate their outputs for correctness and performance. Candidates should be proficient in Python or JavaScript and... 
    Remote job
    Hourly pay

    DataAnnotation

    New York, NY
    4 days ago
  • Akraya, Inc. is looking for a skilled individual to develop AI-powered tools that support the creation and evaluation of agent skills and prompts. This remote role involves building user-friendly interfaces while integrating existing systems. The successful candidate will... 
    Remote job

    Akraya, Inc.

    New York, NY
    2 days ago
  • A cybersecurity solutions company is looking for experienced cybersecurity professionals to help train AI models. You will work remotely to evaluate AI-generated security content, solve technical problems, and provide feedback to improve AI systems. Ideal candidates have... 
    Remote job
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • $40 per hour

    A leading AI-focused cybersecurity firm is looking for experienced cybersecurity professionals to evaluate AI-generated content and solve technical security problems. In this flexible role, you can work remotely and choose your projects. Ideal candidates will have 2+ years... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • $40 per hour

    A cybersecurity firm is looking for experienced professionals to join its team. This remote role involves evaluating AI-generated security content and solving technical cybersecurity problems. Candidates should have over 2 years of hands-on experience in cybersecurity and... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    5 days ago
  • A leading cybersecurity firm is seeking experienced professionals to evaluate AI-generated cybersecurity content and solve technical security problems. You will play a significant role in training AI models, providing critical feedback, and improving system accuracy. This... 
    Remote job
    Flexible hours

    DataAnnotation

    New York, NY
    5 days ago
  • $60 per hour

    A cutting-edge AI company is seeking quantitative professionals to evaluate AI-generated quantitative analysis and provide feedback that shapes future AI systems. The role offers flexible remote work, allowing professionals to choose projects and set their schedules. Candidates... 
    Remote job
    Flexible hours

    DataAnnotation

    New York, NY
    5 days ago
  • $400 per month

     ...looking for contributors to support a Frontier Code Agents project with a leading AI research lab. The role involves using AI coding agents to evaluate and improve machine learning and AI engineering tasks. Ideal candidates should have 2+ years of machine learning engineering... 
    Remote job

    Obsidian

    New York, NY
    4 days ago
  • A leading AI development company is looking for experienced quantitative professionals to join their remote team. In this role, you will evaluate AI-generated quantitative work, design quantitative problems, and provide valuable feedback for future AI models. Candidates... 
    Remote job
    Extra income
    Full time
    Flexible hours

    DataAnnotation

    New York, NY
    3 days ago
  • $30 per hour

    A technology company specializing in AI training is seeking a Cloud Platform Engineer to join their team. The position is remote and focuses on training and evaluating AI chatbots. The ideal candidate should be proficient in programming languages like JavaScript, Python... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    5 days ago
  • $40 per hour

    A tech company in the United States is seeking a Web Platform Engineer to enhance AI chatbot models. This role involves evaluating AI outputs and requires proficiency in programming languages such as Python or JavaScript. Candidates should be detail-oriented, experienced... 
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    4 days ago
  •  ...and unstructured data at scale including calls, chats, emails, and AI agent responses to build autonomous systems that adapt, detect,...  ...for unstructured text and audio. Strong coding background in Ruby or Python, with experience in data pipelines, AWS or GCP infrastructure... 
    Ruby

    Rulebase: The voice fraud defense system for financial servi...

    New York, NY
    3 days ago
  • $115k - $145k

    Beam Benefits is seeking a Senior Software Engineer to lead code delivery and improve employee benefits access through...  ...over 6 years of experience, is proficient in React and Ruby on Rails, and is familiar with AI tools to enhance development workflows. This position offers... 
    Ruby
    Remote job

    Beam Benefits

    New York, NY
    2 days ago
  • $80 per hour

    A leading AI project company in the United States is seeking experienced software developers for project-based opportunities. Candidates must have a degree in Computer Science or Software Engineering, at least 5 years of experience in Python, and Full-Stack development... 
    Contract work
    Flexible hours

    Mindrift

    Brooklyn, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Evaluation Engineer - Ruby. Be the first to apply!