AI Evaluation Engineer - Ruby
Biz Tech Analytics
Review and validate AI benchmark tasks in real‑world repositories. Run containerized test suites, verify patches and solutions, debug flaky tests, and assess quality for difficulty, reproducibility, and correctness. Require strong CLI, Docker, Linux skills. Required Candidate profile 3–10 years in production software in the relevant language. Hands‑on Docker and Linux. Experience with testing, debugging, and dependency tools. Can navigate large codebases and deliver clear feedback. #J-18808-Ljbffr Biz Tech Analytics
- Biz Tech Analytics is looking for a skilled candidate to review and validate AI benchmark tasks in real-world repositories. Responsibilities include running containerized test suites, verifying patches, debugging flaky tests, and ensuring correctness and quality. The ideal...Ruby
$40 per hour
A leading AI security solutions provider is seeking experienced cybersecurity professionals to evaluate AI-generated security content and solve real-world technical problems. In this remote role, candidates will require over 2 years of cybersecurity experience, fluency...SuggestedHourly payRemote work- Feedinkoo is looking for experienced software engineers to contribute their expertise to AI evaluation and improvement projects. This role involves applying software engineering skills to assess AI systems with no prior AI experience required, as training will be provided...SuggestedRemote jobFreelance
$60 per hour
A leading AI development company seeks proficient programmers to contribute to cutting-edge AI systems while enjoying fully remote... ...include solving coding problems, writing high-quality code, and evaluating AI-generated code. Ideal candidates should have a bachelor’s degree...SuggestedRemote jobHourly payFlexible hours- A leading technology company is seeking AI Developers to design and implement AI/ML features in a remote role. Responsibilities include... ...building AI services, developing data pipelines, and creating evaluations for LLMs. Ideal candidates have mid-senior experience in AI...SuggestedRemote jobHourly pay
$150k - $250k
Slingshot Aerospace is seeking a Senior AI Engineer to join our AI and Data Science team. This role involves developing evaluation frameworks for intelligent systems in mission-critical space operations. Responsibilities include maintaining our validation SDK, designing...Remote job- Biz Tech Consultants is seeking an AI Evaluation Engineer to work part-time. The role involves reviewing and validating AI benchmark tasks in Python repositories, running Docker-based test suites, and debugging flaky tests. The ideal candidate should have strong skills...Remote jobPart timeFreelanceWork from home
- ...Member of Technical Staff - Evals located in New York, NY, where you will ensure our AI-powered features are high quality and reliable. Your responsibilities include designing evaluation frameworks, building automated tests, and developing tools for seamless evaluations....
$40 per hour
A cybersecurity firm in the United States seeks experienced professionals to evaluate AI-generated security content and solve technical cybersecurity problems. In this remote role, you'll work on your own schedule, contributing to the next generation of AI security systems...Hourly payRemote work- AI Evaluation Engineer Shell / Bash Scripting Biz-Tech Analytics offers specialized, human‑in‑the‑loop annotation, RLHF, and dataset creation—leveraging our network of vetted developers, STEM professionals, linguists, and medical experts. Review AI benchmark tasks in...Part timeFreelance
$40 per hour
A technology consulting company is seeking experienced cybersecurity professionals for a remote position. In this role, you'll evaluate AI-generated cybersecurity content, solve technical problems, and provide valuable feedback to enhance AI models. Ideal candidates should...Hourly payRemote work$40 per hour
A technology company is seeking a Web Engineer to train AI models remotely, offering either full-time or part-time options. The role involves evaluating AI chatbots' coding challenges and assessing the quality of AI outputs. Applicants should be fluent in English and detail...Remote jobHourly payFull timePart time$165k - $200k
...Opportunity Carrot Fertility is hiring an Applied AI Engineer to join our Enterprise Technology team.... ...‑based access control, prompt hygiene, evaluation frameworks, and observability throughout... ...and TypeScript/JavaScript; comfort with Ruby a plus. You write clean, well‑structured...RubyTemporary workWorldwide- ...> International Corporate Contractor - AI Engineer International Corporate Contractor - AI... ...could be Mandarin or English, Python or Ruby. Languages offer unique views of the world... ...‑free experience. Model Benchmarking & Evaluation: Develop model evals to identify the...RubyContract workFor contractorsRemote work
$40 per hour
A cybersecurity AI company is looking for experienced professionals to evaluate AI-generated security content and solve technical problems to improve AI systems. This role requires a minimum of 2 years in cybersecurity and allows for remote work with flexible scheduling...Hourly payRemote workFlexible hours$40 per hour
A cybersecurity company is seeking experienced professionals to evaluate AI-generated security content and solve technical issues. The role requires 2+ years in the field, coding experience, and strong analytical skills. Candidates should have a bachelor's degree and cybersecurity...Hourly payFull timePart timeRemote work$40 per hour
A leading AI training company is seeking a DevOps Engineer to join their remote team. In this role, you will provide coding challenges to AI chatbots and evaluate their outputs for correctness and performance. Candidates should be proficient in Python or JavaScript and...Remote jobHourly pay- Akraya, Inc. is looking for a skilled individual to develop AI-powered tools that support the creation and evaluation of agent skills and prompts. This remote role involves building user-friendly interfaces while integrating existing systems. The successful candidate will...Remote job
- A cybersecurity solutions company is looking for experienced cybersecurity professionals to help train AI models. You will work remotely to evaluate AI-generated security content, solve technical problems, and provide feedback to improve AI systems. Ideal candidates have...Remote jobFlexible hours
$40 per hour
A leading AI-focused cybersecurity firm is looking for experienced cybersecurity professionals to evaluate AI-generated content and solve technical security problems. In this flexible role, you can work remotely and choose your projects. Ideal candidates will have 2+ years...Remote jobHourly payFlexible hours$40 per hour
A cybersecurity firm is looking for experienced professionals to join its team. This remote role involves evaluating AI-generated security content and solving technical cybersecurity problems. Candidates should have over 2 years of hands-on experience in cybersecurity and...Remote jobHourly payFlexible hours- A leading cybersecurity firm is seeking experienced professionals to evaluate AI-generated cybersecurity content and solve technical security problems. You will play a significant role in training AI models, providing critical feedback, and improving system accuracy. This...Remote jobFlexible hours
$60 per hour
A cutting-edge AI company is seeking quantitative professionals to evaluate AI-generated quantitative analysis and provide feedback that shapes future AI systems. The role offers flexible remote work, allowing professionals to choose projects and set their schedules. Candidates...Remote jobFlexible hours$400 per month
...looking for contributors to support a Frontier Code Agents project with a leading AI research lab. The role involves using AI coding agents to evaluate and improve machine learning and AI engineering tasks. Ideal candidates should have 2+ years of machine learning engineering...Remote job- A leading AI development company is looking for experienced quantitative professionals to join their remote team. In this role, you will evaluate AI-generated quantitative work, design quantitative problems, and provide valuable feedback for future AI models. Candidates...Remote jobExtra incomeFull timeFlexible hours
$30 per hour
A technology company specializing in AI training is seeking a Cloud Platform Engineer to join their team. The position is remote and focuses on training and evaluating AI chatbots. The ideal candidate should be proficient in programming languages like JavaScript, Python...Remote jobHourly payFlexible hours$40 per hour
A tech company in the United States is seeking a Web Platform Engineer to enhance AI chatbot models. This role involves evaluating AI outputs and requires proficiency in programming languages such as Python or JavaScript. Candidates should be detail-oriented, experienced...Remote jobHourly payFlexible hours- ...and unstructured data at scale including calls, chats, emails, and AI agent responses to build autonomous systems that adapt, detect,... ...for unstructured text and audio. Strong coding background in Ruby or Python, with experience in data pipelines, AWS or GCP infrastructure...Ruby
$115k - $145k
Beam Benefits is seeking a Senior Software Engineer to lead code delivery and improve employee benefits access through... ...over 6 years of experience, is proficient in React and Ruby on Rails, and is familiar with AI tools to enhance development workflows. This position offers...RubyRemote job$80 per hour
A leading AI project company in the United States is seeking experienced software developers for project-based opportunities. Candidates must have a degree in Computer Science or Software Engineering, at least 5 years of experience in Python, and Full-Stack development...Contract workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Evaluation Engineer - Ruby. Be the first to apply!
- ai engineer New York, NY
- machine learning ai engineer New York, NY
- ai research engineer New York, NY
- ai ml engineer New York, NY
- senior ai engineer New York, NY
- ai prompt engineer New York, NY
- ai developer New York, NY
- ai engineer remote New York, NY
- remote ruby New York, NY
- junior ruby on rails New York, NY

