Security Engineer for AI Model Evaluation
$85 per hourSaidGig
This position offers a unique opportunity to contribute to a cutting-edge AI research initiative focused on enhancing frontier AI coding models. As a key contributor, you will engage in structured technical assessments that evaluate and improve the capabilities of AI coding agents within realistic security engineering workflows.
Key Responsibilities- Utilize frontier AI coding agents to execute and assess complex security engineering tasks.
- Review model-generated code and architectures to identify vulnerabilities, security risks, and implementation flaws.
- Detect bugs, edge cases, and security-related failure modes.
- Compare outputs from various frontier models, analyzing their strengths and weaknesses.
- Apply professional engineering judgment to realistic security engineering scenarios.
This is a sprint-based project requiring commitment in 12-24 hour intervals based on client needs.
CompensationCompensation is set at $400 per accepted task, with typical tasks taking approximately 2, 3 hours to complete after an initial ramp-up period.
Eligibility- Minimum of 2 years of professional experience in security engineering.
- Familiarity with application security, cloud security, infrastructure security, vulnerability management, or security tooling.
- Regular use of AI coding agents such as Cursor, Claude Code, Codex, Windsurf, Gemini CLI, or similar tools.
- Proven ability to identify flaws in model-generated code and architectures.
- Experience securing production systems is preferred.
$180k - $270k
...Finance, Counsel, Hardware, Information Security, Platform Infrastructure Engineering, and more — provide support and... ...is an equal opportunity employer evaluating all employees and job applicants without... ...has been reviewed. Use of AI in Our Hiring Process We may use automated...SuggestedFull timeLocal areaRemote workFlexible hours$165k - $215k
..., developer-oriented Senior Security Engineer to focus on securing our software... ...mobile applications, APIs, AI/CV platforms, and cloud-... ...security reviews and threat modeling * Lead secure SDLC and... ...decision tool (AEDT) to assess or evaluate your candidacy for...SuggestedFull timeTemporary workWork at officeLocal area$301.75k - $355k
...the only vertically integrated AI infrastructure company built... ...The Senior Director for the Model LifeCycle team will undertake... ...management: versioning, lineage, evaluation, and reproducible fine‑tuning... ...degree in Computer Science, Engineering, or a related field. ~10+...SuggestedTemporary work- ...dynamic project aimed at enhancing AI systems'' capabilities in... ...cybersecurity professionals, including security analysts, penetration testers,... ...will play a crucial role in evaluating and improving AI performance... ...to an hourly compensation model based on sustained quality and...SuggestedRemote jobHourly pay
- ...Overview Engage with leading AI labs as a telecommunications expert... ...and regulatory compliance to evaluate and improve AI performance on... ...networks (4G/LTE, 5G NR, RAN engineering, core network). ~ Wireline/... ...to an hourly compensation model based on sustained quality and...SuggestedHourly payNight shift
$171k - $273k
...We’re searching for a Staff Security Engineer, Enterprise Security Architecture... ..., and target-state operating model across enterprise and backend... ...emerging needs related to AI security, automation, infrastructure... ...security tooling strategy by evaluating architectural fit,...Full timeWork at officeLocal area3 days per week- Nuclear professionals can apply their expertise to evaluate and enhance AI models in their field through a flexible, part-time engagement. This role involves assessing AI-generated content related to nuclear science and providing structured feedback to improve the model...Part timeRemote workFlexible hours
$50 per hour
...of tax preparation, compliance, and corporate tax workflows within AI systems. Your expertise with TurboTax, ProConnect, GoSystem Tax RS, and ONESOURCE will be instrumental in creating and evaluating realistic tax scenarios. Key Responsibilities Create tax workflow...Contract workFor contractorsFreelanceRemote work$75 per hour
...Overview Experienced legal professionals are invited to evaluate and enhance the performance of AI systems in handling legal research, contracts,... ...realistic legal scenarios, assess AI outputs, and improve model accuracy. Key Responsibilities Create legal workflow...Contract workFor contractorsFreelanceRemote work- Medical professionals can leverage their expertise to contribute to AI research projects focused on enhancing model performance in their field. This role involves evaluating AI-generated content, assessing its relevance to your professional experience, and providing structured...Temporary workPart timeRemote workFlexible hours
- ...development. As the leader in AI code review and verification,... ...autonomous agents is reliable, secure, and maintainable. Integrating... ...As a Senior Security Engineer in Austin, you will provide senior... ...strategic security plan, including evaluating and introducing new tools and...RelocationFlexible hours
$100k - $228k
...Security Engineer - Governance Risk Compliance New York, NY; Palo Alto, CA; Washington, D.C.... ...About xAI xAI's mission is to create AI systems that can accurately understand... ...assessments, scenario analyses, and proactive evaluations of emerging threats, certifications,...Permanent employmentTemporary work$171k - $273k
...We're searching for a Staff Security Engineer to join our Enterprise Security... ...plans. Experience evaluating security posture and identifying... ...and identity-centric security models (BeyondCorp-style or similar... ...Familiarity with securing AI/ML platforms or applications...Full timeWork at officeLocal area3 days per weekEarly shift$165k - $215k
...Metropolis is seeking a Senior Security Engineer to establish and lead a... ...programming * Experience leveraging AI tools to transform static... ...members hold to our office-first model, which requires employees to... ...tool (AEDT) to assess or evaluate your candidacy for employment...Full timeTemporary workWork at officeLocal areaRemote work$80 per hour
...professionals can leverage their expertise in mineral title and upstream accounting workflows to contribute to AI research projects. This role involves evaluating AI-generated content and providing feedback to enhance AI''s understanding of upstream land management and...Hourly payContract workPart timeRemote workFlexible hours$350k
...role in shaping the future of AI-powered legal reasoning. This... ...of large language models, agentic systems, and legal workflows... ...the development of rigorous evaluation frameworks to measure and enhance... ...with AI researchers and engineers to evaluate and improve legal...Remote jobFull time$75 per hour
Cartographers and photogrammetrists can apply their expertise to evaluate AI models and enhance their understanding of geographical data. In this role, you will utilize your professional experience to assess AI-generated content, provide structured feedback, and contribute...Remote jobFlexible hours$50 per hour
...Physics problems to assess and enhance the capabilities of large language models. Your expertise will be pivotal in developing innovative evaluation benchmarks and providing insights that drive AI research forward. Key Responsibilities: Design and solve challenging...For contractorsRemote work$70 - $100 per hour
...Join a leading AI lab''s cutting-edge research team to be at the core of the AI... ...contribute to a project supporting a frontier-model evaluation effort focused on rigorous scientific... ..., Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering...Contract workWeekday work$100 per hour
...Expert, you will play a pivotal role in the development and evaluation of advanced AI systems designed for capital markets compliance and market... ...generating high-quality training data for next-generation AI models. This position is ideal for professionals experienced with...$80 - $120 per hour
...As an expert Evaluator in Education, you will play a crucial role in reviewing and assessing AI-generated work products such as documents, spreadsheets, and slide decks. Your deep subject-matter expertise will be essential in ensuring accuracy, rigor, and domain quality...Hourly payWork at officeRemote work$80 - $120 per hour
...In this role, expert Evaluators in Clinical, Biomedical, or Pharma will review and assess AI-generated work products, including documents, spreadsheets, and slide decks, ensuring accuracy, rigor, and domain quality. Your deep subject-matter expertise will be crucial in...Remote jobHourly payWork at office$80 - $120 per hour
...In this role, you will leverage your expertise in healthcare operations to evaluate and assess AI-generated work products, including documents, spreadsheets, and slide decks. Your primary responsibility will be to ensure the accuracy, rigor, and quality of these outputs...Remote jobHourly payWork at office$150 per hour
Aerospace Engineering Professionals can apply their expertise to evaluate AI models and enhance their understanding of aerospace tasks and terminology. This role involves assessing AI-generated content related to your field and providing structured feedback to improve the...Hourly payPart timeRemote work$80 - $120 per hour
...This role involves evaluating and assessing AI-generated work products, including documents, spreadsheets, and slide decks, specifically within the context of nonprofit, philanthropy, and community programs. As an expert evaluator, you will leverage your deep subject-matter...Remote jobHourly payWork at office$60 - $90 per hour
...A unique opportunity awaits in a leading AI lab''s GenAI team, where your expertise in lighting and cinematography will play a pivotal role in shaping the evaluation of advanced AI models. This position allows you to influence how lighting is perceived and judged in AI...Hourly payPart timeFreelance$75 per hour
...Records Managers, including Archivists, Information Managers, Collections Managers, and Librarians, play a crucial role in evaluating AI models by leveraging their professional expertise. In this position, you will assess AI-generated content relevant to your field, providing...Remote workFlexible hours$80 per hour
Materials science and engineering professionals can leverage their expertise in computational modeling and materials simulation to support AI research through flexible, hourly contract work. This role involves evaluating AI-generated content and providing critical feedback...Remote jobHourly payContract workPart timeFlexible hours$80 - $120 per hour
...Role Overview As a Cybersecurity / IT GRC Evaluator, you will leverage your expertise to review and assess AI-generated work products, including documents, spreadsheets, and slide decks. Your role is crucial in ensuring the accuracy, rigor, and overall quality of these...Hourly payWork at officeRemote work$50 per hour
...will collaborate with researchers to enhance the performance of AI models in various financial domains. This role invites professionals... ...AI experience is necessary. Key Responsibilities : Evaluate LLM models to identify areas where financial models underperform...Hourly payFreelanceRemote work10 hours per weekFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Security Engineer for AI Model Evaluation. Be the first to apply!
- security support engineer United States
- entry level security engineer United States
- senior application security engineer United States
- security engineer intern United States
- staff security engineer United States
- offensive security engineer United States
- IT security engineer United States
- junior network security engineer United States
- cloud security engineer United States
- network security engineer United States

