Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer: Agentic Evaluation

$181.1k - $318.4k

Apple Inc.

Senior Software Engineer: Agentic Evaluation Cupertino, California, United States — Machine Learning and AI At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Do you want to help measure and improve the quality of Siri across the devices, features, and experiences people rely on every day? Apple's Agentic Evaluation Engineering organization builds the infrastructure that determines how Siri's quality is measured, trusted, and improved. You'll join a team focused on expanding what that platform can reach: the devices and environments we evaluate on, the features and interaction modalities we exercise, and the realistic, repeatable conditions we stage to ground each evaluation. The surface area is large and growing. You'll have real autonomy in how you tackle it, and you'll build infrastructure the team can rely on as priorities shift. Description In this role you'll contribute to the infrastructure, tooling, and pipelines that let us evaluate Siri reliably and at scale. You'll have meaningful autonomy in how you get there, and the work will move across several areas of expansion as priorities evolve. The specific platforms, frameworks, and components will change over time, so we're looking for someone who can transition smoothly across them and bring strong evaluation and systems engineering fundamentals to whatever the team needs next. Responsibilities Extending evaluation capabilities to new devices, platforms, and runtime environments, with designs that favor portability over any single target Supporting the evaluation of new Siri features and interaction modalities, working from ambiguous early requirements toward concrete, automated coverage Diagnosing failures across the stack, from environment provisioning through pipeline execution to scoring, enabling auto-diagnostics and driving durable fixes Contributing to architecture decisions for the team's evaluation systems Partnering across engineering, infrastructure, and program teams to align on interfaces, priorities, and shared standards Minimum Qualifications Strong programming skills in one or more compiled languages (Swift, C++ or Objective‑C). Python scripting skills for tooling and automation Solid understanding of computer science fundamentals Ability to quickly learn new technologies and adapt to evolving requirements Excellent communication skills and ability to collaborate across teams M.S. or B.S. in Computer Science, Machine Learning, or related field (or equivalent experience) Preferred Qualifications Experience staging, provisioning, or controlling test or evaluation environments to produce repeatable, deterministic conditions Experience evaluating ML, LLM or agent-based systems, including familiarity with metrics, scoring methodology, or trajectory and outcome analysis Experience designing or operating test infrastructure at scale, such as device provisioning, environment restore, warm pools, or continuous integration systems Proficiency with Python and Swift in a production setting A track record of approaching problems flexibly and cutting through ambiguity, adapting your approach to reach the right outcome and setting a clear path when requirements are not yet defined A talent for focusing and simplifying, stripping away what is not essential and distilling complex decisions down to the factors that matter A history of collaborating across teams and communicating effectively with both technical and program audiences Compensation and Benefits Base pay range for this role is between $181,100 and $318,400, depending on skills, qualifications, experience, and location. Employees may gain equity through Apple’s discretionary employee stock programs, discounted stock purchase plans, and other stock benefits. Additional benefits include comprehensive medical and dental coverage, retirement benefits, discounts on Apple products and services, and educational expense reimbursement. The role may be eligible for discretionary bonuses, commission payments, and relocation assistance. Equal Opportunity Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Accessibility Apple believes accessibility is a fundamental human right and strives to embed it in culture, benefits, and digital tools. Learn more about accessibility in Apple’s workplace. Reasonable accommodations are available for job applicants. #J-18808-Ljbffr Apple Inc.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Software Engineer: Agentic Evaluation in Cupertino, CA vacancy
  • $147.4k - $272.1k

     ...and AI We're a team at Apple building software that helps shape the next generation of...  ...infrastructure — including a strong focus on how we evaluate and measure the quality of what we ship....  .... Description You'll work alongside engineers, designers, and researchers to design... 
    Suggested
    Relocation package

    Apple Inc.

    Cupertino, CA
    2 days ago
  • Apple Inc. is seeking a Senior Software Engineer in Cupertino to improve Siri's evaluation infrastructure. The ideal candidate has strong programming skills, particularly in Swift and Python, and is committed to enhancing the quality of Siri across devices. In this role... 
    Suggested

    Apple Inc.

    Cupertino, CA
    3 days ago
  • $181.1k - $272.1k

    Cupertino, California, United States Software and Services We're a team at...  ...work sits at the intersection of agentic AI, developer tools, and evaluation infrastructure, driven by a belief...  ...Description You’ll work alongside engineers, designers, and researchers to design... 
    Suggested
    Relocation

    Apple Inc.

    Cupertino, CA
    11 hours ago
  • $170k - $265k

     ...employee. This foundation powers Glean's agentic capabilities - AI agents that automate...  ..., Google Gemini) and internal evaluation frameworks to improve quality and predictability...  .... You are: ~3+ years of software engineering experience building production distributed... 
    Suggested
    Home office
    Flexible hours
    3 days per week

    Glean.info

    Mountain View, CA
    1 day ago
  •  ...experiences. About the Organization The Evaluation team builds and evolves the evaluation...  ...-level results into clear feedback for engineering and leadership, and help accelerate...  ...to introspect autonomous driving software performance atinterfaces across the autonomy... 
    Suggested
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • We are looking for a Senior Software Engineer to help build NeMo Platform, NVIDIA’s product for developing, evaluating, deploying, and operating AI systems at scale. This role...  ...confidence. Our roadmap is increasingly focused on agentic development and automated agent... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $181.1k - $318.4k

    Senior Software Engineer: Agentic Systems Cupertino, California, United States Software and Services Join a passionate, dynamic team responsible for creating experiences that enable users to get help with—and discover all the things they love about—their Apple products... 
    Relocation

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $207k - $300k

    Staff Software Engineer, Agentic Data and Evals Location: Sunnyvale, CA, USA. You will design, develop, test, deploy, maintain, and enhance large‑scale software solutions for data generation and evaluation of Generative AI models. The role requires versatile engineering... 
    Full time

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $244.9k - $321.1k

     ...experiences, and intelligent software come together at massive...  ...team, you will help define how agentic AI systems are designed, built...  ...cases. This is a hands-on engineering role for someone who treats...  ...architecture through orchestration, evaluation, deployment, observability,... 
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    5 days ago
  • $143k - $191k

     ...Eightfold is building the next generation of Agentic AI products that help millions of...  ...talent decisions. We’re looking for a Software Engineer to design and build highly scalable, user...  ...with ML engineers to integrate, evaluate, and productionize models Write clean... 
    Work at office
    Remote work
    Flexible hours

    Eightfold LLC

    Santa Clara, CA
    3 days ago
  • $174k - $252k

    Senior Software Engineer, Generative AI, Agentic Creatives Optimization, YouTube Ads corporate_fare YouTube place Mountain View, CA, USA Required Qualifications...  ...with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging). 1 year of... 
    Full time

    Google Inc.

    Mountain View, CA
    3 days ago
  • $181.1k - $318.4k

    Senior Software Engineer, On-Device Health Agentic Systems Cupertino, California, United States Hardware We are seeking a senior iOS Software Engineer with deep experience building applications powered by Large Language Models (LLMs). This is a hands‑on development role... 
    Relocation

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $184k - $287.5k

    We are looking for a skilled Agentic AI Software Engineer to join our team. The ideal candidate is passionate about building autonomous, multi‑...  ...by NVIDIA inference services Develop test harnesses and evaluation pipelines to validate agent behavior, tool‑call correctness... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...California, seeks a qualified candidate to design and evaluate AI systems with a focus on agentic functionalities. The ideal individual will hold a degree in computer science or engineering and have strong software engineering skills in Python. A competitive salary range... 

    Robert Bosch Group

    Sunnyvale, CA
    1 day ago
  • $165k - $180k

    Robert Bosch Group is seeking an AI Systems Engineer in Sunnyvale, California to design and evaluate agentic AI systems. The role focuses on building robust AI frameworks...  ...science or engineering and possess strong software engineering skills in Python. The position offers... 

    Robert Bosch Group

    Sunnyvale, CA
    1 day ago
  • $287.8k - $328.5k

     ...Distinguished AI Engineer (Agentic AI Platform Infrastructure) At Capital One, we are creating...  ...develop, test, deploy, and support AI software components including foundation model...  ...similarity search, guardrails, model evaluation, experimentation, governance, and observability... 
    Local area

    Capital One National Association

    San Jose, CA
    11 hours ago
  • $188k - $300k

     ...the transformation to AI-enabled software-defined vehicles. Traditional...  ...seeking a highly motivated Staff AI Engineer to define and develop next-generation agentic AI applications for software-...  ..., RAG, context engineering, and evaluation). Design and implement robust... 
    Work at office
    Worldwide
    Flexible hours
    Shift work

    Sonatus

    Sunnyvale, CA
    24 days ago
  • $147.4k - $272.1k

    Cupertino, California, United States Software and Services At Apple, great ideas have a...  ...Learning. We're a dynamic team seeking engineers who are enthusiastic about influencing the...  ...collect on-device signals, and advance the evaluation pipeline to the next level.... 
    Relocation

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $174k - $252k

    Senior Software Engineer, Infrastructure, CoreOS Agentic Engineering Sunnyvale, CA, USA Mid Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. Apply Bachelor’s degree or... 
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $188k - $259.44k

     ...Senior Full-Stack Software Engineer Menlo Park, California, United States About Moloco: Moloco builds some of the most powerful AI...  ...-impact projects The Opportunity: Design and ship agentic software that helps advertisers manage campaigns through natural... 
    Temporary work
    Work at office
    Flexible hours

    Moloco

    Menlo Park, CA
    8 days ago
  • $272k - $431.25k

     ...to develop and expand enterprise-grade agentic AI systems at one of the world’s most advanced...  ...and accelerate business results across engineering, IT, supply chain, finance, HR, and...  ..., orchestration, tool integration, and evaluation pipelines.* Define and advance NVIDIA’s... 

    NVIDIA Corporation

    Santa Clara, CA
    11 hours ago
  • $125k - $222k

     ...management, and next‑generation agentic analytics. Our platform gives autonomy engineers complete visibility and control...  ...sensor logs, simulation runs, model evaluation results, KPIs, and more. It...  ...developing, testing, and shipping software with coding agents Bonus: experience... 
    Full time
    For contractors
    For subcontractor

    Decisive Point

    Sunnyvale, CA
    3 days ago
  • $162.78k - $221.47k

     ...seeking a Senior Kubernetes Engineer to help design and implement...  ...purpose-built for scientific and agentic workflows. This role is...  ..., and GitOps workflows Evaluate and introduce new technologies...  .../CD pipelines for scientific software, data processing workflows, and... 
    Worldwide
    Flexible hours
    Night shift

    Stanford University

    Menlo Park, CA
    2 days ago
  • $132k - $286k

     ...at JOIN TARGET AS A LEAD AI ENGINEER - ADVANCED AI About Us: Working...  ...applications, including agentic architectures where they add...  ...architecture and design decisions, evaluate appropriate models,...  ...agentic systems, and modern software engineering practices. A successful... 
    Full time
    Temporary work
    Work experience placement
    Flexible hours

    Target

    Sunnyvale, CA
    3 days ago
  • $229.9k - $262.4k

    Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One,...  ...develop, test, deploy, and support AI software components including foundation...  ...search, guardrails, model evaluation, experimentation, governance, and... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    4 days ago
  • $172.5k - $260.1k

     ...duplicating efforts. Job Category Software Engineering Job Details About Salesforce...  ...leading workforce transformation in the agentic era? You're in the right place! Agentforce...  ...to help our recruiters assess and evaluate candidates' resumes and qualifications... 

    Salesforce

    Palo Alto, CA
    4 days ago
  • $170k - $216k

     ...Software Engineer, Quantitative Evaluations Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    4 days ago
  • $170k - $216k

     ...Software Engineer, Statistical Evaluation and Sampling Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    5 days ago
  • $178.42k - $230.5k

     ...maintaining the tools and services engineers here at GM use every day to...  ...Productivity Org, the AV Agentic Developer Engineer is responsible...  ...team's technical roadmap Evaluate new tools and technologies...  ...core company metrics Drive software engineering best practices... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours
    3 days per week

    General Motors

    Sunnyvale, CA
    4 days ago
  • $165k - $180k

     ...ADAS) and Autonomous Systems, AI Systems Engineering, and Industry AI. We develop scalable,...  ...Responsibilities: Design, build, and evaluate agentic AI systems that can plan, reason, act,...  ...science or engineering Strong software engineering skills in Python Hands-on... 
    Full time
    Temporary work
    Work experience placement
    Worldwide

    Bosch Group

    Sunnyvale, CA
    4 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer: Agentic Evaluation. Be the first to apply!