Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal AI Architect/Engineer

$110.7k

Pepsi Bottling Group

Overview The AI Platform/Observability Architect is an execution-focused engineer who designs, builds, and operates observability capabilities within a defined domain of the enterprise AI observability platform. Working under the strategic direction of the Senior AI Observability Architect (L11), this role translates architecture blueprints into production-grade instrumentation, telemetry pipelines, dashboards, quality gates, and safety signals across agentic AI systems. The junior architect is a hands‑on engineer who codes, integrates, tests, and iterates — owning feature‑level delivery within one or more specialization tracks while developing a growing understanding of the full observability platform. They are a technical practitioner first, with an emerging architect mindset. Responsibilities Observability Platform Engineering & OTEL Integration (25%): Implement OpenTelemetry (OTEL) instrumentation within assigned agent frameworks or platforms — including custom exporters, span enrichers, semantic conventions, and context propagation hooks. Build and maintain telemetry pipeline components (collectors, processors, exporters) that route metrics, logs, traces, and semantic signals to central observability backends. Integrate OTEL with enterprise agentic platforms as assigned — which may include Salesforce AgentForce, ServiceNow, Microsoft Agent 365, or internal frameworks — following architecture blueprints set by the L11. Develop and maintain observability dashboards, alerting rules, and SLO/SLA definitions for the assigned sub‑domain, ensuring signal quality and low false‑positive rates. Participate in on‑call rotations and incident response for the observability platform — contributing to RCA documentation and runbook improvement. Write unit, integration, and end‑to‑end tests for all telemetry components; maintain >80% test coverage across owned services. Safety, Security & Red Teaming Support (15%): Instrument safety‑critical signal capture within assigned pipelines — including guardrail trigger rates, policy violation events, prompt injection detections, and hallucination flags. Support red team exercises by building observability hooks that capture adversarial test results, attack surface telemetry, and behavioral deviation signals in real time. Implement secure trace handling for sensitive AI decision events — applying data masking, PII redaction, and audit‑log retention policies as defined by the security architecture. Assist in maintaining the Security Observability Playbook — documenting findings, updating escalation paths, and contributing to incident classification procedures. Monitor agent‑to‑agent protocol traffic (A2A, UCP, AP2) for anomalous communication patterns and flag deviations for review by the L11 architect and security team. Responsible AI (RAI) & Governance Signal Instrumentation (10%): Implement RAI signal collectors within assigned agent workflows — capturing fairness indicators, bias detection outputs, explainability scores, and content safety classifications. Maintain RAI telemetry pipelines and ensure data quality, completeness, and timeliness of governance signals feeding into compliance dashboards. Contribute to audit‑readiness work by ensuring all AI decision traces within the assigned domain include required governance metadata and are retained per policy. Support gap analyses by comparing current RAI signal coverage against governance framework requirements and flagging coverage gaps to the L11. Quality Engineering for Agentic Solutions — Post Go‑Live & Continuous QE (15%): Build and maintain quality gate components within CI/CD pipelines — using observability data to detect performance regressions, behavioral drift, and SLA breaches before they reach production. Instrument and monitor Skill Evaluations (evals) across the Memory, Skills, and MCP harness stack — collecting eval results, tracking pass/fail trends, and alerting on regression thresholds. Implement continuous quality monitoring for post‑go‑live agentic solutions — tracking agent success rate, tool‑call fidelity, latency distributions, and user outcome proxies. Conduct structured testing of new agent capabilities using standardized eval harnesses — documenting results and feeding findings into quality improvement cycles. Develop automated quality reports and quality metric dashboards for stakeholder review, surfacing trends and anomalies in agent behavior over time. Memory, Skills, MCP & Harness Engineering Observability (10%): Instrument agent memory operations (read/write latency, cache hit rates, memory drift) across episodic, semantic, and working memory backends within the assigned scope. Add trace instrumentation to MCP server interactions — tagging tool registrations, skill invocations, context injections, and result returns with semantic OTEL attributes. Capture harness execution telemetry for self‑evolving and RL systems — logging reward signals, policy update events, environment transitions, and convergence indicators. Monitor skill eval harness execution pipelines — detecting flaky evals, environment setup failures, and result inconsistencies that could mask real capability regressions. Data Science & Python Engineering (10%): Write production‑grade Python for observability tooling — custom OTEL exporters, signal aggregators, anomaly detectors, and data transformation pipelines — adhering to team engineering standards. Apply basic statistical and data science methods to telemetry data — time‑series analysis, threshold tuning, distribution characterization — to improve signal quality and alerting precision. Contribute to Python SDK and library development that simplifies OTEL onboarding for agent developers across the organization. Participate in code reviews, apply test‑driven development practices, and continuously improve the quality and maintainability of the observability codebase. Agent Fleet, Physical AI & Multi‑Modal Observability (5%): Implement telemetry for agent fleet coordination — capturing spawn/termination events, inter‑agent communication traces, load distribution metrics, and fleet health indicators. Contribute to observability instrumentation for physical AI pipelines (edge inference, sensor fusion, robotics control loops) as directed — focusing on latency, reliability, and data quality signals. Add OTEL instrumentation to multi‑modal model pipelines — tracing vision, audio, and text input processing stages and capturing cross‑modal alignment quality signals. Agentic Marketplace, Registry & A2A / UCP / AP2 Observability (5%): Instrument the Agentic Marketplace and Agent Registry with usage telemetry — tracking agent invocations, capability health scores, adoption trends, and dependency relationships. Implement protocol‑level observability for A2A (Agent‑to‑Agent), UCP, and AP2 communication flows — capturing message latency, error rates, retry patterns, and trust boundary crossings. Contribute to Marketplace Observability Dashboard development — building data connectors, metric calculations, and visualization components as directed by the L11. Collaboration, Integration & Continuous Learning (5%): Collaborate closely with AI platform engineers, AI Solution Engineers, SRE, and product teams to gather requirements, align on telemetry standards, and resolve integration friction. Participate in agile ceremonies — sprint planning, stand‑ups, retrospectives — contributing to estimation, dependency identification, and delivery transparency. Stay current with emerging observability frameworks, OTEL specifications, agent communication protocols, and AI safety research — sharing learnings with the team regularly. Contribute to internal documentation, engineering wikis, and onboarding guides for the observability platform. Compensation and Benefits The expected compensation range for this position is between $110,700 & $185,250. Location, confirmed job‑related skills, experience, and education will be considered in setting actual starting salary. Bonus based on performance and eligibility target payout is 12% of annual salary paid out annually. Paid time off subject to eligibility, including paid parental leave, vacation, sick, and bereavement. In addition to salary, PepsiCo offers a comprehensive benefits package to support employees and their families, subject to elections and eligibility: Medical, Dental, Vision, Disability, Health, and Dependent Care Reimbursement Accounts, Employee Assistance Program, Insurance (Accident, Group Legal, Life), Defined Contribution Retirement Plan. Qualifications Bachelor's or Master's degree in Computer Science, Software Engineering, AI/ML, Data Science, or a related technical field. 6–8 years of experience in software engineering, platform engineering, or data engineering — with at least 2–3 years of hands‑on work in observability, monitoring, or distributed systems. Demonstrated ability to deliver production‑grade software in a team environment; track record of completing complex technical features end‑to‑end. Python Proficiency: Strong Python engineering skills — writing clean, testable, maintainable production code; familiarity with async patterns, type hints, and modern Python tooling (Poetry, Ruff, pytest). Observability Fundamentals: Solid working knowledge of the three pillars of observability (metrics, logs, traces); ability to instrument services with OpenTelemetry (OTEL) SDKs; understanding of trace context propagation and semantic conventions. Distributed Systems: Working knowledge of microservices, event streaming (Kafka or equivalent), REST/gRPC APIs, and containerized deployment (Docker, Kubernetes). Cloud Platforms: Hands‑on experience with at least one major cloud provider (Azure, AWS, or GCP) — including managed services, IAM basics, and cost awareness. CI/CD & DevOps: Experience building or contributing to CI/CD pipelines; familiarity with GitOps, infrastructure‑as‑code concepts, and automated testing frameworks. Data Fundamentals: Ability to query, analyze, and visualize time‑series and log data using tools such as Grafana, Datadog, Splunk, Prometheus, or equivalent. Hands‑on experience with agentic AI frameworks (LangChain, LangGraph, AutoGen, Semantic Kernel, CrewAI, or equivalent). Contributions to open‑source observability projects or OTEL community. Familiarity with reinforcement learning concepts, self‑supervised learning, or model fine‑tuning workflows. Experience with security tooling relevant to AI (adversarial robustness libraries, LLM safety frameworks, or red‑team toolkits). Exposure to Responsible AI frameworks, fairness evaluation libraries (Arize, Fairlearn, AI Fairness 360), or explainability tools (SHAP, LIME). Experience in a fast‑paced AI platform, MLOps, or LLMOps role with production deployment responsibilities. Equal Employment Opportunity Statements All qualified applicants will receive consideration for employment without regard to age, race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status. PepsiCo is an Equal Opportunity Employer: Female / Minority / Disability / Protected Veteran / Sexual Orientation / Gender Identity / Age. Our Company will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Credit Reporting Act, and all other applicable laws, including but not limited to, San Francisco Police Code Sections 4901-4919, commonly referred to as the San Francisco Fair Chance Ordinance; and Chapter XVII, Article 9 of the Los Angeles Municipal Code, commonly referred to as the Fair Chance Initiative for Hiring Ordinance. #J-18808-Ljbffr

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Principal AI Architect/Engineer in Plano, TX vacancy
  • Highbrow LLC is seeking a Hands-On Architect/Principal Software Engineer to lead AI/ML software solutions in Frisco, TX. The ideal candidate should have over 10 years of experience in software development and be proficient in Python, AWS, and GCP. The role emphasizes coding... 
    Principal

    Highbrow LLC

    Frisco, TX
    2 days ago
  • $123.5k - $206.75k

     ...Overview The AI Observability Architect is a senior technical leader responsible for designing...  ...This role serves as the strategic and engineering authority for end-to-end telemetry,...  .... ~5+ years in a senior/principal or architect-level role with demonstrated... 
    Principal
    Shift work

    PepsiCo

    Plano, TX
    4 days ago
  • $110.7k - $185.25k

     ...Overview The AI Platform/Observability Architect is an execution-focused engineer who designs, builds, and operates observability capabilities within a defined domain of the enterprise AI observability platform. Working under the strategic direction of the Senior... 
    Principal

    PepsiCo

    Plano, TX
    3 days ago
  •  ...Overview We are seeking a Senior AI Engineer to define and drive the end-to-end engineering of an enterprise-grade agentic orchestration capability that enables smart AI agents to autonomously execute workflows, collaborate with humans, and operate securely with governed... 
    Suggested

    PepsiCo

    Plano, TX
    3 days ago
  • $170k - $200k

     ...Dormont Manufacturing Co is looking for a hybrid Principal Architect for Automation & Orchestration. In this role, you will lead the design...  ...should have over 10 years of experience in Network Engineering, expertise in automation frameworks like Python and Terraform... 
    Principal

    Dormont Manufacturing Company

    Plano, TX
    1 day ago
  •  ...Infosys Limited is seeking a Senior Principal Technology Architect in Richardson, Texas. The candidate will partner with business stakeholders, oversee requirement elicitation, and drive architectural design that ensures compliance with standards. The role involves leading... 
    Principal

    Infosys

    Richardson, TX
    2 days ago
  •  ...Koitecc Solutions is looking for a Principal Cybersecurity Architect to enhance and develop architecture platforms for cloud-based technologies....  ...have over 10 years of experience in software and security engineering, with a strong focus on cybersecurity architecture and programming... 
    Principal

    Koitecc Solutions

    Plano, TX
    1 day ago
  •  ...Principal Enterprise Solution Architect The Principal Enterprise Solution Architect is the senior-most architectural...  ..., enterprise integrations, and AI/ML workloads. You will partner with executives, product leaders, engineering teams, and customers to translate... 
    Principal

    WIS International

    Plano, TX
    6 days ago
  • Executive Director, Cybersecurity and Technology Controls Assessments and Exercises This is a rare opportunity to operate at the intersection of deep technical cybersecurity expertise and enterprise-level risk strategy. As a senior technical authority, you will shape...
    Principal

    Chase

    Plano, TX
    3 days ago
  • JPMorgan Chase & Co. is seeking a Senior Principal Cybersecurity Architect to lead cybersecurity strategy across multiple products and technologies. This role requires deep experience in cybersecurity architecture and the ability to drive impactful innovation. The ideal... 
    Principal

    JPMorgan Chase & Co.

    Plano, TX
    3 days ago
  • $170k - $200k

     ...services through global simplicity with trusted transparency. ROLE SUMMARY We are seeking a Principal Architect for Network Services who will understand existing engineering standards and reference architectures. The ideal candidate will guide application teams to... 
    Principal

    EOS

    Plano, TX
    11 days ago
  •  ...Principal Cybersecurity Architect Take your engineering expertise to new heights by joining a team of exceptionally talented professionals and solidify your place among top performers in the industry. As a Principal Cybersecurity Architect at JPMorganChase within... 
    Principal

    Chase

    Plano, TX
    4 days ago
  • $170k - $200k

     ...and are proud to deliver our services through global simplicity with trusted transparency. WHAT YOU WILL DO We are seeking a Principal Architect to define and lead next‑generation data center architecture spanning physical and logical design, software‑defined infrastructure... 
    Principal

    Dormont Manufacturing Company

    Plano, TX
    1 day ago
  •  ...TMN Toyota Motor North America Company is looking for a Principal Engineer – Security AI Solutions in Plano, TX. This role focuses on developing AI-assisted applications to automate security workflows and integrate machine learning capabilities into red team pipelines... 
    Principal

    TMN Toyota Motor North America Company

    Plano, TX
    1 day ago
  •  ...Welcome! Service Experts is seeking a Principal Architect to lead the architectural strategy,...  ...platform enables analytics, reporting, and AI/ML workloads at enterprise scale....  ...Platform, including L1-L3 support, data engineering, analytics, and platform operations teams... 
    Principal

    Service Experts

    Richardson, TX
    2 days ago
  •  ...Senior Principal Cybersecurity Architect Come on board with an iconic financial institution and take your career to the next level. You have found...  ...expertise to bring together talent that will consistently create AI-enabled solutions, processes, and reusable proof-of-... 
    Principal

    Chase

    Plano, TX
    4 days ago
  •  ...Principal HBM Design Architect Our vision is to transform how the world uses information to enrich life...  ...semiconductor solutions. As an HBM Memory Design Engineer on the HBM Architecture Team, you...  ...developing modern HBM solutions for AI and ML applications. Responsibilities... 
    Principal
    Local area
    Relocation

    Micron Technology

    Richardson, TX
    1 day ago
  •  ...Senior Principal Cybersecurity Architect Come on board with an iconic financial institution and take your career to the next level. You have found...  ...-state architecture decisions Provide deep data engineering expertise and work across agile teams to enhance, build,... 
    Principal

    Chase

    Plano, TX
    5 days ago
  •  ...Senior Principal Architect You're a pro who wants to influence the future of technical architecture...  ..., collaborating across product, engineering, operations, and business teams. You will...  ...technologies (e.g., real-time payments, AI/ML in fraud detection, blockchain), shaping... 
    Principal
    Worldwide

    Chase

    Plano, TX
    4 days ago
  •  ...Principal Cybersecurity Architect Take your engineering expertise to new heights by joining a team of exceptionally talented professionals and solidify your place among top performers in the industry. As a Principal Cybersecurity Architect at JPMorganChase within... 
    Principal

    Chase

    Plano, TX
    4 days ago
  •  ...Principal Architect Step into the role of a Principal Architect at JPMorganChase and become a driving force behind the development and adoption of cutting-edge, cloud-based technologies. As a Principal Architect at JPMorganChase within the CIB/Payments line of business... 
    Principal

    Chase

    Plano, TX
    4 days ago
  •  ...Take your engineering expertise to new heights by joining a team of exceptionally talented professionals and solidify your place among top performers in the industry. As a Principal Cybersecurity Architect at JPMorgan Chase within the Cybersecurity & Technology Controls... 
    Principal

    Koitecc Solutions

    Plano, TX
    1 day ago
  • $146k - $309k

    Micron Technology, Inc in Richardson, Texas, is seeking a Principal SoC DFT Engineer to define and implement DFT architecture for complex HBM base-die SoC designs. The role requires close collaboration with RTL design, verification, and product engineering to optimize... 
    Principal

    Micron Technology, Inc

    Richardson, TX
    5 days ago
  •  ...Principal Architect Step into the role of a Principal Architect at JPMorganChase and become a driving force behind the development and adoption of cutting-edge, cloud-based technologies. As a Principal Architect at JPMorganChase within the CIB Payments, you provide... 
    Principal

    Chase

    Plano, TX
    4 days ago
  •  ...memory and semiconductor technologies. As an HBM Memory Design Engineer within the HBM Architecture Team, you will design, simulate, and...  ...mixed-signal architectures supporting Machine Learning and AI applications. Responsibilities Design and analyze digital,... 
    Principal
    Local area
    Immediate start

    Micron Technology

    Richardson, TX
    3 days ago
  • A leading global financial services firm is seeking a Principal Architect for IAM within their Cybersecurity & Technology Controls organization. You will leverage your expertise to enhance architecture platforms, design scalable solutions on cloud architectures, and lead... 
    Principal

    JPMorgan Chase & Co.

    Plano, TX
    5 days ago
  •  ...advanced foundry process technologies. Collaborate with HBM architects and circuit designers on floorplanning, placement, routing, and...  .... Motivated contributor who excels in team‑based, fast‑paced engineering environments. Benefits Micron offers a choice of medical,... 
    Principal
    Local area

    1000 Micron Technology, Inc.

    Richardson, TX
    5 days ago
  • Yahoo Holdings Inc. is looking for a Principal Backend Software Engineer to lead the architectural strategy for high-scale applications. You will utilize...  ...significant experience working with distributed systems, AI-assisted tools, and a strong foundation in both backend... 
    Principal

    Yahoo Holdings Inc.

    Richardson, TX
    5 days ago
  • Step into the role of a Principal Architect at JPMorganChase and become a driving force behind the development and adoption of cutting‑edge, cloud‑based technologies. As a Principal Architect at JPMorganChase within the CIB/Payments line of business, you provide expertise... 
    Principal

    JPMorgan Chase & Co.

    Plano, TX
    5 days ago
  •  ...Principal AI Engineer As a CBRE Principal AI Engineer, you will shape enterprise intelligence by architecting, building, and scaling cutting-edge AI/ML solutions and intelligent platforms end-to-end, from inception through deployment and ongoing operations, combining... 
    Principal
    Flexible hours

    CBRE Group

    Richardson, TX
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal AI Architect/Engineer. Be the first to apply!