Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Solutions Architect, AI Factory Observability and Visualization - NVIS

$184k - $287.5k
Full-time

NVIDIA

NVIDIA's Infrastructure Specialists team is hiring a Senior Solutions Architect - AI Factory Observability & Visualization! This remote role develops full-spectrum visibility that supports the smooth functioning of HPC systems and AI factories, transforming intricate telemetry across network and compute into straightforward, actionable perspectives. The role has a complete, end-to-end understanding of the HPC/AI system, running and interpreting microbenchmarks and workloads to confirm system readiness, then establishing the observability that maintains this state. The work involves collaborating across NVIDIA teams to help partners see, understand, and respond to HPC system and AI factory performance, from hardware to workload. What You Will be Doing: Run AI factory validation tools, microbenchmarks, and workloads provided by the team, and interpret results to assess system health and performance. Gain a comprehensive understanding of the system from start to finish, including network topology, interconnects, and compute. Establish what "healthy" represents across the stack — the metrics, logs, and signals that confirm a system is functioning well, and the thresholds that show it isn't. Build and extend the telemetry surface across hardware, fabric, and workload, crafting how data is collected, transformed, stored, and surfaced. Serve as the observability expert, investigating gaps in visibility to ensure it reflects true system behavior. Develop automation (Python, Shell) for collecting, transforming, and presenting system and network data. Recommend improvements to system visibility, data sources, and reporting that give teams clearer insight. Collaborate with hardware, software, networking, datacenter, and product groups to ready HPC systems and AI factories for customer deployment, contributing documentation and readiness materials throughout the process. What We Need to See: Bachelor's degree or equivalent experience in Computer Science, Mathematics, Engineering, Physics, or related field. 6+ years of experience managing Linux-based systems in HPC, distributed systems, or large AI/ML settings. Hands-on experience with the architecture of multi-GPU and/or multi-node clusters, including networking and interconnects. Solid grasp of how HPC and AI factory systems fit together end to end, from network fabric through compute. Proficiency with Python and Shell/Bash for scripting, automation, and tooling. Practical experience working with observability systems (e.g., Prometheus, Grafana, Loki, or similar), including building custom exporters or collectors, setting up alerts, and handling metric cardinality and retention on a large scale. Experience transforming metrics, logs, and traces into clear, actionable insight for complex distributed environments. Familiarity with GPU and fabric telemetry (e.g., DCGM, NVLink, InfiniBand/Ethernet fabric counters) and using it to diagnose performance regressions. Strong communication skills and the ability to work effectively with cross-functional teams. Ways to Stand Out From the Crowd: Experience with AI factory or large-scale AI infrastructure build, deployment, or operations. Background in HPC systems engineering, SRE, or systems analysis for GPU-accelerated environments. Experience building automation and data pipelines that feed dashboards and reporting at scale. Demonstrated desire to use AI to solve practical problems, improve workflows, and guide data-driven decisions. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 28, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Solutions Architect, AI Factory Observability and Visualization - NVIS in California vacancy
  • $184k - $287.5k

     ...redefining what is possible in AI, accelerated computing, and...  ...NVIDIA Infrastructure Services (NVIS), we help customers, OEMs,...  ...with project delivery managers, Solution Architects, Sales, OEM/SI partners,...  ...customer work, building trust with senior audiences, and translating... 
    Senior
    Full time
    Casual work

    NVIDIA

    California
    2 days ago
  • $146k - $194k

     ...powered by Lattice OS, an AI-powered operating...  ...of delivering software solutions that integrate seamlessly...  ...ABOUT THE JOB Solutions Architects play a pivotal role in...  ...of the possible, with senior government customers...  ...our candidates. We've observed a rise in sophisticated... 
    Senior
    Full time
    Work experience placement
    Immediate start
    Worldwide

    Anduril Industries

    Costa Mesa, CA
    1 day ago
  • $206.43k - $330.26k

     ...The Red Hat North America Technology Sales team is looking for an AI Platform Specialist Solution Architect (SSA) to join our team. This position assumes a crucial role in providing expert deep technical sales support for the seamless execution of Go-To-Market (GTM) strategies... 
    Senior
    Permanent employment
    Full time
    Contract work
    Work experience placement
    Remote work
    Flexible hours

    Jobleads-US

    Sacramento, CA
    2 days ago
  •  ...Inclusion. We weave AI into the fabric of...  ...for the joint solution with the partner;...  ...partner side Influences senior leaders at Palo...  ...clear, compelling visual artifacts that GSI...  ...enough for a GSI architect to present directly...  ...: Translate field observations from partner deployments... 
    Suggested
    Full time
    Immediate start
    Remote work

    Palo Alto Networks

    California
    3 days ago
  • $184k - $287.5k

    NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Data... 
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $216k - $345k

    ## Principal Solutions Architect - Semiconductor TestApplylocations: US, CA, Santa...  ...operations from start to finish. This senior individual contributor role is...  ...reviews to ensure testability, observability, and debug-ability* Drive adoption of AI/ML techniques for yield... 

    Jobleads-US

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

     ...unlimited potential of AI to define the...  ...looking for a Senior Full-stack web applications...  ...software architect to join our...  ...-driven data and visualization platform to power...  ...robust, scalable solutions. Establish best practices...  ...and platform observability (monitoring, logging... 
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $124.74k - $254.5k

     ...currently seeking a Lead Specialist, AI Solution Architect to join our KPMG Managed Services practice...  ..., orchestration, state management, observability, and interoperable workflows that...  ...mentoring technical teams, influencing senior stakeholders, and serving as a... 
    Full time
    H1b
    Local area

    KPMG

    San Francisco, CA
    4 days ago
  • $250k - $290k

     ...finance delivers real-world value - solutions that work in practice, not...  ..., paired with innovative AI-powered technology and an investor...  ...key responsibilities As a Senior Director with EY-Parthenon's AI...  ...professionals with the ability to visualize our clients' goals and think... 
    Senior
    Full time
    Work experience placement
    Summer holiday
    Work at office
    Flexible hours

    EY

    Los Angeles, CA
    2 days ago
  • $152k - $241.5k

     ...people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU...  ...make a lasting impact on the world. We are looking for a Senior Solutions Architect to support our Industrial Engineering accounts — the CAE, CFD... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $131k - $180.5k

     ...solved before, helping develop a rocket, a factory, and a business from the ground up....  ...analytics infrastructure, and next-generation AI to solve real problems and accelerate...  ...the Role: We're seeking an HRIS Solutions Architect to own the design and delivery of... 
    Senior

    Relativity Space

    Long Beach, CA
    25 days ago
  • $146k - $194k

     ...of systems is powered by Lattice OS, an AI-powered operating system that turns...  ...THE ROLE We are seeking a Oracle Fusion Solution Architect with deep Oracle Fusion experience to join...  ...the security of our candidates. We've observed a rise in sophisticated phishing and... 
    Full time
    Work experience placement
    Work at office
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    1 day ago
  • $163.4k - $322.1k

     ...Senior Manager, Functional Transformation Throughout the health ecosystem, you'll find courageous and inspiring people who are committed...  ...by translating business needs into scalable technology solutions Recruiting, mentoring, and leading teams while managing engagement... 
    Senior
    Contract work
    Start working today
    Flexible hours

    Deloitte

    San Jose, CA
    3 days ago
  • WinsAbove is seeking a Sr. Solutions Engineer based in Mexico City to partner with customers, driving complex technology discussions while establishing a trusted advisor role. You will design scalable data architectures using Databricks technology, engage with technical... 
    Senior

    Jobleads-US

    San Francisco, CA
    1 day ago
  • $168k - $264.5k

     ...Senior It Auditor NVIDIA is the pioneer of GPU-accelerated computing...  ...computing. Our work in AI, deep learning, and self-driving...  ...on-site IT contact for factory users and stakeholders within...  ...experience with data analytics and visualization tools (e.g., SQL, Python,... 
    Senior
    Contract work
    For subcontractor
    Remote work

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...Principal Workday Solutions Architect (HCM/Compensation) We are seeking a Principal Workday Solutions Architect...  ...Familiarity with Artificial Intelligence (AI) and its subsets will be an added advantage Good exposure of senior stakeholder management skills. Salary... 

    Western Digital

    San Jose, CA
    3 days ago
  • $250k - $350k

     ...Company Overview TENEX is an AI-native, automation-first,...  ...cybersecurity, automation, and AI-driven solutions. Backed by leading investors,...  .... As a Staff AI Solutions Architect at TENEX, you will be a...  ...Solutions Architect or equivalent senior technical role for an AI/ML... 

    TENEX.AI

    San Jose, CA
    3 days ago
  •  ...The AI Orchestration of Your Wildest Imagination n8n is the open workflow orchestration...  ...with us. About the Role As a Solutions Engineer, you will be the primary...  ...fundamentals, security best practices, and observability concepts Cloud & Infrastructure Awareness... 
    Senior
    Remote job
    Temporary work
    Local area
    Visa sponsorship

    n8n

    San Francisco, CA
    1 day ago
  • $132k - $165k

     ...edge, delivering modular AI infrastructure from first deployment to AI factory with speed, scale and...  ...Disruptor 50, Armada’s solutions are deployed in over 60...  ...ROLE OVERVIEW The Senior Technical Account Manager...  ...management platforms, or observability tools * Familiarity... 
    Senior
    Full time
    Temporary work
    Remote work
    Flexible hours

    Armada

    California
    5 days ago
  • $94.43k - $202.75k

     ...breakthroughs to create solutions and products that...  ...of Cloud, AI, ML, IoT, 5G, and...  ...currently seeking a Senior Associate, Data Engineer...  ...of insightful visualizations, reports and presentations...  ...lake/warehouse architect, designer,...  ...of holidays to be observed during the year and... 
    Senior
    Full time
    Local area
    Visa sponsorship

    KPMG

    Los Angeles, CA
    4 days ago
  • $166k - $220k

     ...powered by Lattice OS, an AI-powered operating system that...  ...of automated testing solutions across Anduril factories. As a member of this team,...  ...comms to large groups and senior leadership. * Must be a U....  ...security of our candidates. We've observed a rise in sophisticated... 
    Senior
    Full time
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    1 day ago
  • Fractal is seeking an EdTech Sales Consultant & Solutioning Expert to drive enterprise growth across AI-led learning solutions in San Francisco. This role involves consultative selling, Solution design, and pursuing large deals. The ideal candidate will have 15-20 years... 
    Flexible hours

    Jobleads-US

    San Francisco, CA
    5 days ago
  • $166k - $220k

     ...is powered by Lattice OS, an AI-powered operating system that...  ...and execute work, but also the factory-facing infrastructure that connects...  ...base. ABOUT THE JOB: The Senior Software Product Manager for...  ...security of our candidates. We've observed a rise in sophisticated... 
    Senior
    Full time
    Work experience placement
    For subcontractor
    Work at office
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    1 day ago
  • $178.42k - $230.5k

    Job Description About Us The AI Cloud and Developer Infrastructure organization is responsible...  ...domain. The Role We are looking for a Senior Engineer with an extensive engineering...  ...will start delivering impact through observability frameworks and will evolve depending on business... 
    Senior
    Full time
    Work experience placement
    Local area
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  • $150k - $220k

     ...of systems is powered by Lattice OS, an AI-powered operating system that turns thousands...  ...Industries Network Team with a focus on factory systems and operational technology (OT)...  ...the security of our candidates. We've observed a rise in sophisticated phishing and fraudulent... 
    Senior
    Full time
    Work experience placement
    For subcontractor
    Immediate start
    Remote work

    Anduril Industries

    Costa Mesa, CA
    1 day ago
  • $146k - $194k

     ...by Lattice OS, an AI-powered operating...  ...operations. As a Senior Client Platform Engineer...  ...office and factory floor environments...  ...endpoint management solutions across Windows,...  ...mobile platforms. * Architect and maintain...  ...candidates. We've observed a rise in sophisticated... 
    Senior
    Full time
    Work experience placement
    Work at office
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    1 day ago
  • $140k

     ...Pre-Sales Solutions Architect II New York, New York, United States As a Pre-Sales Solutions Architect on Everlaw's commercial team, you...  ...your expertise across our platform, including our generative AI capabilities (Deep Dive, Storybuilder, Writing Assistant, Review... 
    Full time
    Work at office
    Work from home
    Flexible hours

    Everlaw

    Oakland, CA
    15 hours ago
  • $260k - $275k

    Saviynt, located in San Francisco, is hiring a Senior Principal Software Engineer to lead the development of our AI security products. You will design and implement secure and scalable workflows, work across various cloud platforms, and contribute to product direction... 
    Senior

    Jobleads-US

    San Francisco, CA
    5 days ago
  •  ...building the infrastructure behind the AI-driven data economy.As AI scales, so does...  ...DescriptionWe are seeking a Principal Workday Solutions Architect who is highly analytical, possess strong...  ...be an added advantageGood exposure of senior stakeholder management skills.Salary... 
    Full time
    Temporary work
    Work at office
    Immediate start
    Remote work
    Worldwide
    Flexible hours
    Shift work

    Western Digital

    San Jose, CA
    5 days ago
  • $125.9k - $231.1k

     ...working world. Microsoft 365 AI Solution Architect (Manager) EY advises...  ...agents with Copilot Studio and Visual Studio. The successful...  ...days of vacation plus twelve observed holidays and 10 personal care...  ...training  An excellent team of senior colleagues, dedicated to... 
    Summer holiday
    Flexible hours

    Ernst & Young

    Los Angeles, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Solutions Architect, AI Factory Observability and Visualization - NVIS. Be the first to apply!