Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Performance Modeling Architect - AI Systems

Velaura

Location: Bay Area (Onsite or Hybrid) About Velaura AI Velaura AI is building the next generation of compute platforms for artificial intelligence. As AI expands beyond the cloud into edge systems and physical environments , the constraints of energy efficiency, latency, and system architecture are becoming fundamental. A particularly exciting frontier is physical and embodied AI —systems that perceive, reason, and act in the real world. Applications such as robotics, autonomous mobile robots (AMRs), drones, and other intelligent machines demand new approaches to compute that combine real-time responsiveness, energy efficiency, and advanced AI capabilities. Our mission is to design silicon and systems from first principles to enable these emerging classes of AI applications. Velaura combines deep semiconductor expertise with hardware–software co-design to deliver breakthrough performance per watt for AI workloads across cloud, edge, and embodied intelligence. We are assembling a world‑class team of architects and engineers to tackle some of the most ambitious challenges in computing. Role Overview We are looking for a Performance Modeling Architect who will develop the modeling and simulation frameworks used to guide architectural decisions across our platform. In this role, you will build models that capture the interaction between AI workloads, software stacks, and hardware architectures , enabling rigorous evaluation of system‑level tradeoffs. Your models will represent not only the SoC architecture and its components , but also key elements of the software stack and system behavior that ultimately determine real‑world performance. These frameworks will enable hardware–software co‑design, allowing us to explore design alternatives and quantify their impact on performance, latency, scalability, and energy efficiency. Your work will directly influence the architecture of next‑generation AI compute platforms. Responsibilities Develop system‑level performance models for AI workloads and compute architectures. Build simulation frameworks that capture interactions between AI models, software runtimes, and hardware systems . Model key components of SoC architecture including compute units, memory hierarchies, and interconnects. Incorporate relevant software and runtime behavior into modeling frameworks to reflect realistic system execution. Analyze system bottlenecks and evaluate architectural tradeoffs across the platform. Work closely with hardware architects to guide microarchitectural and system‑level design decisions. Collaborate with software and ML teams to incorporate realistic workloads into modeling frameworks. Use modeling and simulation to evaluate performance, latency, throughput, and energy efficiency across target applications. Required Qualifications Strong background in computer architecture, system architecture, or performance modeling. Experience building simulation or analytical models of complex hardware systems. Strong programming skills (Python, C++, or similar) for modeling and simulation frameworks. Understanding of modern compute architectures such as CPUs, GPUs, or AI accelerators. Ability to analyze complex systems and identify performance bottlenecks. Strong quantitative reasoning and ability to translate models into architectural insights. Preferred Qualifications Experience modeling AI workloads or machine learning systems. Familiarity with architectural simulators or performance modeling tools.Experience with memory systems, interconnect architectures, or accelerator design. Exposure to hardware–software co‑design methodologies. Experience modeling performance across multiple layers of the stack , including software runtimes and hardware platforms. Why Join Velaura You will help shape how architectural decisions are made across the entire compute stack. Rather than optimizing individual components in isolation, you will build the frameworks that allow us to reason about the complete system —from AI workloads and software runtimes to silicon architecture. If you enjoy understanding complex systems and using modeling to guide major architectural decisions, we would love to hear from you. Velaura is an Equal Opportunity Employer that is committed to inclusion and diversity. Qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, disability or protected veteran status. We also take affirmative action to offer employment opportunities to minorities, women, individuals with disabilities, and protected veterans. Velaura is committed to working with qualified individuals with physical or mental disabilities. Applicants who would like to contact us regarding the accessibility of our website or who need special assistance or a reasonable accommodation for any part of the application or hiring process may contact us at: View email address on click.appcast.io. This contact information is for accommodation requests only. Evaluation of requests for reasonable accommodation will be determined on a case‑by‑case basis. #J-18808-Ljbffr Velaura

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Performance Modeling Architect - AI Systems in Santa Clara, CA vacancy
  •  ...Hybrid) About Velaura AI Velaura AI is building...  ...beyond the cloud into edge systems and physical...  ...deliver breakthrough performance per watt for AI workloads...  ...a world‑class team of architects and engineers to tackle...  ...looking for a Performance Modeling Architect who will... 
    Performance

    Velaura

    Santa Clara, CA
    3 days ago
  • $254k - $349.25k

     ...how people, data, and AI agents connect across email...  ...a Principal ML Architect to lead the design and...  ...of next-generation AI systems for cybersecurity , leveraging...  ...deep expertise in model architecture, training,...  ...continuously improve model performance and reliability... 
    Performance
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    8 hours ago
  • $254k - $349.25k

     ...how people, data, and AI agents connect across email...  ...seeking a Principal ML Architect to lead the design and...  ...of next-generation AI systems for cybersecurity ,...  ...requires deep expertise in model architecture, training,...  ...improve model performance and reliability Productionization... 
    Performance
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    2 days ago
  •  ...computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture...  ...that enable massive model training at scale. Your...  ...expertise will drive 2-3x performance gains in both training...  ...parallel dimensions Architect memory-efficient... 
    Performance
    Remote work

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  •  ...is seeking a Vector Compute Architect Intern to join our advanced...  ...compute architectures for AI and high-performance computing platforms. The...  ...architecture trade-offs, developing system specifications, and...  ...skills in performance modeling and computer architecture.... 
    Performance
    Internship

    Jobleads-US

    Sunnyvale, CA
    5 days ago
  • Netpreme is seeking an ASIC Architect to define the architecture and microarchitecture for their silicon product with offices in Boston, MA. Responsibilities include performance modeling for GPU memory systems, architecting data planes, and owning SoC architecture specs... 
    Performance

    Netpreme

    Santa Clara, CA
    4 days ago
  •  ...focused on measuring and visualizing AV model performance. This role will involve scaling state-of...  ...cross-functional teams within the Embodied AI group. The ideal candidate will have...  ...degree, and experience with large-scale systems. Exceptional communication and mentoring... 
    Performance

    General Motors

    Sunnyvale, CA
    2 days ago
  • $208k - $327.75k

     ...accelerated computing, AI, and autonomous machines...  ...-of-the-art AI, high-performance compute, and scalable software...  ...for a Senior AI Architect to help define the next generation of AI model paradigms for autonomous...  ...hardware architecture, systems optimization, and autonomous... 
    Performance
    Worldwide

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  •  ...NVIDIA Gruppe seeks a Principal Architect to drive the architectural vision for AI communication systems. This role involves setting the technical direction, conducting...  ...and systems software, particularly in high-performance environments, as well as a degree in Computer... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...Building the world's leading AI-powered, cloud-native...  ...Overview: The Business Systems team manages the...  ...looking for a hands‑on AI Architect to deliver technical designs...  ...Scalability: Design a model‑agnostic inference...  ...between models based on performance and cost requirements.... 
    Performance
    Remote work

    IBM Computing

    San Jose, CA
    3 days ago
  • $224k - $431.25k

    NVIDIA Gruppe is seeking an experienced AI Platform Architect to build and enhance internal AI solutions used globally. The role requires a strong AI-first approach, with a focus on developing systems software and deploying AI solutions. The candidate should have extensive... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...globally for innovation, performance and quality. Sandisk...  ...Job Description In this AI/ML ASIC Architecture...  ...Solutions based advanced system architectures and AI/ML...  ...product. As an AI/ML ASIC Architect you will help drive new...  ...mechanisms, and model parallelism techniques... 
    Performance
    Temporary work
    Remote work
    Flexible hours
    Shift work
    Night shift

    Sandisk

    Milpitas, CA
    2 days ago
  • d-Matrix inc. is seeking a Principal Architect specializing in Performance Analysis and Modeling to join our Santa Clara, CA team. The role involves analyzing ML workloads, developing hardware/software features, and creating analytical models that enhance performance across... 
    Performance

    d-Matrix inc.

    Santa Clara, CA
    3 days ago
  • $272k - $431.25k

     ...within NVIDIA’s Networking Systems & Software Architecture...  ...is solving some of AI’s hardest infrastructure...  .... This Principal Architect role leads the research...  ...disaggregated prefill/decode, model parallelism)....  ...deep expertise in high-performance networking (InfiniBand,... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Principal Architect- Performance Analysis and Modeling At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront...  ...philosophy cuts through the system ranging from efficient tensor cores... 
    Performance
    3 days per week

    d-Matrix

    Santa Clara, CA
    4 days ago
  • $224k - $356.5k

     ...NVIDIA Gruppe is seeking a Senior Systems Software Engineer in Santa Clara, California...  ...candidate will have significant experience in model optimization and possess a strong...  ...directly with automotive OEMs and pushing performance benchmarks across various platforms. A salary... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Hybrid) About Velaura AI Velaura AI is building...  ...beyond the cloud into edge systems and physical...  ...to deliver breakthrough performance per watt across cloud,...  ...class team of researchers, architects, and engineers to rethink...  ...the intersection of AI model design and next‑generation... 
    Performance

    Velaura

    Santa Clara, CA
    4 days ago
  •  ...Job Description: Edge AI Architect - CUDA / C++ / Computer Vision...  ...Department: Edge AI & Embedded Systems About the Role: We are...  ...vision algorithms, and AI model deployment in the MedTech domain...  ...at the edge, ensuring performance, reliability, and compliance... 
    Performance
    Contract work

    Codvo.ai

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

     ...dedicated towards a computing model focused on visual and AI computing. For two decades...  ...group is looking for Architects, Functional Modeling Engineers...  ...different functional and performance simulation models across...  ...with experience in modeling (System C & TLM preferred)... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Senior AI Architect Nexxa.ai is building artificial super intelligence...  ...— enabling machines, systems and operations to think, decide...  ...engineering to bring cutting-edge models into reliable, scalable, and...  ...user behavior, system performance, and model impact Build and... 
    Performance

    Nexxa.ai

    Sunnyvale, CA
    1 day ago
  •  ...Onsite in Cupertino. We’re looking for an AI Model Architect & Creative Developer (Technical...  ...quality, bias, and compliance, and monitor performance metrics. Maintain clear, accessible...  ...translate creative vision into technical systems. Work with internal AI models and... 
    Performance
    Contract work

    Salt Digital Recruitment

    Cupertino, CA
    4 days ago
  • Cerebras Systems is seeking an experienced engineer to translate high level architecture specifications...  ...feature requirements. The role involves performance analysis, modeling, and working with a breakthrough architecture that supports AI applications at scale. Ideal candidates... 
    Performance

    Cerebras Systems

    Sunnyvale, CA
    1 day ago
  •  ...Clara is seeking a Deep Learning Communication Architect to optimize DNN models and enhance communication performance during distributed training. This role requires...  ...protocols, ensuring seamless operation across large systems. The ideal candidate holds an advanced degree... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $190k - $280k

    MixMode is seeking a Principal Architect in Santa Clara to accelerate AI application performance through innovative hardware and software solutions. The role involves analyzing ML workloads and collaborating with product and hardware teams to enhance inference accelerators... 
    Performance

    MixMode

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

    NVIDIA is hiring an AI Hardware Architect to analyze and architect the next...  ...groundbreaking GPU and CPU systems. Understand them across a range...  ...Study the applications and models running on Nvidia hardware...  ..., analyze, and explain the performance and power advantages of... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

    Senior Performance Modeling Architect, CPU Fabric and LLC page is loaded## Senior Performance Modeling Architect...  ..., high-density demands of Data Center systems. You will build the "source of truth"...  ...for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.... 
    Performance
    Night shift

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  •  ...Proofpoint is seeking a Principal ML Architect to lead the design and development of next-generation AI systems focused on cybersecurity. In this role, you will leverage...  ..., with significant expertise in deep learning, model architecture, and security. The position offers... 

    Proofpoint

    Sunnyvale, CA
    4 days ago
  • Distinguished Technologist - AI Model Performance Architect Description - This role is responsible for bridging system memory architecture with AI model behavior ensuring optimal performance through HW/SW co-design and workload aware tuning. This role develops AI workload... 
    Performance
    Full time
    Temporary work
    Work experience placement
    Local area
    Relocation
    Flexible hours
    Shift work

    HP

    Palo Alto, CA
    1 day ago
  •  ...seeking a highly accomplished GPU Architect in Milpitas, California, to define next-gen AI accelerators and optimize multi-...  ...technology characterization and performance strategies to enhance...  ...of experience in GPUs or similar systems, strong knowledge of manufacturing... 
    Performance

    graphcore

    Milpitas, CA
    1 day ago
  • Cerebras Systems Inc. is seeking a highly skilled professional to translate architecture specifications...  .... The role involves engaging with performance models and developing insights for power efficiency within a groundbreaking AI platform. Ideal candidates should have a... 
    Performance

    Cerebras Systems Inc.

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Performance Modeling Architect - AI Systems. Be the first to apply!