Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Model Architect for Efficient In-Memory AI

$174.3k

TETRAMEM INC

TETRAMEM INC in San Jose, California, is seeking a qualified candidate to develop and engineer cutting-edge machine learning models for advanced memory technology. The role requires a Master’s degree in Computer Science and a minimum of one year of relevant experience. The successful applicant will work on model compression and optimization, contributing to high-performance AI computations. The salary range for this position is $174,304.00 to $175,000.00 per year, highlighting TetraMem’s commitment to top talent. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the ML Model Architect for Efficient In-Memory AI in San Jose, CA vacancy
  • $254k - $349.25k

     ...how people, data, and AI agents connect across email...  ...seeking a Principal ML Architect to lead the design and...  ...requires deep expertise in model architecture, training,...  ...and compression for efficient deployment in enterprise...  ...tool use, reasoning, memory, and policy‑aware decision... 
    Suggested
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    5 days ago
  • $254k - $349.25k

     ...how people, data, and AI agents connect across email...  ...seeking a Principal ML Architect to lead the design and...  ...deep expertise in model architecture, training,...  ...distillation and compression for efficient deployment in...  ...tool use, reasoning, memory, and policy-aware decision... 
    Suggested
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    1 day ago
  • NVIDIA is hiring a Generative AI Technical Engagement Lead in Santa Clara, CA to support...  ...involves engaging with senior leaders at AI model builders, driving integration of NVIDIA technologies...  ...possess 7+ years of experience, strong AI/ML competencies, and hands-on knowledge of... 
    Suggested

    NVIDIA

    Santa Clara, CA
    5 days ago
  •  ...computing experiences-from AI and data centers,...  ...enable massive model training at scale....  ...organization, from efficient attention...  ...practices for distributed ML systems, you will create...  ...microarchitecture, memory hierarchies, and...  ...parallel dimensions Architect memory-efficient... 
    Suggested
    Remote work

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    5 days ago
  • $206.4k - $379.1k

     ...impressive content. The AI Foundations team...  ...for a Principal Architect to build and implement...  ..., merging strong ML skills with proficiency...  ...to life - spanning model orchestration,...  ...tactics to enhance efficiency and cost-effectiveness...  ...- reasoning loops, memory persistence, task... 
    Suggested
    Temporary work
    Local area
    Worldwide
    Flexible hours

    Adobe

    San Jose, CA
    3 days ago
  •  ...Hybrid) About Velaura AI Velaura AI is...  ...constraints of energy efficiency, latency, and...  ...world‑class team of architects and engineers to tackle...  ...for a Performance Modeling Architect who will...  ...compute units, memory hierarchies, and...  ...with software and ML teams to incorporate... 

    Velaura

    Santa Clara, CA
    4 days ago
  •  ...Hybrid) About Velaura AI Velaura AI is...  ...constraints of energy efficiency, latency, and system...  ...of researchers, architects, and engineers to rethink...  ...intersection of AI model design and next‑...  ...compute and memory architectures. Investigate...  ...working with ML frameworks such as... 

    Velaura

    Santa Clara, CA
    5 days ago
  • $126.7k - $217.9k

     ...AI Accelerator Architecture Engineer...  ...widespread deployment of ML accelerators....  ...commercializing power-efficient on-device AI, edge...  ...largest of today's models. The AI Architecture...  ...architectures Architect enhancements required...  ...GEMM, GEMV Memory Technology and subsystems... 
    Work experience placement
    Flexible hours
    Night shift

    Qualcomm

    Santa Clara, CA
    2 days ago
  •  ...their hybrid cloud and AI journeys. With support...  ...seeking a Senior AI Architect - Enterprise Integrations...  ...fit-for-purpose model selection, efficient prompting & caching, and...  ...orchestration, tool calling, memory management, and human-...  ...+ years focused on AI/ML systems ~ Hands-on... 
    Worldwide

    IBM

    San Jose, CA
    1 day ago
  •  ...will design and ship AI systems that make...  ...writes the code they architect, and takes personal...  ..., secure, and efficient AI systems that reason...  ...covering data flows, model lifecycle, runtime...  ..., agentic memory. Ensure AI solutions...  ...building and deploying AI/ML solutions in... 
    Relocation

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $208k - $327.75k

     ...accelerated computing, AI, and autonomous...  ...looking for a Senior AI Architect to help define the...  ...generation of AI model paradigms for...  ...including GPU, CPU, DLA, memory hierarchy,...  ...validate scalability, efficiency, and deployment feasibility...  ...experience in AI/ML systems, deep... 
    Worldwide

    NVIDIA Corporation

    Santa Clara, CA
    6 days ago
  •  ...Velaura is hiring an AI Research Architect to explore the intersection of AI model design and next-generation compute architectures...  ...to optimize system-level efficiency, especially for applications in...  ...transformers, and experience with ML frameworks such as PyTorch or TensorFlow... 

    Velaura

    Santa Clara, CA
    5 days ago
  •  ...Job Summary T he AI Interconnect Architect designs and engineers high-speed...  ...bandwidth, power efficiency, scalability, and optimized...  ...switched fabrics tailored for AI/ML scale-out. # Performance...  ...Ability to develop performance models through advanced modeling and... 

    Compunnel

    Milpitas, CA
    2 days ago
  • $272k - $431.25k

     ...group is solving some of AI’s hardest infrastructure...  .... This Principal Architect role leads the research...  ...disaggregated prefill/decode, model parallelism). Integrating...  ...computer architecture, memory hierarchies, DMA engines...  ...networking. Understanding of ML systems concepts—... 

    Jobleads-US

    Santa Clara, CA
    4 days ago
  •  ...Location: Hybrid - Onsite in Cupertino. We’re looking for an AI Model Architect & Creative Developer (Technical Production Designer), a detail...  ...and multi-agent patterns (planner‑critic‑actor, tool use, memory/knowledge retrieval, safety gates). Integrate AI models, APIs... 
    Contract work

    Salt Digital Recruitment

    Cupertino, CA
    5 days ago
  • $138k - $206k

     ...a Senior Engineer in Performance Architecture at our San Jose, CA headquarters. You will model and optimize AI systems, working within a research lab focused on advancing Memory Centric Computing. Candidates with relevant degrees and strong backgrounds in computer architecture... 
    Flexible hours

    Samsung Semiconductor

    San Jose, CA
    4 days ago
  •  ...Engineer for DRAM. The engineer will analyze system architectures and mentor junior engineers while driving innovation in memory solutions for AI workloads. This role requires a Bachelor's degree in Electrical Engineering and at least 12 years of engineering experience... 

    Micron Technology

    San Jose, CA
    1 day ago
  • $260k - $280k

     ...Skills We are looking for an AI Architect to design and lead the development...  ...‑tune and optimize frontier models, and design the guardrails,...  ...specialized agents, and shared memory. Define agent interfaces, policies...  ...5+ years building ML or LLM‑powered products (multi... 
    Worldwide

    Lattice Semiconductor

    San Jose, CA
    4 days ago
  •  ...IT Consulting is seeking a Senior Account Manager in San Jose, California. The candidate will manage high-value accounts and evaluate AI-generated content for accuracy in account management. Responsibilities include challenging AI with relevant scenarios and providing... 
    Contract work

    YO IT Consulting

    San Jose, CA
    4 days ago
  • $150k - $200k

     ...Principal AI/ML Architect Job in USA 2025 (USD 150,000 to 200,000) Are you ready to elevate your career in artificial...  ...AWS, Azure, or GCP. Manage MLOps Pipelines : Ensure efficient deployment and monitoring of AI models using tools like Docker and Kubernetes. Innovate and... 
    Full time

    NewsNowGh

    San Jose, CA
    4 days ago
  • $212k - $386.3k

     ...United States Machine Learning and AI Apple is where individual...  ...also develop generative AI‑based models and systems to power Apple...  ...devices more effectively and efficiently with less effort. Description...  ...engineering/applied research/ML experiences in search, natural... 
    Worldwide
    Relocation

    Apple Inc.

    Santa Clara, CA
    3 days ago
  • $200k - $250k

     ...Involved Responsibilities As an AI Architect, you will be responsible for...  ...opportunities for AI-driven efficiency, develop prototypes and bring...  .... Integrate large language models (LLMs) and other AI technologies...  ...building and deploying AI/ML or automation solutions in production... 
    Full time
    Contract work
    Local area
    Remote work

    PTR Global

    Cupertino, CA
    6 days ago
  • $272k - $431.25k

     ...NVIDIA Gruppe in Santa Clara is seeking a seasoned architect specializing in CPU/GPU systems. The role involves developing...  ...high-performance architectures and enhancing memory systems with a focus on energy efficiency. Candidates should have 18+ years of experience,... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $192k - $279k

     ...Staff Co-Design Engineer to drive innovations in AI/ML hardware acceleration. In this pivotal role, you...  ...to enable seamless integration of hardware and model architecture, focusing on high-performance and power-efficient solutions. Your responsibilities include optimizing... 

    Google Inc.

    Sunnyvale, CA
    6 days ago
  •  ...in Flash and advanced memory technologies, our solutions...  ...for breakthroughs in efficient operations. With our...  ...Description In this AI/ML ASIC Architecture position...  ....  As an AI/ML ASIC Architect you will help drive new...  ...attention mechanisms, and model parallelism techniques... 
    Temporary work
    Remote work
    Flexible hours
    Shift work
    Night shift

    Sandisk

    Milpitas, CA
    29 days ago
  •  ...in Flash and advanced memory technologies, our...  ...for breakthroughs in efficient operations. With our...  ...Description An AI Interconnect Architect defines and engineers...  ...fabrics tailored for AI/ML scale-out.  Performance...  ...serial links. ~ Modeling and SimulationL Ability... 
    Temporary work
    Remote work
    Flexible hours
    Shift work

    Sandisk

    Milpitas, CA
    26 days ago
  • $156k - $387.6k

     ...for ultra-large-scale language models, vision-language models, and...  ...systems, from multimodal data efficiency to convergence optimization for...  ...TP, PP, EP, operator fusion, memory optimization, and cluster-level...  ...of experience in large-scale ML systems, training infrastructure... 
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    1 day ago
  •  ...Applied Scientist to join the Delivery Foundation Model team in Santa Clara, CA. This role involves...  ...innovative foundation models to enhance logistics efficiency across Amazon's delivery network, leveraging cutting-edge AI and machine learning techniques. The ideal candidate... 

    Itlearn360

    Santa Clara, CA
    2 days ago
  • $212.3k - $275.8k

     ...Meet the Team Join Cisco's CX AI Incubation Team as an AI...  ...customer engagement and operational efficiency. You will focus on end-to-end...  ..., deployment automation, and model/service observability. This...  ...building production services for ML/AI workloads. Experience... 
    Full time
    Temporary work
    Local area
    Flexible hours
    3 days per week

    Cisco

    San Jose, CA
    6 days ago
  •  ...Vings Technologies is seeking an experienced AI/ML Architect to join our team in San Jose, CA. You will play a critical role in building and...  ...learning systems, overseeing all aspects from data ingestion to model serving. The ideal candidate will have over 8 years of hands-... 

    Vings Technologies

    San Jose, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Model Architect for Efficient In-Memory AI. Be the first to apply!