ML Model Architect for Efficient In-Memory AI
$174.3kTETRAMEM INC
TETRAMEM INC in San Jose, California, is seeking a qualified candidate to develop and engineer cutting-edge machine learning models for advanced memory technology. The role requires a Master’s degree in Computer Science and a minimum of one year of relevant experience. The successful applicant will work on model compression and optimization, contributing to high-performance AI computations. The salary range for this position is $174,304.00 to $175,000.00 per year, highlighting TetraMem’s commitment to top talent. #J-18808-Ljbffr
$254k - $349.25k
...how people, data, and AI agents connect across email... ...seeking a Principal ML Architect to lead the design and... ...requires deep expertise in model architecture, training,... ...and compression for efficient deployment in enterprise... ...tool use, reasoning, memory, and policy‑aware decision...SuggestedFlexible hours$254k - $349.25k
...how people, data, and AI agents connect across email... ...seeking a Principal ML Architect to lead the design and... ...deep expertise in model architecture, training,... ...distillation and compression for efficient deployment in... ...tool use, reasoning, memory, and policy-aware decision...SuggestedFlexible hours- NVIDIA is hiring a Generative AI Technical Engagement Lead in Santa Clara, CA to support... ...involves engaging with senior leaders at AI model builders, driving integration of NVIDIA technologies... ...possess 7+ years of experience, strong AI/ML competencies, and hands-on knowledge of...Suggested
- ...computing experiences-from AI and data centers,... ...enable massive model training at scale.... ...organization, from efficient attention... ...practices for distributed ML systems, you will create... ...microarchitecture, memory hierarchies, and... ...parallel dimensions Architect memory-efficient...SuggestedRemote work
$206.4k - $379.1k
...impressive content. The AI Foundations team... ...for a Principal Architect to build and implement... ..., merging strong ML skills with proficiency... ...to life - spanning model orchestration,... ...tactics to enhance efficiency and cost-effectiveness... ...- reasoning loops, memory persistence, task...SuggestedTemporary workLocal areaWorldwideFlexible hours- ...Hybrid) About Velaura AI Velaura AI is... ...constraints of energy efficiency, latency, and... ...world‑class team of architects and engineers to tackle... ...for a Performance Modeling Architect who will... ...compute units, memory hierarchies, and... ...with software and ML teams to incorporate...
- ...Hybrid) About Velaura AI Velaura AI is... ...constraints of energy efficiency, latency, and system... ...of researchers, architects, and engineers to rethink... ...intersection of AI model design and next‑... ...compute and memory architectures. Investigate... ...working with ML frameworks such as...
$126.7k - $217.9k
...AI Accelerator Architecture Engineer... ...widespread deployment of ML accelerators.... ...commercializing power-efficient on-device AI, edge... ...largest of today's models. The AI Architecture... ...architectures Architect enhancements required... ...GEMM, GEMV Memory Technology and subsystems...Work experience placementFlexible hoursNight shift- ...their hybrid cloud and AI journeys. With support... ...seeking a Senior AI Architect - Enterprise Integrations... ...fit-for-purpose model selection, efficient prompting & caching, and... ...orchestration, tool calling, memory management, and human-... ...+ years focused on AI/ML systems ~ Hands-on...Worldwide
- ...will design and ship AI systems that make... ...writes the code they architect, and takes personal... ..., secure, and efficient AI systems that reason... ...covering data flows, model lifecycle, runtime... ..., agentic memory. Ensure AI solutions... ...building and deploying AI/ML solutions in...Relocation
$208k - $327.75k
...accelerated computing, AI, and autonomous... ...looking for a Senior AI Architect to help define the... ...generation of AI model paradigms for... ...including GPU, CPU, DLA, memory hierarchy,... ...validate scalability, efficiency, and deployment feasibility... ...experience in AI/ML systems, deep...Worldwide- ...Velaura is hiring an AI Research Architect to explore the intersection of AI model design and next-generation compute architectures... ...to optimize system-level efficiency, especially for applications in... ...transformers, and experience with ML frameworks such as PyTorch or TensorFlow...
- ...Job Summary T he AI Interconnect Architect designs and engineers high-speed... ...bandwidth, power efficiency, scalability, and optimized... ...switched fabrics tailored for AI/ML scale-out. # Performance... ...Ability to develop performance models through advanced modeling and...
$272k - $431.25k
...group is solving some of AI’s hardest infrastructure... .... This Principal Architect role leads the research... ...disaggregated prefill/decode, model parallelism). Integrating... ...computer architecture, memory hierarchies, DMA engines... ...networking. Understanding of ML systems concepts—...- ...Location: Hybrid - Onsite in Cupertino. We’re looking for an AI Model Architect & Creative Developer (Technical Production Designer), a detail... ...and multi-agent patterns (planner‑critic‑actor, tool use, memory/knowledge retrieval, safety gates). Integrate AI models, APIs...Contract work
$138k - $206k
...a Senior Engineer in Performance Architecture at our San Jose, CA headquarters. You will model and optimize AI systems, working within a research lab focused on advancing Memory Centric Computing. Candidates with relevant degrees and strong backgrounds in computer architecture...Flexible hours- ...Engineer for DRAM. The engineer will analyze system architectures and mentor junior engineers while driving innovation in memory solutions for AI workloads. This role requires a Bachelor's degree in Electrical Engineering and at least 12 years of engineering experience...
$260k - $280k
...Skills We are looking for an AI Architect to design and lead the development... ...‑tune and optimize frontier models, and design the guardrails,... ...specialized agents, and shared memory. Define agent interfaces, policies... ...5+ years building ML or LLM‑powered products (multi...Worldwide- ...IT Consulting is seeking a Senior Account Manager in San Jose, California. The candidate will manage high-value accounts and evaluate AI-generated content for accuracy in account management. Responsibilities include challenging AI with relevant scenarios and providing...Contract work
$150k - $200k
...Principal AI/ML Architect Job in USA 2025 (USD 150,000 to 200,000) Are you ready to elevate your career in artificial... ...AWS, Azure, or GCP. Manage MLOps Pipelines : Ensure efficient deployment and monitoring of AI models using tools like Docker and Kubernetes. Innovate and...Full time$212k - $386.3k
...United States Machine Learning and AI Apple is where individual... ...also develop generative AI‑based models and systems to power Apple... ...devices more effectively and efficiently with less effort. Description... ...engineering/applied research/ML experiences in search, natural...WorldwideRelocation$200k - $250k
...Involved Responsibilities As an AI Architect, you will be responsible for... ...opportunities for AI-driven efficiency, develop prototypes and bring... .... Integrate large language models (LLMs) and other AI technologies... ...building and deploying AI/ML or automation solutions in production...Full timeContract workLocal areaRemote work$272k - $431.25k
...NVIDIA Gruppe in Santa Clara is seeking a seasoned architect specializing in CPU/GPU systems. The role involves developing... ...high-performance architectures and enhancing memory systems with a focus on energy efficiency. Candidates should have 18+ years of experience,...$192k - $279k
...Staff Co-Design Engineer to drive innovations in AI/ML hardware acceleration. In this pivotal role, you... ...to enable seamless integration of hardware and model architecture, focusing on high-performance and power-efficient solutions. Your responsibilities include optimizing...- ...in Flash and advanced memory technologies, our solutions... ...for breakthroughs in efficient operations. With our... ...Description In this AI/ML ASIC Architecture position... .... As an AI/ML ASIC Architect you will help drive new... ...attention mechanisms, and model parallelism techniques...Temporary workRemote workFlexible hoursShift workNight shift
- ...in Flash and advanced memory technologies, our... ...for breakthroughs in efficient operations. With our... ...Description An AI Interconnect Architect defines and engineers... ...fabrics tailored for AI/ML scale-out. Performance... ...serial links. ~ Modeling and SimulationL Ability...Temporary workRemote workFlexible hoursShift work
$156k - $387.6k
...for ultra-large-scale language models, vision-language models, and... ...systems, from multimodal data efficiency to convergence optimization for... ...TP, PP, EP, operator fusion, memory optimization, and cluster-level... ...of experience in large-scale ML systems, training infrastructure...Temporary workLocal area- ...Applied Scientist to join the Delivery Foundation Model team in Santa Clara, CA. This role involves... ...innovative foundation models to enhance logistics efficiency across Amazon's delivery network, leveraging cutting-edge AI and machine learning techniques. The ideal candidate...
$212.3k - $275.8k
...Meet the Team Join Cisco's CX AI Incubation Team as an AI... ...customer engagement and operational efficiency. You will focus on end-to-end... ..., deployment automation, and model/service observability. This... ...building production services for ML/AI workloads. Experience...Full timeTemporary workLocal areaFlexible hours3 days per week- ...Vings Technologies is seeking an experienced AI/ML Architect to join our team in San Jose, CA. You will play a critical role in building and... ...learning systems, overseeing all aspects from data ingestion to model serving. The ideal candidate will have over 8 years of hands-...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Model Architect for Efficient In-Memory AI. Be the first to apply!
- machine learning scientist San Jose, CA
- machine learning remote San Jose, CA
- machine learning San Jose, CA
- machine learning researcher San Jose, CA
- artificial intelligence - machine learning intern San Jose, CA
- machine learning part time San Jose, CA
- internship machine learning San Jose, CA
- data engineer machine learning San Jose, CA
- machine learning intern San Jose, CA
- machine learning research scientist San Jose, CA

