Staff Machine Learning Architect

Neurophos Inc

About Neurophos

The demand for new datacenters and AI compute is rapidly outpacing the planet's energy capacity. Digital solutions are hitting a power wall as we approach the physical limits of traditional silicon. Conquering this bottleneck isn't about bigger chips or more of them; it means rethinking the fundamental architecture. The industry's current path isn't going to meet the need, so we took a different approach.

Instead of traditional electronic circuits, we use silicon photonics and an active, programmable metasurface to perform matrix multiplications at the speed of light. Our optical cells are 10,000x smaller than traditional photonic components, enabling unprecedented density. By using photonics instead of electricity, our chips become more efficient as they scale. This architecture will deliver up to 100 times the energy efficiency of existing solutions while significantly improving performance for large-scale AI inference.

We've assembled a world-class team of industry veterans and recently raised a $110M Series A led by Gates Frontier. Participants include M12 (Microsoft's Venture Fund), Carbon Direct Capital, Aramco Ventures, Bosch Ventures, Tectonic Ventures, Space Capital, and others. We have also been recognized on the EE Times Silicon 100 list for several consecutive years.

Join us and shape the future of computing!

Position Overview:

We are seeking an experienced machine learning architect to lead the porting and optimization of large language models (LLMs), diffusion models, and other ML applications to our revolutionary optical inference engines. This role is critical to demonstrating the full potential of our metamaterial-based optical processing units (OPUs) by adapting state-of-the-art AI models to leverage our ultra-high-throughput, low-precision compute architecture. The ideal candidate will bridge the gap between cutting-edge ML research and novel hardware capabilities, ensuring customers can seamlessly deploy their AI workloads on Neurophos hardware.

Location: Austin, TX or San Jose, CA. Full-time onsite position.

Key Responsibilities:

Lead the porting of LLM applications, diffusion models, and visual ML applications to Neurophos optical inference engines
Adapt models from diverse sources, including GitHub, Hugging Face, other open-source repositories, and customer private models
Work with models in various formats, including PyTorch, Triton, JAX, and emerging frameworks
Develop and implement quantization strategies to migrate models from higher precision formats (FP8, INT8, and above) to our optimized 4-bit precision (FP4/INT4) for weights and activations
Design and execute re-quantization, retraining, and other model adaptation techniques to minimize accuracy loss during precision reduction
Create or integrate third-party tools and workflows for efficient model porting and optimization
Optimize GEMM operations for high-throughput execution
Develop benchmarking methodologies to measure and validate model quality post-porting, including perplexity metrics and other quality indicators
Collaborate with hardware and software teams to co-optimize model architectures for optical compute characteristics
Publish research papers on novel optimization techniques and methodologies (with appropriate IP protection)

Qualifications:

MS or PhD in Computer Science, Data Science, Machine Learning, Mathematics, or related field
7+ years of experience in machine learning engineering with at least 3 years focused on model optimization and deployment
Deep expertise in neural network quantization techniques, including post-training quantization (PTQ) and quantization-aware training (QAT)
Strong proficiency in PyTorch and familiarity with other ML frameworks (JAX, Triton, TensorFlow)
Hands-on experience with transformer architectures, LLMs, and diffusion models
Experience with low-precision inference optimization (INT8, FP8, or lower)
Strong understanding of GEMM operations and linear algebra optimizations for deep learning
Experience with model evaluation metrics, including perplexity, accuracy, and benchmark suites
Track record of successfully deploying ML models on specialized hardware accelerators
Excellent communication skills with the ability to collaborate across hardware and software teams

Preferred Skills:

Experience with sub-8-bit quantization (INT4, FP4) and mixed-precision inference
Familiarity with Hugging Face Transformers library and model hub ecosystem
Experience with ONNX, TensorRT, or other model optimization frameworks
Background in analog or optical computing architectures
Knowledge of in-memory computing paradigms and matrix-vector multiplication acceleration
Published research in model compression, quantization, or efficient inference
Experience with large-scale batch inference optimization
Familiarity with prefill vs. decode optimization strategies in LLM inference

What We Offer

This is an opportunity to play a pivotal role in an innovative startup redefining the future of AI hardware. Work on a game-changing technology at the intersection of photonics and AI as part of a collaborative and brilliant team. You'll contribute to a platform that redefines computational performance and accelerates the future of artificial intelligence. Come help us bring this transformative technology to the world.

Benefits

Join a team that invests in your future and your well-being. At Neurophos, we offer:

100% coverage of base health plan premiums for you and your dependents, plus HSA contributions.
Unlimited PTO. No rigid vacation banks, just a focus on delivery.
401(k) matching and stock option opportunities to ensure our success is your success.
Full suite of voluntary benefits, including Dental, Vision, Life, Hospital, Critical Illness, and Accident insurance.
Personalized Benefits. Choose the plans that fit your life and take the cash back for those that don't.

Apply

Vacancy posted 7 hours ago

Similar jobs that could be interesting for youBased on the Staff Machine Learning Architect in San Jose, CA vacancy

Compute Server Platform Architect
...industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without... .... About The Role As a Compute / Server Platform Architect on the Cluster Architecture Team, you will own the server-side...
Suggested
Local area
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
1 day ago
Senior Staff Silicon Architect for AI/ML TPU
$227k - $320k
A leading technology company is seeking a Senior Staff Architect in Sunnyvale, CA. In this role, you will advance TPU technology and contribute to silicon design for Google's AI/ML applications. With over 10 years of technical leadership in silicon design and a Bachelor...
Suggested
Google Inc.
Sunnyvale, CA
1 day ago
Staff DSP Architect for AI/ML Hardware & Silicon
$192k - $278k
Google Inc. in Sunnyvale, CA is seeking a Staff Architect for Digital Signal Processing in Google Cloud. This role involves architecting core algorithms for next-gen data center interconnects, focusing on communications and forward error correction. The ideal candidate...
Suggested
Google Inc.
Sunnyvale, CA
14 hours ago
Staff DSP Architect for AI/ML Hardware (TPU)
$192k - $278k
A leading tech company is seeking a Staff Design Engineer in Sunnyvale, CA to drive DSP technology in AI/ML applications. This role requires expertise in Digital Signal Processing and high-speed logic design, along with experience in MATLAB, Python, or C++. You'll contribute...
Suggested
Google Inc.
Sunnyvale, CA
14 hours ago
Staff SaaS Platform Architect
$155k - $225k
...users that is easy to set up and interact with every day. Staff SaaS Platform Architect Location: Milpitas, CA (hybrid) We are looking... ...growth in areas essential to the role. Interested in learning more about our workplace? Visit and follow our LinkedIn,...
Suggested
Arlo Technologies, Inc.
Milpitas, CA
1 day ago
Staff Backend Architect, Enterprise Identity Platform
$232k - $258k
Uber is seeking a Staff Backend Engineer for the Enterprise Identity Platform team in Sunnyvale, CA. You will architect identity services for scalability while mentoring senior engineers and collaborating across teams to align solutions with business objectives. The position...
Uber
Sunnyvale, CA
4 days ago
Staff HV SMPS Power Electronics Architect (Medical)
$196.8k - $283.2k
Intuitive is seeking a Staff Power Electronics Design Engineer for its advanced R&D group in Sunnyvale, CA. In this role, you'll develop high-voltage, high-frequency switch mode power electronics systems for medical applications, and be responsible for defining architectures...
Intuitive
Sunnyvale, CA
2 days ago
Senior Staff Architect, Silicon, Google Cloud
$227k - $320k
Senior Staff Architect, Silicon, Google Cloud Sunnyvale, CA, USA Apply Bachelor’s degree in Electrical Engineering, Computer Science, a related... ...salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Responsibilities Build...
Full time
Worldwide
Google Inc.
Sunnyvale, CA
1 day ago
Power Performance Architect, Senior Staff - Accelerator Design
...bandwidth, gate counts, compute/memory blocks. Work with frontend architects and backend design to compile performance monitor availability,... ...expertise, kindness, dedication and a willingness to embrace challenges and learn together every day. #J-18808-Ljbffr d-Matrix inc.
3 days per week
d-Matrix inc.
Santa Clara, CA
14 hours ago
Staff Architect, Digital Signal Processing, Google Cloud
$192k - $278k
Staff Architect, Digital Signal Processing, Google Cloud Google Sunnyvale, CA, USA Bachelor's degree in Electrical Engineering, Computer Engineering... ...salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Google is a global company and...
Full time
Worldwide
Google Inc.
Sunnyvale, CA
4 days ago
Senior Staff Architect, Enterprise Messaging & Exchange
NVIDIA Corporation is seeking a Senior Staff Engineer for Enterprise Messaging Platforms to manage and enhance their global email and messaging infrastructure. This role involves architecting solutions with Microsoft Exchange and Azure services, ensuring high availability...
NVIDIA Corporation
Santa Clara, CA
2 days ago
Staff Firmware Architect — ARM SoCs & SoC Design
Google Inc. is seeking a Staff Software Engineer, Firmware for ARM SoCs in Sunnyvale, CA. This vital role will focus on leading the architecture, design, and development of firmware for next-generation systems. Candidates should possess a bachelor's degree and extensive...
Google Inc.
Sunnyvale, CA
3 days ago
Remote Innovation Architect
$240k
...Opportunity Are you a highly skilled NLP Architect with a strong background in natural language processing and machine learning, looking to push the boundaries of AI-driven... ...innovative ways. About the Team The senior staff engineer will join the Unified Storage team...
Work at office
Remote work
Relocation package
3 days per week
Nutanix
San Jose, CA
2 days ago
Principal Architect, Express AI Foundations
$206.4k - $379.1k
...motion, and personalization. We're looking for a Principal Architect to build and implement the AI framework for Adobe Express,... ...or equivalent experience in Computer Science, Data Science, Machine Learning, or a related technical field. Experience architecting AI...
Temporary work
Local area
Worldwide
Flexible hours
Adobe
San Jose, CA
2 days ago
Principal AI Architect
...hybrid retrieval. Proven track record architecting and shipping multi-agent systems,... ...Strong ML fundamentals: XGBoost, deep learning, NLP, time-series forecasting, propensity... ...related quantitative field AWS Certified Machine Learning Engineer or GCP Professional ML...
Yochana
San Jose, CA
11 days ago
MLOps / AI/ML architect
...Mandatory Skills Agentic AI/ADK/Python We are looking for a skilled MLOps Architect to join our team and help us build, deploy, and maintain robust and scalable machine learning systems. You will be responsible for the full lifecycle of our ML pipelines, from data ingestion...
Omni Inclusive
San Jose, CA
1 day ago
ML/AI Architect
...across AMD.Applied Research and Engineering team is looking for the following KEY RESPONSIBILITIES: Proficiency in machine learning in (neural networks) and Artificial Intelligence Ability to program in low level languages (x86 asm, SSE, ISA) General software...
Advanced Micro Devices , Inc.
San Jose, CA
2 days ago
Architect- Architecture Design
$254.34k - $310.86k
...segment of chip design, including artificial intelligence, machine learning, automotive, data center, mobile, and consumer. With SiFive,... ...research and develop next generation high performance SoC. The SoC architect will guide the team to develop system IPs such as IOMMU,...
Work experience placement
SiFive
Santa Clara, CA
4 days ago
AI/Gen AI Architect
...AI/Gen AI Architect Location: Sunnyvale, CA (3x/ week onsite) Duration: 6 months... ..., including supervised and unsupervised learning, deep learning, anomaly detection, and large... ...NLU (Natural Language Understanding), ML (Machine Learning), Conversational AI •...
AceStack LLC
Sunnyvale, CA
1 day ago
Senior AI Architect
...Senior AI Architect Nexxa.ai is building artificial super intelligence for heavy industries — enabling machines, systems and operations to think, decide and act autonomously across... ...experience in software engineering, machine learning, data science, or closely related...
Nexxa.ai
Sunnyvale, CA
1 day ago
Machine Learning Architect - Conversational Speech
...create engaging, intelligent, and personalized conversational experiences for millions of Apple users. We are seeking a Machine Learning Architect to serve as a senior technical leader spanning the full Speech organization. You will set the future modeling direction...
Apple
Cupertino, CA
17 hours ago
ML Architect with Data Bricks and Azure ADF
...ML Architect With Data Bricks And Azure ADF Location: Santa Clara, CA - onsite Duration... ...areas: supervised & unsupervised learning, deep learning, reinforcement learning,... ...areas: LLM, NLP, DL (Deep Learning), ML (Machine Learning), object detection/classification...
Software Technology Inc
Santa Clara, CA
1 day ago
Machine Learning Architect 5 - GenAI Experiences
$172.5k - $306.63k
...next big idea can come from anywhere-and it might come from you. The Opportunity Adobe is seeking a Senior Machine Learning Architect to help define and deliver the next generation of AI-powered user experiences across Adobe Experience Cloud. This role...
Temporary work
Local area
Worldwide
Adobe
San Jose, CA
4 days ago
Core Performance Architect
$254.34k - $310.86k
...segment of chip design, including artificial intelligence, machine learning, automotive, data center, mobile, and consumer. With SiFive,... ...next generation high performance processors. The performance architect will guide the performance team to work closely with the architect...
Work experience placement
SiFive
Santa Clara, CA
3 days ago
Machine Learning Architect 5 GenAI Experiences
$172.5k - $306.63k
...believe the next big idea can come from anywhere-and it might come from you. The Opportunity Adobe is seeking a Senior Machine Learning Architect to help define and deliver the next generation of AI-powered user experiences across Adobe Experience Cloud. This role...
Temporary work
Local area
Adobe
San Jose, CA
3 days ago
Principal Gen AI Architect
...cloud architecture optimization for client solutions. Architect innovative AI solutions from ideation to MVP, delivering business... ...Proficient in Python, Java, or Go languages. ~ Advanced machine learning and GenAI models (GPT, BERT, etc.). ~ Familiarity with...
Work at office
E-Solutions
Santa Clara, CA
14 hours ago
Server Architect
$190.61k - $361.48k
...alternatives that meet constraints related to performance, power, area, and timing. Collaborate with cross-functional teams including architects, design, verification, and validation engineers to execute project requirements seamlessly. Deliver new microarchitecture...
Local area
Immediate start
Shift work
Intel
Santa Clara, CA
7 days ago
Architect- Architecture Design
$168k - $264.5k
...tools. Our team develops these tools by fusing advances in parallel computing, machine learning, and specialized algorithms for VLSI design. We are seeking a Senior P&R Methodology Architect to define and own the next generation RTL2GDS flow for advanced nodes (3nm and...
NVIDIA
Santa Clara, CA
2 days ago
TPU Power & Performance Architect
$240k - $334k
Google Inc. in Sunnyvale, CA is seeking a Power and Performance Architect to drive innovative TPU technology. This role entails... ...for next-gen TPU SOCs, optimizing performance-per-watt across machine learning workloads, and collaborating with various teams for effective...
Google Inc.
Sunnyvale, CA
2 days ago
AI Research Architect — Edge & Embodied Compute
Velaura is hiring an AI Research Architect to explore the intersection of AI model design and next-generation compute architectures... .... Ideal candidates should have a strong background in machine learning, a deep understanding of model architectures like transformers...
Velaura
Santa Clara, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Architect. Be the first to apply!