Staff Machine Learning Architect
Neurophos Inc
About Neurophos The demand for new datacenters and AI compute is rapidly outpacing the planet's energy capacity. Digital solutions are hitting a power wall as we approach the physical limits of traditional silicon. Conquering this bottleneck isn't about bigger chips or more of them; it means rethinking the fundamental architecture. The industry's current path isn't going to meet the need, so we took a different approach. Instead of traditional electronic circuits, we use silicon photonics and an active, programmable metasurface to perform matrix multiplications at the speed of light. Our optical cells are 10,000x smaller than traditional photonic components, enabling unprecedented density. By using photonics instead of electricity, our chips become more efficient as they scale. This architecture will deliver up to 100 times the energy efficiency of existing solutions while significantly improving performance for large-scale AI inference. We've assembled a world-class team of industry veterans and recently raised a $110M Series A led by Gates Frontier. Participants include M12 (Microsoft's Venture Fund), Carbon Direct Capital, Aramco Ventures, Bosch Ventures, Tectonic Ventures, Space Capital, and others. We have also been recognized on the EE Times Silicon 100 list for several consecutive years. Join us and shape the future of computing! Position Overview: We are seeking an experienced machine learning architect to lead the porting and optimization of large language models (LLMs), diffusion models, and other ML applications to our revolutionary optical inference engines. This role is critical to demonstrating the full potential of our metamaterial-based optical processing units (OPUs) by adapting state-of-the-art AI models to leverage our ultra-high-throughput, low-precision compute architecture. The ideal candidate will bridge the gap between cutting-edge ML research and novel hardware capabilities, ensuring customers can seamlessly deploy their AI workloads on Neurophos hardware. Location: Austin, TX or San Jose, CA. Full-time onsite position. Key Responsibilities:
- Lead the porting of LLM applications, diffusion models, and visual ML applications to Neurophos optical inference engines
- Adapt models from diverse sources, including GitHub, Hugging Face, other open-source repositories, and customer private models
- Work with models in various formats, including PyTorch, Triton, JAX, and emerging frameworks
- Develop and implement quantization strategies to migrate models from higher precision formats (FP8, INT8, and above) to our optimized 4-bit precision (FP4/INT4) for weights and activations
- Design and execute re-quantization, retraining, and other model adaptation techniques to minimize accuracy loss during precision reduction
- Create or integrate third-party tools and workflows for efficient model porting and optimization
- Optimize GEMM operations for high-throughput execution
- Develop benchmarking methodologies to measure and validate model quality post-porting, including perplexity metrics and other quality indicators
- Collaborate with hardware and software teams to co-optimize model architectures for optical compute characteristics
- Publish research papers on novel optimization techniques and methodologies (with appropriate IP protection)
- MS or PhD in Computer Science, Data Science, Machine Learning, Mathematics, or related field
- 7+ years of experience in machine learning engineering with at least 3 years focused on model optimization and deployment
- Deep expertise in neural network quantization techniques, including post-training quantization (PTQ) and quantization-aware training (QAT)
- Strong proficiency in PyTorch and familiarity with other ML frameworks (JAX, Triton, TensorFlow)
- Hands-on experience with transformer architectures, LLMs, and diffusion models
- Experience with low-precision inference optimization (INT8, FP8, or lower)
- Strong understanding of GEMM operations and linear algebra optimizations for deep learning
- Experience with model evaluation metrics, including perplexity, accuracy, and benchmark suites
- Track record of successfully deploying ML models on specialized hardware accelerators
- Excellent communication skills with the ability to collaborate across hardware and software teams
- Experience with sub-8-bit quantization (INT4, FP4) and mixed-precision inference
- Familiarity with Hugging Face Transformers library and model hub ecosystem
- Experience with ONNX, TensorRT, or other model optimization frameworks
- Background in analog or optical computing architectures
- Knowledge of in-memory computing paradigms and matrix-vector multiplication acceleration
- Published research in model compression, quantization, or efficient inference
- Experience with large-scale batch inference optimization
- Familiarity with prefill vs. decode optimization strategies in LLM inference
- 100% coverage of base health plan premiums for you and your dependents, plus HSA contributions.
- Unlimited PTO. No rigid vacation banks, just a focus on delivery.
- 401(k) matching and stock option opportunities to ensure our success is your success.
- Full suite of voluntary benefits, including Dental, Vision, Life, Hospital, Critical Illness, and Accident insurance.
- Personalized Benefits. Choose the plans that fit your life and take the cash back for those that don't.
Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Staff Machine Learning Architect in San Jose, CA vacancy
- ...industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without... .... About The Role As a Compute / Server Platform Architect on the Cluster Architecture Team, you will own the server-side...SuggestedLocal area
$227k - $320k
A leading technology company is seeking a Senior Staff Architect in Sunnyvale, CA. In this role, you will advance TPU technology and contribute to silicon design for Google's AI/ML applications. With over 10 years of technical leadership in silicon design and a Bachelor...Suggested$192k - $278k
Google Inc. in Sunnyvale, CA is seeking a Staff Architect for Digital Signal Processing in Google Cloud. This role involves architecting core algorithms for next-gen data center interconnects, focusing on communications and forward error correction. The ideal candidate...Suggested$192k - $278k
A leading tech company is seeking a Staff Design Engineer in Sunnyvale, CA to drive DSP technology in AI/ML applications. This role requires expertise in Digital Signal Processing and high-speed logic design, along with experience in MATLAB, Python, or C++. You'll contribute...Suggested$155k - $225k
...users that is easy to set up and interact with every day. Staff SaaS Platform Architect Location: Milpitas, CA (hybrid) We are looking... ...growth in areas essential to the role. Interested in learning more about our workplace? Visit and follow our LinkedIn,...Suggested$232k - $258k
Uber is seeking a Staff Backend Engineer for the Enterprise Identity Platform team in Sunnyvale, CA. You will architect identity services for scalability while mentoring senior engineers and collaborating across teams to align solutions with business objectives. The position...$196.8k - $283.2k
Intuitive is seeking a Staff Power Electronics Design Engineer for its advanced R&D group in Sunnyvale, CA. In this role, you'll develop high-voltage, high-frequency switch mode power electronics systems for medical applications, and be responsible for defining architectures...$227k - $320k
Senior Staff Architect, Silicon, Google Cloud Sunnyvale, CA, USA Apply Bachelor’s degree in Electrical Engineering, Computer Science, a related... ...salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Responsibilities Build...Full timeWorldwide- ...bandwidth, gate counts, compute/memory blocks. Work with frontend architects and backend design to compile performance monitor availability,... ...expertise, kindness, dedication and a willingness to embrace challenges and learn together every day. #J-18808-Ljbffr d-Matrix inc.3 days per week
$192k - $278k
Staff Architect, Digital Signal Processing, Google Cloud Google Sunnyvale, CA, USA Bachelor's degree in Electrical Engineering, Computer Engineering... ...salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Google is a global company and...Full timeWorldwide- NVIDIA Corporation is seeking a Senior Staff Engineer for Enterprise Messaging Platforms to manage and enhance their global email and messaging infrastructure. This role involves architecting solutions with Microsoft Exchange and Azure services, ensuring high availability...
- Google Inc. is seeking a Staff Software Engineer, Firmware for ARM SoCs in Sunnyvale, CA. This vital role will focus on leading the architecture, design, and development of firmware for next-generation systems. Candidates should possess a bachelor's degree and extensive...
$240k
...Opportunity Are you a highly skilled NLP Architect with a strong background in natural language processing and machine learning, looking to push the boundaries of AI-driven... ...innovative ways. About the Team The senior staff engineer will join the Unified Storage team...Work at officeRemote workRelocation package3 days per week$206.4k - $379.1k
...motion, and personalization. We're looking for a Principal Architect to build and implement the AI framework for Adobe Express,... ...or equivalent experience in Computer Science, Data Science, Machine Learning, or a related technical field. Experience architecting AI...Temporary workLocal areaWorldwideFlexible hours- ...hybrid retrieval. Proven track record architecting and shipping multi-agent systems,... ...Strong ML fundamentals: XGBoost, deep learning, NLP, time-series forecasting, propensity... ...related quantitative field AWS Certified Machine Learning Engineer or GCP Professional ML...
- ...Mandatory Skills Agentic AI/ADK/Python We are looking for a skilled MLOps Architect to join our team and help us build, deploy, and maintain robust and scalable machine learning systems. You will be responsible for the full lifecycle of our ML pipelines, from data ingestion...
- ...across AMD.Applied Research and Engineering team is looking for the following KEY RESPONSIBILITIES: Proficiency in machine learning in (neural networks) and Artificial Intelligence Ability to program in low level languages (x86 asm, SSE, ISA) General software...
$254.34k - $310.86k
...segment of chip design, including artificial intelligence, machine learning, automotive, data center, mobile, and consumer. With SiFive,... ...research and develop next generation high performance SoC. The SoC architect will guide the team to develop system IPs such as IOMMU,...Work experience placement- ...AI/Gen AI Architect Location: Sunnyvale, CA (3x/ week onsite) Duration: 6 months... ..., including supervised and unsupervised learning, deep learning, anomaly detection, and large... ...NLU (Natural Language Understanding), ML (Machine Learning), Conversational AI •...
- ...Senior AI Architect Nexxa.ai is building artificial super intelligence for heavy industries — enabling machines, systems and operations to think, decide and act autonomously across... ...experience in software engineering, machine learning, data science, or closely related...
- ...create engaging, intelligent, and personalized conversational experiences for millions of Apple users. We are seeking a Machine Learning Architect to serve as a senior technical leader spanning the full Speech organization. You will set the future modeling direction...
- ...ML Architect With Data Bricks And Azure ADF Location: Santa Clara, CA - onsite Duration... ...areas: supervised & unsupervised learning, deep learning, reinforcement learning,... ...areas: LLM, NLP, DL (Deep Learning), ML (Machine Learning), object detection/classification...
$172.5k - $306.63k
...next big idea can come from anywhere-and it might come from you. The Opportunity Adobe is seeking a Senior Machine Learning Architect to help define and deliver the next generation of AI-powered user experiences across Adobe Experience Cloud. This role...Temporary workLocal areaWorldwide$254.34k - $310.86k
...segment of chip design, including artificial intelligence, machine learning, automotive, data center, mobile, and consumer. With SiFive,... ...next generation high performance processors. The performance architect will guide the performance team to work closely with the architect...Work experience placement$172.5k - $306.63k
...believe the next big idea can come from anywhere-and it might come from you. The Opportunity Adobe is seeking a Senior Machine Learning Architect to help define and deliver the next generation of AI-powered user experiences across Adobe Experience Cloud. This role...Temporary workLocal area- ...cloud architecture optimization for client solutions. Architect innovative AI solutions from ideation to MVP, delivering business... ...Proficient in Python, Java, or Go languages. ~ Advanced machine learning and GenAI models (GPT, BERT, etc.). ~ Familiarity with...Work at office
$190.61k - $361.48k
...alternatives that meet constraints related to performance, power, area, and timing. Collaborate with cross-functional teams including architects, design, verification, and validation engineers to execute project requirements seamlessly. Deliver new microarchitecture...Local areaImmediate startShift work$168k - $264.5k
...tools. Our team develops these tools by fusing advances in parallel computing, machine learning, and specialized algorithms for VLSI design. We are seeking a Senior P&R Methodology Architect to define and own the next generation RTL2GDS flow for advanced nodes (3nm and...$240k - $334k
Google Inc. in Sunnyvale, CA is seeking a Power and Performance Architect to drive innovative TPU technology. This role entails... ...for next-gen TPU SOCs, optimizing performance-per-watt across machine learning workloads, and collaborating with various teams for effective...- Velaura is hiring an AI Research Architect to explore the intersection of AI model design and next-generation compute architectures... .... Ideal candidates should have a strong background in machine learning, a deep understanding of model architectures like transformers...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Machine Learning Architect. Be the first to apply!
Related searches
- machine learning research scientist San Jose, CA
- machine learning part time San Jose, CA
- artificial intelligence - machine learning intern San Jose, CA
- machine learning San Jose, CA
- machine learning researcher San Jose, CA
- machine learning intern San Jose, CA
- data engineer machine learning San Jose, CA
- machine learning scientist San Jose, CA
- internship machine learning San Jose, CA
- machine learning remote San Jose, CA

