Staff Machine Learning Architect
Neurophos
About Neurophos We are developing an ultra-high-performance, energy-efficient photonic AI inference system. We’re transforming AI computation with the first-ever metamaterial-based optical processing unit (OPU). As AI adoption accelerates, data centers face significant power and scalability challenges. Traditional solutions are struggling to keep up, leading to rapidly rising energy consumption and costs. We’re solving both problems with an OPU that integrates over one million micron-scale optical processing components on a single chip. This architecture will deliver up to 100 times the energy efficiency of existing solutions while significantly improving large-scale AI inference performance. We’ve assembled a world‑class team of industry veterans and recently raised a $110M Series A led by Gates Frontier. Participants include M12 (Microsoft’s Venture Fund), Carbon Direct Capital, Aramco Ventures, Bosch Ventures, Tectonic Ventures, Space Capital, and others. We have also been recognized on the EE Times Silicon 100 list for several consecutive years. Join us and shape the future of optical computing! Location Austin, TX or San Francisco, CA. Full-time onsite position. Position Overview We are seeking an experienced machine learning architect to lead the porting and optimization of large language models (LLMs), diffusion models, and other ML applications to our revolutionary optical inference engines. This role is critical to demonstrating the full potential of our metamaterial‑based optical processing units (OPUs) by adapting state‑of‑the‑art AI models to leverage our ultra‑high‑throughput, low‑precision compute architecture. The ideal candidate will bridge the gap between cutting‑edge ML research and novel hardware capabilities, ensuring customers can seamlessly deploy their AI workloads on Neurophos hardware. Key Responsibilities Lead the porting of LLM applications, diffusion models, and visual ML applications to Neurophos optical inference engines Adapt models from diverse sources, including GitHub, Hugging Face, other open‑source repositories, and customer private models Work with models in various formats, including PyTorch, Triton, JAX, and emerging frameworks Develop and implement quantization strategies to migrate models from higher precision formats (FP8, INT8, and above) to our optimized 4‑bit precision (FP4/INT4) for weights and activations Design and execute re‑quantization, retraining, and other model adaptation techniques to minimize accuracy loss during precision reduction Create or integrate third‑party tools and workflows for efficient model porting and optimization Optimize GEMM operations for high‑throughput execution Develop benchmarking methodologies to measure and validate model quality post‑porting, including perplexity metrics and other quality indicators Collaborate with hardware and software teams to co‑optimize model architectures for optical compute characteristics Publish research papers on novel optimization techniques and methodologies (with appropriate IP protection) Qualifications MS or PhD in Computer Science, Data Science, Machine Learning, Mathematics, or related field 7+ years of experience in machine learning engineering with at least 3 years focused on model optimization and deployment Deep expertise in neural network quantization techniques, including post‑training quantization (PTQ) and quantization‑aware training (QAT) Strong proficiency in PyTorch and familiarity with other ML frameworks (JAX, Triton, TensorFlow) Hands‑on experience with transformer architectures, LLMs, and diffusion models Experience with low‑precision inference optimization (INT8, FP8, or lower) Strong understanding of GEMM operations and linear algebra optimizations for deep learning Experience with model evaluation metrics, including perplexity, accuracy, and benchmark suites Track record of successfully deploying ML models on specialized hardware accelerators Excellent communication skills with the ability to collaborate across hardware and software teams Preferred Skills Experience with sub‑8‑bit quantization (INT4, FP4) and mixed‑precision inference Familiarity with Hugging Face Transformers library and model hub ecosystem Experience with ONNX, TensorRT, or other model optimization frameworks Background in analog or optical computing architectures Knowledge of in‑memory computing paradigms and matrix‑vector multiplication acceleration Published research in model compression, quantization, or efficient inference Experience with large‑scale batch inference optimization Familiarity with prefill vs. decode optimization strategies in LLM inference What We Offer A pivotal role in an innovative startup redefining the future of AI hardware. A collaborative and intellectually stimulating work environment. Competitive compensation, including salary and equity options. Good benefits – health, vision, dental, 401(k), etc. Opportunities for career growth and future team leadership. Access to cutting‑edge technology and state‑of‑the‑art facilities. Opportunity to publish research and contribute to the field of efficient AI inference. If you are passionate about pushing the boundaries of model optimization and driving impact in the semiconductor industry, we want to hear from you! This is a rare opportunity to work on a game‑changing technology at the intersection of photonics and AI. As part of our elite team, you’ll contribute to a platform that redefines computational performance and accelerates the future of artificial intelligence. Be a key player in bringing this transformative innovation to the world. #J-18808-Ljbffr
$200k - $250k
A robotics and AI company in San Carlos, California is searching for a Firmware or Embedded Engineer to develop robust firmware for their humanoid robots. The candidate will work across teams to ensure hardware capabilities align with higher-level system requirements, ...Suggested- ...Zoox is seeking a strategic and execution-focused Sr/Staff Business Enablement Architect to lead the optimization and scaling of our core business... ...to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation...SuggestedTemporary workImmediate startRelocation package
- ...A leading AI hardware company in Austin is looking for a Machine Learning Architect to lead the optimization of large language models and other ML applications for groundbreaking optical inference engines. Candidates should have extensive experience in neural network...Suggested
$295.25k - $345.04k
...I did my part and supported the Regular Toilet in San Mateo, CA is seeking a Principal Machine Learning Engineer to shape the future of machine learning systems for reliability. In this role, you will define the technical strategy and drive cross-functional collaboration...SuggestedFlexible hours$110k - $270k
...the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional... ...C++ DSP and control code. As a Senior Performance Architect, you will be the critical link between software and hardware...SuggestedWork at officeLocal areaImmediate startFlexible hours2 days per week3 days per week$90 - $100 per hour
...differentiated career for our employees. Role: Senior Manager – AI Architect Location: West Coast - US Job Description: We are... ...Technical Skills • Expertise in: o Python, R, SQL o Machine Learning & Deep Learning frameworks • Experience with: o Cloud...- ...FRG Technology Consulting is looking for an LLM Architect to elevate your career with an exciting contract opportunity. This remote... ...requires expertise in AWS and a strong understanding of machine learning and language model architectures. The ideal candidate will design...Contract workRemote work
- ...vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: This is a rare...Immediate start
$287k - $410k
...you will work within a multi-disciplinary engineering team to architect our next-generation compute platforms, both on-vehicle and... ...technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of...Temporary workWork experience placementRelocation package- ...As a Principal Architect, you'll be the driving force behind the next generation of our autonomous fulfillment solutions, leading the definition and design of cutting-edge machine learning versions for our core algorithms. You will be instrumental in evolving our product...
$185k - $220k
Join Our Team Come work at a place where innovation and teamwork come together to support the most exciting missions in the world! About the Role You will define and lead the reference architecture for third-party AI products within IT, enabling secure, scalable...- ...AI-First QE Architect Luminary helps engineering companies be more competitive by getting to market faster, creating new, better products, and reducing development risk. We do this with our Physics AI platform, the fastest and easiest way to build and deploy models...
- ...Quadric, Inc in Burlingame, CA is seeking a Senior Performance Architect to enhance software and hardware performance. You will be responsible for analyzing and optimizing workloads written in C++ and Python, pinpointing performance bottlenecks, and developing proof-of...2 days per week3 days per week
$200k
...United Cerebral Palsy of Georgia is seeking a leader to guide a talented team in engineering simulations using AI and machine learning. This role offers a competitive starting salary of $200,000 and the opportunity to shape systems and establish engineering standards....$231.28k - $429.52k
...F. Hoffmann-La Roche AG seeks a Principal Machine Learning Engineer in South San Francisco to lead the design and development of advanced machine learning models. You will oversee the lifecycle of models while ensuring compliance and collaborating with diverse teams....$210k - $250k
...to stand still. Required Qualifications ~5+ years of professional experience in software engineering with a focus on Machine Learning or AI. ~ Strong proficiency in Python ~ Strong familiarity with libraries such as NumPy, Pandas, and Scikit-learn....Full timeTemporary work- A leading AI research company in California is seeking an experienced Machine Learning Engineer to develop Vision-Language-Action models for robotics. The ideal candidate has over 5 years of expertise in machine learning, a commitment to data quality, and strong production...
- ...explore for critical resources. We are looking for a talented deep learning engineer or scientist to lead the development of this model... ..., cleaning, and maintaining high‑quality datasets tailored for machine learning applications Strong Software Engineering and Design Experience...
- ...Role Title: Workday Finance Architect Job Description :Key Responsibilities (Strategy & Configuration ~)Record to Report (R2R): Own the Financial Data Model (FDM). Configure Business Processes for Journals, Fixed Assets, and Allocations. Design the period-close...
$90k - $110k
...IA Interior Architects translates client goals, brand and culture into powerful environments built around people, processes, technologies... ...their enterprise forward, support their culture, engage their staff, integrate technology and drive efficiencies. As architects, designers...Contract workWork experience placementImmediate startWorldwide$185k - $225k
...help solve this problem by redefining how critical resources are discovered, evaluated, and developed. By combining advanced machine learning, probabilistic modeling, and deep geoscience expertise, Terra AI helps exploration and mining companies make faster, more informed...- Google Inc. is seeking a Senior Staff Tech Lead for YouTube Shorts Quality in San Bruno, CA. The role focuses on defining technical... ...expertise in software development, project leadership, and Machine Learning. This is a critical role in pushing Google's innovative...
- ...engineering. The ideal candidate is hands-on, pragmatic, and comfortable working in early-stage, exploratory AI efforts where speed and learning matter most. Key Responsibilities Rapid PoCs & Pilots: Build and iterate on quick-turn AI PoCs, pilots, and demos to...Work experience placementRemote work
- A healthcare staffing firm is seeking an experienced Epic Hospital Billing Analyst to join their team on a 3+ month contract in San Mateo, CA. The ideal candidate must have an active Epic Certification in Hospital Billing and at least 2 years of experience in Epic model...Contract workRemote work
$185k - $220k
QLYS_US Qualys, Inc. is looking for an expert to define and lead the reference architecture for AI products. You'll drive the design and implementation of complex AI workflows as you enable enterprise-wide AI adoption. The ideal candidate has 15+ years in data engineering...- A pioneering autonomous vehicle company is seeking a Software Systems Engineer to enhance the middleware components of its innovative robotaxi. You will drive best practices in software development, ensuring the safety and robustness of the system. Ideal candidates will...
- ...traditional barriers to application creation. We're seeking a RevOps Architect leader to define and drive the strategy, architecture, and... ...Gatherings In Office Amenities (In-Office Only) Want to learn more about what we are up to? Meet the Replit Agent...Full timeContract workTemporary workWork at officeWorldwideMonday to FridayFlexible hours
- ...Data Science, or Solution Architecture ~ Proven experience as an AI Architect / AI Consultant / ML Architect ~ Strong understanding of AI/ML frameworks (TensorFlow, PyTorch, Scikit-learn, etc.) ~ Experience with cloud platforms (AWS, Azure, or GCP AI...
- ...Copilot Architect Full-Time / Contract Foster City, CA 94404 We are seeking an experienced Copilot Architect to design, architect, and implement Microsoft Copilot solutions across enterprise environments. The ideal candidate will have strong expertise in Microsoft...Full timeContract work
$185k - $250k
...help solve this problem by redefining how critical resources are discovered, evaluated, and developed. By combining advanced machine learning, probabilistic modeling, and deep geoscience expertise, Terra AI helps exploration and mining companies make faster, more informed...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Machine Learning Architect. Be the first to apply!
- machine learning San Mateo, CA
- artificial intelligence - machine learning intern San Mateo, CA
- machine learning research scientist San Mateo, CA
- data engineer machine learning San Mateo, CA
- machine learning scientist San Mateo, CA
- machine learning remote San Mateo, CA
- amd machine learning
- ibm machine learning
- applied machine learning
- mls soccer



