Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Machine Learning Architect

Neurophos

About Neurophos We are developing an ultra-high-performance, energy-efficient photonic AI inference system. We’re transforming AI computation with the first-ever metamaterial-based optical processing unit (OPU). As AI adoption accelerates, data centers face significant power and scalability challenges. Traditional solutions are struggling to keep up, leading to rapidly rising energy consumption and costs. We’re solving both problems with an OPU that integrates over one million micron-scale optical processing components on a single chip. This architecture will deliver up to 100 times the energy efficiency of existing solutions while significantly improving large-scale AI inference performance. We’ve assembled a world‑class team of industry veterans and recently raised a $110M Series A led by Gates Frontier. Participants include M12 (Microsoft’s Venture Fund), Carbon Direct Capital, Aramco Ventures, Bosch Ventures, Tectonic Ventures, Space Capital, and others. We have also been recognized on the EE Times Silicon 100 list for several consecutive years. Join us and shape the future of optical computing! Location Austin, TX or San Francisco, CA. Full-time onsite position. Position Overview We are seeking an experienced machine learning architect to lead the porting and optimization of large language models (LLMs), diffusion models, and other ML applications to our revolutionary optical inference engines. This role is critical to demonstrating the full potential of our metamaterial‑based optical processing units (OPUs) by adapting state‑of‑the‑art AI models to leverage our ultra‑high‑throughput, low‑precision compute architecture. The ideal candidate will bridge the gap between cutting‑edge ML research and novel hardware capabilities, ensuring customers can seamlessly deploy their AI workloads on Neurophos hardware. Key Responsibilities Lead the porting of LLM applications, diffusion models, and visual ML applications to Neurophos optical inference engines Adapt models from diverse sources, including GitHub, Hugging Face, other open‑source repositories, and customer private models Work with models in various formats, including PyTorch, Triton, JAX, and emerging frameworks Develop and implement quantization strategies to migrate models from higher precision formats (FP8, INT8, and above) to our optimized 4‑bit precision (FP4/INT4) for weights and activations Design and execute re‑quantization, retraining, and other model adaptation techniques to minimize accuracy loss during precision reduction Create or integrate third‑party tools and workflows for efficient model porting and optimization Optimize GEMM operations for high‑throughput execution Develop benchmarking methodologies to measure and validate model quality post‑porting, including perplexity metrics and other quality indicators Collaborate with hardware and software teams to co‑optimize model architectures for optical compute characteristics Publish research papers on novel optimization techniques and methodologies (with appropriate IP protection) Qualifications MS or PhD in Computer Science, Data Science, Machine Learning, Mathematics, or related field 7+ years of experience in machine learning engineering with at least 3 years focused on model optimization and deployment Deep expertise in neural network quantization techniques, including post‑training quantization (PTQ) and quantization‑aware training (QAT) Strong proficiency in PyTorch and familiarity with other ML frameworks (JAX, Triton, TensorFlow) Hands‑on experience with transformer architectures, LLMs, and diffusion models Experience with low‑precision inference optimization (INT8, FP8, or lower) Strong understanding of GEMM operations and linear algebra optimizations for deep learning Experience with model evaluation metrics, including perplexity, accuracy, and benchmark suites Track record of successfully deploying ML models on specialized hardware accelerators Excellent communication skills with the ability to collaborate across hardware and software teams Preferred Skills Experience with sub‑8‑bit quantization (INT4, FP4) and mixed‑precision inference Familiarity with Hugging Face Transformers library and model hub ecosystem Experience with ONNX, TensorRT, or other model optimization frameworks Background in analog or optical computing architectures Knowledge of in‑memory computing paradigms and matrix‑vector multiplication acceleration Published research in model compression, quantization, or efficient inference Experience with large‑scale batch inference optimization Familiarity with prefill vs. decode optimization strategies in LLM inference What We Offer A pivotal role in an innovative startup redefining the future of AI hardware. A collaborative and intellectually stimulating work environment. Competitive compensation, including salary and equity options. Good benefits – health, vision, dental, 401(k), etc. Opportunities for career growth and future team leadership. Access to cutting‑edge technology and state‑of‑the‑art facilities. Opportunity to publish research and contribute to the field of efficient AI inference. If you are passionate about pushing the boundaries of model optimization and driving impact in the semiconductor industry, we want to hear from you! This is a rare opportunity to work on a game‑changing technology at the intersection of photonics and AI. As part of our elite team, you’ll contribute to a platform that redefines computational performance and accelerates the future of artificial intelligence. Be a key player in bringing this transformative innovation to the world. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Staff Machine Learning Architect in San Mateo, CA vacancy
  • $200k - $250k

    A robotics and AI company in San Carlos, California is searching for a Firmware or Embedded Engineer to develop robust firmware for their humanoid robots. The candidate will work across teams to ensure hardware capabilities align with higher-level system requirements, ...
    Suggested

    1X Technologies AS

    San Carlos, CA
    23 hours ago
  •  ...Zoox is seeking a strategic and execution-focused Sr/Staff Business Enablement Architect to lead the optimization and scaling of our core business...  ...to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation... 
    Suggested
    Temporary work
    Immediate start
    Relocation package

    Zoox

    Foster, CA
    26 days ago
  •  ...A leading AI hardware company in Austin is looking for a Machine Learning Architect to lead the optimization of large language models and other ML applications for groundbreaking optical inference engines. Candidates should have extensive experience in neural network... 
    Suggested

    Neurophos

    San Mateo, CA
    1 day ago
  • $295.25k - $345.04k

     ...I did my part and supported the Regular Toilet in San Mateo, CA is seeking a Principal Machine Learning Engineer to shape the future of machine learning systems for reliability. In this role, you will define the technical strategy and drive cross-functional collaboration... 
    Suggested
    Flexible hours

    I did my part and supported the Regular Toilet

    San Mateo, CA
    23 hours ago
  • $110k - $270k

     ...the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional...  ...C++ DSP and control code. As a Senior Performance Architect, you will be the critical link between software and hardware... 
    Suggested
    Work at office
    Local area
    Immediate start
    Flexible hours
    2 days per week
    3 days per week

    quadric.io

    Burlingame, CA
    9 days ago
  • $90 - $100 per hour

     ...differentiated career for our employees. Role: Senior Manager – AI Architect Location: West Coast - US Job Description: We are...  ...Technical Skills • Expertise in: o Python, R, SQL o Machine Learning & Deep Learning frameworks • Experience with: o Cloud... 

    Indegene

    Foster, CA
    3 days ago
  •  ...FRG Technology Consulting is looking for an LLM Architect to elevate your career with an exciting contract opportunity. This remote...  ...requires expertise in AWS and a strong understanding of machine learning and language model architectures. The ideal candidate will design... 
    Contract work
    Remote work

    FRG Technology Consulting

    San Mateo, CA
    4 days ago
  •  ...vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: This is a rare... 
    Immediate start

    quadric.io, Inc

    Burlingame, CA
    5 days ago
  • $287k - $410k

     ...you will work within a multi-disciplinary engineering team to architect our next-generation compute platforms, both on-vehicle and...  ...technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of... 
    Temporary work
    Work experience placement
    Relocation package

    Zoox

    Foster, CA
    1 day ago
  •  ...As a Principal Architect, you'll be the driving force behind the next generation of our autonomous fulfillment solutions, leading the definition and design of cutting-edge machine learning versions for our core algorithms. You will be instrumental in evolving our product... 

    Texas State Library and Archives Commision

    Redwood City, CA
    5 days ago
  • $185k - $220k

    Join Our Team Come work at a place where innovation and teamwork come together to support the most exciting missions in the world! About the Role You will define and lead the reference architecture for third-party AI products within IT, enabling secure, scalable...

    Qualys

    San Mateo, CA
    2 days ago
  •  ...AI-First QE Architect Luminary helps engineering companies be more competitive by getting to market faster, creating new, better products, and reducing development risk. We do this with our Physics AI platform, the fastest and easiest way to build and deploy models... 

    Luminary Cloud, Inc.

    San Mateo, CA
    3 days ago
  •  ...Quadric, Inc in Burlingame, CA is seeking a Senior Performance Architect to enhance software and hardware performance. You will be responsible for analyzing and optimizing workloads written in C++ and Python, pinpointing performance bottlenecks, and developing proof-of... 
    2 days per week
    3 days per week

    quadric.io

    Burlingame, CA
    23 hours ago
  • $200k

     ...United Cerebral Palsy of Georgia is seeking a leader to guide a talented team in engineering simulations using AI and machine learning. This role offers a competitive starting salary of $200,000 and the opportunity to shape systems and establish engineering standards.... 

    United Cerebral Palsy of Georgia

    San Mateo, CA
    23 hours ago
  • $231.28k - $429.52k

     ...F. Hoffmann-La Roche AG seeks a Principal Machine Learning Engineer in South San Francisco to lead the design and development of advanced machine learning models. You will oversee the lifecycle of models while ensuring compliance and collaborating with diverse teams.... 

    F. Hoffmann-La Roche AG

    South San Francisco, CA
    23 hours ago
  • $210k - $250k

     ...to stand still. Required Qualifications ~5+ years of professional experience in software engineering with a focus on Machine Learning or AI. ~ Strong proficiency in Python ~ Strong familiarity with libraries such as NumPy, Pandas, and Scikit-learn.... 
    Full time
    Temporary work

    Faros AI

    San Mateo, CA
    4 days ago
  • A leading AI research company in California is seeking an experienced Machine Learning Engineer to develop Vision-Language-Action models for robotics. The ideal candidate has over 5 years of expertise in machine learning, a commitment to data quality, and strong production... 

    GenesisAI

    San Carlos, CA
    3 days ago
  •  ...explore for critical resources. We are looking for a talented deep learning engineer or scientist to lead the development of this model...  ..., cleaning, and maintaining high‑quality datasets tailored for machine learning applications Strong Software Engineering and Design Experience... 

    terra.ai Inc.

    Redwood City, CA
    1 day ago
  •  ...Role Title: Workday Finance Architect Job Description :Key Responsibilities (Strategy & Configuration ~)Record to Report (R2R): Own the Financial Data Model (FDM). Configure Business Processes for Journals, Fixed Assets, and Allocations. Design the period-close... 

    Jade Global

    San Mateo, CA
    22 hours ago
  • $90k - $110k

     ...IA Interior Architects translates client goals, brand and culture into powerful environments built around people, processes, technologies...  ...their enterprise forward, support their culture, engage their staff, integrate technology and drive efficiencies. As architects, designers... 
    Contract work
    Work experience placement
    Immediate start
    Worldwide

    IA Interior Architects

    San Mateo, CA
    1 day ago
  • $185k - $225k

     ...help solve this problem by redefining how critical resources are discovered, evaluated, and developed. By combining advanced machine learning, probabilistic modeling, and deep geoscience expertise, Terra AI helps exploration and mining companies make faster, more informed... 

    terra.ai Inc.

    Redwood City, CA
    2 days ago
  • Google Inc. is seeking a Senior Staff Tech Lead for YouTube Shorts Quality in San Bruno, CA. The role focuses on defining technical...  ...expertise in software development, project leadership, and Machine Learning. This is a critical role in pushing Google's innovative... 

    Google Inc.

    San Bruno, CA
    4 days ago
  •  ...engineering. The ideal candidate is hands-on, pragmatic, and comfortable working in early-stage, exploratory AI efforts where speed and learning matter most. Key Responsibilities Rapid PoCs & Pilots: Build and iterate on quick-turn AI PoCs, pilots, and demos to... 
    Work experience placement
    Remote work

    Mindlance

    San Mateo, CA
    1 day ago
  • A healthcare staffing firm is seeking an experienced Epic Hospital Billing Analyst to join their team on a 3+ month contract in San Mateo, CA. The ideal candidate must have an active Epic Certification in Hospital Billing and at least 2 years of experience in Epic model...
    Contract work
    Remote work

    ClinDCast LLC

    San Mateo, CA
    1 day ago
  • $185k - $220k

    QLYS_US Qualys, Inc. is looking for an expert to define and lead the reference architecture for AI products. You'll drive the design and implementation of complex AI workflows as you enable enterprise-wide AI adoption. The ideal candidate has 15+ years in data engineering...

    QLYS_US Qualys, Inc.

    Foster, CA
    1 day ago
  • A pioneering autonomous vehicle company is seeking a Software Systems Engineer to enhance the middleware components of its innovative robotaxi. You will drive best practices in software development, ensuring the safety and robustness of the system. Ideal candidates will...

    Zoox

    Foster, CA
    23 hours ago
  •  ...traditional barriers to application creation. We're seeking a RevOps Architect leader to define and drive the strategy, architecture, and...  ...Gatherings In Office Amenities (In-Office Only) Want to learn more about what we are up to? Meet the Replit Agent... 
    Full time
    Contract work
    Temporary work
    Work at office
    Worldwide
    Monday to Friday
    Flexible hours

    Replit

    Foster, CA
    3 days ago
  •  ...Data Science, or Solution Architecture ~ Proven experience as an AI Architect / AI Consultant / ML Architect ~ Strong understanding of AI/ML frameworks (TensorFlow, PyTorch, Scikit-learn, etc.) ~ Experience with cloud platforms (AWS, Azure, or GCP AI... 

    Omega Solutions Inc

    San Carlos, CA
    3 days ago
  •  ...Copilot Architect Full-Time / Contract Foster City, CA 94404 We are seeking an experienced Copilot Architect to design, architect, and implement Microsoft Copilot solutions across enterprise environments. The ideal candidate will have strong expertise in Microsoft... 
    Full time
    Contract work

    Lorven Technologies

    Foster, CA
    5 days ago
  • $185k - $250k

     ...help solve this problem by redefining how critical resources are discovered, evaluated, and developed. By combining advanced machine learning, probabilistic modeling, and deep geoscience expertise, Terra AI helps exploration and mining companies make faster, more informed... 

    terra.ai Inc.

    Redwood City, CA
    23 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Architect. Be the first to apply!