Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Machine Learning Architect

Neurophos Inc

About Neurophos

The demand for new datacenters and AI compute is rapidly outpacing the planet's energy capacity. Digital solutions are hitting a power wall as we approach the physical limits of traditional silicon. Conquering this bottleneck isn't about bigger chips or more of them; it means rethinking the fundamental architecture. The industry's current path isn't going to meet the need, so we took a different approach.

Instead of traditional electronic circuits, we use silicon photonics and an active, programmable metasurface to perform matrix multiplications at the speed of light. Our optical cells are 10,000x smaller than traditional photonic components, enabling unprecedented density. By using photonics instead of electricity, our chips become more efficient as they scale. This architecture will deliver up to 100 times the energy efficiency of existing solutions while significantly improving performance for large-scale AI inference.

We've assembled a world-class team of industry veterans and recently raised a $110M Series A led by Gates Frontier. Participants include M12 (Microsoft's Venture Fund), Carbon Direct Capital, Aramco Ventures, Bosch Ventures, Tectonic Ventures, Space Capital, and others. We have also been recognized on the EE Times Silicon 100 list for several consecutive years.

Join us and shape the future of computing!

Position Overview:

We are seeking an experienced machine learning architect to lead the porting and optimization of large language models (LLMs), diffusion models, and other ML applications to our revolutionary optical inference engines. This role is critical to demonstrating the full potential of our metamaterial-based optical processing units (OPUs) by adapting state-of-the-art AI models to leverage our ultra-high-throughput, low-precision compute architecture. The ideal candidate will bridge the gap between cutting-edge ML research and novel hardware capabilities, ensuring customers can seamlessly deploy their AI workloads on Neurophos hardware.

Location: Austin, TX or San Jose, CA. Full-time onsite position.

Key Responsibilities:
  • Lead the porting of LLM applications, diffusion models, and visual ML applications to Neurophos optical inference engines
  • Adapt models from diverse sources, including GitHub, Hugging Face, other open-source repositories, and customer private models
  • Work with models in various formats, including PyTorch, Triton, JAX, and emerging frameworks
  • Develop and implement quantization strategies to migrate models from higher precision formats (FP8, INT8, and above) to our optimized 4-bit precision (FP4/INT4) for weights and activations
  • Design and execute re-quantization, retraining, and other model adaptation techniques to minimize accuracy loss during precision reduction
  • Create or integrate third-party tools and workflows for efficient model porting and optimization
  • Optimize GEMM operations for high-throughput execution
  • Develop benchmarking methodologies to measure and validate model quality post-porting, including perplexity metrics and other quality indicators
  • Collaborate with hardware and software teams to co-optimize model architectures for optical compute characteristics
  • Publish research papers on novel optimization techniques and methodologies (with appropriate IP protection)
Qualifications:
  • MS or PhD in Computer Science, Data Science, Machine Learning, Mathematics, or related field
  • 7+ years of experience in machine learning engineering with at least 3 years focused on model optimization and deployment
  • Deep expertise in neural network quantization techniques, including post-training quantization (PTQ) and quantization-aware training (QAT)
  • Strong proficiency in PyTorch and familiarity with other ML frameworks (JAX, Triton, TensorFlow)
  • Hands-on experience with transformer architectures, LLMs, and diffusion models
  • Experience with low-precision inference optimization (INT8, FP8, or lower)
  • Strong understanding of GEMM operations and linear algebra optimizations for deep learning
  • Experience with model evaluation metrics, including perplexity, accuracy, and benchmark suites
  • Track record of successfully deploying ML models on specialized hardware accelerators
  • Excellent communication skills with the ability to collaborate across hardware and software teams
Preferred Skills:
  • Experience with sub-8-bit quantization (INT4, FP4) and mixed-precision inference
  • Familiarity with Hugging Face Transformers library and model hub ecosystem
  • Experience with ONNX, TensorRT, or other model optimization frameworks
  • Background in analog or optical computing architectures
  • Knowledge of in-memory computing paradigms and matrix-vector multiplication acceleration
  • Published research in model compression, quantization, or efficient inference
  • Experience with large-scale batch inference optimization
  • Familiarity with prefill vs. decode optimization strategies in LLM inference
What We Offer

This is an opportunity to play a pivotal role in an innovative startup redefining the future of AI hardware. Work on a game-changing technology at the intersection of photonics and AI as part of a collaborative and brilliant team. You'll contribute to a platform that redefines computational performance and accelerates the future of artificial intelligence. Come help us bring this transformative technology to the world.

Benefits

Join a team that invests in your future and your well-being. At Neurophos, we offer:
  • 100% coverage of base health plan premiums for you and your dependents, plus HSA contributions.
  • Unlimited PTO. No rigid vacation banks, just a focus on delivery.
  • 401(k) matching and stock option opportunities to ensure our success is your success.
  • Full suite of voluntary benefits, including Dental, Vision, Life, Hospital, Critical Illness, and Accident insurance.
  • Personalized Benefits. Choose the plans that fit your life and take the cash back for those that don't.
Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Staff Machine Learning Architect in San Jose, CA vacancy
  •  ...industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without...  .... About The Role As a Compute / Server Platform Architect on the Cluster Architecture Team, you will own the server-side... 
    Suggested
    Local area

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago
  • $227k - $320k

    A leading technology company is seeking a Senior Staff Architect in Sunnyvale, CA. In this role, you will advance TPU technology and contribute to silicon design for Google's AI/ML applications. With over 10 years of technical leadership in silicon design and a Bachelor... 
    Suggested

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $192k - $278k

    Google Inc. in Sunnyvale, CA is seeking a Staff Architect for Digital Signal Processing in Google Cloud. This role involves architecting core algorithms for next-gen data center interconnects, focusing on communications and forward error correction. The ideal candidate... 
    Suggested

    Google Inc.

    Sunnyvale, CA
    14 hours ago
  • $192k - $278k

    A leading tech company is seeking a Staff Design Engineer in Sunnyvale, CA to drive DSP technology in AI/ML applications. This role requires expertise in Digital Signal Processing and high-speed logic design, along with experience in MATLAB, Python, or C++. You'll contribute... 
    Suggested

    Google Inc.

    Sunnyvale, CA
    14 hours ago
  • $155k - $225k

     ...users that is easy to set up and interact with every day. Staff SaaS Platform Architect Location: Milpitas, CA (hybrid) We are looking...  ...growth in areas essential to the role. Interested in learning more about our workplace? Visit and follow our LinkedIn,... 
    Suggested

    Arlo Technologies, Inc.

    Milpitas, CA
    1 day ago
  • $232k - $258k

    Uber is seeking a Staff Backend Engineer for the Enterprise Identity Platform team in Sunnyvale, CA. You will architect identity services for scalability while mentoring senior engineers and collaborating across teams to align solutions with business objectives. The position... 

    Uber

    Sunnyvale, CA
    4 days ago
  • $196.8k - $283.2k

    Intuitive is seeking a Staff Power Electronics Design Engineer for its advanced R&D group in Sunnyvale, CA. In this role, you'll develop high-voltage, high-frequency switch mode power electronics systems for medical applications, and be responsible for defining architectures... 

    Intuitive

    Sunnyvale, CA
    2 days ago
  • $227k - $320k

    Senior Staff Architect, Silicon, Google Cloud Sunnyvale, CA, USA Apply Bachelor’s degree in Electrical Engineering, Computer Science, a related...  ...salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Responsibilities Build... 
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    1 day ago
  •  ...bandwidth, gate counts, compute/memory blocks. Work with frontend architects and backend design to compile performance monitor availability,...  ...expertise, kindness, dedication and a willingness to embrace challenges and learn together every day. #J-18808-Ljbffr d-Matrix inc.
    3 days per week

    d-Matrix inc.

    Santa Clara, CA
    14 hours ago
  • $192k - $278k

    Staff Architect, Digital Signal Processing, Google Cloud Google Sunnyvale, CA, USA Bachelor's degree in Electrical Engineering, Computer Engineering...  ...salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Google is a global company and... 
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • NVIDIA Corporation is seeking a Senior Staff Engineer for Enterprise Messaging Platforms to manage and enhance their global email and messaging infrastructure. This role involves architecting solutions with Microsoft Exchange and Azure services, ensuring high availability... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • Google Inc. is seeking a Staff Software Engineer, Firmware for ARM SoCs in Sunnyvale, CA. This vital role will focus on leading the architecture, design, and development of firmware for next-generation systems. Candidates should possess a bachelor's degree and extensive... 

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $240k

     ...Opportunity Are you a highly skilled NLP Architect with a strong background in natural language processing and machine learning, looking to push the boundaries of AI-driven...  ...innovative ways. About the Team The senior staff engineer will join the Unified Storage team... 
    Work at office
    Remote work
    Relocation package
    3 days per week

    Nutanix

    San Jose, CA
    2 days ago
  • $206.4k - $379.1k

     ...motion, and personalization. We're looking for a Principal Architect to build and implement the AI framework for Adobe Express,...  ...or equivalent experience in Computer Science, Data Science, Machine Learning, or a related technical field. Experience architecting AI... 
    Temporary work
    Local area
    Worldwide
    Flexible hours

    Adobe

    San Jose, CA
    2 days ago
  •  ...hybrid retrieval. Proven track record architecting and shipping multi-agent systems,...  ...Strong ML fundamentals: XGBoost, deep learning, NLP, time-series forecasting, propensity...  ...related quantitative field AWS Certified Machine Learning Engineer or GCP Professional ML... 

    Yochana

    San Jose, CA
    11 days ago
  •  ...Mandatory Skills Agentic AI/ADK/Python We are looking for a skilled MLOps Architect to join our team and help us build, deploy, and maintain robust and scalable machine learning systems. You will be responsible for the full lifecycle of our ML pipelines, from data ingestion... 

    Omni Inclusive

    San Jose, CA
    1 day ago
  •  ...across AMD.Applied Research and Engineering team is looking for the following KEY RESPONSIBILITIES: Proficiency in machine learning in (neural networks) and Artificial Intelligence Ability to program in low level languages (x86 asm, SSE, ISA) General software... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    2 days ago
  • $254.34k - $310.86k

     ...segment of chip design, including artificial intelligence, machine learning, automotive, data center, mobile, and consumer. With SiFive,...  ...research and develop next generation high performance SoC. The SoC architect will guide the team to develop system IPs such as IOMMU,... 
    Work experience placement

    SiFive

    Santa Clara, CA
    4 days ago
  •  ...AI/Gen AI Architect Location: Sunnyvale, CA (3x/ week onsite) Duration: 6 months...  ..., including supervised and unsupervised learning, deep learning, anomaly detection, and large...  ...NLU (Natural Language Understanding), ML (Machine Learning), Conversational AI •... 

    AceStack LLC

    Sunnyvale, CA
    1 day ago
  •  ...Senior AI Architect Nexxa.ai is building artificial super intelligence for heavy industries — enabling machines, systems and operations to think, decide and act autonomously across...  ...experience in software engineering, machine learning, data science, or closely related... 

    Nexxa.ai

    Sunnyvale, CA
    1 day ago
  •  ...create engaging, intelligent, and personalized conversational experiences for millions of Apple users. We are seeking a Machine Learning Architect to serve as a senior technical leader spanning the full Speech organization. You will set the future modeling direction... 

    Apple

    Cupertino, CA
    17 hours ago
  •  ...ML Architect With Data Bricks And Azure ADF Location: Santa Clara, CA - onsite Duration...  ...areas: supervised & unsupervised learning, deep learning, reinforcement learning,...  ...areas: LLM, NLP, DL (Deep Learning), ML (Machine Learning), object detection/classification... 

    Software Technology Inc

    Santa Clara, CA
    1 day ago
  • $172.5k - $306.63k

     ...next big idea can come from anywhere-and it might come from you. The Opportunity Adobe is seeking a Senior Machine Learning Architect to help define and deliver the next generation of AI-powered user experiences across Adobe Experience Cloud. This role... 
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    4 days ago
  • $254.34k - $310.86k

     ...segment of chip design, including artificial intelligence, machine learning, automotive, data center, mobile, and consumer. With SiFive,...  ...next generation high performance processors. The performance architect will guide the performance team to work closely with the architect... 
    Work experience placement

    SiFive

    Santa Clara, CA
    3 days ago
  • $172.5k - $306.63k

     ...believe the next big idea can come from anywhere-and it might come from you. The Opportunity Adobe is seeking a Senior Machine Learning Architect to help define and deliver the next generation of AI-powered user experiences across Adobe Experience Cloud. This role... 
    Temporary work
    Local area

    Adobe

    San Jose, CA
    3 days ago
  •  ...cloud architecture optimization for client solutions. Architect innovative AI solutions from ideation to MVP, delivering business...  ...Proficient in Python, Java, or Go languages. ~ Advanced machine learning and GenAI models (GPT, BERT, etc.). ~ Familiarity with... 
    Work at office

    E-Solutions

    Santa Clara, CA
    14 hours ago
  • $190.61k - $361.48k

     ...alternatives that meet constraints related to performance, power, area, and timing. Collaborate with cross-functional teams including architects, design, verification, and validation engineers to execute project requirements seamlessly. Deliver new microarchitecture... 
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    7 days ago
  • $168k - $264.5k

     ...tools. Our team develops these tools by fusing advances in parallel computing, machine learning, and specialized algorithms for VLSI design. We are seeking a Senior P&R Methodology Architect to define and own the next generation RTL2GDS flow for advanced nodes (3nm and... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $240k - $334k

    Google Inc. in Sunnyvale, CA is seeking a Power and Performance Architect to drive innovative TPU technology. This role entails...  ...for next-gen TPU SOCs, optimizing performance-per-watt across machine learning workloads, and collaborating with various teams for effective... 

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • Velaura is hiring an AI Research Architect to explore the intersection of AI model design and next-generation compute architectures...  .... Ideal candidates should have a strong background in machine learning, a deep understanding of model architectures like transformers... 

    Velaura

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Architect. Be the first to apply!