ML Systems Engineer

$150k - $350k

Full-time

ChipAgents

About ChipAgents ChipAgents is redefining the future of chip design and verification with agentic AI workflows. Our platform leverages cutting-edge generative AI to assist engineers in RTL design, simulation, and verification, dramatically accelerating chip development. Founded by experts in AI and semiconductor engineering, we partner with top semiconductor firms, cloud providers, and innovative startups to build intelligent AI agents. The company is a Series A company backed by tier-1 VC firms. ChipAgents is deployed in production to companies that have shipped 16B chips. Position Overview We are seeking an ML Systems Engineer to optimize the performance and efficiency of large language model inference powering our agentic AI platform. This is a technical role focused on low-level systems optimization. You will implement performance optimizations, build evaluation harnesses, and architect multi-node clusters for training and inference that push the limits of LLM throughput and latency. Your work will directly impact the responsiveness and cost-efficiency of AI agents used by leading semiconductor companies to design chips. Key Responsibilities Design, deploy, and optimize LLM inference systems across multi-node clusters, maximizing throughput and minimizing latency for production workloads. Implement and benchmark concrete inference optimizations. Profile and analyze inference bottlenecks at the systems level—from GPU kernel execution to memory bandwidth constraints. Build robust evaluation harnesses and benchmarking frameworks that measure accuracy, throughput, latency, and resource utilization across various parallelism strategies. Collaborate with research scientists to integrate new model architectures and optimizations into production inference infrastructure. Investigate and apply emerging techniques from research papers and open-source projects to continuously improve inference performance. Qualifications B.S., M.S., or PhD in Computer Science, Electrical Engineering, or related field (or equivalent experience). Experience with large-scale ML systems, GPU computing, or high-performance inference optimization. Strong proficiency in Python and C++/CUDA; hands-on experience with SGLang, vLLM, PyTorch, or similar inference frameworks. Deep understanding of GPU architecture, memory hierarchies, and parallel computing paradigms. Experience deploying and optimizing LLMs in production: model serving, batching strategies, distributed inference, or quantization. Strong systems-level debugging and profiling skills; comfort working at multiple layers of the stack from CUDA kernels to application logic. Familiarity with distributed computing frameworks (Ray, multi-node training/inference) is a plus. Self-directed problem solver who is interested in working on ambitious optimization challenges. Why Join Us Work on cutting-edge LLM inference optimization problems with real-world production impact. Access to substantial GPU compute resources for experimentation and benchmarking. Collaborate with a world-class team spanning AI research, systems engineering, and EDA. Shape the performance characteristics of AI systems used by leading semiconductor companies. What we offer $150K/yr – $350K/yr + Offers Equity. We are open to discuss above-scale compensation with exceptional candidates on a case-by-case basis. Unlimited PTO and full benefits (medical, vision, dental, 401k). Two engineering-centric offices with free parking, private gym, and free lunch, drinks and snacks.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the ML Systems Engineer in San Jose, CA vacancy

Staff ML Engineer — Production ML & Mentorship (Equity)
...financial technology firm in San Jose seeks a Staff Machine Learning Engineer to develop and implement production-scale machine learning... ...machine learning, strong Python skills, and proficiency with major ML libraries. Responsibilities include designing models for...
Suggested
Venmo
San Jose, CA
1 hour ago
Senior ML Performance Engineer
...: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get... .... About the Role We're looking for a Senior ML Performance Engineer to architect and lead our...
Suggested
Lemurian Labs
Santa Clara, CA
25 days ago
Sr Machine Learning Engineer 5 - AEP, Agentic System
$172.5k - $306.63k
...Senior Machine Learning Engineer At Adobe's Experience Platform, we are looking for a Senior... ..., and operate scalable intelligent AI systems that power end-user AI products. You will... .... Establish safety, governance, and ML Ops guidelines (guardrails, observability...
Suggested
Temporary work
Local area
Worldwide
Adobe
San Jose, CA
3 days ago
Staff ML Engineer, Autonomy & Planning
$240k - $275k
...layer. We are looking for a Staff Autonomy Engineer to own thislayerend to end. You will... ...shipping autonomy, planning, or decision-making systems to production in robotics, autonomous... ...learned autonomy systems, or large-scale ML for real-time decision making. ~ Hands...
Suggested
Full time
Knightscope
Sunnyvale, CA
4 days ago
ML Engineer / Generalist
...ML Engineer / Generalist HypeLab is a small, profitable ad network operating at real marketplace scale. We process more than 1B ad requests... ...with ML and data at the center of it. You will work on the systems that decide which ads we show, how we bid, how we predict...
Suggested
Hypelab
San Jose, CA
2 days ago
Senior ML Engineer - Embodied AI Scaling Foundations
$159.3k - $230.7k
...-from breakthrough hardware and battery systems to intuitive design, intelligent software... ...The team directly works on and delivers ML models to the product that successively go... ...collaborative, high-impact team of AI/ML engineers, data scientists and engineers who are passionate...
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
2 days ago
ML Engineers - with LLM GenAI
...Job Title: ML Engineers - with LLM GenAI (3 Resources) Responsibilities Write efficient machine learning workflows and pipelines Training pipeline - Ingest/Preprocess/Vectorize and index data Inference pipeline - AI Guided workflow to respond to user...
Work experience placement
Sparktek
San Jose, CA
5 days ago
Senior ML Accelerator Engineer - GPU
$128.7k - $261.3k
...including Level 4–capable fully self-driving systems, to move us toward safer, more... ...export, kernel development, and performance engineering so that every cycle on our accelerators translates... ...that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous...
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
ML Engineer
...ML Engineer Santa Clara, California, United States About the Job Our client is a rapidly growing Tier 1 VC backed startup based... ...long-term growth trajectory in the evolving world of intelligent systems. Location New York, NY Work Type Full Time...
Full time
Catalyst Labs, LLC
Santa Clara, CA
3 days ago
ML Runtime Optimization Engineer
...Software Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded... ...areas: tools and infrastructure, operating systems, and autonomy. Eighteen of the top 20... ...engineer with deep experience in optimizing ML models and deploying them on production-...
For contractors
For subcontractor
Casual work
Work at office
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
1 day ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...including Level 4-capable fully self-driving systems, to move us toward safer, more... ...export, kernel development, and performance engineering so that every cycle on our accelerators translates... ...path fast, reliable, and effortless for ML engineers across the AV organization to...
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
5 days ago
Matterport - Senior ML Ops Engineer
$173k - $253k
...Senior MLOps Engineer Matterport is leading the digital transformation of the built world... ...production. You will work closely with ML R&D Engineers and other engineering teams... ...environments. ~ Familiarity with version control systems (e.g., Git) and agile development...
Work at office
Work from home
CoStar Group
Sunnyvale, CA
5 days ago
Data Scientist + ML Engineer - Gen AI
...Job Title : Data Scientist + ML Engineer (Gen AI) Location : Cupertino, CA - hybrid Duration : 12+ Months with Possible Extension Pay Range : $70 - $74.63/hr on W2 Job Summary Description: We are looking for a highly skilled Data Scientist...
Pride Global
Cupertino, CA
2 days ago
Senior ML Data Infrastructure Engineer
...TITLE: ML Data Infrastructure Engineer LOCATION: Sunnyvale CA or Remote Duration: 12+ Months Rate: DOE Key skills - GCP ML Infrastructure... ...ML data infrastructure platform . You'll create the systems and tools that enable efficient data preparation,...
Remote work
Redolent
Sunnyvale, CA
3 days ago
Director ML Engineering
$206.4k - $384.68k
...expanding into adjacent verticals. We are hiring a Director, ML Engineering to own the engineering function behind Firefly Foundry ’s... ...creative visions into reliable, high-performance ML systems that transform how content is conceived, produced, and delivered...
Temporary work
Local area
Worldwide
Adobe
San Jose, CA
2 days ago
Senior ML Infrastructure Engineer - Embodied AI Scaling Foundations
$153.2k - $234.1k
...-from breakthrough hardware and battery systems to intuitive design, intelligent software... ...infrastructure that powers every machine learning engineer working on our cutting-edge Autonomous... ...driverless vehicles. As a Senior ML Infra Engineer, you will build critical infrastructure...
Work at office
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
2 days ago
ML Data Infrastructure Engineer, Sunnyvale
...Job Title 7+ years of software engineering experience, with 3+ years in ML serving/infrastructure. Strong expertise in container orchestration (Kubernetes... ..., Triton, KServe). Deep knowledge of distributed systems and microservices architecture. Proficiency in Python...
Tranzeal
Sunnyvale, CA
2 days ago
Senior ML Infrastructure Engineer (Compute)
...development by prioritizing high-impact, ML-centric use cases. About the Role:... ...are seeking a Senior ML Infrastructure engineer to help build and scale robust Compute platforms... ...and running scalable distributed systems. They will rapidly test and promote ideas...
Local area
Work from home
General Motors
Sunnyvale, CA
3 days ago
Senior ML Infrastructure Engineer - Embodied AI
$153.2k - $234.1k
...tomorrow's standard -from breakthrough hardware and battery systems to intuitive design, intelligent software, and next-generation... ...vehicle behavior across real-world scenarios. As a Senior ML Infra Engineer, you will work on the core systems that enable rapid dataset...
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Staff ML Engineer - Embodied AI Offboard Perception
$189.3k - $320.7k
...into tomorrow's standard -from breakthrough hardware and battery systems to intuitive design, intelligent software, and next-generation... ...vehicle behavior across real-world scenarios. As a Staff ML Engineer on the Prometheus team within the Embodied AI organization,...
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
5 days ago
Staff Inference ML Runtime Engineer
...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel... ...learning users to effortlessly run large-scale ML applications, without the hassle of... ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to...
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
3 days ago
Senior ML Engineer - Model Compression
$128.7k - $261.3k
...partner closely with model developers and deployment and infra engineers to ship numerically robust, low-latency models to the car, blending... ..., Electrical Engineering, Physics, Mathematics, Data Science / ML, or a closely related quantitative field (or equivalent...
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
5 days ago
SerDes IP Systems Engineering Director
$178.5k - $331.5k
...At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology. SerDes IP Systems Engineering Director Description:???? This is a unique opportunity to join the rapidly growing Die-to-Die Interface IP team in the Silicon...
Cadence Design Systems
San Jose, CA
3 days ago
Systems Engineer IV, Optical Hardware
$161k - $221k
Who We Are Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced... ...or wherever you may go. Learn more about our benefits . As a Systems Engineer, you’ll design, integrate, and optimize complex systems...
Full time
Relocation
APPLIED MATERIALS
Santa Clara, CA
19 hours ago
Machine Learning Engineer
$150k
...-class researchers, data scientists, and engineers, tackling the most fundamental and impactful... ...into codebase. Develop and implement systems to support the lifecycle of machine... ...including experience with Machine Learning (ML) models, ML infrastructure, Natural Language...
Worldwide
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Distributed Machine Learning Engineer
$150k
...‑class researchers, data scientists, and engineers, tackling the most fundamental and impactful... ...pioneers. The Role The Distributed ML Engineer will play a role at the... ...the team to develop new and cutting‑edge systems. The ideal candidate will have a strong background...
Work experience placement
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Matterport - Senior ML Ops Engineer
$173k - $253k
Matterport - Senior ML Ops Engineer Job Description CoStar Group is a leading global provider of commercial and residential real estate... ...in cloud environments. Familiarity with version control systems (e.g., Git) and agile development methodologies. Excellent problem...
Full time
Work at office
Work from home
CoStar Group
Sunnyvale, CA
4 days ago
Senior AI/ML Engineer
$140k - $175k
...About the Role Knightscope is seeking two Senior AI/ML Engineers to own the machine learning detection pipelines running on the... ...Signals platform: a prioritization engine, pattern detection system, recommendation scorer, explain ability module, and continuous...
Full time
Relocation
Knightscope
Sunnyvale, CA
5 days ago
AI Engineer/ML Engineer - Senior Developers - AI Training - San Jose, USA
$80 per hour
...Job Description AI & Machine Learning Engineer - AI Training About Prolific Prolific is... ...experience building, deploying, or fine-tuning ML models in a production environment.... ...developing more accurate, nuanced, and aligned AI systems. We believe that the next leap in AI...
Hourly pay
Work from home
Flexible hours
Prolific Academic Ltd
San Jose, CA
24 days ago
Systems Sales Engineering Manager
...Systems Sales Engineering Manager Santa Clara, CA Forward Networks is transforming how the world's most complex networks are managed and secured. Founded in 2013 by four Stanford Ph.D.s, we built the industry's first network digital twin—a mathematically precise...
Work experience placement
Forward Networks Inc
Santa Clara, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Systems Engineer. Be the first to apply!