Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Systems Engineer

$150k - $350k
Full-time

ChipAgents

About ChipAgents ChipAgents is redefining the future of chip design and verification with agentic AI workflows. Our platform leverages cutting-edge generative AI to assist engineers in RTL design, simulation, and verification, dramatically accelerating chip development. Founded by experts in AI and semiconductor engineering, we partner with top semiconductor firms, cloud providers, and innovative startups to build intelligent AI agents. The company is a Series A company backed by tier-1 VC firms. ChipAgents is deployed in production to companies that have shipped 16B chips. Position Overview We are seeking an ML Systems Engineer to optimize the performance and efficiency of large language model inference powering our agentic AI platform. This is a technical role focused on low-level systems optimization. You will implement performance optimizations, build evaluation harnesses, and architect multi-node clusters for training and inference that push the limits of LLM throughput and latency. Your work will directly impact the responsiveness and cost-efficiency of AI agents used by leading semiconductor companies to design chips. Key Responsibilities Design, deploy, and optimize LLM inference systems across multi-node clusters, maximizing throughput and minimizing latency for production workloads. Implement and benchmark concrete inference optimizations. Profile and analyze inference bottlenecks at the systems level—from GPU kernel execution to memory bandwidth constraints. Build robust evaluation harnesses and benchmarking frameworks that measure accuracy, throughput, latency, and resource utilization across various parallelism strategies. Collaborate with research scientists to integrate new model architectures and optimizations into production inference infrastructure. Investigate and apply emerging techniques from research papers and open-source projects to continuously improve inference performance. Qualifications B.S., M.S., or PhD in Computer Science, Electrical Engineering, or related field (or equivalent experience). Experience with large-scale ML systems, GPU computing, or high-performance inference optimization. Strong proficiency in Python and C++/CUDA; hands-on experience with SGLang, vLLM, PyTorch, or similar inference frameworks. Deep understanding of GPU architecture, memory hierarchies, and parallel computing paradigms. Experience deploying and optimizing LLMs in production: model serving, batching strategies, distributed inference, or quantization. Strong systems-level debugging and profiling skills; comfort working at multiple layers of the stack from CUDA kernels to application logic. Familiarity with distributed computing frameworks (Ray, multi-node training/inference) is a plus. Self-directed problem solver who is interested in working on ambitious optimization challenges. Why Join Us Work on cutting-edge LLM inference optimization problems with real-world production impact. Access to substantial GPU compute resources for experimentation and benchmarking. Collaborate with a world-class team spanning AI research, systems engineering, and EDA. Shape the performance characteristics of AI systems used by leading semiconductor companies. What we offer $150K/yr – $350K/yr + Offers Equity. We are open to discuss above-scale compensation with exceptional candidates on a case-by-case basis. Unlimited PTO and full benefits (medical, vision, dental, 401k). Two engineering-centric offices with free parking, private gym, and free lunch, drinks and snacks.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the ML Systems Engineer in San Jose, CA vacancy
  •  ...financial technology firm in San Jose seeks a Staff Machine Learning Engineer to develop and implement production-scale machine learning...  ...machine learning, strong Python skills, and proficiency with major ML libraries. Responsibilities include designing models for... 
    Suggested

    Venmo

    San Jose, CA
    1 hour ago
  •  ...: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get...  .... About the Role We're looking for a Senior ML Performance Engineer to architect and lead our... 
    Suggested

    Lemurian Labs

    Santa Clara, CA
    25 days ago
  • $172.5k - $306.63k

     ...Senior Machine Learning Engineer At Adobe's Experience Platform, we are looking for a Senior...  ..., and operate scalable intelligent AI systems that power end-user AI products. You will...  .... Establish safety, governance, and ML Ops guidelines (guardrails, observability... 
    Suggested
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    3 days ago
  • $240k - $275k

     ...layer. We are looking for a Staff Autonomy Engineer to own thislayerend to end. You will...  ...shipping autonomy, planning, or decision-making systems to production in robotics, autonomous...  ...learned autonomy systems, or large-scale ML for real-time decision making. ~ Hands... 
    Suggested
    Full time

    Knightscope

    Sunnyvale, CA
    4 days ago
  •  ...ML Engineer / Generalist HypeLab is a small, profitable ad network operating at real marketplace scale. We process more than 1B ad requests...  ...with ML and data at the center of it. You will work on the systems that decide which ads we show, how we bid, how we predict... 
    Suggested

    Hypelab

    San Jose, CA
    2 days ago
  • $159.3k - $230.7k

     ...-from breakthrough hardware and battery systems to intuitive design, intelligent software...  ...The team directly works on and delivers ML models to the product that successively go...  ...collaborative, high-impact team of AI/ML engineers, data scientists and engineers who are passionate... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  •  ...Job Title: ML Engineers - with LLM GenAI (3 Resources) Responsibilities Write efficient machine learning workflows and pipelines Training pipeline - Ingest/Preprocess/Vectorize and index data Inference pipeline - AI Guided workflow to respond to user... 
    Work experience placement

    Sparktek

    San Jose, CA
    5 days ago
  • $128.7k - $261.3k

     ...including Level 4–capable fully self-driving systems, to move us toward safer, more...  ...export, kernel development, and performance engineering so that every cycle on our accelerators translates...  ...that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    4 days ago
  •  ...ML Engineer Santa Clara, California, United States About the Job Our client is a rapidly growing Tier 1 VC backed startup based...  ...long-term growth trajectory in the evolving world of intelligent systems. Location New York, NY Work Type Full Time... 
    Full time

    Catalyst Labs, LLC

    Santa Clara, CA
    3 days ago
  •  ...Software Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded...  ...areas: tools and infrastructure, operating systems, and autonomy. Eighteen of the top 20...  ...engineer with deep experience in optimizing ML models and deploying them on production-... 
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    1 day ago
  • $128.7k - $261.3k

     ...including Level 4-capable fully self-driving systems, to move us toward safer, more...  ...export, kernel development, and performance engineering so that every cycle on our accelerators translates...  ...path fast, reliable, and effortless for ML engineers across the AV organization to... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • $173k - $253k

     ...Senior MLOps Engineer Matterport is leading the digital transformation of the built world...  ...production. You will work closely with ML R&D Engineers and other engineering teams...  ...environments. ~ Familiarity with version control systems (e.g., Git) and agile development... 
    Work at office
    Work from home

    CoStar Group

    Sunnyvale, CA
    5 days ago
  •  ...Job Title : Data Scientist + ML Engineer (Gen AI) Location : Cupertino, CA - hybrid Duration : 12+ Months with Possible Extension Pay Range : $70 - $74.63/hr on W2 Job Summary Description: We are looking for a highly skilled Data Scientist... 

    Pride Global

    Cupertino, CA
    2 days ago
  •  ...TITLE: ML Data Infrastructure Engineer LOCATION: Sunnyvale CA or Remote Duration: 12+ Months Rate: DOE Key skills - GCP ML Infrastructure...  ...ML data infrastructure platform . You'll create the systems and tools that enable efficient data preparation,... 
    Remote work

    Redolent

    Sunnyvale, CA
    3 days ago
  • $206.4k - $384.68k

     ...expanding into adjacent verticals. We are hiring a Director, ML Engineering to own the engineering function behind Firefly Foundry ’s...  ...creative visions into reliable, high-performance ML systems that transform how content is conceived, produced, and delivered... 
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    2 days ago
  • $153.2k - $234.1k

     ...-from breakthrough hardware and battery systems to intuitive design, intelligent software...  ...infrastructure that powers every machine learning engineer working on our cutting-edge Autonomous...  ...driverless vehicles. As a Senior ML Infra Engineer, you will build critical infrastructure... 
    Work at office
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  •  ...Job Title 7+ years of software engineering experience, with 3+ years in ML serving/infrastructure. Strong expertise in container orchestration (Kubernetes...  ..., Triton, KServe). Deep knowledge of distributed systems and microservices architecture. Proficiency in Python... 

    Tranzeal

    Sunnyvale, CA
    2 days ago
  •  ...development by prioritizing high-impact, ML-centric use cases. About the Role:...  ...are seeking a Senior ML Infrastructure engineer to help build and scale robust Compute platforms...  ...and running scalable distributed systems. They will rapidly test and promote ideas... 
    Local area
    Work from home

    General Motors

    Sunnyvale, CA
    3 days ago
  • $153.2k - $234.1k

     ...tomorrow's standard -from breakthrough hardware and battery systems to intuitive design, intelligent software, and next-generation...  ...vehicle behavior across real-world scenarios. As a Senior ML Infra Engineer, you will work on the core systems that enable rapid dataset... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  • $189.3k - $320.7k

     ...into tomorrow's standard -from breakthrough hardware and battery systems to intuitive design, intelligent software, and next-generation...  ...vehicle behavior across real-world scenarios. As a Staff ML Engineer on the Prometheus team within the Embodied AI organization,... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel...  ...learning users to effortlessly run large-scale ML applications, without the hassle of...  ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • $128.7k - $261.3k

     ...partner closely with model developers and deployment and infra engineers to ship numerically robust, low-latency models to the car, blending...  ..., Electrical Engineering, Physics, Mathematics, Data Science / ML, or a closely related quantitative field (or equivalent... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • $178.5k - $331.5k

     ...At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology. SerDes IP Systems Engineering Director Description:???? This is a unique opportunity to join the rapidly growing Die-to-Die Interface IP team in the Silicon... 

    Cadence Design Systems

    San Jose, CA
    3 days ago
  • $161k - $221k

    Who We Are Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced...  ...or wherever you may go. Learn more about our benefits . As a Systems Engineer, you’ll design, integrate, and optimize complex systems... 
    Full time
    Relocation

    APPLIED MATERIALS

    Santa Clara, CA
    19 hours ago
  • $150k

     ...-class researchers, data scientists, and engineers, tackling the most fundamental and impactful...  ...into codebase. Develop and implement systems to support the lifecycle of machine...  ...including experience with Machine Learning (ML) models, ML infrastructure, Natural Language... 
    Worldwide
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $150k

     ...‑class researchers, data scientists, and engineers, tackling the most fundamental and impactful...  ...pioneers. The Role The Distributed ML Engineer will play a role at the...  ...the team to develop new and cutting‑edge systems. The ideal candidate will have a strong background... 
    Work experience placement
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $173k - $253k

    Matterport - Senior ML Ops Engineer Job Description CoStar Group is a leading global provider of commercial and residential real estate...  ...in cloud environments. Familiarity with version control systems (e.g., Git) and agile development methodologies. Excellent problem... 
    Full time
    Work at office
    Work from home

    CoStar Group

    Sunnyvale, CA
    4 days ago
  • $140k - $175k

     ...About the Role Knightscope is seeking two Senior AI/ML Engineers to own the machine learning detection pipelines running on the...  ...Signals platform: a prioritization engine, pattern detection system, recommendation scorer, explain ability module, and continuous... 
    Full time
    Relocation

    Knightscope

    Sunnyvale, CA
    5 days ago
  • $80 per hour

     ...Job Description AI & Machine Learning Engineer - AI Training About Prolific Prolific is...  ...experience building, deploying, or fine-tuning ML models in a production environment....  ...developing more accurate, nuanced, and aligned AI systems. We believe that the next leap in AI... 
    Hourly pay
    Work from home
    Flexible hours

    Prolific Academic Ltd

    San Jose, CA
    24 days ago
  •  ...Systems Sales Engineering Manager Santa Clara, CA Forward Networks is transforming how the world's most complex networks are managed and secured. Founded in 2013 by four Stanford Ph.D.s, we built the industry's first network digital twin—a mathematically precise... 
    Work experience placement

    Forward Networks Inc

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Systems Engineer. Be the first to apply!