ML Systems Engineer: Production-Scale LLM Inference

ScOp Venture Capital

ScOp Venture Capital is looking for an ML Systems Engineer to optimize LLM inference systems crucial for their AI platform. The role focuses on enhancing performance and efficiency via low-level systems optimization, directly impacting industry leader processes in semiconductor design. A successful candidate will have a strong background in ML systems, GPU optimization, and programming skills in Python and C++. The position offers competitive compensation and professional growth within a leading AI-focused environment. #J-18808-Ljbffr ScOp Venture Capital

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the ML Systems Engineer: Production-Scale LLM Inference in Santa Clara, CA vacancy

ML Systems Engineer
...generative AI to assist engineers in RTL design,... ...is deployed in production to companies that... ...We are seeking an ML Systems Engineer to optimize... ...large language model inference powering our... ...push the limits of LLM throughput and latency... ...with large‑scale ML systems, GPU computing...
Suggested
ScOp Venture Capital
Santa Clara, CA
3 days ago
Senior ML Systems Engineer — Production-Scale AI
...Learning to lead the development of critical ML subsystems for their AI product in Palo Alto, California. In this role,... ...end-to-end, collaborating closely with engineers and product teams to deliver high reliability in production systems. The ideal candidate has experience...
Suggested
A1
Palo Alto, CA
3 days ago
AIML - Full Stack ML Engineer, LLM Optimization
$170.7k - $300.2k
As a Full Stack ML Engineer on our team, you will leverage your background to: Design and... ...Innovate across the entire end-to-end ML production pipeline, including dataset creation,... ...optimization, on-device and server side inference Quickly prototype and iterate to achieve...
Suggested
Relocation
Apple Inc.
Cupertino, CA
3 days ago
ML and Agentic Systems Engineer
...108 Job Category: Engineering Time Type: Full time... ...building agentic systems that can reason... ...meta‑layer of modern ML: the agents,... ...developer and researcher productivity. Create self‑... ...and evolve large‑scale Python and PyTorch... ...Strong agency in LLM‑based systems, such...
Suggested
Full time
NVIDIA AI
Santa Clara, CA
3 days ago
AI/ML Engineer - Model Inference
$117.7k - $221.4k
...high-value data from large-scale real-world sensor streams.... ..., and developer productivity, building systems that make it easier to search... ...operating model reflects how Cola engineers think: build durable... ...processing, featurization, and inference foundations that power...
Suggested
Full time
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
Senior Machine Learning Engineer, LLM
$180k - $339.25k
...Senior Machine Learning Engineer (LLM)! NVIDIA is seeking... ...) run experiments at scale, (3) generate... ...compression‑aware training and inference seamless. We own the... ...recipes Work with production SW teams to realize recipes... ...and ability to debug ML systems Experience with...
Work experience placement
NVIDIA
Santa Clara, CA
3 days ago
Inference Optimization ML Engineer
...hardware and robot systems to the... ...manufacturing scale‑up to make generalist... ...looking for an Inference Optimization MLE... ...efficiently in production. You'll be... ...with research engineers to translate model... ...optimization, ML systems, or a closely... ..., or other LLM serving optimizations...
Rhoda AI
Mountain View, CA
5 days ago
ML Engineer: LLM Research & Production
$181.1k - $272.1k
...ideal candidate will develop and fine-tune domain-specific LLMs for various NLP tasks and ensure the translation of product requirements into engineering tasks. This role offers a competitive salary between $181,100 and $272,100, comprehensive benefits, and opportunities...
Apple Inc.
Cupertino, CA
5 days ago
Real-Time ML Systems Engineer — Anomaly Detection & Streaming
NVIDIA seeks an ML Engineer to design and implement real-time ML algorithms that process millions of telemetry streams for an AI Data Center AIOps platform. You will build production-grade models and pipelines that detect anomalies and surface insights under tight CPU...
NVIDIA
Santa Clara, CA
3 days ago
Senior Inference Systems Engineer — Large-Scale GPUs
...infrastructure company in California is seeking a Member of Technical Staff — Inference to design and optimize large-scale AI inference systems. The role demands 5+ years in systems engineering and expertise in large-scale inference systems. Successful candidates will...
Flexible hours
RadixArk
Palo Alto, CA
1 day ago
RL Systems Engineer: Inference & Training at Scale
xAI is seeking an engineer for the RL infrastructure team to help with low precision RL training and inference. You will design and optimize the inference stack for RL... ...requires building and optimizing large-scale distributed systems and proficiency in Python, C++, or...
xAI
Palo Alto, CA
2 days ago
Staff ML Systems Engineer — Distributed Training at Scale
...California seeks a Member of Technical Staff — Training to design and optimize large-scale distributed training systems for frontier AI models. Candidates should have 5+ years of experience in ML systems and be proficient in Python along with another systems language, such...
RadixArk
Palo Alto, CA
4 days ago
Senior ML Training Systems Engineer, Large-Scale Robotics
Rhoda AI in Mountain View is seeking a Staff / Principal ML Training Systems Engineer to lead the performance of large-scale multimodal training systems. This role involves improving training efficiency and collaborating closely with research teams to accelerate model iteration...
Rhoda AI
Mountain View, CA
21 days ago
Full-Stack ML Systems Engineer
...infrastructure gap required to scale them efficiently,... ...cost-effectively in production. We bridge this exact gap by applying deep systems programming, software-... ...Austin and world-renowned ML systems researcher with... ...production experience engineering ML systems, OR a PhD...
Shift work
Success Matcher Recruitment
Sunnyvale, CA
11 days ago
Full-Stack ML Engineer for LLM Optimization & On-Device Inference
Apple Inc. is seeking a Full Stack ML Engineer to innovate and enhance machine learning solutions for its applications. The role involves designing and implementing solutions to optimize performance metrics such as runtime latency and decoding speed. Candidates should possess...
Apple
Cupertino, CA
3 days ago
PhD AI/ML Systems Engineer - Full-Stack ML at Scale
Gravity Engineering Services Pvt Ltd. is seeking a PhD Software Engineer... ...creativity are key. You will develop systems that impact billions of users, contributing to various product areas like AI, Cloud, YouTube,... ...skills, and experience in ML or AI technologies....
Gravity Engineering Services Pvt Ltd.
Sunnyvale, CA
3 days ago
Systems Engineer
$120k - $160k
...powered analytics to locate every product in a store, continuously,... ...environments - at fleet scale. RADAR is one of the best-funded... ...ABOUT THE JOB Radar is hiring a Systems Engineer who will take end-to-end... ...artificial intelligence or a LLM such as ChatGPT during the interview...
Radar
Sunnyvale, CA
4 days ago
ML Platform Engineer - Petabyte-Scale Data Systems
...the infrastructure for search features, impacting millions of users. You will handle large-scale data and be part of a collaborative team dedicated to creating innovative products. The ideal candidate has a computer science background, strong coding skills, and...
Apple
Cupertino, CA
3 days ago
Staff ML Software Engineer L6 Platform Systems AIMS Engineering
...AI for Member Systems (AIMS) runs the... ...doing so. But AI/ML is moving fast,... ...AI at this scale. Migrating to a... ...Systems is the engineering foundation of AIMS... ...with dozens of production models in flight... ..., applying LLM-based systems to... ...serving and online inference at high-traffic...
Hourly pay
Full time
Immediate start
Flexible hours
Netflix
Los Gatos, CA
3 days ago
Senior ML Engineer, Production Ads & Knowledge Graph
...reliable on-demand, logistics engine for delivery. We are... ...build and maintain a large scale 24x7 global infrastructure system that powers DoorDash's 3-sided... ...catalog system and our product knowledge graph at the heart... ...to implement new ML solutions to make our product...
Full time
Temporary work
Work at office
Local area
Remote work
DoorDash
Sunnyvale, CA
3 days ago
ML Engineer, Inference & Optimization
About the Role We are seeking Inference Engineers to accelerate the performance of Pika's AI-driven products. In this highly technical role, you... ...efficiency, ensuring our creative AI systems deliver industry-leading user experiences at scale. You will design and optimize...
Work at office
3 days per week
Pika
Palo Alto, CA
3 days ago
Founding ML Infra Engineer — Production-Grade LLMs
An innovative AI startup is seeking a Founding ML Infrastructure Engineer to take charge of deploying and optimizing production-grade LLM systems. In this core role, you will be responsible for building and managing a full ML serving stack, working closely with product...
Realmlabs
Sunnyvale, CA
3 days ago
Senior Machine Learning Engineer
$194k - $214k
...attempts—before mass production. This lack of... ...customer‑centric Senior ML Engineer who will join our... ...including DL‑ and LLM‑based approaches,... ...reliably work at scale. What You'll Do... ...of large‑scale ML systems, all the way to surfacing... ..., deployment, and inference at scale with...
Instrumental Inc.
Palo Alto, CA
3 days ago
ML Engineer
Job Title: ML Engineer What You Will Own End‑to‑End ML Lifecycle across real products: data ingestion, feature design... ...Production‑grade ML systems built with PyTorch or... .... Applied GenAI and LLM work that creates measurable... ..., maintaining and scaling ML models in...
MetAntz
Palo Alto, CA
4 days ago
ML Systems Engineer, Physical AI
Scale our Ray + PyTorch infrastructure for the multimodal... .... 3+ yrs distributed systems / ML infra. About Orbifold... ...a Machine Learning Engineer to scale and optimize... ...training jobs and inference deployments to maximize... ...let our researchers and product engineers deploy...
Orbifold AI
Palo Alto, CA
5 days ago
3D ML Systems Engineer: Low-Level Training & Quantization
...distributed training focussed Machine Learning Engineer to train, accelerate and deploy... ...foundation models for a $50m+ funded later stage scale up, building the worlds leading 3D... ...optimize the worlds largest 3D native ML systems, working from lower levels, building an...
Relocation package
Acceler8 Talent
Santa Clara, CA
5 days ago
Senior ML Engineer - Simulation & Production Systems
$204k - $259k
...seeking researchers and software engineers to develop production-grade machine learning systems for autonomous vehicles. The role involves working with large scale data systems and innovative algorithms... ...2 years of experience in building ML models. The expected salary for...
Full time
Waymo
Mountain View, CA
3 days ago
ML Systems Engineer: Production-Ready AI
...focuses on developing core machine learning components in real production systems, assisting with data pipelines, and implementing evaluation... ...in machine learning, some hands-on experience with deploying ML models, and a bias toward shipping and continuous improvement...
A1 Services
Palo Alto, CA
5 days ago
Senior ML Platform Engineer - Scale GPU Inference
A leading AI company in California seeks a skilled engineer to develop foundational infrastructure for large-scale multimodal AI models. You will architect model serving pipelines, manage scheduling systems for GPU resources, and own CI/CD pipelines for checkpoints. Ideal...
Luma AI
Palo Alto, CA
3 days ago
Global Factory Systems Engineering Manager - Diagnostics
$224k - $356.5k
.... We are the Datacenter System Software team, and we are... ...motivated, creative Engineering Manager to drive Factory... ...NVIDIA's GPU- and DPU-based products. This includes tightly coupled rack-scale systems such as GB200/GB... ...recordof integrating AI/LLM tooling into engineering...
Shift work
NVIDIA
Santa Clara, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Systems Engineer: Production-Scale LLM Inference. Be the first to apply!