Senior ML Systems Engineer - LLM Infra & Governance
TRM Labs
A tech-driven company focused on blockchain solutions is seeking a Senior ML Systems Engineer. In this role, you will build reusable workflows, automate model versioning, and deploy scalable AI systems. Candidates should have strong programming skills, experience with scalable infrastructure, and a deep understanding of ML Ops best practices. The position offers a competitive salary range with the opportunity to contribute to significant national security initiatives in a fast-paced environment. #J-18808-Ljbffr TRM Labs
- ...company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for... .... You will design distributed training systems and optimize GPU utilization while collaborating... ...have over 5 years of experience in ML infrastructure and a strong background...Senior
$295k - $380k
OpenAI is searching for a Senior Software Engineer to join their Robotics team in San Francisco. The role focuses on maintaining and improving... ...while actively reviewing and debugging code within ML systems. The ideal candidate should thrive in hands-on settings, possess...Senior- ...the deterministic governance layer that... ..., model-agnostic system that enforces policy... ...the gap between LLM capabilities and... ...fundamentals?" CTGT's Senior Machine Learning Engineer will operate deep... ...databases Infra: Docker, Kubernetes... ...customer VPCs ML: Self hosted...Suggested
- ...Francisco is seeking an experienced Software Engineer to develop machine learning infrastructure for monetization and ads systems. The role involves building data pipelines, creating... ..., particularly in distributed systems and ML workflows. Join us in shaping the future of...Senior
- HopHR is building its Silicon Valley engineering team in San Francisco and seeks a skilled AI/ML Engineer. This role focuses on developing production-grade AI systems and involves significant responsibilities in building LLM pipelines and optimizing RAG architectures. The...SeniorWork at office
- A leading AI research company in San Francisco seeks Senior/Staff Engineers skilled in distributed systems and large-scale ML training. Responsibilities include designing systems optimized for low-bandwidth conditions and implementing robust training strategies. Ideal...SeniorRemote job
- MakerMaker.AI is seeking a Senior ML Engineer in San Francisco. In this role, you will build and maintain machine learning systems and pipelines for research purposes, ensuring accurate and reliable results. You will lead the development from prototype to production, collaborating...Senior
- ...Member of Technical Staff to design and optimize inference systems. The role involves managing KV cache allocation and... ...components. Ideal candidates should have strong software engineering skills and experience with ML inference systems, particularly in Python and C++. This...Senior
- ...Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern...Senior
- A leading AI infrastructure company is seeking a Senior ML Performance Engineer to design a comprehensive performance testing platform for large language models. This role requires a minimum of 7 years in performance engineering and strong experience with GPU programming...Senior
- ...on cutting-edge AI research and development. The role involves building and scaling training and inference infrastructure, designing ML kernels, and optimizing performance. Ideal candidates should have a passion for addressing ambitious challenges at the intersection of...
- ...Member of Technical Staff focused on building and optimizing ML inference systems in San Francisco. The role involves designing end-to-end... ...real-world workloads. Candidates should have strong software engineering skills, experience with ML inference systems, and...
- A pioneering AI company in the San Francisco Bay Area is seeking an ML Ops Engineer to automate model training, deployment, and governance processes. The ideal candidate will have extensive MLOps experience and be proficient in tools like Kubernetes and Terraform. This...
$172.5k - $210k
A cutting-edge AI infrastructure firm located in San Francisco is seeking a Senior Systems Performance Engineer. This role involves leading hardware evaluations and optimizing AI systems for performance. Candidates should have over 5 years of experience, proficiency in...Senior$200k
A tech startup in San Francisco is seeking senior or staff-level engineers to design and maintain their global cloud infrastructure. Ideal candidates... ...engineering experience, a strong understanding of systems at scale, and a passion for optimizing performance and reliability...SeniorFull time- ...Walrus, a decentralized storage network, ensuring high standards of reliability and performance. Candidates should have over 5 years of systems/network programming experience, with skills in Rust, C, or C++. A Bachelor’s degree in Computer Science or related field is...Senior
- MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability. The ideal candidate will have 3+ years of experience in production...Senior
- A pioneering tech startup in neurotechnology is seeking a Senior Machine Learning Infrastructure Engineer to design and scale critical infrastructure powering ML applications. This role involves creating robust data pipelines and optimizing modeling processes, essential...Senior
- ...and real-world responsibility in mind. Our ML team comes from a culture of academic... ...frontier research for their next generation of LLM products. Join us if you: Wish to... ...attacks. Collaborate with our engineering team to deliver real-world applications of...Local areaShift work
- ...integration into real-world systems, with the observability, manageability... ...Gentoro helps teams enforce governance, maintain auditability, and... ...are looking for a visionary Senior ML Engineer who will bridge the gap... ...training or fine-tuning LLM models, embeddings; building...SeniorShift work
- ...Senior ML Engineer Highlight is building a shared intelligence layer for the modern workforce... ...Senior ML Engineer to help build the AI systems that power Highlight. You will work across... ...the engineering org Stay current on LLM advances, retrieval techniques, and ML...SeniorWork at officeRelocationRelocation packageFlexible hours
$240k
...The Team: Convex has assembled a team of engineers who have built and designed some of the... ...team has a lot of experience running large systems at scale, and as our customers and... ...experience designing and operating large-scale infra, we’d love to talk! This is a hands-on role...SeniorFull timeWork at officeRemote workShift workNight shift$170k - $220k
...significant ownership in designing and developing high-performance systems for LLMs, focusing on distributed systems and multi-GPU workloads. The ideal candidate should have 2+ years of backend engineering experience and strong Python skills. This full-time position...Full time- A leading tech company in San Francisco seeks a Machine Learning Engineer to build and maintain infrastructure for large-scale model training. In this hands-on role, you will design systems, work closely with researchers, and optimize training processes. Candidates should...
- ...foundation in low-level operating systems concepts including multi-... ...systems like TGI , vLLM , TensorRT-LLM , and Optimum , and... ...contributions and staying current with ML infrastructure developments... ...this usually requires a large engineering effort dedicated to building specialized...Work at office
- ...growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and... ...should possess strong Python skills, have a background in systems engineering, and experience with Kubernetes. The position...
- Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto (Canada) - Hybrid Employment... ...: AI Infrastructure / Compiler Systems Overview A venture-backed AI infrastructure... ...performance testing platform for LLM inference workloads across GPU clusters...SeniorFull time
$189.72k - $332.01k
A leading social media platform based in Palo Alto is seeking a Machine Learning Engineer. The role involves building innovative systems using deep learning and machine learning, improving their models across various product areas, and utilizing data-driven methods for...Senior- A government services organization based in San Francisco is seeking a Senior Systems Administrator to support network systems and implement Automated Litigation Support software solutions. The ideal candidate will have extensive experience in implementing litigation support...Senior
- A leading streaming service is seeking a Staff Software Engineer to enhance ML infrastructure. The role involves designing scalable systems, mentoring engineers, and collaborating with cross-functional teams. Candidates should have over 8 years of experience in building...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Systems Engineer - LLM Infra & Governance. Be the first to apply!
- computer vision machine learning engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- wireless systems engineer San Francisco, CA

