Machine Learning Engineer: LLM Interpretability & Systems

CTGT

About CTGT & The Mission

Despite massive investment in commercial AI, organizations often find that demonstrated value is elusive, primarily due to the non-deterministic risk inherent to generative models. CTGT is the deterministic governance layer that enables the most important global institutions to deploy AI workflows with confidence.

Born out of Stanford University research, we provide the control plane that makes it possible. A lightweight, model-agnostic system that enforces policy, prevents drift, and produces auditable decisions in real time.

While we sit on the edge of AI research, CTGT brings frontier intelligence into real-world environments. We apply cutting-edge theory directly in production to make large language models more reliable, controllable, and performant in practice.

Our mission is to bring models to the level of performance and accountability required by the Fortune 500. By bridging the gap between LLM capabilities and domain-specific requirements, we unlock the true potential of generative AI to solve the most pressing problems in our world today.

The Role

A new open-source model is released and you are compelled to reach inside and understand how it actually works. You instinctively try to push it beyond what most people say is already impressive. You observe model behavior and don't think, "What's a better prompt?", but "How do I improve its fundamentals?"

CTGT's Senior Machine Learning Engineer will operate deep within the model stack, working directly with weights, activations, and architectures to build the systems that make AI governance deterministic. Your work powers the Policy Engine, the core technology that gives enterprises real-time, auditable control over model behavior in production. Your mandate is ostensibly simple but difficult in execution: determine how a model can be improved for a specific purpose and build the systems that operationalize that within our platform.

As opposed to simply using models, you will probe the mechanics of their cognition.

What You Will Do

Take ideas from mechanistic interpretability and related work and turn them into code that runs in production, making research into reality.
Work directly with model internals to improve behavior and performance across commercial and open-source models.
Leverage techniques like activation patching, control vectors, and feature extraction to achieve targeted, repeatable improvements in model output.
Build the evaluation and deployment loops needed to ship changes reliably into enterprise environments.
Design and optimize the feature-level intervention systems that enable deterministic policy enforcement at inference time.

Who You Are

Strong understanding of Transformer architectures, PyTorch internals, and the mathematical foundations of deep learning.
Have trained, fine-tuned, or optimized models beyond superficial augmentation.
Can read a paper, decide what matters, and implement it.
Notice when something is not working and take ownership of fixing it.
Motivated by the challenge of making large language models reliable and controllable enough for the highest-stakes enterprise applications.

What We Offer

Compensation & Equity: Competitive base compensation, plus significant equity in a venture-backed company with institutional investors including Google's Gradient Ventures, General Catalyst, and Y Combinator. We want people who think and act like owners.

Real Impact: You will work directly on the core systems that determine how models perform in the wild. Your work ships into real, high-stakes environments where governance, auditability, and performance are non-negotiable.

Autonomy & Trust: We operate with a high degree of trust. You are expected to form strong technical opinions and execute on them.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Machine Learning Engineer: LLM Interpretability & Systems in San Francisco, CA vacancy

Machine Learning Systems Engineer, RL Engineering
...Machine Learning Systems Engineer, RL Engineering San Francisco, CA | New York City, NY | Seattle, WA... ...Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want... ...distributed systems Large scale LLM training Python Implementing...
Suggested
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
1 day ago
Machine Learning Engineer — Large Language Models, Generative AI & Agentic Systems
$147.4k - $272.1k
Machine Learning Engineer — Large Language Models, Generative AI & Agentic Systems San Francisco Bay Area, California, United States Machine... ...evolving landscape of LLM-powered agents, tool-use models... ...Demonstrated ability to read, interpret, and apply cutting‑edge research...
Suggested
Relocation
Apple Inc.
San Francisco, CA
4 days ago
Machine Learning Engineer, LLM Evals & Observability
$200k - $300k
...SaaS connectors, flexible LLM choice, and robust APIs... ..., you'll help build systems used daily across Microsoft... ..., and the tooling engineers use to understand what... ...evaluation, reinforcement learning from human feedback,... ...large systems involving machine learning. ~ Analytically...
Suggested
Home office
Flexible hours
3 days per week
Glean.info
San Francisco, CA
4 days ago
Machine Learning Engineer, Speech LLM Training - San Francisco
$200k
...and privacy protection. To learn more about Plaud, please visit... ...intersection of research and engineering, eager to design novel sequence... ...Are obsessed with building AI systems that natively understand and generate... ...(e.g., vLLM, TensorRT-LLM, SGLang) to minimize latency for...
Suggested
Full time
Work at office
Worldwide
Plaud
San Francisco, CA
4 days ago
Senior Machine Learning Engineer, Perception LLM/VLM
$204k - $259k
...Perception team builds the system which learns the spatial-temporal representation... ...set of sensors, enabling engineers like you to (1) develop... ...~5+ years of experience in Machine Learning, with a focus on large... ...large-scale model development (LLM, VLM, or similar foundation...
Suggested
Full time
Remote work
Waymo
San Francisco, CA
6 hours ago
Senior Machine Learning Engineer - VLM/LLM Evaluation
$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most... ...-art Foundation Models that are used throughout Waymo's systems, both onboard autonomous vehicles and offboard in simulation...
Full time
Temporary work
Remote work
Waymo
San Francisco, CA
2 days ago
Machine Learning Engineer, Inference & Serving (Speech LLM) - San Francisco
$200k
...and privacy protection. To learn more about Plaud, please visit... ..., ultra-low-latency inference engines for large language models or foundational... ...and genuinely enjoy the systems-engineering challenge of... ...-hood familiarity with modern LLM serving frameworks like vLLM,...
Full time
Work at office
Worldwide
Plaud
San Francisco, CA
4 days ago
Machine Learning Systems Engineer, Research Tools
$320k - $405k
...Machine Learning Systems Engineer, Research Tools San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for...
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
6 hours ago
Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco
$200k - $365k
...and privacy protection. To learn more about Plaud, please visit... ...on. Possess strong software engineering skills (especially in Python)... ...building reliable distributed systems, data pipelines, or evaluation... ...good" looks like for a Speech LLM, translating capabilities (like...
Full time
Work at office
Worldwide
Plaud
San Francisco, CA
4 days ago
Senior ML Systems Engineer - LLM Infra & Governance
A tech-driven company focused on blockchain solutions is seeking a Senior ML Systems Engineer. In this role, you will build reusable workflows, automate model versioning, and deploy scalable AI systems. Candidates should have strong programming skills, experience with...
TRM Labs
San Francisco, CA
4 days ago
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
$250k - $350k
...clients. As an ML Sys Research Engineer, you'll work on building out... ...technologies to optimize our ML system. Your customer will be other... ...: At least 1-3 years of LLM training in a production environment... ..., retirement benefits, a learning and development stipend, and generous...
Full time
Scale AI
San Francisco, CA
2 days ago
LLM/ML Engineer (Inference)
...Python and PyTorch, with a strong foundation in low-level operating systems concepts including multi-threading, memory management,... ...experienced with modern inference systems like TGI, vLLM, TensorRT-LLM, and Optimum, and comfortable creating custom tooling for testing...
Work at office
Reducto
San Francisco, CA
1 day ago
Machine Learning Engineer
...Machine Learning Engineer We're assisting a well-funded startup with their search... ...complex documents into LLM-ready inputs with exceptional... ...art models for parsing and interpreting unstructured data... ...models used in production systems You have experience deploying...
Work at office
DRH Search
San Francisco, CA
4 days ago
Machine Learning Engineer
...Capital, and are hiring a Machine Learning Engineer to help us train and deploy... ...models used in production systems Language/Skills: You're... ...art models for parsing and interpreting unstructured data Experimenting... ...techniques to improve LLM accuracy Build data pipelines...
Work at office
Local area
Reducto
San Francisco, CA
3 days ago
AI/ML Engineer(RL & Physical Systems)
...AI/ML Engineer (RL & Physical Systems) FLUIX is building the AI Operating System... ...) with deep reinforcement learning and physics-based modeling... ...evaluate model reliability, interpret failure cases, and propose... ...Support integration of LLM-based tools and workflows...
Weekend work
Fluix AI
San Francisco, CA
1 day ago
Senior Machine Learning Engineer - System Experience Personalization
$181.1k - $272.1k
...Senior Machine Learning Engineer - System Experience Personalization Our team is looking for you to help make iOS more intelligent, proactive and personal. Our team is part of the core iOS experience, using privacy preserving on-device intelligence to drive new experiences...
Relocation
Apple
San Francisco, CA
1 day ago
ML Engineer (AI-Native Systems & Forecasting)
...Ando is rebuilding this system from first principles.... ...architectural clarity, learning loops, and AI systems quality... ...-term success. ML Engineer (AI-Native Systems &... ...deployment of Ando's machine learning systems, including... ...intelligence, and LLM-powered workflows. This...
Hourly pay
Contract work
Ando Technologies, Inc
San Francisco, CA
2 days ago
Senior Machine Learning Engineer, Prediction & Planning, System Architecture
$213k - $263k
...Senior Machine Learning Engineer, Prediction & Planning, System Architecture Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused...
Full time
Contract work
Internship
Remote work
Waymo
San Francisco, CA
4 days ago
Senior Machine Learning Engineer - Systems
...individuals who seek to have a lasting impact. Learn more at At EvenUp, we leverage... ...and accessibility to the legal system. Tackling the most complex legal document... ...a curious, impact-driven early career Machine Learning Engineer eager to join EvenUp's mission. You'll...
Full time
Temporary work
Local area
Home office
Flexible hours
EvenUp Inc.
San Francisco, CA
1 day ago
Machine Learning Engineer - Distributed ML Systems
...Research carries out foundational research on Protocol Learning : multi-participant training of foundation models... ...economics. We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You’ll be implementing...
Remote work
Visa sponsorship
Pluralis Research
San Francisco, CA
4 days ago
Machine Learning Engineer, Distributed Data Systems
...As a Research Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large-scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines, collaborate closely with researchers to translate requirements...
OpenAI
San Francisco, CA
4 days ago
Principal Machine Learning Engineer
$275k - $350k
...team of top researchers and engineers across AI and biology to... ...Role As a Principal Machine Learning Engineer at Edison Scientific... ...Responsibilities Interpret qualitative challenges in... ...training of large-scale LLM-based systems, including building internal...
Work at office
Flexible hours
Edison Scientific
San Francisco, CA
1 day ago
Senior Machine Learning Engineer - Model Evaluations, Public Sector
$240.45k - $300.3k
...Senior Machine Learning Engineer - Model Evaluations, Public Sector San Francisco... ...Scale deploys advanced AI systems—including LLMs, agentic... ...safety metrics, including LLM-judge–based evaluations.... .... Knowledge of interpretability, adversarial robustness, or...
Full time
Scale AI
San Francisco, CA
1 day ago
Machine Learning Engineer, Distributed Data Systems - Robotics
$295k
...capabilities with the constraints of physical systems to improve peoples' lives. About the Role As a Research Engineer, Distributed Data Systems, you will design... ...storage, streaming infrastructure, machine learning infrastructure while ensuring scalability,...
Work at office
Relocation package
OpenAI
San Francisco, CA
1 day ago
GenAI Python Systems Engineer - Senior Associate
$77k - $202k
...in data and analytics engineering focus on leveraging... ...artificial intelligence and machine learning at PwC will focus on... ..., models, and systems to enable intelligent... ...development areas. Interpret data to inform insights... ...engineering for LLM outputs - Designing...
Full time
H1b
PwC
San Francisco, CA
2 days ago
ML Research Engineer, ML Systems
$189.6k - $237k
...automatic training and evaluation of LLM's, as well as evaluation of... ...to optimize our ML system Ideally you'd have:... ...ML systems Strong software engineering skills, proficient in frameworks... ...coverage, retirement benefits, a learning and development stipend, and...
Full time
Scale AI
San Francisco, CA
1 day ago
Machine Learning Research Engineer (MLRE) - Workflows/Systems
...class scientists, ML researchers, and engineers to work together to move beyond the... ...molecular level something that can be learned, predicted, and designed. At Achira... ...who thrives at the intersection of machine learning systems architecture and distributed computing...
Work at office
Achira
San Francisco, CA
1 day ago
Sr. Staff Machine Learning Engineer, Content Ecosystem
...ecosystem continuously learns what to create,... ...this Sr. Staff ML Engineer role, you’ll be... ...marketplace: a dynamic system with feedback... ...experimentation, and model interpretability to connect content... ...fundamentals in machine learning and... ...Familiarity with LLM-powered productivity...
Full time
Work at office
Remote work
Relocation
Relocation package
Pinterest
San Francisco, CA
6 hours ago
Senior or Staff ML Systems Engineer, LLMs
$200k - $240k
...secure world for all. The AI Engineering Team is chartered with enabling... ...Language Models (LLMs) and agentic systems. Our mission is to build... ...integrating cutting-edge tools in the LLM and agent space — including... ...and measurable impact. Learn about TRM Speed in this position...
Remote work
Worldwide
TRM Labs
San Francisco, CA
1 day ago
Sr. Machine Learning Engineer (Recommendation Systems)
$175k - $270k
...this means leveraging cloud delivery, modern tech stacks, machine learning, and hand-crafted native app experiences on all of our... ...-user playback experiences. Senior Machine Learning Engineer (Recommendation Systems) Philo's recommendation system improves user engagement...
Full time
For contractors
Work at office
Remote work
Home office
Flexible hours
3 days per week
Philo
San Francisco, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer: LLM Interpretability & Systems. Be the first to apply!