Senior or Staff AI Infrastructure Engineer
$200k - $240kTRM Labs
Build a Safer World. TRM Labs provides blockchain analytics and AI solutions to help law enforcement and national security agencies, financial institutions, and cryptocurrency businesses detect, investigate, and disrupt crypto-related fraud and financial crime. TRM’s blockchain intelligence and AI platforms include solutions to trace the source and destination of funds, identify illicit activity, build cases, and construct an operating picture of threats. TRM is trusted by leading agencies and businesses worldwide who rely on TRM to enable a safer, more secure world for all. The AI Engineering Team is chartered with enabling next-generation AI applications , with a special focus on Large Language Models (LLMs) and agentic systems. Our mission is to build robust pipelines, high-performance infrastructure, and operational tooling that allow AI systems to be deployed with speed, safety, and scale. We manage petabyte-scale pipelines, serve models with millisecond-level latency, and provide the observability and governance needed to make AI production-ready. We’re also deeply involved in evaluating and integrating cutting-edge tools in the LLM and agent space — including open-source stacks, vector databases, evaluation frameworks, and orchestration tools that unlock TRM’s ability to innovate faster than the market. As a Senior or Staff AI Infrastructure Engineer , you’ll be at the core of building and scaling the technical infrastructure for AI/ML systems. You will: Build reusable CI/CD workflows for model training, evaluation, and deployment — integrating Langfuse, GitHub Actions, and experiment tracking, etc. Automate model versioning, approval workflows, and compliance checks across environments. Build out a modular and scalable AI infrastructure stack — including vector databases, feature stores, model registries, and observability tooling. Partner with engineering and data science to embed AI models and agents into real-time applications and workflows. Continuously evaluate and integrate state-of-the-art AI tools (e.g. LangChain, LlamaIndex, vLLM, MLflow, BentoML, etc.). Drive AI reliability and governance, enabling experimentation while ensuring compliance, security, and uptime. Build and enhance AI/ML Model Performance Ensure data accuracy, consistency and reliability, leading to better model training and inferencing Deploy infrastructure to support offline and online evaluation of LLMs and agents — including regression testing, cost monitoring, and human-in-the-loop workflows. Enable researchers to iterate quickly by providing sandboxes, dashboards, and reproducible environments. What We’re Looking For Write high-quality, maintainable software — primarily in Python, but we value engineering ability over language familiarity. Have a strong background in scalable infrastructure , including: Containerization and orchestration (e.g. Docker, Kubernetes) Infrastructure-as-code and deployment (e.g. Terraform, CI/CD pipelines) Monitoring and logging frameworks (e.g. Datadog, Prometheus, OpenTelemetry) Understand and implement ML Ops best practices , including: Model versioning and rollback strategies Automated evaluation and drift detection Scalable model and agent serving infrastructure (e.g. vLLM, Triton, BentoML) Deploy and maintain LLM and agentic workflows in production, including: Monitoring cost, latency, and performance Capturing traces for analysis and debugging Optimizing prompt/response flows with real-time data access Demonstrate strong ownership and pragmatism , balancing infrastructure elegance with iterative delivery and measurable impact. Learn about TRM Speed in this position: Rapid Issue Resolution. TRM Engineers identify and resolve critical onsite issues in minutes to hours, not weeks. We create virtual war rooms, implement fixes, and share lessons with both customer stakeholders and internal teams within 48 hours. Navigating Bureaucracy. We anticipate and address procedural hurdles, build trust with key stakeholders, and find alternative pathways to approvals. This keeps projects moving even in complex environments. Efficient Knowledge Transfer. Engineers document and share updates in real time, ensuring the entire team—onsite and remote—has full visibility into plans, blockers, and resolutions. Knowledge sharing sessions and clear documentation reduce friction and accelerate delivery. About TRM's Engineering Levels: Engineer: Responsible for helping to define project milestones and executing small decision decisions independently with the appropriate tradeoffs between simplicity, readability, and performance. Provides mentorship to junior engineers, and enhances operational excellence through tech debt reduction and knowledge sharing. Senior Engineer: Successfully designs and documents system improvements and features for an OKR/project from the ground up. Consistently delivers efficient and reusable systems, optimizes team throughput with appropriate tradeoffs, mentors team members, and enhances cross-team collaboration through documentation and knowledge sharing. Staff Engineer: Drives scoping and execution of one or more OKRs/projects that impact multiple teams. Partners with stakeholders to set the team vision and technical roadmaps for one or more products. Is a role model and mentor to the entire engineering organization. Ensures system health and quality with operational reviews, testing strategies, and monitoring rigor. The following represents the expected range of compensation for this role: Individual pay is determined by skills, qualifications, experience, and location. The compensation details listed in this posting reflect the US base salary only. The estimated base salary range for this role is $200,000 - $240,000. Additionally, this role may be eligible to participate in TRM’s equity plan. Please note – we factor in the different costs for geographies outside the United States. Life at TRM We are building a safer world. That promise shows up in how we work every day. TRM moves quickly. We are a high velocity, high ownership team that expects clarity, follow-through, and impact. People who thrive here are energized by hard problems, experimentation, and continuous feedback. If something takes months elsewhere, it will ship here in days. Our work sits at the intersection of AI, national security, and fighting financial crime. The problems are complex, the stakes are real, and the environment evolves quickly. The pace and intensity of the work reflect the importance of the mission. As a result, the way we operate requires a high level of ownership, adaptability, collaboration, and creative problem-solving. At TRM, you should expect: Priorities and targets to change quickly as we experiment and iterate Work that often requires operating with a high degree of ambiguity A high level of personal ownership and accountability Close collaboration across teams and functions Frequent, high-touch communication • Creative problem solving and out-of-the-box thinking A pace that rewards urgency, adaptability, and outcomes This environment is energizing for people who enjoy building, solving hard problems, and making progress in situations that are not always fully defined. It also requires comfort navigating ambiguity, adjusting course as new information emerges, and maintaining focus and positivity in a fast-moving and intense environment. We also recognize that this style of operating is not for everyone. If you are primarily optimizing for predictability or a consistently balanced workload, we encourage you to use the interview process to pressure test whether this environment is truly the right fit. We want teammates who thrive here, not just survive here. At the same time, many people find this work deeply rewarding. If you are excited by meaningful problems, motivated by ambitious goals, and energized by working alongside mission-driven colleagues, there is a good chance you will find TRM to be an exceptional place to grow and contribute. Learn more: Interviewing at TRM: How We Hire and What Success Looks Like AI Fluency at TRM AI fluency is a baseline expectation at TRM. We believe AI meaningfully changes how top performers operate. We expect every team member to use AI to accelerate and reimagine their craft, not just automate surface tasks. At TRM, AI fluency means you are among the top 10 percent of operators in your function in how you apply AI to: Accelerate repeatable workflows Structure and solve problems Improve output quality Increase speed and leverage You will be evaluated on applied AI fluency during the interview process. Leadership Principles We hire and grow against three leadership principles. They’re the standards for how we operate, treat each other, and make decisions. Impact-Oriented Trailblazer: We put customers first and move with speed, focus, and adaptability. We treat every plan like an experiment – test, ship, measure, and iterate quickly. Master Craftsperson: We care deeply about our craft. We balance speed with high standards, own outcomes end‑to‑end, and invest in getting better everyday. Inspiring Colleague: We add clarity and energy, not noise. We bring humility, candor, and a one‑team mindset — giving and receiving feedback to make the team stronger. The impact you will have This work has real stakes. Depending on your role at TRM, your week might look like: Driving critical investigations that can’t wait for typical business hours. Shipping products in days when others would schedule quarters. Partnering with teams across time zones to deliver insights while the story is still unfolding. Building new solutions from first principles when the playbook doesn’t yet exist. Protecting victims and customers by tracing illicit activity and disrupting criminal networks. Join our Mission At TRM we care deeply about our craft. We are looking for individuals who want their work to matter, who experiment with speed and rigor, and who take pride in building a safer world for billions of people. If you’re excited by TRM’s mission but don’t check every box, we encourage you to apply — we hire for slope, judgment, and the will to learn fast. TRM is a Series C company with $220M in total funding, backed by Blockchain Capital, Goldman Sachs, Bessemer, Y Combinator, Thoma Bravo, and others. Headquartered in San Francisco, TRM operates as a distributed-first company with hubs in Los Angeles, San Francisco, New York, Washington D.C., London, and Singapore. Privacy Policy and Additional Information By submitting your application, you are agreeing to allow TRM to process your personal information in accordance with the TRM Privacy Policy. Our typical hiring cycles for specialized roles span 24 to 36 months. Accordingly, we retain your personal information for up to 36 months to evaluate your application and to consider you for current and future employment opportunities, unless you request earlier deletion or a different retention period is required or permitted by law. To notify TRM Labs that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this form. Recruitment agencies TRM Labs does not accept unsolicited agency resumes. Please do not forward resumes to TRM employees. TRM Labs is not responsible for any fees related to unsolicited resumes and will not pay fees to any third-party agency or company without a signed agreement. Learn More : Company Values | Interviewing | FAQs #J-18808-Ljbffr
$190k - $270k
...AI Chopping Block, Inc. in San Francisco is seeking an AI Infrastructure Engineer to maintain user-facing services and production systems. The role involves building and managing infrastructure with tools like Ansible and Kubernetes, ensuring reliability and scalability...Senior- ...A leading AI research firm in San Francisco seeks a Staff Infrastructure Engineer to identify and resolve infrastructure bottlenecks and design large-scale systems for AI training. The ideal candidate has over 3 years of experience in infrastructure engineering and strong...Senior
- ...An innovative AI lab is seeking an experienced engineer to manage and optimize large-scale training infrastructure. You will build core systems that support researchers, focusing on distributed training, performance optimization, and data pipelines. Ideal candidates should...Senior
$190k - $270k
...AI Chopping Block, Inc. is hiring an AI Infrastructure Engineer in San Francisco, California. This full-time role involves ensuring smooth operation of user-facing services and production systems, alongside building and running infrastructure with Ansible, Terraform, and...SeniorFull time- ...AI Chopping Block, Inc. seeks a Senior Software Engineer for their Agentic Infrastructure team in San Francisco. This role involves architecting and building AI systems that enable autonomous planning and execution across the platform. Ideal candidates have 4-7 years of...SeniorRemote workFlexible hours
- An innovative AI infrastructure startup is seeking a Sales Engineer to lead technical discovery and drive successful evaluations with clients. The ideal candidate will have significant experience in customer-facing technical roles focused on AI and machine learning infrastructure...SeniorRemote work
- ...Handshake is seeking a Senior Software Engineer for its Agentic Infrastructure team in San Francisco. You will build the backbone for AI agents, designing key systems that ensure functionality and safety across Handshake's platform. The ideal candidate has 4-7 years of...SeniorRemote workFlexible hours
$190k - $270k
...AI Chopping Block, Inc. seeks an AI Infrastructure Engineer in San Francisco to manage user-facing services and production systems. The role requires 5+ years of experience, a Bachelor's in Computer Science (or equivalent), and proficiency in Ansible, Terraform, and Kubernetes...Senior- ...Refinitiv is seeking a Senior Engineer for the CoCounsel Audit team in San Francisco. This role is for experienced engineers or engineering... ...multi-team initiatives, mentoring engineers, and integrating AI systems into workflows. The position offers a flexible hybrid...SeniorFlexible hours
- ...seeking a skilled individual to join their AI Platform team in San Francisco. You'll take charge of the development of infrastructure that powers AI features within the... ...will have extensive experience in software engineering and AI infrastructure support. The position...Senior
- ...Granica, based in San Francisco, is seeking an expert in distributed systems to enhance their data infrastructure. This role involves architecting a global metadata substrate, developing intelligent data layouts, and implementing algorithms for efficient data representation...SeniorFlexible hours
- ...A fast-growing AI startup is seeking a Senior Infrastructure Engineer in San Francisco. In this role, you will architect and scale distributed systems that handle AI-driven phone conversations for major brands. You will contribute to optimizing ML infrastructure and integrating...Senior
$200k - $240k
...ZipHQ, Inc. is looking for an Application Engineer to serve as the engineering anchor of their Internal AI team in San Francisco. The successful candidate will... ...experience in backend applications, a solid grasp on infrastructure, and the ability to communicate effectively...SeniorFlexible hours- ...Drata is seeking a Senior Platform AI Engineer in San Francisco to develop our AI infrastructure, responsible for building and managing the systems that support AI features across compliance platforms. You'll collaborate with cross-functional teams to enhance production...Senior
- ...Capital One National Association is seeking a skilled AI Engineer to develop and optimize AI systems. You will collaborate with cross-functional teams to implement AI-powered products, enhancing customer interactions and internal workflows. Candidates should have a strong...Senior
- ...Drata is seeking a Senior Platform Engineer II, AI Tooling in San Francisco. This role focuses on building internal AI platforms to enhance engineering productivity, including the design and delivery of systems like MCP servers and agentic workflows. The ideal candidate...SeniorFlexible hours
- ...Poggio Labs, Inc. is seeking a Software Engineer to build scalable platforms and lead product initiatives. You will collaborate closely with engineers and leadership to incorporate AI capabilities and improve our web application. The ideal candidate has over 4 years of...SeniorRemote workFlexible hours
- ...A leading tech company is seeking an Infrastructure Engineer to build and scale its core platform powering AI systems. The role involves designing Kubernetes and Terraform-based infrastructures, defining standards for security and performance, and ensuring reliability...Senior
- ...Marble is looking for a Senior Software Engineer in San Francisco, California. This role involves building the core product experience of an AI-powered tax platform. As an early hire, you will contribute to shaping the architecture and features of the platform. Ideal candidates...Senior
- ...B Capital is looking for a Senior Software Development Engineer to build our AI Governance platform from the ground up in San Francisco. In this senior... ...experienced in full stack development, AWS cloud infrastructure, and has a strong understanding of AI systems. Join...Senior
$180k
...A leading software platform in San Francisco is seeking a Senior Software Engineer to develop intelligent services that enhance the software buying... ...–7 years of experience in software engineering, focusing on AI and machine learning. Responsibilities include building...Senior- ...B Capital is seeking a highly skilled AI Platform Engineer to enhance their ML/AI platform that powers autonomous AI agents at scale. This... ..., AWS, and CI/CD practices. You'll design agent harness infrastructure, implement evaluation frameworks, and ensure a seamless journey...Senior
- ...MaintainX is seeking a Senior AI Platform Developer to build scalable backend services for their AI-powered products. In this remote... ...Python development experience and the ability to work with cloud infrastructure. MaintainX offers a competitive salary, health benefits, and...SeniorRemote workFlexible hours
- ...jobr.pro is seeking a Senior Software Engineer to join its AI Platform team in San Francisco. In this role, you will help design and build scalable infrastructure to transform AI product development and enhance agent performance. The ideal candidate will possess a strong...Senior
- ...A leading AI startup in San Francisco seeks a Senior Platform and Infrastructure Engineer. This role involves designing multi-cloud systems and developing secure deployment tooling for Enterprise Agent OS. The ideal candidate will have 3-7 years of experience in cloud...Senior
- ...Salesforce is seeking a highly skilled Software Development Engineer to lead the development of their AI Governance platform. This senior role focuses on both front-end and back-end engineering, ensuring trusted and scalable AI deployment across the enterprise. The ideal...Senior
- ...ManpowerGroup Global, Inc. is seeking a highly skilled Software Engineer to join its expanding team. This role focuses on building AI-enabled enterprise solutions, leveraging Large Language Models and cloud technologies. Ideal candidates will have a strong foundation in...Senior
$167.2k - $209k
...AppFolio, Inc is looking for a Software Engineer specializing in AI to define and drive the technical vision and architecture within the Realm-X platform. This role requires extensive experience in developing and deploying ML/AI systems and a Master's or Ph.D. in a relevant...Senior- ...A leading financial institution in California seeks a Senior Lead AI Engineer to design and develop innovative AI products. The successful candidate will collaborate with diverse teams to optimize AI systems and solutions, contributing significantly to modern banking....Senior
$225k - $300k
...David Joseph & Company is seeking a Staff/Senior Engineer for Varick Agents in San Francisco. This role encompasses owning the technical architecture for AI systems and ensuring their production deployment. Candidates should possess extensive software engineering experience...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior or Staff AI Infrastructure Engineer. Be the first to apply!
- software engineer staff San Francisco, CA
- assistant engineer San Francisco, CA
- assistant engineering manager San Francisco, CA
- staff design engineer San Francisco, CA
- project engineer assistant project manager San Francisco, CA
- technology administrator San Francisco, CA
- staff data engineer San Francisco, CA
- assistant chief engineer San Francisco, CA
- senior staff systems engineer San Francisco, CA
- staff engineer San Francisco, CA

