ML Ops Engineer — Agentic AI Lab (Founding Team)
Fabrion
About the Role ML Ops Engineer — Agentic AI Lab (Founding Team) — Location: San Francisco Bay Area — Type: Full-Time — Compensation: Competitive salary + meaningful equity (founding tier) Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems. Our AI Lab is pioneering the future of intelligent infrastructure through open-source LLMs, agent-native pipelines, retrieval-augmented generation (RAG), and knowledge-graph-grounded models. We’re hiring an ML Ops Engineer to be the glue between ML research and production systems — responsible for automating the model training, deployment, versioning, and observability pipelines that power our agents and AI data fabric. You’ll work across compute orchestration, GPU infrastructure, fine-tuned model lifecycle management, model governance, and security. Responsibilities Build and maintain secure, scalable, and automated pipelines for: LLM fine-tuning, SFT, LoRA, RLHF, DPO training RAG embedding pipelines with dynamic updates Model conversion, quantization, and inference rollout Manage hybrid compute infrastructure (cloud, on-prem, GPU clusters) for training and inference workloads using Kubernetes, Ray, and Terraform Containerize models and agents using Docker, with reproducible builds and CI/CD via GitHub Actions or ArgoCD Implement and enforce model governance: versioning, metadata, lineage, reproducibility, and evaluation capture Create and manage evaluation and benchmarking frameworks (e.g. OpenLLM-Evals, RAGAS, LangSmith) Integrate with security and access control layers (OPA, ABAC, Keycloak) to enforce model policies per tenant Instrument observability for model latency, token usage, performance metrics, error tracing, and drift detection Support deployment of agentic apps with LangGraph, LangChain, and custom inference backends (e.g. vLLM, TGI, Triton) Desired Experience Model Infrastructure: 4+ years in MLOps, ML platform engineering, or infra-focused ML roles Deep familiarity with model lifecycle management tools: MLflow, Weights & Biases, DVC, HuggingFace Hub Experience with large model deployments (open-source LLMs preferred): LLaMA, Mistral, Falcon, Mixtral Comfortable with tuning libraries (HuggingFace Trainer, DeepSpeed, FSDP, QLoRA) Familiarity with inference serving: vLLM, TGI, Ray Serve, Triton Inference Server Automation + Infra Proficient with Terraform, Helm, K8s, and container orchestration Experience with CI/CD for ML (e.g. GitHub Actions + model checkpoints) Managed hybrid workloads across GPU cloud (Lambda, Modal, HuggingFace Inference, Sagemaker) Familiar with cost optimization (spot instance scaling, batch prioritization, model sharding) Agent + Data Pipeline Support Familiarity with LangChain, LangGraph, LlamaIndex or similar RAG/agent orchestration tools Built embedding pipelines for multi-source documents (PDF, JSON, CSV, HTML) Integrated with vector databases (Weaviate, Qdrant, FAISS, Chroma) Security & Governance Implemented model-level RBAC, usage tracking, audit trails Integrated with API rate limits, tenant billing, and SLA observability Experience with policy-as-code systems (OPA, Rego) and access layers Preferred Stack LLM Ops : HuggingFace, DeepSpeed, MLflow, Weights & Biases, DVC Infra : Kubernetes (GKE/EKS), Ray, Terraform, Helm, GitHub Actions, ArgoCD Serving : vLLM, TGI, Triton, Ray Serve Pipelines : Prefect, Airflow, Dagster Monitoring : Prometheus, Grafana, OpenTelemetry, LangSmith Security : OPA (Rego), Keycloak, Vault Languages : Python (primary), Bash, optionally Rust or Go for tooling Mindset & Culture Fit Builder's mindset with startup autonomy: you automate what slows you down Obsessive about reproducibility, observability, and traceability Comfortable with a hybrid team of AI researchers, DevOps, and backend engineers Interested in aligning ML systems to product delivery, not just papers Bonus: experience with SOC2, HIPAA, or GovCloud-grade model operations What We’re Looking For Experience: 5+ years as a full stack or backend engineer Experience owning and delivering production systems end-to-end Prior experience with modern frontend frameworks (React, Next.js) Familiarity with building APIs, databases, cloud infrastructure, or deployment workflows at scale Comfortable working in early-stage startups or autonomous roles, prior experience as a founder, founding engineer, or a 0-1 pre-seed startup is a big plus Mindset: Comfortable with ambiguity, eager to prototype and iterate quickly Strong sense of ownership — prefers to build systems rather than wait for tickets Enjoys thinking about architecture, performance, and tradeoffs at every level Clear communicator and pragmatic team player Values equity and impact over prestige or hierarchy Prior startup or founding team experience Why This Role Matters Your work will enable models and agents to be trained, evaluated, deployed, and governed at scale — across many tenants, models, and tasks. This is the backbone of a secure, reliable, and scalable AI-native enterprise system. If you dream about using AI to solve some really hard real world problems – we would love to hear from you. #J-18808-Ljbffr Fabrion
- ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful equity (... ...self‑correction, multi‑agent communication, and agent ops logging Optimization Strong background in token...SuggestedFull time
- ...Data/ETL Engineer (Founding Team) Location: San Francisco Bay Area Type: Full-Time... ...’re building a multi-tenant, AI-native platform where enterprise... ...knowledge models that fuel agentic applications. If you\u2019ve... ...semantic data Collaborate with ML/LLM teams to feed high-...SuggestedFull time
$150k - $225k
...modern families. Most AI tools are designed for... ...simple. Our strength is our team — driven, thoughtful... ...We’re looking for an Founding AI Engineer to design, build, and... ...building and scaling data or ML pipelines in production... ..., tool calling, and agentic workflows Experience...SuggestedFull timeWork from homeFlexible hours$150k - $300k
...An early-stage AI data company that went from zero... ...web. You will own the ML systems that turn that... ...including reporting lines and team shapes. Detect... ...the full ML research and engineering cycle, from prototype to... ...with a direct path toward founding something of your own....Suggested- ...About the Role Our company is hiring a Founding AI / ML Engineer to help architect and ship the next generation... ...that solve real customer problems in lean teams. You will work across multimodal AI, document intelligence, agentic workflows, and evaluation systems, turning...SuggestedH1bVisa sponsorship
- ...building the gateway to the internet for AI agents. Our APIs already power... ...push the boundaries of what our ML systems can do. We're hiring a Founding ML Engineer to own the research and... ...chart — who reports to whom, what the team structure looks like, how the engineering...
- ...startup in San Francisco seeks founding Machine Learning Engineers (MLEs) to enhance core... ...You will work on low-latency AI solutions in browser... .... This role demands strong ML skills and experience with... ...and character fit on a small team. Ideal candidates must have...
- Composite is seeking founding Machine Learning Engineers to enhance our proactive automation platform. In this... ...unique position offers a chance to work with cutting-edge technology in a small team, directly influencing the work lives of millions. #J-18808-Ljbffr Composite.ai
- Icehouseventures is looking for founding Machine Learning Engineers (MLEs) to enhance their core action models for a proactive automation platform. This... ...instant response times without IT friction. Join a small team where your contributions improve the work lives of...
$200k
Founding ML Engineer San Francisco, on-site, full-time - $200,000 - $500,000 per... ...been crunched. The house AI parsed the PK curves, flagged... ...head of science from a major AI lab walks in. He's got access to... ...is dosed in mice. By 8 PM the team hands you the data. It holds...Full timeNight shiftDay shiftAfternoon shift$172.5k - $260.1k
...Category Software Engineering About Salesforce Salesforce... ...is the #1 AI CRM, where humans with... ...in the agentic era? You’re in the... ...decision‑making. Our team is composed of Architects... ...company benefits can be found at the following... ...as applicable. #J-18808-Ljbffr Centaur LabsShift work- A leading AI infrastructure firm based in San Francisco is looking for engineers to join their founding core team. You will work directly with the founders to develop AI models that optimize network operations and anticipate failures. This unique position offers the opportunity...
- About David AI David AI is the first audio... ...with the same rigor AI labs bring to models. Our... ...David AI excels. Founded in 2024 by former Scale AI engineers and operators, David... ...Round Capital. Our team is sharp, humble, ambitious... ...manage the complete ML lifecycle, from...
- ...the Role We're looking for founding Machine Learning Engineers (MLEs) to own and improve our... ..., or consumer-focused “AI browsers,” we run AI directly... ...architecture creates unique ML challenges. This is a high-... ...on our small, exceptional team where your work ships directly...Sleeping nights
- ...known as Femtosense—was founded in 2018 by researchers from... ...the Brains in Silicon Lab at Stanford University.... ...pioneered a high-performance AI accelerator integrated... ...will work on a custom ML compiler that transforms... ...with hardware and ML teams to improve system performance...
$175k - $300k
...Trove Trove is developing an AI associate for financial firms -... ...density: Be a part of a great founding team in SF. Shivaal Roy (CTO) was a founding engineer at Glean ($100M+ ARR today) and... ...Collaborate with the top AI labs and elite financial firms to design...- ...Join us Human Computer Lab is building robots that... ...fast-paced and focused team, with the goal of pushing... ...for a machine learning engineer who wants to work on the... ...the frontier of embodied AI, where the problems are... ...learning models Integrate ML systems with robotic hardware...Immediate start
$200k - $400k
...data platform to train AI video models. Troveo... ..., and AI research labs, enabling scalable,... ...innovative strategic engineer to help us scale.... ...work across the full ML lifecycle, from structuring... ...and Operations teams to translate ML... .... Proficiency in ML ops tools (e.g., MLflow,...Work experience placement- ...About the Role As a MLOps Engineer within the Perception Deep Learning team, you will lead the design and evolution of... ...and drive best practices across the ML lifecycle. You will play a key role... ...efficient deployment and management of AI models. Collaborate with cross-...Work at office3 days per week
- ...revolutionizing how outside sales and service teams work. Their AI technology captures and analyzes real-... ..., build, and deploy production‑grade ML systems with end‑to‑end ownership of... ...years of professional experience in ML engineering. Strong programming skills in Python (...Full time
- ...Andalusia Labs is building foundational economic... ...GDP. Our team has shipped massively... ...driven Machine Learning Engineer who is passionate... ...Build best‑in‑class AI chatbots that guide... ...models with ML frameworks such as... ...finance primitives Founded a company Experience...
$200k - $300k
...& skill-level) | San Francisco, CA | On-site Most ML roles ask you to fine-tune someone else's idea. This... ...entirely. Backed by Anthropic. The problem is hard, the team is exceptional, and the window to be a founding engineer on something like this doesn't stay open long. What...InternshipWork at office$200k - $250k
Job Title: Founding AI/ML Engineer Salary: $200-250K + Equity Company Description: Generalcatalyst.com - AI startup pioneered by Princeton researchers... ...and a GPT co‑creator. Job Description: Join the founding team at Curium to build Generative Engine Optimization (GEO), the...$250k
...Our client, a venture-backed AI Startup, is hiring a talented ML/AI Research Engineer to join their team in San Francisco. The successful candidate will act as the key... ...HIPAA or GovCloud environments, or prior startup/founding team experience is a bonus. Benefits Salary:...- About AI systems are getting better on benchmarks... ...world use. At Arcada Labs, we build products... ...small, deeply technical team with people from Harvard... ...We’re looking for an ML Research Engineer to help us build... ...systems, model evaluation, agentic workflows, and frontier...
- Open Select, an early-stage AI data company in San Francisco, is seeking a Machine Learning Engineer. You will be responsible for owning ML systems that convert raw data into structured intelligence... ...retrieval systems and contribute to a team that has already achieved $7M ARR. #J...
- A leading AI technology firm in San Francisco seeks a Founding ML Engineer to lead the research and engineering of their core AI systems. You will convert messy, multilingual web-scale data into structured intelligence, resolve data discrepancies, and train ML models from...
- We are seeking an agentic focussed ML Engineer to build the core agent intelligence layer for a heavily funded team of Google Brain alumni building the Claude code of industrial engineering. Whilst early stage, the team have already built a customer base including Tesla...
- Gravity Engineering Services Pvt Ltd. is looking for a Founding ML Engineer to lead the development of core ML intelligence systems. You will dive into the complexities... ..., making significant contributions to the way AI agents access real-time information. Experience in NLP...
$147.4k - $272.1k
Machine Learning Engineer, ASE Search Team San Francisco, California, United States Machine Learning and AI The Apple Services Engineering AI/ML organization is hiring a Machine Learning Engineer... ..., context management, RAG and Agentic architectures and solutions. Exposure...Relocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Ops Engineer — Agentic AI Lab (Founding Team). Be the first to apply!
- graduate machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- intern - quantum machine learning for quantum computing San Francisco, CA

