Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Ops Engineer — Agentic AI Lab (Founding Team)

Fabrion

About the Role ML Ops Engineer — Agentic AI Lab (Founding Team) — Location: San Francisco Bay Area — Type: Full-Time — Compensation: Competitive salary + meaningful equity (founding tier) Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems. Our AI Lab is pioneering the future of intelligent infrastructure through open-source LLMs, agent-native pipelines, retrieval-augmented generation (RAG), and knowledge-graph-grounded models. We’re hiring an ML Ops Engineer to be the glue between ML research and production systems — responsible for automating the model training, deployment, versioning, and observability pipelines that power our agents and AI data fabric. You’ll work across compute orchestration, GPU infrastructure, fine-tuned model lifecycle management, model governance, and security. Responsibilities Build and maintain secure, scalable, and automated pipelines for: LLM fine-tuning, SFT, LoRA, RLHF, DPO training RAG embedding pipelines with dynamic updates Model conversion, quantization, and inference rollout Manage hybrid compute infrastructure (cloud, on-prem, GPU clusters) for training and inference workloads using Kubernetes, Ray, and Terraform Containerize models and agents using Docker, with reproducible builds and CI/CD via GitHub Actions or ArgoCD Implement and enforce model governance: versioning, metadata, lineage, reproducibility, and evaluation capture Create and manage evaluation and benchmarking frameworks (e.g. OpenLLM-Evals, RAGAS, LangSmith) Integrate with security and access control layers (OPA, ABAC, Keycloak) to enforce model policies per tenant Instrument observability for model latency, token usage, performance metrics, error tracing, and drift detection Support deployment of agentic apps with LangGraph, LangChain, and custom inference backends (e.g. vLLM, TGI, Triton) Desired Experience Model Infrastructure: 4+ years in MLOps, ML platform engineering, or infra-focused ML roles Deep familiarity with model lifecycle management tools: MLflow, Weights & Biases, DVC, HuggingFace Hub Experience with large model deployments (open-source LLMs preferred): LLaMA, Mistral, Falcon, Mixtral Comfortable with tuning libraries (HuggingFace Trainer, DeepSpeed, FSDP, QLoRA) Familiarity with inference serving: vLLM, TGI, Ray Serve, Triton Inference Server Automation + Infra Proficient with Terraform, Helm, K8s, and container orchestration Experience with CI/CD for ML (e.g. GitHub Actions + model checkpoints) Managed hybrid workloads across GPU cloud (Lambda, Modal, HuggingFace Inference, Sagemaker) Familiar with cost optimization (spot instance scaling, batch prioritization, model sharding) Agent + Data Pipeline Support Familiarity with LangChain, LangGraph, LlamaIndex or similar RAG/agent orchestration tools Built embedding pipelines for multi-source documents (PDF, JSON, CSV, HTML) Integrated with vector databases (Weaviate, Qdrant, FAISS, Chroma) Security & Governance Implemented model-level RBAC, usage tracking, audit trails Integrated with API rate limits, tenant billing, and SLA observability Experience with policy-as-code systems (OPA, Rego) and access layers Preferred Stack LLM Ops : HuggingFace, DeepSpeed, MLflow, Weights & Biases, DVC Infra : Kubernetes (GKE/EKS), Ray, Terraform, Helm, GitHub Actions, ArgoCD Serving : vLLM, TGI, Triton, Ray Serve Pipelines : Prefect, Airflow, Dagster Monitoring : Prometheus, Grafana, OpenTelemetry, LangSmith Security : OPA (Rego), Keycloak, Vault Languages : Python (primary), Bash, optionally Rust or Go for tooling Mindset & Culture Fit Builder's mindset with startup autonomy: you automate what slows you down Obsessive about reproducibility, observability, and traceability Comfortable with a hybrid team of AI researchers, DevOps, and backend engineers Interested in aligning ML systems to product delivery, not just papers Bonus: experience with SOC2, HIPAA, or GovCloud-grade model operations What We’re Looking For Experience: 5+ years as a full stack or backend engineer Experience owning and delivering production systems end-to-end Prior experience with modern frontend frameworks (React, Next.js) Familiarity with building APIs, databases, cloud infrastructure, or deployment workflows at scale Comfortable working in early-stage startups or autonomous roles, prior experience as a founder, founding engineer, or a 0-1 pre-seed startup is a big plus Mindset: Comfortable with ambiguity, eager to prototype and iterate quickly Strong sense of ownership — prefers to build systems rather than wait for tickets Enjoys thinking about architecture, performance, and tradeoffs at every level Clear communicator and pragmatic team player Values equity and impact over prestige or hierarchy Prior startup or founding team experience Why This Role Matters Your work will enable models and agents to be trained, evaluated, deployed, and governed at scale — across many tenants, models, and tasks. This is the backbone of a secure, reliable, and scalable AI-native enterprise system. If you dream about using AI to solve some really hard real world problems – we would love to hear from you. #J-18808-Ljbffr Fabrion

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the ML Ops Engineer — Agentic AI Lab (Founding Team) in San Francisco, CA vacancy
  • ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful equity (...  ...self‑correction, multi‑agent communication, and agent ops logging Optimization Strong background in token... 
    Suggested
    Full time

    Fabrion

    San Francisco, CA
    13 hours ago
  •  ...Data/ETL Engineer (Founding Team) Location: San Francisco Bay Area Type: Full-Time...  ...’re building a multi-tenant, AI-native platform where enterprise...  ...knowledge models that fuel agentic applications. If you\u2019ve...  ...semantic data Collaborate with ML/LLM teams to feed high-... 
    Suggested
    Full time

    Fabrion

    San Francisco, CA
    3 days ago
  • $150k - $225k

     ...modern families. Most AI tools are designed for...  ...simple. Our strength is our team — driven, thoughtful...  ...We’re looking for an Founding AI Engineer to design, build, and...  ...building and scaling data or ML pipelines in production...  ..., tool calling, and agentic workflows Experience... 
    Suggested
    Full time
    Work from home
    Flexible hours

    Hellobabs

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...An early-stage AI data company that went from zero...  ...web. You will own the ML systems that turn that...  ...including reporting lines and team shapes. Detect...  ...the full ML research and engineering cycle, from prototype to...  ...with a direct path toward founding something of your own.... 
    Suggested

    Open Select

    San Francisco, CA
    5 days ago
  •  ...About the Role Our company is hiring a Founding AI / ML Engineer to help architect and ship the next generation...  ...that solve real customer problems in lean teams. You will work across multimodal AI, document intelligence, agentic workflows, and evaluation systems, turning... 
    Suggested
    H1b
    Visa sponsorship

    Ersilia

    San Francisco, CA
    5 days ago
  •  ...building the gateway to the internet for AI agents. Our APIs already power...  ...push the boundaries of what our ML systems can do. We're hiring a Founding ML Engineer to own the research and...  ...chart — who reports to whom, what the team structure looks like, how the engineering... 

    Crustdata (YC F24)

    San Francisco, CA
    4 days ago
  •  ...startup in San Francisco seeks founding Machine Learning Engineers (MLEs) to enhance core...  ...You will work on low-latency AI solutions in browser...  .... This role demands strong ML skills and experience with...  ...and character fit on a small team. Ideal candidates must have... 

    Composite.ai

    San Francisco, CA
    4 days ago
  • Composite is seeking founding Machine Learning Engineers to enhance our proactive automation platform. In this...  ...unique position offers a chance to work with cutting-edge technology in a small team, directly influencing the work lives of millions. #J-18808-Ljbffr Composite.ai

    Composite.ai

    San Francisco, CA
    2 days ago
  • Icehouseventures is looking for founding Machine Learning Engineers (MLEs) to enhance their core action models for a proactive automation platform. This...  ...instant response times without IT friction. Join a small team where your contributions improve the work lives of... 

    Icehouseventures

    San Francisco, CA
    2 days ago
  • $200k

    Founding ML Engineer San Francisco, on-site, full-time - $200,000 - $500,000 per...  ...been crunched. The house AI parsed the PK curves, flagged...  ...head of science from a major AI lab walks in. He's got access to...  ...is dosed in mice. By 8 PM the team hands you the data. It holds... 
    Full time
    Night shift
    Day shift
    Afternoon shift

    Stealth Deep Tech

    San Francisco, CA
    13 hours ago
  • $172.5k - $260.1k

     ...Category Software Engineering About Salesforce Salesforce...  ...is the #1 AI CRM, where humans with...  ...in the agentic era? You’re in the...  ...decision‑making. Our team is composed of Architects...  ...company benefits can be found at the following...  ...as applicable. #J-18808-Ljbffr Centaur Labs
    Shift work

    Centaur Labs

    San Francisco, CA
    13 hours ago
  • A leading AI infrastructure firm based in San Francisco is looking for engineers to join their founding core team. You will work directly with the founders to develop AI models that optimize network operations and anticipate failures. This unique position offers the opportunity... 

    Meter Service

    San Francisco, CA
    4 days ago
  • About David AI David AI is the first audio...  ...with the same rigor AI labs bring to models. Our...  ...David AI excels. Founded in 2024 by former Scale AI engineers and operators, David...  ...Round Capital. Our team is sharp, humble, ambitious...  ...manage the complete ML lifecycle, from... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    13 hours ago
  •  ...the Role We're looking for founding Machine Learning Engineers (MLEs) to own and improve our...  ..., or consumer-focused “AI browsers,” we run AI directly...  ...architecture creates unique ML challenges. This is a high-...  ...on our small, exceptional team where your work ships directly... 
    Sleeping nights

    Composite.ai

    San Francisco, CA
    4 days ago
  •  ...known as Femtosense—was founded in 2018 by researchers from...  ...the Brains in Silicon Lab at Stanford University....  ...pioneered a high-performance AI accelerator integrated...  ...will work on a custom ML compiler that transforms...  ...with hardware and ML teams to improve system performance... 

    Femtosense

    San Bruno, CA
    3 days ago
  • $175k - $300k

     ...Trove Trove is developing an AI associate for financial firms -...  ...density: Be a part of a great founding team in SF. Shivaal Roy (CTO) was a founding engineer at Glean ($100M+ ARR today) and...  ...Collaborate with the top AI labs and elite financial firms to design... 

    TROVE

    San Francisco, CA
    3 days ago
  •  ...Join us Human Computer Lab is building robots that...  ...fast-paced and focused team, with the goal of pushing...  ...for a machine learning engineer who wants to work on the...  ...the frontier of embodied AI, where the problems are...  ...learning models Integrate ML systems with robotic hardware... 
    Immediate start

    Human Computer Lab

    San Francisco, CA
    4 days ago
  • $200k - $400k

     ...data platform to train AI video models. Troveo...  ..., and AI research labs, enabling scalable,...  ...innovative strategic engineer to help us scale....  ...work across the full ML lifecycle, from structuring...  ...and Operations teams to translate ML...  .... Proficiency in ML ops tools (e.g., MLflow,... 
    Work experience placement

    Troveo AI

    San Francisco, CA
    4 days ago
  •  ...About the Role As a MLOps Engineer within the Perception Deep Learning team, you will lead the design and evolution of...  ...and drive best practices across the ML lifecycle. You will play a key role...  ...efficient deployment and management of AI models. Collaborate with cross-... 
    Work at office
    3 days per week

    Hayden AI Technologies, Inc.

    San Francisco, CA
    4 days ago
  •  ...revolutionizing how outside sales and service teams work. Their AI technology captures and analyzes real-...  ..., build, and deploy production‑grade ML systems with end‑to‑end ownership of...  ...years of professional experience in ML engineering. Strong programming skills in Python (... 
    Full time

    Catalyst Labs, LLC

    San Francisco, CA
    3 days ago
  •  ...Andalusia Labs is building foundational economic...  ...GDP. Our team has shipped massively...  ...driven Machine Learning Engineer who is passionate...  ...Build best‑in‑class AI chatbots that guide...  ...models with ML frameworks such as...  ...finance primitives Founded a company Experience... 

    Framework Ventures

    San Francisco, CA
    3 days ago
  • $200k - $300k

     ...& skill-level) | San Francisco, CA | On-site Most ML roles ask you to fine-tune someone else's idea. This...  ...entirely. Backed by Anthropic. The problem is hard, the team is exceptional, and the window to be a founding engineer on something like this doesn't stay open long. What... 
    Internship
    Work at office

    Strive - GTM Talent Partner

    San Francisco, CA
    4 days ago
  • $200k - $250k

    Job Title: Founding AI/ML Engineer Salary: $200-250K + Equity Company Description: Generalcatalyst.com - AI startup pioneered by Princeton researchers...  ...and a GPT co‑creator. Job Description: Join the founding team at Curium to build Generative Engine Optimization (GEO), the... 

    Jack & Jill

    San Francisco, CA
    4 days ago
  • $250k

     ...Our client, a venture-backed AI Startup, is hiring a talented ML/AI Research Engineer to join their team in San Francisco. The successful candidate will act as the key...  ...HIPAA or GovCloud environments, or prior startup/founding team experience is a bonus. Benefits Salary:... 

    Alldus International Consulting Ltd

    San Francisco, CA
    4 days ago
  • About AI systems are getting better on benchmarks...  ...world use. At Arcada Labs, we build products...  ...small, deeply technical team with people from Harvard...  ...We’re looking for an ML Research Engineer to help us build...  ...systems, model evaluation, agentic workflows, and frontier... 

    Arcada Labs Incorporated

    San Francisco, CA
    1 day ago
  • Open Select, an early-stage AI data company in San Francisco, is seeking a Machine Learning Engineer. You will be responsible for owning ML systems that convert raw data into structured intelligence...  ...retrieval systems and contribute to a team that has already achieved $7M ARR. #J... 

    Open Select

    San Francisco, CA
    13 hours ago
  • A leading AI technology firm in San Francisco seeks a Founding ML Engineer to lead the research and engineering of their core AI systems. You will convert messy, multilingual web-scale data into structured intelligence, resolve data discrepancies, and train ML models from... 

    Crustdata (YC F24)

    San Francisco, CA
    2 days ago
  • We are seeking an agentic focussed ML Engineer to build the core agent intelligence layer for a heavily funded team of Google Brain alumni building the Claude code of industrial engineering. Whilst early stage, the team have already built a customer base including Tesla... 

    Acceler8 Talent

    San Francisco, CA
    2 days ago
  • Gravity Engineering Services Pvt Ltd. is looking for a Founding ML Engineer to lead the development of core ML intelligence systems. You will dive into the complexities...  ..., making significant contributions to the way AI agents access real-time information. Experience in NLP... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    2 days ago
  • $147.4k - $272.1k

    Machine Learning Engineer, ASE Search Team San Francisco, California, United States Machine Learning and AI The Apple Services Engineering AI/ML organization is hiring a Machine Learning Engineer...  ..., context management, RAG and Agentic architectures and solutions. Exposure... 
    Relocation

    Apple

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Ops Engineer — Agentic AI Lab (Founding Team). Be the first to apply!