AI Engineer - Core
Hilbert's AI
AI Engineer Opportunity at Hilbert
Hilbert is building a reasoning engine that must navigate non-deterministic user behavior across data silos — turning months-long decision cycles into minutes. Fully agentic by design, our demand intelligence platform doesn't just call APIs; it solves the hard problem of orchestrating multi-step inference over messy, high-stakes enterprise data where deterministic answers don't exist.
From Fortune 500 enterprises to beloved brands like FreshDirect, Blank Street, and Levain Bakery, operators run their growth on Hilbert. We're also co-building alongside leading AI companies.
We're looking for an AI Engineer who can build production-grade AI systems end-to-end — from prototype to pipeline to product — with the ownership and urgency of a startup culture.
This is not a "wire up a prompt chain and move on" role. You'll own core pieces of the AI stack that power Hilbert's demand intelligence platform — designing agent architectures, building evaluation systems, and making hard tradeoffs between accuracy, latency, and cost in production. You'll ship fast in conditions where the spec is evolving, and communicate what you're building (and why) with clarity to the rest of the team. If you think in systems, have opinions about how agentic workflows should actually work, and want to build AI products that drive real enterprise outcomes, we want to meet you.
The Role
You'll work directly with the founding team and across product, data, and GTM to design, build, and improve the AI systems at the heart of Hilbert. The environment is high-autonomy and high-ambiguity — the nature of building AI-native products means requirements shift, approaches evolve, and the person closest to the problem often makes the call.
What You'll Do:
Design, build, and maintain AI-driven features and pipelines that serve enterprise customers at scale
Architect and implement agent-based workflows using LangChain, LangGraph, or equivalent orchestration frameworks
Own systems end-to-end — from experimentation through production deployment and monitoring
Build and improve evaluation pipelines to measure, validate, and iterate on AI system performance
Collaborate closely with the founding team and cross-functional partners — communicating tradeoffs, progress, and technical decisions with clarity
Make pragmatic engineering decisions under ambiguity — ship, learn, iterate
Shape the technical direction of the AI stack as the company scales
Our Current Hurdles
These are the kinds of problems you'll walk into on day one:
Intelligent retrieval across heterogeneous approaches — our agents need the right information at exactly the right moment. The challenge isn't picking one retrieval method; it's combining RAG, graph-based retrieval, and other approaches into a unified strategy that fetches the most relevant content precisely when the agent needs it — no more, no less.
Agentic workflows that solve real-world problems — it's building workflows robust enough to handle the unexpected. When an agent hits an edge case, missing data, or a situation it wasn't explicitly designed for, it needs to reason through it — leveraging available context, escalating to a human when it can't, and never silently failing.
Evaluation beyond vibes — we need systematic, reproducible evals that actually predict real-world performance. If you've built custom evaluators for RAG or agent workflows, we want to talk.
Execution and real-world integration — an agent that only surfaces insights isn't enough. We're building systems where agents take action — integrating with external platforms, executing workflows, and doing real work with the information they have, combined with human-in-the-loop checkpoints that keep enterprise trust intact.
Who Thrives in This Role
We care about how you think and how you ship - not how many years are on your resume.
The Profile:
You're a strong Software engineer. Your code is clean, testable, and production-ready.
You have real experience with LangChain, LangGraph, or equivalent agent/orchestration frameworks. You've built with them, hit their limits, and worked around them - not just followed tutorials
You communicate with clarity and conviction. You can explain a technical decision to a non-technical founder and debate architecture tradeoffs with a senior engineer. Communication is not a nice-to-have here - it's core to the role
You take ownership. You don't wait for tickets. You see what needs to be built, raise your hand, and ship it
You thrive in ambiguity. AI products evolve fast. Requirements change. You're energized by figuring it out.
You move at startup speed. You understand what it means to be available, responsive, and biased toward action in a fast-moving, early-stage environment
Strong Pluses:
Experience building evals pipelines — designing metrics, running systematic evaluations, and using results to drive iteration on AI systems
Backend software engineering experience — building APIs, services, data infrastructure, or production systems
Exposure to retrieval-augmented generation (RAG), vector databases, or LLM-powered search and recommendation systems
Experience at early-stage startups or high-growth environments where you wore multiple hats
You Might Be:
A backend engineer who went deep on LLMs and never looked back. An ML engineer who realized they love building products, not just models. A startup CTO who wants to go deep on AI at a company where the stack is the product. Someone who's been hacking on agents and pipelines nights and weekends and wants to do it full-time with real enterprise stakes. What matters: you ship, you own it, and you communicate like a teammate — not a silo.
Location
San Francisco, with occasional travel for team meets, offsites or customer engagements.
Compensation
Competitive salary + equity package, commensurate with experience. Performance-based bonuses tied to project milestones and customer impact.
The Hiring Journey
Short form → Intro call → Technical working session → Team conversations → Offer
- Hilbert is building a reasoning engine that must navigate non-deterministic user behavior across... ...We're also co-building alongside leading AI companies. We're looking for an AI... ...prompt chain and move on" role. You'll own core pieces of the AI stack that power Hilbert'...SuggestedFull timeShift workNight shiftWeekend work
- ...Senior AI Engineer Disney Entertainment and ESPN Product & Technology Technology is at the heart of Disney's past, present, and future... .... We're hiring a Senior AI Engineer to build the AI core capabilities and tooling that accelerate teams across Ad Technology...Suggested
$197.3k - $225.1k
...Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized...SuggestedFull timePart timeLocal area- About Build Build is creating the agentic AI stack for the built world. We help institutional real estate teams automate complex... ...and the physical world. About the role We are looking for an AI engineer, core to build the infrastructure, systems, and quality loops behind...Suggested
$230k - $385k
...About the Team The Codex Core Agent team builds the kernel of Codex. We own making the agent better, accelerating research, and... ...over time. About the Role We're looking for applied AI engineers to help bring Codex agents from impressive demos to dependable...Suggested- A leading AI research organization in San Francisco seeks applied AI engineers to enhance Codex agents for software engineering tasks. This role involves designing agent behaviors, analyzing production failures, and collaborating with research teams on performance evaluation...
- ...Core Ai Platform Engineering Manager Front is the customer operations platform built for B2B complexity, keeping every team, tool, and customer conversation in sync so companies can scale without losing connection. Others handle simple interactions. Front handles the...Work at officeImmediate startRemote workWork from homeMonday to FridayFlexible hours
$230k - $385k
Slope is hiring an AI Systems Engineer in San Francisco. This position focuses on designing and building core agent systems for Codex, improving observability and efficiency in operations. The role demands expertise in production systems and a solid grasp of Rust and Python...$230k - $385k
OpenAI in San Francisco is seeking an AI Systems Engineer for Codex Core Agents. The role focuses on building reliable AI systems for production environments. Candidates should have experience with distributed systems, ML workflows, and debugging skills across diverse...- A leading AI-driven startup in San Francisco is looking for a key engineer with significant AI/ML and LLM experience. This role involves developing core tools for agent-driven chip design workflows and requires expertise in Python, C++, and ML models. Candidates should...
- ...and Forbes Best Startup Employers 2022 List . The Core Product Experience (CPX) team owns and elevates the heart... ...and contacts. CPX partners closely with Client Platform, AI, and other pods across Engineering to deliver cohesive, high-impact experiences. As Front...Work at officeImmediate startRemote workWork from homeWorldwideMonday to FridayFlexible hours
- ...Software Engineer, Core Services Our mission is to automate coding. The first step in our journey is to build the best tool for professional... ...response, closing the loop on one of the hardest problems in AI-powered development. You May Be A Fit If You have experience...Work at office
- ...tech stacks to accelerate the progress of AI applications out into the real world.... ...performance and reliability. We're looking for engineers with systems software experience that are... ...contributing to the Ray backend. About the Ray Core Team The Ray Core team develops and...Work experience placement
$140k - $200k
...around the globe work on Speechify in a 100% distributed setting - Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like...Work at officeRemote work- ...Runtime Engineer – AI Runtime & Execution About the Role We're looking for a Runtime Engineer to help build and optimise the execution layer that powers next-generation AI workloads. Working at the intersection of systems software, compiler technology, and hardware...
$150k - $230k
...customer survey platform, rebuilt around AI agents. For two decades, teams have relied... ...category is being rebuilt, with AI at the core. If you're excited by hard problems and the... ...client-side SDK. Our full-stack engineers are responsible for architecting and building...Full timeWork at officeWorldwide$175k - $215k
...autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states. Software Engineering builds the brains of Waymo's fully autonomous driving technology. Our software allows the Waymo Driver to perceive the world around...Full timeRemote work- ...Hello, We have 5 urgent openings for an " AI Software Engineer ". These are hybrid roles. Only looking for candidates who can work on W2 Strictly no C2C or third-party vendors Location: San Francisco, CA (Hybrid) Duration: Long-term Contract We...Long term contract
- Job Title Disabled veteran A veteran who served on active duty in the U.S. military and is entitled to disability compensation (or who but for the receipt of military retired pay would be entitled to disability compensation) under laws administered by the Secretary of...
$190k - $260k
...About Gem Gem is the only AI-first all-in-one recruiting platform. It brings together your ATS, CRM, sourcing, scheduling, and... ...3 The Team and Role We're looking for a Senior Software Engineer to join our ATS team, one of the most strategically important areas...Full timeWork at officeLocal areaImmediate startRemote workFlexible hours3 days per week$140k - $200k
...working. We’re #1 in our category, and experiencing exponential growth. Overview We're looking for a Senior Software Engineer to join our Core Experiences Team. This team builds and maintains the foundational services and SDKs that power Speechify’s product...Remote work$160k - $180k
...belonging by enabling meaningful connections through voice, video, and text. We're looking for a mission-driven Software Engineer to join our Core Product team. This team is responsible for shaping the heart of the Discord experience: how people chat, hang out, and...Full time- ...Protocol: Go (Cosmos SDK, go-ethereum, btcsuite, Tendermint Core) Smart contracts: Solidity, Rust, Hardhat Frontend: Typescript... ...experience Base experience ~+4 years working as a software engineer ~ Full-stack engineering experience with focus on the backend...Remote work
$160k - $250k
...Title: Founding AI Engineer (Research & Systems) Target: PhDs & Research Masters from Stanford, MIT, Berkeley, CMU focused on AI, ML, NLP... ...first AI hire to own the research and implementation of our core agentic models. You will be responsible for turning groundbreaking...H1bImmediate startVisa sponsorship- ...building the performance management layer for enterprise AI systems. As companies deploy AI across customer support, operations... .... The role We’re looking for a founding AI engineer to help build the core system. This is a true 0→1 role, working directly with the...Immediate start
$150k - $250k
...AI Engineer At Distyl, AI Engineers build and operate AI systems that deliver real business value inside customer environments. This role... ...building and operating real systems. You understand core engineering concepts like versioning, debugging, testing, and performance...Work at office3 days per week- ...AI Engineer Conduit is the platform for building conversational AI agents focused on hospitality. Our AI agents automate inbound and outbound conversational workflows for to increase conversions, reduce costs, and improve customer satisfaction. To maximize conversational...Flexible hours
$175k - $250k
...AI Engineer (Hybrid - San Francisco, CA) We are currently supporting a new client based in San Francisco that is building next generation AI powered healthcare workflow solutions. They are looking for a AI Engineer to join their early engineering team and help build...Full time- ...Cooperidge Consulting Firm is seeking an AI Engineer for a top Financial Technology (FinTech) client. This role focuses on building and deploying AI agent systems that automate complex financial audit workflows, including autonomous planning, memory, and tool execution...Hourly payFull timeWeekend work
$150k - $180k
...AI Evaluations Engineer – HealthcareLocation: Remote, located in the USType: Full-timeDepartment: EngineeringReports to: Director Of EngineeringResponsibilitiesBuild and maintain infrastructure and tooling for the AI evaluations platform used by internal teams, including...Remote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Engineer - Core. Be the first to apply!
- ai research engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- ai engineer remote San Francisco, CA
- ai prompt engineer San Francisco, CA
- ai developer San Francisco, CA
- ai engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ai engineer San Francisco, CA
- ai network engineer
- azure ai engineer


