Tech Lead, Data & Inference Engineer

Catalyst Labs, LLC

Our Client

A fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers. Their identity resolution technology blends business and consumer signals to convert static audience lists into high match and cross channel segments without the use of cookies. By transforming first party and third party data into precision targetable audiences across platforms such as Meta, Google and YouTube, they enable marketing teams to reach higher match rates, reduce wasted advertising spend and accelerate pipeline growth. With a strong understanding of how business buyers behave in channels that have traditionally been focused on business to consumer activity, they are redefining how business brands scale demand generation and account based efforts.

About Us

Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency thats deeply embedded in our clients recruitment operations.

We collaborate directly with Founders, CTOs, and Heads of AI in those themes who are driving the next wave of applied intelligence from model optimization to productized AI workflows. We take pride in facilitating conversations that align with your technical expertise, creative problem-solving mindset, and long-term growth trajectory in the evolving world of intelligent systems.

Location

San Francisco

Work type

Full Time,

Compensation

above market base + bonus + equity

Roles & Responsibilities

Lead the design, development and scaling of an end to end data platform from ingestion to insights, ensuring that data is fast, reliable and ready for business use.
Build and maintain scalable batch and streaming pipelines, transforming diverse data sources and third party application programming interfaces into trusted and low latency systems.
Take full ownership of reliability, cost and service level objectives. This includes achieving ninety nine point nine percent uptime, maintaining minutes level latency and optimizing cost per terabyte. Conduct root cause analysis and provide long lasting solutions.
Operate inference pipelines that enhance and enrich data. This includes enrichment, scoring and quality assurance using large language models and retrieval augmented generation. Manage version control, caching and evaluation loops.
Work across teams to deliver data as a product through the creation of clear data contracts, ownership models, lifecycle processes and usage based decision making.
Guide architectural decisions across the data lake and the entire pipeline stack. Document lineage, trade offs and reversibility while making practical decisions on whether to build internally or buy externally.
Scale integration with application programming interfaces and internal services while ensuring data consistency, high data quality and support for both real time and batch oriented use cases.
Mentor engineers, review code and raise the overall technical standard across teams. Promote data driven best practices throughout the organization.

Qualifications

Bachelors or Masters degree in Computer Science, Computer Engineering, Electrical Engineering, or Mathematics.
Excellent written and verbal communication; proactive and collaborative mindset.
Comfortable in hybrid or distributed environments with strong ownership and accountability.
A founder-level bias for actionable to identify bottlenecks, automate workflows, and iterate rapidly based on measurable outcomes.
Demonstrated ability to teach, mentor, and document technical decisions and schemas clearly.

Core Experience

6 to 12 years of experience building and scaling production-grade data systems, with deep expertise in data architecture, modeling, and pipeline design.
Expert SQL (query optimization on large datasets) and Python skills.
Hands-on experience with distributed data technologies (Spark, Flink, Kafka) and modern orchestration tools (Airflow, Dagster, Prefect).
Familiarity with dbt, DuckDB, and the modern data stack; experience with IaC, CI/CD, and observability.
Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or Azure).
Bonus: Strong Node.js skills for faster onboarding and system integration.
Previous experience at a high-growth startup (10 to 200 people) or early-stage environment with a strong product mindset.

#J-18808-Ljbffr

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Tech Lead, Data & Inference Engineer in San Francisco, CA vacancy

Inference Technical Lead, Sora
$380k
...societal benefit. About the Role We're looking for a GPU Inference Engineer to contribute to improvements in model serving efficiency for... ...system efficiency Drive optimizations from a kernel and data movement perspective to improve system throughput and...
Suggested
Work at office
Relocation package
OpenAI
San Francisco, CA
1 day ago
Inference Technical Lead, On-Device Transformers
...About the Role As a Technical Lead on the Future of Computing... .... Build and lead a team of engineers responsible for implementing the low-level inference stack, including kernel development... ...your possession (including the data contained therein) upon termination...
Suggested
Work at office
Relocation package
OpenAI
San Francisco, CA
4 days ago
Senior AI Inference Data Plane Engineer - Remote
$167.2k - $209k
A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong...
Suggested
Remote job
DigitalOcean
San Francisco, CA
3 days ago
Staff Technical Lead for Inference & ML Performance
...Staff Technical Lead for Inference & ML Performance San Francisco fal is the generative media ecosystem powering the next generation of... ...This Role Matters You'll shape the future of fal's inference engine and ensure our generative models achieve best-in-class...
Suggested
Fal
San Francisco, CA
3 days ago
Technical Staff Lead, AI Inference & GPU Infra
A tech company specializing in AI infrastructure is seeking a skilled professional to build scalable infrastructure for AI model training and inference. You will lead architectural decisions and work with core systems that power their GPU optimization platform. Candidates...
Suggested
Wafer
San Francisco, CA
2 days ago
Senior Engineer, Inference Data Plane
$139.2k - $174k
...applications. We are seeking a Senior Engineer 2 to play a key role in our AI... ...for running AI workloads— inference, training, fine‑tuning— at... .... Operational Excellence: Lead the operational strategy for critical... ...position is based on market data, relevant years of experience,...
Local area
Remote work
Worldwide
Flexible hours
DigitalOcean
San Francisco, CA
3 days ago
AI Inference Engineer
$175k - $225k
...led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla... ...participation from other leading venture capital firms. The... ...We're looking for an AI Inference Engineer who lives at the boundary... ...software synergy. Work with raw data from cameras and LiDAR to...
Local area
Remote work
Sauron
San Francisco, CA
5 days ago
Distributed Systems Engineer, Data & Inference Platform
...that turn raw compute into useful intelligence - the inference services that serve LLMs at scale and the data pipelines that feed them. One week you're hunting... ...keeps you honest about both. Researchers and ML engineers will hand you workloads that barely run; you'll hand...
Flexible hours
Adaption
San Francisco, CA
14 days ago
AI Inference Infrastructure Engineer
$350k
...A leading AI research organization seeks an Infrastructure Research Engineer in San Francisco to optimize and scale systems powering large AI models. This role emphasizes enhancing inference speed, reliability, and cost-effectiveness. Ideal candidates possess a Bachelor...
Visa sponsorship
Thinking Machines Lab Inc.
San Francisco, CA
3 days ago
AI Infrastructure Engineer: Scalable GPU Inference, On-Site
...An innovative studio is seeking an AI Infrastructure Engineer to enhance their ML infrastructure for groundbreaking anime games. This role involves designing and implementing cutting-edge inference architectures to support various platforms. As part of a small, agile...
Worldwide
Spellbrush
San Francisco, CA
3 days ago
Senior AI Inference Engineer - GPU, Rust & CUDA
$220k
Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience...
Perplexity
San Francisco, CA
3 days ago
Senior AI Inference Performance Engineer (Remote)
A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service...
Remote job
DigitalOcean
San Francisco, CA
4 days ago
AI Inference Performance Engineer
Fathom is seeking a Model Performance Engineer in San Francisco to optimize the speed, cost, and reliability of its model inference stack while building fine-tuning infrastructure. The ideal candidate will have extensive experience with LLM frameworks, quantization techniques...
Fathom
San Francisco, CA
1 day ago
Software Engineer, AI Inference
$100k - $300k
...failing. We believe massive scale through data-driven machine learning is the key to... ...Overview We are looking for a Software Engineer to work at the forefront of deploying our... ...You will be responsible for optimizing AI inference processes from lightweight to billion-...
Work at office
Skild
San Francisco, CA
5 days ago
Robotics AI Inference Engineer - Optimize & Deploy
...Skild AI is searching for a passionate Software Engineer to enhance AI models and ensure optimal performance of robotic systems. In this role, you will develop cutting-edge AI inference processes, tackling challenges of efficiency in diverse real-world scenarios. Ideal...
Skild
San Francisco, CA
3 days ago
Lead Data Engineer, AI
$172.5k - $260.1k
...Here, ambition meets action. Tech meets trust. And innovation isn... ...your career at the company leading workforce transformation in the... ....Salesforce is looking for a Data Engineer to join the Data & Analytics... ...Customer Success data.Build Inference Infrastructure — Partner with...
Salesforce
San Francisco, CA
3 days ago
Software Engineer - AI Inference Engine
...About the Job We are seeking a highly technical Inference Engine Engineer to optimize the performance and efficiency of our core inference engine. In this role, you will focus on designing, implementing, and optimizing GPU kernels and supporting infrastructure for...
Worldwide
Flexible hours
FriendliAI Corp
San Francisco, CA
2 days ago
Software Engineer - GenAI inference
$142.2k - $204.6k
...P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers... ...00-$204,600 USD About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide -...
Local area
Worldwide
Databricks
San Francisco, CA
3 days ago
Software Engineer, Inference
...most persistent challenges in data infrastructure: extracting accurate... ...a small, fast-growing team of engineers in San Francisco powering... ...growing quickly. What makes our tech special is our multi-stage... ...low-latency, high-throughput inference for OCR and multimodal models....
Work at office
Visa sponsorship
Relocation package
PULSE
San Francisco, CA
3 days ago
Software Engineer, Inference
$187.5k - $395k
...Software Engineer, Inference Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality... ...trace the end-to-end lifetime of any inference workload Tech stack Must have Python Redis S3-compatible...
Luma AI
San Francisco, CA
1 day ago
Software Engineer, Inference - AMD GPU Enablement
$325k
...About the Team Our Inference team brings OpenAI's most capable research and technology... ...inference. About the Role We're hiring engineers to scale and optimize OpenAI's inference... ...in your possession (including the data contained therein) upon termination of employment...
OpenAI
San Francisco, CA
3 days ago
AI Infrastructure Engineer — Scalable Training & Inference
An innovative AI company is seeking a Software Engineer to develop infrastructure that supports AI training and inference workflows. This role requires strong object-oriented... ...programming skills and a solid foundation in data structures and algorithms. The ideal candidate...
SpreeAI
San Francisco, CA
4 days ago
Applied AI Inference Engineer
ABOUT BASETEN Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence,... ...Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As an Applied AI Inference...
Work experience placement
Flexible hours
Baseten
San Francisco, CA
4 days ago
Member of Technical Staff (AI Inference Engineer)
...Inference Engine Engineer We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures at scale with tight latency and cost budgets. Our stack is Rust, Python, CUDA, and CuTe DSL - and we need another engineer to join...
Perplexity AI
San Francisco, CA
10 days ago
Senior ML Platform Engineer - Remote, Scalable Inference
$230k - $265k
...Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role requires 5+ years of software... ...platform functionalities, enhance real-time inference processes, and collaborate across teams...
Remote work
Parafin Inc
San Francisco, CA
4 days ago
Software Engineer, ML Inference, Simulation Infrastructure
$170k - $216k
...services and tools for a broad range of customers Software Engineers, Product, Data Science, System Engineering, and more. So if you want to... ...Engineering Manager. You will: Build and evolve ML inference infrastructure for simulations. Be responsible for the...
Full time
Remote work
Waymo
San Francisco, CA
4 days ago
Software Engineer, Model Inference
$295k
...About the Team Our Inference team brings OpenAI's most capable research and technology... ...About the Role We are looking for an engineer who wants to take the world's largest and... ...hardware in your possession (including the data contained therein) upon termination of...
OpenAI
San Francisco, CA
3 days ago
Software Engineer Intern (AI Infrastructure / Training / Inference)
...Role We are hiring Software Engineers focused on AI Infrastructure... ...orchestration, large-scale inference systems, performance optimization... ..., Go, or similar). Strong data structures and algorithms foundations... ...at the forefront of fashion-tech innovation. Your design work...
Internship
Immediate start
SpreeAI
San Francisco, CA
4 days ago
Production AI Inference Engineer — Scale & Impact
A dynamic AI company in San Francisco is looking for an Applied AI Inference Engineer to develop and deploy high-scale production AI applications. You will partner with customers to transform business goals into reliable services while engaging in software development...
Flexible hours
Baseten
San Francisco, CA
4 days ago
Edge Transformer Inference Tech Lead
A leading AI research firm in San Francisco is seeking a Technical Lead to join its Future of Computing Research team. This role involves evaluating silicon platforms and optimizing model architectures while working in a hybrid model. Ideal candidates have expertise in...
Relocation package
OpenAI
San Francisco, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Tech Lead, Data & Inference Engineer. Be the first to apply!