Tech Lead, Data & Inference Engineer

Catalyst Labs, LLC

About the job Tech Lead, Data & Inference Engineer

Our Client

A fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers. Their identity resolution technology blends business and consumer signals to convert static audience lists into high match and cross channel segments without the use of cookies. By transforming first party and third party data into precision targetable audiences across platforms such as Meta, Google and YouTube, they enable marketing teams to reach higher match rates, reduce wasted advertising spend and accelerate pipeline growth. With a strong understanding of how business buyers behave in channels that have traditionally been focused on business to consumer activity, they are redefining how business brands scale demand generation and account based efforts.

About Us

Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency thats deeply embedded in our clients recruitment operations.

We collaborate directly with Founders, CTOs, and Heads of AI in those themes who are driving the next wave of applied intelligence from model optimization to productized AI workflows. We take pride in facilitating conversations that align with your technical expertise, creative problem-solving mindset, and long-term growth trajectory in the evolving world of intelligent systems.

Location : San Francisco

Work type : Full Time,

Compensation : above market base + bonus + equity

Roles & Responsibilities

Lead the design, development and scaling of an end to end data platform from ingestion to insights, ensuring that data is fast, reliable and ready for business use.
Build and maintain scalable batch and streaming pipelines, transforming diverse data sources and third party application programming interfaces into trusted and low latency systems.
Take full ownership of reliability, cost and service level objectives. This includes achieving ninety nine point nine percent uptime, maintaining minutes level latency and optimizing cost per terabyte. Conduct root cause analysis and provide long lasting solutions.
Operate inference pipelines that enhance and enrich data. This includes enrichment, scoring and quality assurance using large language models and retrieval augmented generation. Manage version control, caching and evaluation loops.
Work across teams to deliver data as a product through the creation of clear data contracts, ownership models, lifecycle processes and usage based decision making.
Guide architectural decisions across the data lake and the entire pipeline stack. Document lineage, trade offs and reversibility while making practical decisions on whether to build internally or buy externally.
Scale integration with application programming interfaces and internal services while ensuring data consistency, high data quality and support for both real time and batch oriented use cases.
Mentor engineers, review code and raise the overall technical standard across teams. Promote data driven best practices throughout the organization.

Qualifications

Bachelors or Masters degree in Computer Science, Computer Engineering, Electrical Engineering, or Mathematics.
Excellent written and verbal communication; proactive and collaborative mindset.
Comfortable in hybrid or distributed environments with strong ownership and accountability.
A founder-level bias for actionable to identify bottlenecks, automate workflows, and iterate rapidly based on measurable outcomes.
Demonstrated ability to teach, mentor, and document technical decisions and schemas clearly.

Core Experience

6 to 12 years of experience building and scaling production-grade data systems, with deep expertise in data architecture, modeling, and pipeline design.
Expert SQL (query optimization on large datasets) and Python skills.
Hands-on experience with distributed data technologies (Spark, Flink, Kafka) and modern orchestration tools (Airflow, Dagster, Prefect).
Familiarity with dbt, DuckDB, and the modern data stack; experience with IaC, CI/CD, and observability.
Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or Azure).
Bonus: Strong Node.js skills for faster onboarding and system integration.
Previous experience at a high-growth startup (10 to 200 people) or early-stage environment with a strong product mindset.

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Tech Lead, Data & Inference Engineer in Palo Alto, CA vacancy

AI Inference Engineer: Real-Time ML, Hybrid, Equity
$190k - $250k
A leading financial technology firm in California seeks an AI Inference engineer to join its team. The role involves developing APIs for AI inference, improving system reliability, and optimizing LLM performance. Required qualifications include experience with ML systems...
Suggested
Pantera Capital
Palo Alto, CA
4 days ago
Sr. AI Inference Systems Engineer
$120.1k - $225.7k
...Business Unit What the Role Entails End-to-End Inference Optimization: Lead the optimization of the full inference pipeline for Large... ...: Master's or Ph.D. in Computer Science, Electronic Engineering, AI, or related fields; significant professional experience...
Suggested
Relocation package
Tencent
Palo Alto, CA
3 days ago
Senior Software Engineer - CoreAI Model Inference & Serving
$119.8k - $234.7k
..., where we are building theAI data-planethat powersall LLMinferencing... ...AI fabricdelivers inference capabilities for all LLMs inMicrosoft... ...and more. As a Senior Software Engineer , you will shape the future... ...cross-functional partners. Lead through architecture, code reviews...
Suggested
Ongoing contract
Local area
Microsoft Corporation
Mountain View, CA
3 days ago
Senior ML Inference Engineer - Platform
$128.7k - $261.3k
...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine... ...workflows currently performed manually by engineers. Build the developer experience that... ...and we embrace the responsibility to lead the change that will make our world better...
Suggested
Local area
Remote work
Work from home
Relocation package
Flexible hours
Shift work
General Motors
Mountain View, CA
4 days ago
Senior Staff Software Engineer - High Performance GPU Inference Systems
$248.71k - $292.6k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed... ...reach, anything is possible. Build fast. Sr. Staff Software Engineer - High Performance GPU Inference Systems Mission Push the...
Suggested
I did my part and supported the Regular Toilet
Palo Alto, CA
5 days ago
Tech Lead Manager, Data Engineer
$251k - $310k
...Tech Lead Manager, Data Engineer Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most...
Full time
Remote work
Waymo
Mountain View, CA
2 days ago
Senior ML Infrastructure Engineer, Inference Platform
$155.42k - $395.9k
...Description About the Team: The ML Inference Platform is part of the AV ML... ...a Senior ML Infrastructure engineer to help build and scale robust... ..., for workflows such as data mining, labeling, model distillation... ...deliver incremental value. Lead technical decision‑making on model...
Local area
Remote work
Relocation
Relocation package
Flexible hours
Israelvcforum
Mountain View, CA
5 days ago
Senior ML Inference Platform Engineer (Remote)
Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position aims to build and scale robust platforms for ML inference workflows supporting GM’s AI efforts. You will collaborate with ML engineers and researchers to implement...
Remote job
Israelvcforum
Mountain View, CA
5 days ago
ML Engineer — AI Platform & Multimodal Inference
...company in Mountain View is seeking a Machine Learning Engineer to build and optimize the infrastructure for its Intelligence... ...designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating with teams to productionize...
Corvic
Mountain View, CA
3 days ago
Robotics ML Inference Engineer — Edge & Cloud AI
A cutting-edge robotics company in California seeks an ML Infrastructure Engineer to build and operate inference systems for their automation stack. Responsibilities include maintaining infrastructure for model inference, optimizing performance, and collaborating with research...
Rhoda AI
Palo Alto, CA
4 days ago
Inference Optimization ML Engineer
...Inference Optimization MLE At Rhoda AI, we're building the next generation of generalist intelligent robots. We own the full robotics... ...across model versions Collaborate closely with research engineers to translate model innovations into optimized, deployment-ready...
Rhoda AI
Palo Alto, CA
4 days ago
Senior/Staff Software Engineer - Machine Learning Platform (Inference)
$236k - $339.25k
...redefine the future of how work gets done. Build the future of data. Join the Snowflake team. The Snowflake Machine Learning... ...and/or platforms. Experience in serving LLMs using inference engines like vLLM, TensorRT-LLM, TEI, SGLang, and knowing tradeoffs between...
Flexible hours
Streamlit
Menlo Park, CA
3 days ago
Principal Machine Learning Engineer, Mobile AI Inference Optimization
$278.1k - $347.6k
...Mountain View, CA, USA Principal Machine Learning Engineer, Mobile AI Inference Optimization Location Mountain View, CA, USA Department... ...variants). Team & Cross-Functional Leadership: Lead and mentor a team of ML engineers; define engineering best...
Work at office
Worldwide
Relocation package
Unity Technologies
Mountain View, CA
4 days ago
Lead Data Engineer - AI/ML
$94.35 - $125.03 per hour
...and explore our current job openings. Your best is waiting to be discovered. Day - 08 Hour (United States of America) The Lead Data Engineer will be part of a team building Stanford Health Care's (SHC) solutions incorporating Artificial Intelligence including...
Hourly pay
Work experience placement
Stanford Health Care
Palo Alto, CA
4 days ago
Senior Software Engineer, Inference Platform Palo Alto
We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and... ...AI foundation of the world’s most popular developer data platform Collaborate with ML researchers from Voyage....
Local area
Worldwide
MongoDB
Palo Alto, CA
4 days ago
Software Engineer, AI Inference Platform
SambaNova Systems in Palo Alto, California is seeking a Software Engineer to build and scale the AI inference platform. This role involves working with a cloud-agnostic platform, ensuring system stability, and collaborating with cross-functional teams. Candidates should...
jobs.frontdoordefense.com - Jobboard
Palo Alto, CA
3 days ago
Senior Inference Platform Engineer — Low-Latency, Multi-Tenant
A leading data platform company in Palo Alto seeks a Senior Engineer to develop a cutting-edge inference platform supporting semantic search and AI-native experiences. The ideal candidate will have over five years of experience in backend systems and proficiency in languages...
MongoDB
Palo Alto, CA
4 days ago
AI Inference Performance Engineer - New College Grad 2026
$124k - $195.5k
## AI Inference Performance Engineer - New College Grad 2026Applylocations: US, CA, Santa Claratime type: Full... ...GPU roadmaps based on real workload data.* Technical Leadership: Raise the... ...execution on tight benchmark timelines, and lead a world-class team.**What We Need To...
NVIDIA Corporation
Santa Clara, CA
4 days ago
Senior GPU AI Inference Engineer - Triton & Dynamo
A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...
NVIDIA Corporation
Santa Clara, CA
1 day ago
AI Inference Performance Engineer — Scale LLMs & GPU Clusters
$124k - $195.5k
NVIDIA Corporation is seeking an AI Inference Performance Engineer - New College Grad 2026 in Santa Clara. This role involves optimizing AI inference benchmarks using NVIDIA’s accelerators and working with various teams on performance enhancements. Applicants should have...
NVIDIA Corporation
Santa Clara, CA
3 days ago
Senior AI Inference Performance Engineer (GPU/Cluster)
$152k - $241.5k
NVIDIA Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant...
NVIDIA Gruppe
Santa Clara, CA
5 days ago
Senior AI Inference Compiler Engineer
$152k - $241.5k
...for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers... ...has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive... .... The compiler must deliver leading inference performance, fast build time...
NVIDIA Gruppe
Santa Clara, CA
5 days ago
Lead Data Engineer, Search & Discovery
$164k - $282k
Dormont Manufacturing Co is seeking a skilled Data Engineer to enhance data systems for optimal search experiences. You will manage analytics and ETL/ELT systems and work with data quality and scalability. The ideal candidate will have at least 6 years of industry experience...
Dormont Manufacturing Co
Mountain View, CA
5 days ago
Software Engineer, Inference
$187.5k - $395k
...Ship new model architectures by integrating them into our inference engine Collaborate closely across research, engineering and infrastructure... ...to trace the end-to-end lifetime of any inference workload Tech stack Must have Python Redis S3-compatible Storage...
Luma AI
Redwood City, CA
2 days ago
Senior Software Development Engineer - LLM Inference Framework
...generation computing experiences-from AI and data centers, to PCs, gaming and embedded... ...ROLE: As a senior member of the LLM inference framework team, you will be responsible for... ...sits at the intersection of inference engines, distributed systems, and GPU runtime and...
Advanced Micro Devices , Inc.
Santa Clara, CA
3 days ago
Senior Software Development Engineer - SGLang and Inference Stack
...generation computing experiences-from AI and data centers, to PCs, gaming and embedded... ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node... ...THE PERSON: Skilled engineer with strong technical and analytical expertise...
Advanced Micro Devices , Inc.
Santa Clara, CA
3 days ago
Senior Software Engineer I, Inference
$139k - $204k
...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud... ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises... ...lunch each day in our office and data center locations ~ A casual work environment...
Permanent employment
Temporary work
Casual work
Work at office
Remote work
Flexible hours
Shift work
CoreWeave
Sunnyvale, CA
4 days ago
Senior Software Engineer, Inference
$152k - $204k
...Senior Software Engineer, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud... ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises... ...lunch each day in our office and data center locations ~ A casual work environment...
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
Shift work
CoreWeave
Sunnyvale, CA
3 days ago
Software Engineer, Inference AI/ML
$92k - $135k
...with confidence. Trusted by leading AI labs, startups, and global... ...What You'll Do: Join the Inference team to ship production features... ...mentorship from experienced engineers. About the role: Implement... .... Foundations in data structures, algorithms, and networked...
Permanent employment
Temporary work
Casual work
Internship
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
24 days ago
Lead ML Data Engineer, AI Core
...NU), we combine proprietary technology, data intelligence, and an efficient operating... ...financial products. As a Machine Learning Engineer in AI Core, Data Intelligence, you’ll work... ...infrastructure, from ingestion to model deployment. Lead technical initiatives that improve our...
Work from home
Relocation package
Flexible hours
Nubank
Palo Alto, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Tech Lead, Data & Inference Engineer. Be the first to apply!