Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Tech Lead, Data & Inference Engineer

Catalyst Labs, LLC

About the job Tech Lead, Data & Inference Engineer


Our Client

A fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers. Their identity resolution technology blends business and consumer signals to convert static audience lists into high match and cross channel segments without the use of cookies. By transforming first party and third party data into precision targetable audiences across platforms such as Meta, Google and YouTube, they enable marketing teams to reach higher match rates, reduce wasted advertising spend and accelerate pipeline growth. With a strong understanding of how business buyers behave in channels that have traditionally been focused on business to consumer activity, they are redefining how business brands scale demand generation and account based efforts.

About Us

Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency thats deeply embedded in our clients recruitment operations.

We collaborate directly with Founders, CTOs, and Heads of AI in those themes who are driving the next wave of applied intelligence from model optimization to productized AI workflows. We take pride in facilitating conversations that align with your technical expertise, creative problem-solving mindset, and long-term growth trajectory in the evolving world of intelligent systems.

Location : San Francisco

Work type : Full Time,

Compensation : above market base + bonus + equity

Roles & Responsibilities
  • Lead the design, development and scaling of an end to end data platform from ingestion to insights, ensuring that data is fast, reliable and ready for business use.
  • Build and maintain scalable batch and streaming pipelines, transforming diverse data sources and third party application programming interfaces into trusted and low latency systems.
  • Take full ownership of reliability, cost and service level objectives. This includes achieving ninety nine point nine percent uptime, maintaining minutes level latency and optimizing cost per terabyte. Conduct root cause analysis and provide long lasting solutions.
  • Operate inference pipelines that enhance and enrich data. This includes enrichment, scoring and quality assurance using large language models and retrieval augmented generation. Manage version control, caching and evaluation loops.
  • Work across teams to deliver data as a product through the creation of clear data contracts, ownership models, lifecycle processes and usage based decision making.
  • Guide architectural decisions across the data lake and the entire pipeline stack. Document lineage, trade offs and reversibility while making practical decisions on whether to build internally or buy externally.
  • Scale integration with application programming interfaces and internal services while ensuring data consistency, high data quality and support for both real time and batch oriented use cases.
  • Mentor engineers, review code and raise the overall technical standard across teams. Promote data driven best practices throughout the organization.
Qualifications
  • Bachelors or Masters degree in Computer Science, Computer Engineering, Electrical Engineering, or Mathematics.
  • Excellent written and verbal communication; proactive and collaborative mindset.
  • Comfortable in hybrid or distributed environments with strong ownership and accountability.
  • A founder-level bias for actionable to identify bottlenecks, automate workflows, and iterate rapidly based on measurable outcomes.
  • Demonstrated ability to teach, mentor, and document technical decisions and schemas clearly.
Core Experience
  • 6 to 12 years of experience building and scaling production-grade data systems, with deep expertise in data architecture, modeling, and pipeline design.
  • Expert SQL (query optimization on large datasets) and Python skills.
  • Hands-on experience with distributed data technologies (Spark, Flink, Kafka) and modern orchestration tools (Airflow, Dagster, Prefect).
  • Familiarity with dbt, DuckDB, and the modern data stack; experience with IaC, CI/CD, and observability.
  • Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or Azure).
  • Bonus: Strong Node.js skills for faster onboarding and system integration.
  • Previous experience at a high-growth startup (10 to 200 people) or early-stage environment with a strong product mindset.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Tech Lead, Data & Inference Engineer in Palo Alto, CA vacancy
  • $190k - $250k

    A leading financial technology firm in California seeks an AI Inference engineer to join its team. The role involves developing APIs for AI inference, improving system reliability, and optimizing LLM performance. Required qualifications include experience with ML systems... 
    Suggested

    Pantera Capital

    Palo Alto, CA
    4 days ago
  • $120.1k - $225.7k

     ...Business Unit What the Role Entails End-to-End Inference Optimization: Lead the optimization of the full inference pipeline for Large...  ...: Master's or Ph.D. in Computer Science, Electronic Engineering, AI, or related fields; significant professional experience... 
    Suggested
    Relocation package

    Tencent

    Palo Alto, CA
    3 days ago
  • $119.8k - $234.7k

     ..., where we are building theAI data-planethat powersall LLMinferencing...  ...AI fabricdelivers inference capabilities for all LLMs inMicrosoft...  ...and more. As a Senior Software Engineer , you will shape the future...  ...cross-functional partners. Lead through architecture, code reviews... 
    Suggested
    Ongoing contract
    Local area

    Microsoft Corporation

    Mountain View, CA
    3 days ago
  • $128.7k - $261.3k

     ...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine...  ...workflows currently performed manually by engineers. Build the developer experience that...  ...and we embrace the responsibility to lead the change that will make our world better... 
    Suggested
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours
    Shift work

    General Motors

    Mountain View, CA
    4 days ago
  • $248.71k - $292.6k

    About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed...  ...reach, anything is possible. Build fast. Sr. Staff Software Engineer - High Performance GPU Inference Systems Mission Push the... 
    Suggested

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    5 days ago
  • $251k - $310k

     ...Tech Lead Manager, Data Engineer Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $155.42k - $395.9k

     ...Description About the Team: The ML Inference Platform is part of the AV ML...  ...a Senior ML Infrastructure engineer to help build and scale robust...  ..., for workflows such as data mining, labeling, model distillation...  ...deliver incremental value. Lead technical decision‑making on model... 
    Local area
    Remote work
    Relocation
    Relocation package
    Flexible hours

    Israelvcforum

    Mountain View, CA
    5 days ago
  • Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position aims to build and scale robust platforms for ML inference workflows supporting GM’s AI efforts. You will collaborate with ML engineers and researchers to implement... 
    Remote job

    Israelvcforum

    Mountain View, CA
    5 days ago
  •  ...company in Mountain View is seeking a Machine Learning Engineer to build and optimize the infrastructure for its Intelligence...  ...designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating with teams to productionize... 

    Corvic

    Mountain View, CA
    3 days ago
  • A cutting-edge robotics company in California seeks an ML Infrastructure Engineer to build and operate inference systems for their automation stack. Responsibilities include maintaining infrastructure for model inference, optimizing performance, and collaborating with research... 

    Rhoda AI

    Palo Alto, CA
    4 days ago
  •  ...Inference Optimization MLE At Rhoda AI, we're building the next generation of generalist intelligent robots. We own the full robotics...  ...across model versions Collaborate closely with research engineers to translate model innovations into optimized, deployment-ready... 

    Rhoda AI

    Palo Alto, CA
    4 days ago
  • $236k - $339.25k

     ...redefine the future of how work gets done. Build the future of data. Join the Snowflake team. The Snowflake Machine Learning...  ...and/or platforms. Experience in serving LLMs using inference engines like vLLM, TensorRT-LLM, TEI, SGLang, and knowing tradeoffs between... 
    Flexible hours

    Streamlit

    Menlo Park, CA
    3 days ago
  • $278.1k - $347.6k

     ...Mountain View, CA, USA Principal Machine Learning Engineer, Mobile AI Inference Optimization Location Mountain View, CA, USA Department...  ...variants). Team & Cross-Functional Leadership: Lead and mentor a team of ML engineers; define engineering best... 
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    4 days ago
  • $94.35 - $125.03 per hour

     ...and explore our current job openings. Your best is waiting to be discovered. Day - 08 Hour (United States of America) The Lead Data Engineer will be part of a team building Stanford Health Care's (SHC) solutions incorporating Artificial Intelligence including... 
    Hourly pay
    Work experience placement

    Stanford Health Care

    Palo Alto, CA
    4 days ago
  • We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and...  ...AI foundation of the world’s most popular developer data platform Collaborate with ML researchers from Voyage.... 
    Local area
    Worldwide

    MongoDB

    Palo Alto, CA
    4 days ago
  • SambaNova Systems in Palo Alto, California is seeking a Software Engineer to build and scale the AI inference platform. This role involves working with a cloud-agnostic platform, ensuring system stability, and collaborating with cross-functional teams. Candidates should... 

    jobs.frontdoordefense.com - Jobboard

    Palo Alto, CA
    3 days ago
  • A leading data platform company in Palo Alto seeks a Senior Engineer to develop a cutting-edge inference platform supporting semantic search and AI-native experiences. The ideal candidate will have over five years of experience in backend systems and proficiency in languages... 

    MongoDB

    Palo Alto, CA
    4 days ago
  • $124k - $195.5k

    ## AI Inference Performance Engineer - New College Grad 2026Applylocations: US, CA, Santa Claratime type: Full...  ...GPU roadmaps based on real workload data.* Technical Leadership: Raise the...  ...execution on tight benchmark timelines, and lead a world-class team.**What We Need To... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $124k - $195.5k

    NVIDIA Corporation is seeking an AI Inference Performance Engineer - New College Grad 2026 in Santa Clara. This role involves optimizing AI inference benchmarks using NVIDIA’s accelerators and working with various teams on performance enhancements. Applicants should have... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    NVIDIA Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

     ...for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers...  ...has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive...  .... The compiler must deliver leading inference performance, fast build time... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $164k - $282k

    Dormont Manufacturing Co is seeking a skilled Data Engineer to enhance data systems for optimal search experiences. You will manage analytics and ETL/ELT systems and work with data quality and scalability. The ideal candidate will have at least 6 years of industry experience... 

    Dormont Manufacturing Co

    Mountain View, CA
    5 days ago
  • $187.5k - $395k

     ...Ship new model architectures by integrating them into our inference engine Collaborate closely across research, engineering and infrastructure...  ...to trace the end-to-end lifetime of any inference workload Tech stack Must have Python Redis S3-compatible Storage... 

    Luma AI

    Redwood City, CA
    2 days ago
  •  ...generation computing experiences-from AI and data centers, to PCs, gaming and embedded...  ...ROLE: As a senior member of the LLM inference framework team, you will be responsible for...  ...sits at the intersection of inference engines, distributed systems, and GPU runtime and... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  •  ...generation computing experiences-from AI and data centers, to PCs, gaming and embedded...  ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node...  ...THE PERSON: Skilled engineer with strong technical and analytical expertise... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  • $139k - $204k

     ...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud...  ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises...  ...lunch each day in our office and data center locations ~ A casual work environment... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • $152k - $204k

     ...Senior Software Engineer, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud...  ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises...  ...lunch each day in our office and data center locations ~ A casual work environment... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • $92k - $135k

     ...with confidence. Trusted by leading AI labs, startups, and global...  ...What You'll Do: Join the Inference team to ship production features...  ...mentorship from experienced engineers. About the role: Implement...  .... Foundations in data structures, algorithms, and networked... 
    Permanent employment
    Temporary work
    Casual work
    Internship
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    24 days ago
  •  ...NU), we combine proprietary technology, data intelligence, and an efficient operating...  ...financial products. As a Machine Learning Engineer in AI Core, Data Intelligence, you’ll work...  ...infrastructure, from ingestion to model deployment. Lead technical initiatives that improve our... 
    Work from home
    Relocation package
    Flexible hours

    Nubank

    Palo Alto, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Tech Lead, Data & Inference Engineer. Be the first to apply!