Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Tech Lead, Data & Inference Engineer

Catalyst Labs, LLC

Our Client

A fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers. Their identity resolution technology blends business and consumer signals to convert static audience lists into high match and cross channel segments without the use of cookies. By transforming first party and third party data into precision targetable audiences across platforms such as Meta, Google and YouTube, they enable marketing teams to reach higher match rates, reduce wasted advertising spend and accelerate pipeline growth. With a strong understanding of how business buyers behave in channels that have traditionally been focused on business to consumer activity, they are redefining how business brands scale demand generation and account based efforts.

About Us

Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency thats deeply embedded in our clients recruitment operations.

We collaborate directly with Founders, CTOs, and Heads of AI in those themes who are driving the next wave of applied intelligence from model optimization to productized AI workflows. We take pride in facilitating conversations that align with your technical expertise, creative problem-solving mindset, and long-term growth trajectory in the evolving world of intelligent systems.

Location

San Francisco

Work type

Full Time,

Compensation

above market base + bonus + equity

Roles & Responsibilities
  • Lead the design, development and scaling of an end to end data platform from ingestion to insights, ensuring that data is fast, reliable and ready for business use.
  • Build and maintain scalable batch and streaming pipelines, transforming diverse data sources and third party application programming interfaces into trusted and low latency systems.
  • Take full ownership of reliability, cost and service level objectives. This includes achieving ninety nine point nine percent uptime, maintaining minutes level latency and optimizing cost per terabyte. Conduct root cause analysis and provide long lasting solutions.
  • Operate inference pipelines that enhance and enrich data. This includes enrichment, scoring and quality assurance using large language models and retrieval augmented generation. Manage version control, caching and evaluation loops.
  • Work across teams to deliver data as a product through the creation of clear data contracts, ownership models, lifecycle processes and usage based decision making.
  • Guide architectural decisions across the data lake and the entire pipeline stack. Document lineage, trade offs and reversibility while making practical decisions on whether to build internally or buy externally.
  • Scale integration with application programming interfaces and internal services while ensuring data consistency, high data quality and support for both real time and batch oriented use cases.
  • Mentor engineers, review code and raise the overall technical standard across teams. Promote data driven best practices throughout the organization.
Qualifications
  • Bachelors or Masters degree in Computer Science, Computer Engineering, Electrical Engineering, or Mathematics.
  • Excellent written and verbal communication; proactive and collaborative mindset.
  • Comfortable in hybrid or distributed environments with strong ownership and accountability.
  • A founder-level bias for actionable to identify bottlenecks, automate workflows, and iterate rapidly based on measurable outcomes.
  • Demonstrated ability to teach, mentor, and document technical decisions and schemas clearly.
Core Experience
  • 6 to 12 years of experience building and scaling production-grade data systems, with deep expertise in data architecture, modeling, and pipeline design.
  • Expert SQL (query optimization on large datasets) and Python skills.
  • Hands-on experience with distributed data technologies (Spark, Flink, Kafka) and modern orchestration tools (Airflow, Dagster, Prefect).
  • Familiarity with dbt, DuckDB, and the modern data stack; experience with IaC, CI/CD, and observability.
  • Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or Azure).
  • Bonus: Strong Node.js skills for faster onboarding and system integration.
  • Previous experience at a high-growth startup (10 to 200 people) or early-stage environment with a strong product mindset.
#J-18808-Ljbffr
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Tech Lead, Data & Inference Engineer in San Francisco, CA vacancy
  • $380k

     ...societal benefit. About the Role We're looking for a GPU Inference Engineer to contribute to improvements in model serving efficiency for...  ...system efficiency Drive optimizations from a kernel and data movement perspective to improve system throughput and... 
    Suggested
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    1 day ago
  •  ...About the Role As a Technical Lead on the Future of Computing...  .... Build and lead a team of engineers responsible for implementing the low-level inference stack, including kernel development...  ...your possession (including the data contained therein) upon termination... 
    Suggested
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Suggested
    Remote job

    DigitalOcean

    San Francisco, CA
    3 days ago
  •  ...Staff Technical Lead for Inference & ML Performance San Francisco fal is the generative media ecosystem powering the next generation of...  ...This Role Matters You'll shape the future of fal's inference engine and ensure our generative models achieve best-in-class... 
    Suggested

    Fal

    San Francisco, CA
    3 days ago
  • A tech company specializing in AI infrastructure is seeking a skilled professional to build scalable infrastructure for AI model training and inference. You will lead architectural decisions and work with core systems that power their GPU optimization platform. Candidates... 
    Suggested

    Wafer

    San Francisco, CA
    2 days ago
  • $139.2k - $174k

     ...applications. We are seeking a Senior Engineer 2 to play a key role in our AI...  ...for running AI workloads— inference, training, fine‑tuning— at...  .... Operational Excellence: Lead the operational strategy for critical...  ...position is based on market data, relevant years of experience,... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    3 days ago
  • $175k - $225k

     ...led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla...  ...participation from other leading venture capital firms. The...  ...We're looking for an AI Inference Engineer who lives at the boundary...  ...software synergy. Work with raw data from cameras and LiDAR to... 
    Local area
    Remote work

    Sauron

    San Francisco, CA
    5 days ago
  •  ...that turn raw compute into useful intelligence - the inference services that serve LLMs at scale and the data pipelines that feed them. One week you're hunting...  ...keeps you honest about both. Researchers and ML engineers will hand you workloads that barely run; you'll hand... 
    Flexible hours

    Adaption

    San Francisco, CA
    14 days ago
  • $350k

     ...A leading AI research organization seeks an Infrastructure Research Engineer in San Francisco to optimize and scale systems powering large AI models. This role emphasizes enhancing inference speed, reliability, and cost-effectiveness. Ideal candidates possess a Bachelor... 
    Visa sponsorship

    Thinking Machines Lab Inc.

    San Francisco, CA
    3 days ago
  •  ...An innovative studio is seeking an AI Infrastructure Engineer to enhance their ML infrastructure for groundbreaking anime games. This role involves designing and implementing cutting-edge inference architectures to support various platforms. As part of a small, agile... 
    Worldwide

    Spellbrush

    San Francisco, CA
    3 days ago
  • $220k

    Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience... 

    Perplexity

    San Francisco, CA
    3 days ago
  • A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service... 
    Remote job

    DigitalOcean

    San Francisco, CA
    4 days ago
  • Fathom is seeking a Model Performance Engineer in San Francisco to optimize the speed, cost, and reliability of its model inference stack while building fine-tuning infrastructure. The ideal candidate will have extensive experience with LLM frameworks, quantization techniques... 

    Fathom

    San Francisco, CA
    1 day ago
  • $100k - $300k

     ...failing. We believe massive scale through data-driven machine learning is the key to...  ...Overview We are looking for a Software Engineer to work at the forefront of deploying our...  ...You will be responsible for optimizing AI inference processes from lightweight to billion-... 
    Work at office

    Skild

    San Francisco, CA
    5 days ago
  •  ...Skild AI is searching for a passionate Software Engineer to enhance AI models and ensure optimal performance of robotic systems. In this role, you will develop cutting-edge AI inference processes, tackling challenges of efficiency in diverse real-world scenarios. Ideal... 

    Skild

    San Francisco, CA
    3 days ago
  • $172.5k - $260.1k

     ...Here, ambition meets action. Tech meets trust. And innovation isn...  ...your career at the company leading workforce transformation in the...  ....Salesforce is looking for a Data Engineer to join the Data & Analytics...  ...Customer Success data.Build Inference Infrastructure — Partner with... 

    Salesforce

    San Francisco, CA
    3 days ago
  •  ...About the Job We are seeking a highly technical Inference Engine Engineer to optimize the performance and efficiency of our core inference engine. In this role, you will focus on designing, implementing, and optimizing GPU kernels and supporting infrastructure for... 
    Worldwide
    Flexible hours

    FriendliAI Corp

    San Francisco, CA
    2 days ago
  • $142.2k - $204.6k

     ...P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers...  ...00-$204,600 USD About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide -... 
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    3 days ago
  •  ...most persistent challenges in data infrastructure: extracting accurate...  ...a small, fast-growing team of engineers in San Francisco powering...  ...growing quickly. What makes our tech special is our multi-stage...  ...low-latency, high-throughput inference for OCR and multimodal models.... 
    Work at office
    Visa sponsorship
    Relocation package

    PULSE

    San Francisco, CA
    3 days ago
  • $187.5k - $395k

     ...Software Engineer, Inference Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality...  ...trace the end-to-end lifetime of any inference workload Tech stack Must have Python Redis S3-compatible... 

    Luma AI

    San Francisco, CA
    1 day ago
  • $325k

     ...About the Team Our Inference team brings OpenAI's most capable research and technology...  ...inference. About the Role We're hiring engineers to scale and optimize OpenAI's inference...  ...in your possession (including the data contained therein) upon termination of employment... 

    OpenAI

    San Francisco, CA
    3 days ago
  • An innovative AI company is seeking a Software Engineer to develop infrastructure that supports AI training and inference workflows. This role requires strong object-oriented...  ...programming skills and a solid foundation in data structures and algorithms. The ideal candidate... 

    SpreeAI

    San Francisco, CA
    4 days ago
  • ABOUT BASETEN Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence,...  ...Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As an Applied AI Inference... 
    Work experience placement
    Flexible hours

    Baseten

    San Francisco, CA
    4 days ago
  •  ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures at scale with tight latency and cost budgets. Our stack is Rust, Python, CUDA, and CuTe DSL - and we need another engineer to join... 

    Perplexity AI

    San Francisco, CA
    10 days ago
  • $230k - $265k

     ...Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role requires 5+ years of software...  ...platform functionalities, enhance real-time inference processes, and collaborate across teams... 
    Remote work

    Parafin Inc

    San Francisco, CA
    4 days ago
  • $170k - $216k

     ...services and tools for a broad range of customers Software Engineers, Product, Data Science, System Engineering, and more. So if you want to...  ...Engineering Manager. You will: Build and evolve ML inference infrastructure for simulations. Be responsible for the... 
    Full time
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  • $295k

     ...About the Team Our Inference team brings OpenAI's most capable research and technology...  ...About the Role We are looking for an engineer who wants to take the world's largest and...  ...hardware in your possession (including the data contained therein) upon termination of... 

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...Role We are hiring Software Engineers focused on AI Infrastructure...  ...orchestration, large-scale inference systems, performance optimization...  ..., Go, or similar). Strong data structures and algorithms foundations...  ...at the forefront of fashion-tech innovation. Your design work... 
    Internship
    Immediate start

    SpreeAI

    San Francisco, CA
    4 days ago
  • A dynamic AI company in San Francisco is looking for an Applied AI Inference Engineer to develop and deploy high-scale production AI applications. You will partner with customers to transform business goals into reliable services while engaging in software development... 
    Flexible hours

    Baseten

    San Francisco, CA
    4 days ago
  • A leading AI research firm in San Francisco is seeking a Technical Lead to join its Future of Computing Research team. This role involves evaluating silicon platforms and optimizing model architectures while working in a hybrid model. Ideal candidates have expertise in... 
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Tech Lead, Data & Inference Engineer. Be the first to apply!