Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Tech Lead, Data & Inference Engineer

Catalyst Labs, LLC

Tech Lead, Data & Inference Engineer

Sunnyvale, California, United States

About the Job

Our client is a fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers. Their identity resolution technology blends business and consumer signals to convert static audience lists into high match and cross channel segments without the use of cookies. By transforming first party and third party data into precision targetable audiences across platforms such as Meta, Google and YouTube, they enable marketing teams to reach higher match rates, reduce wasted advertising spend and accelerate pipeline growth. With a strong understanding of how business buyers behave in channels that have traditionally been focused on business to consumer activity, they are redefining how business brands scale demand generation and account based efforts.

About Us

Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency that’s deeply embedded in our clients recruitment operations.

We collaborate directly with Founders, CTOs, and Heads of AI in those themes who are driving the next wave of applied intelligence from model optimization to productized AI workflows. We take pride in facilitating conversations that align with your technical expertise, creative problem-solving mindset, and long-term growth trajectory in the evolving world of intelligent systems.

Location

San Francisco

Work Type

Full Time

Compensation

Above market base + bonus + equity

Roles & Responsibilities
  • Lead the design, development and scaling of an end to end data platform from ingestion to insights, ensuring that data is fast, reliable and ready for business use.
  • Build and maintain scalable batch and streaming pipelines, transforming diverse data sources and third party application programming interfaces into trusted and low latency systems.
  • Take full ownership of reliability, cost and service level objectives. This includes achieving ninety nine point nine percent uptime, maintaining minutes level latency and optimizing cost per terabyte. Conduct root cause analysis and provide long lasting solutions.
  • Operate inference pipelines that enhance and enrich data. This includes enrichment, scoring and quality assurance using large language models and retrieval augmented generation. Manage version control, caching and evaluation loops.
  • Work across teams to deliver data as a product through the creation of clear data contracts, ownership models, lifecycle processes and usage based decision making.
  • Guide architectural decisions across the data lake and the entire pipeline stack. Document lineage, trade offs and reversibility while making practical decisions on whether to build internally or buy externally.
  • Scale integration with application programming interfaces and internal services while ensuring data consistency, high data quality and support for both real time and batch oriented use cases.
  • Mentor engineers, review code and raise the overall technical standard across teams. Promote data driven best practices throughout the organization.
Qualifications
  • Bachelors or Masters degree in Computer Science, Computer Engineering, Electrical Engineering, or Mathematics.
  • Excellent written and verbal communication; proactive and collaborative mindset.
  • Comfortable in hybrid or distributed environments with strong ownership and accountability.
  • A founder-level bias for actionable to identify bottlenecks, automate workflows, and iterate rapidly based on measurable outcomes.
  • Demonstrated ability to teach, mentor, and document technical decisions and schemas clearly.
Core Experience
  • 6 to 12 years of experience building and scaling production-grade data systems, with deep expertise in data architecture, modeling, and pipeline design.
  • Expert SQL (query optimization on large datasets) and Python skills.
  • Hands-on experience with distributed data technologies (Spark, Flink, Kafka) and modern orchestration tools (Airflow, Dagster, Prefect).
  • Familiarity with dbt, DuckDB, and the modern data stack; experience with IaC, CI/CD, and observability.
  • Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or Azure).
  • Bonus: Strong Node.js skills for faster onboarding and system integration.
  • Previous experience at a high-growth startup (10 to 200 people) or early-stage environment with a strong product mindset.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Tech Lead, Data & Inference Engineer in Sunnyvale, CA vacancy
  •  ...Tech Lead, Data & Inference Engineer Mountain View, California, United States About the Job Tech Lead, Data & Inference Engineer Our client is a fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million... 
    Suggested
    Full time

    Catalyst Labs, LLC

    Mountain View, CA
    2 days ago
  •  ...Micro Devices in Santa Clara, California, seeks a strategic software engineering lead. This role entails developing techniques for optimizing key applications, particularly for large-scale inference within the K8s ecosystem. Successful candidates should possess leadership... 
    Suggested

    Advanced Micro Devices

    Santa Clara, CA
    1 day ago
  • $184k - $356.5k

    NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa Clara, CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $170.5k - $240.71k

    Intel Corporation is seeking an experienced AI Software Development Engineer to drive optimization of AI inference workloads. Responsibilities include optimizing Large Language Models on GPUs and developing efficient graph-based compilation flows. Candidates should have... 

    Intel Corporation

    Santa Clara, CA
    1 day ago
  •  ...generation computing experiences-from AI and data centers, to PCs, gaming and embedded...  ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node...  ...THE PERSON: Skilled engineer with strong technical and analytical expertise... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    2 days ago
  • $193.3k - $261.5k

     ...deliver high-performance, low-cost inference at scale. The Neuron Serving...  ...a Software Development Engineer to lead and architect our next-generation...  ...system components for tensor/data parallelism and disaggregated...  ...- Experience as a mentor, tech lead or leading an engineering... 
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    5 hours ago
  • $152k - $241.5k

     ...solve some of the world’s most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the TensorRT... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a...  .... ~ Experience and knowledge in Computer Architecture, Data Structures, Algorithms. ~ Excellent communication skills... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $170k - $216k

     ...developer velocity. We’re looking for a software engineer to join the team to build and maintain the critical data and ML pipelines that powers ML development at...  ...Engineer.   You will: Develop Waymo's inference platform to make it scalable, high throughput,... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • $139k - $204k

     ...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud...  ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises...  ...lunch each day in our office and data center locations ~ A casual work environment... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • $165k - $242k

     ...Senior Software Engineer II, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud...  ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises...  ...lunch each day in our office and data center locations ~ A casual work environment... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • $152k - $241.5k

     ...powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM serving by contributing directly...  ...layers-from Python orchestration to C++/CUDA kernels-using data to guide optimization work. Improve multi‑GPU inference... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today's most sophisticated AI applications. Our team is responsible... 
    Remote work

    NVIDIA

    Santa Clara, CA
    6 days ago
  • $272k - $431.25k

     ...NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly... 
    Remote work

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $128.7k - $261.3k

     ...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine...  ...workflows currently performed manually by engineers. Build the developer experience that...  ...and we embrace the responsibility to lead the change that will make our world better... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours
    Shift work

    General Motors

    Mountain View, CA
    3 days ago
  • $155.42k - $205.9k

     ...About the Team: The ML Inference Platform is part of the AV ML...  ...seeking a Senior ML Infrastructure engineer to help build and scale robust...  ...production, for workflows such as data mining, labeling, model...  ...deliver incremental value. Lead technical decision-making on model... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $170k - $216k

     ...services and tools for a broad range of customers Software Engineers, Product, Data Science, System Engineering, and more. So if you want to...  ...Engineering Manager. You will: Build and evolve ML inference infrastructure for simulations. Be responsible for the... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • $193.3k - $261.5k

     ...PyTorch and JAX enabling unparalleled ML inference and training performance. The...  ...till the hardware-software boundary, our engineers build systematic infrastructure, innovate...  ...responsibilities This role will help lead the efforts in building distributed inference... 
    Work experience placement
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    1 hour ago
  • $152k - $241.5k

    Senior Software Engineer, Quantized Inference page is loaded## Senior Software Engineer, Quantized Inferencelocations: US, WA, Redmond: US, CA, Santa...  ...to attenuate outlier impact, or improved calibration data drawn from SFT/RL pipelines.Each new recipe demands corresponding... 

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $251k - $310k

     ...Tech Lead Manager, Data Engineer Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • $185.5k - $270k

     ...About the Team: The ML Inference Platform is part of the AI Compute...  ...seeking a Staff ML Infrastructure engineer to help build and scale robust...  ..., for their workflows such as data mining, labeling, model...  ...deliver incremental value. Lead technical decision-making on model... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 hours ago
  • $246.5k

     ...core of this is our Machine Learning and Inference Platform that powers the entire...  ...this role, you will architect, design, and lead the development of a state-of-the-art Inference...  ...frameworks - someone excited to mentor engineers, innovate at scale, and shape the future... 
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    4 days ago
  • $92k - $135k

     ...with confidence. Trusted by leading AI labs, startups, and global...  ...What You'll Do: Join the Inference team to ship production features...  ...mentorship from experienced engineers. About the role: Implement...  .... Foundations in data structures, algorithms, and networked... 
    Permanent employment
    Temporary work
    Casual work
    Internship
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    8 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    7 days ago
  • $184k - $287.5k

     ...highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale...  ...methods like speculative decoding, data/tensor/expert/pipeline-parallelism,...  ...NVIDIA’s submissions to the industry-leading MLPerf Inference benchmarking suite... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $139.9k - $274.8k

     ..., where we are building theAI data-planethat powersall LLMinferencing...  ...AI fabricdelivers inference capabilities for all LLMs inMicrosoft...  ...more. As a Principal Software Engineer , you will shape the future of...  ...cross-functional partners. Lead through architecture, code... 
    Ongoing contract
    Local area

    Microsoft Corporation

    Mountain View, CA
    5 hours ago
  •  ...company in Mountain View is seeking a Machine Learning Engineer to build and optimize the infrastructure for its Intelligence...  ...designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating with teams to productionize... 

    Corvic

    Mountain View, CA
    2 days ago
  •  ...seeking a Member of Technical Staff (Software Engineer) to implement infrastructure for high-performance, low-latency inference services. Applicants should have a Master’s...  ...essential. The environment supports growth and diversity in tech. #J-18808-Ljbffr Cerebras Systems

    Cerebras Systems

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Tech Lead, Data & Inference Engineer. Be the first to apply!