Tech Lead, Data & Inference Engineer
Catalyst Labs, LLC
Our Client
A fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers. Their identity resolution technology blends business and consumer signals to convert static audience lists into high match and cross channel segments without the use of cookies. By transforming first party and third party data into precision targetable audiences across platforms such as Meta, Google and YouTube, they enable marketing teams to reach higher match rates, reduce wasted advertising spend and accelerate pipeline growth. With a strong understanding of how business buyers behave in channels that have traditionally been focused on business to consumer activity, they are redefining how business brands scale demand generation and account based efforts.
About Us
Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency thats deeply embedded in our clients recruitment operations.
We collaborate directly with Founders, CTOs, and Heads of AI in those themes who are driving the next wave of applied intelligence from model optimization to productized AI workflows. We take pride in facilitating conversations that align with your technical expertise, creative problem-solving mindset, and long-term growth trajectory in the evolving world of intelligent systems.
Location
San Francisco
Work type
Full Time,
Compensation
above market base + bonus + equity
Roles & Responsibilities
- Lead the design, development and scaling of an end to end data platform from ingestion to insights, ensuring that data is fast, reliable and ready for business use.
- Build and maintain scalable batch and streaming pipelines, transforming diverse data sources and third party application programming interfaces into trusted and low latency systems.
- Take full ownership of reliability, cost and service level objectives. This includes achieving ninety nine point nine percent uptime, maintaining minutes level latency and optimizing cost per terabyte. Conduct root cause analysis and provide long lasting solutions.
- Operate inference pipelines that enhance and enrich data. This includes enrichment, scoring and quality assurance using large language models and retrieval augmented generation. Manage version control, caching and evaluation loops.
- Work across teams to deliver data as a product through the creation of clear data contracts, ownership models, lifecycle processes and usage based decision making.
- Guide architectural decisions across the data lake and the entire pipeline stack. Document lineage, trade offs and reversibility while making practical decisions on whether to build internally or buy externally.
- Scale integration with application programming interfaces and internal services while ensuring data consistency, high data quality and support for both real time and batch oriented use cases.
- Mentor engineers, review code and raise the overall technical standard across teams. Promote data driven best practices throughout the organization.
Qualifications
- Bachelors or Masters degree in Computer Science, Computer Engineering, Electrical Engineering, or Mathematics.
- Excellent written and verbal communication; proactive and collaborative mindset.
- Comfortable in hybrid or distributed environments with strong ownership and accountability.
- A founder-level bias for actionable to identify bottlenecks, automate workflows, and iterate rapidly based on measurable outcomes.
- Demonstrated ability to teach, mentor, and document technical decisions and schemas clearly.
Core Experience
- 6 to 12 years of experience building and scaling production-grade data systems, with deep expertise in data architecture, modeling, and pipeline design.
- Expert SQL (query optimization on large datasets) and Python skills.
- Hands-on experience with distributed data technologies (Spark, Flink, Kafka) and modern orchestration tools (Airflow, Dagster, Prefect).
- Familiarity with dbt, DuckDB, and the modern data stack; experience with IaC, CI/CD, and observability.
- Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or Azure).
- Bonus: Strong Node.js skills for faster onboarding and system integration.
- Previous experience at a high-growth startup (10 to 200 people) or early-stage environment with a strong product mindset.
$380k
...societal benefit. About the Role We're looking for a GPU Inference Engineer to contribute to improvements in model serving efficiency for... ...system efficiency Drive optimizations from a kernel and data movement perspective to improve system throughput and...SuggestedWork at officeRelocation package- ...About the Role As a Technical Lead on the Future of Computing... .... Build and lead a team of engineers responsible for implementing the low-level inference stack, including kernel development... ...your possession (including the data contained therein) upon termination...SuggestedWork at officeRelocation package
$167.2k - $209k
A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong...SuggestedRemote job- ...Staff Technical Lead for Inference & ML Performance San Francisco fal is the generative media ecosystem powering the next generation of... ...This Role Matters You'll shape the future of fal's inference engine and ensure our generative models achieve best-in-class...Suggested
- A tech company specializing in AI infrastructure is seeking a skilled professional to build scalable infrastructure for AI model training and inference. You will lead architectural decisions and work with core systems that power their GPU optimization platform. Candidates...Suggested
$139.2k - $174k
...applications. We are seeking a Senior Engineer 2 to play a key role in our AI... ...for running AI workloads— inference, training, fine‑tuning— at... .... Operational Excellence: Lead the operational strategy for critical... ...position is based on market data, relevant years of experience,...Local areaRemote workWorldwideFlexible hours$175k - $225k
...led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla... ...participation from other leading venture capital firms. The... ...We're looking for an AI Inference Engineer who lives at the boundary... ...software synergy. Work with raw data from cameras and LiDAR to...Local areaRemote work- ...that turn raw compute into useful intelligence - the inference services that serve LLMs at scale and the data pipelines that feed them. One week you're hunting... ...keeps you honest about both. Researchers and ML engineers will hand you workloads that barely run; you'll hand...Flexible hours
$350k
...A leading AI research organization seeks an Infrastructure Research Engineer in San Francisco to optimize and scale systems powering large AI models. This role emphasizes enhancing inference speed, reliability, and cost-effectiveness. Ideal candidates possess a Bachelor...Visa sponsorship- ...An innovative studio is seeking an AI Infrastructure Engineer to enhance their ML infrastructure for groundbreaking anime games. This role involves designing and implementing cutting-edge inference architectures to support various platforms. As part of a small, agile...Worldwide
$220k
Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience...- A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service...Remote job
- Fathom is seeking a Model Performance Engineer in San Francisco to optimize the speed, cost, and reliability of its model inference stack while building fine-tuning infrastructure. The ideal candidate will have extensive experience with LLM frameworks, quantization techniques...
$100k - $300k
...failing. We believe massive scale through data-driven machine learning is the key to... ...Overview We are looking for a Software Engineer to work at the forefront of deploying our... ...You will be responsible for optimizing AI inference processes from lightweight to billion-...Work at office- ...Skild AI is searching for a passionate Software Engineer to enhance AI models and ensure optimal performance of robotic systems. In this role, you will develop cutting-edge AI inference processes, tackling challenges of efficiency in diverse real-world scenarios. Ideal...
$172.5k - $260.1k
...Here, ambition meets action. Tech meets trust. And innovation isn... ...your career at the company leading workforce transformation in the... ....Salesforce is looking for a Data Engineer to join the Data & Analytics... ...Customer Success data.Build Inference Infrastructure — Partner with...- ...About the Job We are seeking a highly technical Inference Engine Engineer to optimize the performance and efficiency of our core inference engine. In this role, you will focus on designing, implementing, and optimizing GPU kernels and supporting infrastructure for...WorldwideFlexible hours
$142.2k - $204.6k
...P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers... ...00-$204,600 USD About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide -...Local areaWorldwide- ...most persistent challenges in data infrastructure: extracting accurate... ...a small, fast-growing team of engineers in San Francisco powering... ...growing quickly. What makes our tech special is our multi-stage... ...low-latency, high-throughput inference for OCR and multimodal models....Work at officeVisa sponsorshipRelocation package
$187.5k - $395k
...Software Engineer, Inference Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality... ...trace the end-to-end lifetime of any inference workload Tech stack Must have Python Redis S3-compatible...$325k
...About the Team Our Inference team brings OpenAI's most capable research and technology... ...inference. About the Role We're hiring engineers to scale and optimize OpenAI's inference... ...in your possession (including the data contained therein) upon termination of employment...- An innovative AI company is seeking a Software Engineer to develop infrastructure that supports AI training and inference workflows. This role requires strong object-oriented... ...programming skills and a solid foundation in data structures and algorithms. The ideal candidate...
- ABOUT BASETEN Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence,... ...Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As an Applied AI Inference...Work experience placementFlexible hours
- ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures at scale with tight latency and cost budgets. Our stack is Rust, Python, CUDA, and CuTe DSL - and we need another engineer to join...
$230k - $265k
...Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role requires 5+ years of software... ...platform functionalities, enhance real-time inference processes, and collaborate across teams...Remote work$170k - $216k
...services and tools for a broad range of customers Software Engineers, Product, Data Science, System Engineering, and more. So if you want to... ...Engineering Manager. You will: Build and evolve ML inference infrastructure for simulations. Be responsible for the...Full timeRemote work$295k
...About the Team Our Inference team brings OpenAI's most capable research and technology... ...About the Role We are looking for an engineer who wants to take the world's largest and... ...hardware in your possession (including the data contained therein) upon termination of...- ...Role We are hiring Software Engineers focused on AI Infrastructure... ...orchestration, large-scale inference systems, performance optimization... ..., Go, or similar). Strong data structures and algorithms foundations... ...at the forefront of fashion-tech innovation. Your design work...InternshipImmediate start
- A dynamic AI company in San Francisco is looking for an Applied AI Inference Engineer to develop and deploy high-scale production AI applications. You will partner with customers to transform business goals into reliable services while engaging in software development...Flexible hours
- A leading AI research firm in San Francisco is seeking a Technical Lead to join its Future of Computing Research team. This role involves evaluating silicon platforms and optimizing model architectures while working in a hybrid model. Ideal candidates have expertise in...Relocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Tech Lead, Data & Inference Engineer. Be the first to apply!
- technical lead manager San Francisco, CA
- technical leader San Francisco, CA
- technical lead San Francisco, CA
- salesforce technical lead San Francisco, CA
- data officer San Francisco, CA
- data network cabling San Francisco, CA
- data auditor San Francisco, CA
- test data management San Francisco, CA
- data mining San Francisco, CA
- minimum data set coordinator San Francisco, CA

