Tech Lead, Data & Inference Engineer
Catalyst Labs, LLC
About the job Tech Lead, Data & Inference Engineer
Our Client
- Lead the design, development and scaling of an end to end data platform from ingestion to insights, ensuring that data is fast, reliable and ready for business use.
- Build and maintain scalable batch and streaming pipelines, transforming diverse data sources and third party application programming interfaces into trusted and low latency systems.
- Take full ownership of reliability, cost and service level objectives. This includes achieving ninety nine point nine percent uptime, maintaining minutes level latency and optimizing cost per terabyte. Conduct root cause analysis and provide long lasting solutions.
- Operate inference pipelines that enhance and enrich data. This includes enrichment, scoring and quality assurance using large language models and retrieval augmented generation. Manage version control, caching and evaluation loops.
- Work across teams to deliver data as a product through the creation of clear data contracts, ownership models, lifecycle processes and usage based decision making.
- Guide architectural decisions across the data lake and the entire pipeline stack. Document lineage, trade offs and reversibility while making practical decisions on whether to build internally or buy externally.
- Scale integration with application programming interfaces and internal services while ensuring data consistency, high data quality and support for both real time and batch oriented use cases.
- Mentor engineers, review code and raise the overall technical standard across teams. Promote data driven best practices throughout the organization.
- Bachelors or Masters degree in Computer Science, Computer Engineering, Electrical Engineering, or Mathematics.
- Excellent written and verbal communication; proactive and collaborative mindset.
- Comfortable in hybrid or distributed environments with strong ownership and accountability.
- A founder-level bias for actionable to identify bottlenecks, automate workflows, and iterate rapidly based on measurable outcomes.
- Demonstrated ability to teach, mentor, and document technical decisions and schemas clearly.
- 6 to 12 years of experience building and scaling production-grade data systems, with deep expertise in data architecture, modeling, and pipeline design.
- Expert SQL (query optimization on large datasets) and Python skills.
- Hands-on experience with distributed data technologies (Spark, Flink, Kafka) and modern orchestration tools (Airflow, Dagster, Prefect).
- Familiarity with dbt, DuckDB, and the modern data stack; experience with IaC, CI/CD, and observability.
- Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or Azure).
- Bonus: Strong Node.js skills for faster onboarding and system integration.
- Previous experience at a high-growth startup (10 to 200 people) or early-stage environment with a strong product mindset.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Tech Lead, Data & Inference Engineer in Palo Alto, CA vacancy
$190k - $250k
A leading financial technology firm in California seeks an AI Inference engineer to join its team. The role involves developing APIs for AI inference, improving system reliability, and optimizing LLM performance. Required qualifications include experience with ML systems...Suggested$120.1k - $225.7k
...Business Unit What the Role Entails End-to-End Inference Optimization: Lead the optimization of the full inference pipeline for Large... ...: Master's or Ph.D. in Computer Science, Electronic Engineering, AI, or related fields; significant professional experience...SuggestedRelocation package$119.8k - $234.7k
..., where we are building theAI data-planethat powersall LLMinferencing... ...AI fabricdelivers inference capabilities for all LLMs inMicrosoft... ...and more. As a Senior Software Engineer , you will shape the future... ...cross-functional partners. Lead through architecture, code reviews...SuggestedOngoing contractLocal area$128.7k - $261.3k
...About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine... ...workflows currently performed manually by engineers. Build the developer experience that... ...and we embrace the responsibility to lead the change that will make our world better...SuggestedLocal areaRemote workWork from homeRelocation packageFlexible hoursShift work$248.71k - $292.6k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed... ...reach, anything is possible. Build fast. Sr. Staff Software Engineer - High Performance GPU Inference Systems Mission Push the...Suggested$251k - $310k
...Tech Lead Manager, Data Engineer Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most...Full timeRemote work$155.42k - $395.9k
...Description About the Team: The ML Inference Platform is part of the AV ML... ...a Senior ML Infrastructure engineer to help build and scale robust... ..., for workflows such as data mining, labeling, model distillation... ...deliver incremental value. Lead technical decision‑making on model...Local areaRemote workRelocationRelocation packageFlexible hours- Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position aims to build and scale robust platforms for ML inference workflows supporting GM’s AI efforts. You will collaborate with ML engineers and researchers to implement...Remote job
- ...company in Mountain View is seeking a Machine Learning Engineer to build and optimize the infrastructure for its Intelligence... ...designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating with teams to productionize...
- A cutting-edge robotics company in California seeks an ML Infrastructure Engineer to build and operate inference systems for their automation stack. Responsibilities include maintaining infrastructure for model inference, optimizing performance, and collaborating with research...
- ...Inference Optimization MLE At Rhoda AI, we're building the next generation of generalist intelligent robots. We own the full robotics... ...across model versions Collaborate closely with research engineers to translate model innovations into optimized, deployment-ready...
$236k - $339.25k
...redefine the future of how work gets done. Build the future of data. Join the Snowflake team. The Snowflake Machine Learning... ...and/or platforms. Experience in serving LLMs using inference engines like vLLM, TensorRT-LLM, TEI, SGLang, and knowing tradeoffs between...Flexible hours$278.1k - $347.6k
...Mountain View, CA, USA Principal Machine Learning Engineer, Mobile AI Inference Optimization Location Mountain View, CA, USA Department... ...variants). Team & Cross-Functional Leadership: Lead and mentor a team of ML engineers; define engineering best...Work at officeWorldwideRelocation package$94.35 - $125.03 per hour
...and explore our current job openings. Your best is waiting to be discovered. Day - 08 Hour (United States of America) The Lead Data Engineer will be part of a team building Stanford Health Care's (SHC) solutions incorporating Artificial Intelligence including...Hourly payWork experience placement- We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and... ...AI foundation of the world’s most popular developer data platform Collaborate with ML researchers from Voyage....Local areaWorldwide
- SambaNova Systems in Palo Alto, California is seeking a Software Engineer to build and scale the AI inference platform. This role involves working with a cloud-agnostic platform, ensuring system stability, and collaborating with cross-functional teams. Candidates should...
- A leading data platform company in Palo Alto seeks a Senior Engineer to develop a cutting-edge inference platform supporting semantic search and AI-native experiences. The ideal candidate will have over five years of experience in backend systems and proficiency in languages...
$124k - $195.5k
## AI Inference Performance Engineer - New College Grad 2026Applylocations: US, CA, Santa Claratime type: Full... ...GPU roadmaps based on real workload data.* Technical Leadership: Raise the... ...execution on tight benchmark timelines, and lead a world-class team.**What We Need To...- A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...
$124k - $195.5k
NVIDIA Corporation is seeking an AI Inference Performance Engineer - New College Grad 2026 in Santa Clara. This role involves optimizing AI inference benchmarks using NVIDIA’s accelerators and working with various teams on performance enhancements. Applicants should have...$152k - $241.5k
NVIDIA Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant...$152k - $241.5k
...for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers... ...has been the backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive... .... The compiler must deliver leading inference performance, fast build time...$164k - $282k
Dormont Manufacturing Co is seeking a skilled Data Engineer to enhance data systems for optimal search experiences. You will manage analytics and ETL/ELT systems and work with data quality and scalability. The ideal candidate will have at least 6 years of industry experience...$187.5k - $395k
...Ship new model architectures by integrating them into our inference engine Collaborate closely across research, engineering and infrastructure... ...to trace the end-to-end lifetime of any inference workload Tech stack Must have Python Redis S3-compatible Storage...- ...generation computing experiences-from AI and data centers, to PCs, gaming and embedded... ...ROLE: As a senior member of the LLM inference framework team, you will be responsible for... ...sits at the intersection of inference engines, distributed systems, and GPU runtime and...
- ...generation computing experiences-from AI and data centers, to PCs, gaming and embedded... ...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node... ...THE PERSON: Skilled engineer with strong technical and analytical expertise...
$139k - $204k
...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud... ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises... ...lunch each day in our office and data center locations ~ A casual work environment...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$152k - $204k
...Senior Software Engineer, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud... ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises... ...lunch each day in our office and data center locations ~ A casual work environment...Permanent employmentTemporary workCasual workWork at officeFlexible hoursShift work$92k - $135k
...with confidence. Trusted by leading AI labs, startups, and global... ...What You'll Do: Join the Inference team to ship production features... ...mentorship from experienced engineers. About the role: Implement... .... Foundations in data structures, algorithms, and networked...Permanent employmentTemporary workCasual workInternshipWork at officeFlexible hours- ...NU), we combine proprietary technology, data intelligence, and an efficient operating... ...financial products. As a Machine Learning Engineer in AI Core, Data Intelligence, you’ll work... ...infrastructure, from ingestion to model deployment. Lead technical initiatives that improve our...Work from homeRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Tech Lead, Data & Inference Engineer. Be the first to apply!
Related searches


