Lead Software Engineer, Model Serving Platform

Sciforium

Sciforium's Next-Generation Model Serving Platform Architect

Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from AMD engineers the team is scaling rapidly to build the full stack powering frontier AI models and real-time applications.

About the Role

This is a rare chance to help architect and lead the development of Sciforium's next-generation model serving platform, the high-performance engine that will bring a multimodal, highly efficient foundation model to market. As a senior technical leader, you'll not only build core components yourself but also guide and mentor other engineers, influencing engineering direction, standards, and execution quality.

You will learn and shape the full AI stack: from GPU kernels and quantized execution paths to distributed serving, scheduling, and the APIs that power real-time AI applications. If you enjoy deep systems work, thrive on ownership, and want to lead engineers in building foundational AI infrastructure, this role puts you at the center of SciForium's mission and growth.

What You'll Do

Lead the technical direction of the model serving platform, owning architecture decisions and guiding engineering execution.
Build core serving components including execution runtimes, batching, scheduling, and distributed inference systems.
Develop high-performance C++ and CUDA/HIP modules, including custom GPU kernels and memory-optimized runtimes.
Collaborate with ML researchers to productionize new multimodal models and ensure low-latency, scalable inference.
Build Python APIs and services that expose model capabilities to downstream applications.
Mentor and support other engineers through code reviews, design discussions, and hands-on technical guidance.
Drive performance profiling, benchmarking, and observability across the inference stack.
Ensure high reliability and maintainability through testing, monitoring, and engineering best practices.
Troubleshoot and resolve complex issues across GPU, runtime, and service layers.

Ideal Candidate Profile

Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience
5+ years of experience designing and building scalable, reliable backend systems or distributed infrastructure.
Strong understanding of LLM inference mechanics (prefill vs decode, batching, KV cache)
Experience with Kubernetes/Ray, Containerization
Strong proficiency in C++, Python.
Strong debugging, profiling, and performance optimization skills at the system level.
Ability to collaborate closely with ML researchers and translate model or runtime requirements into production-grade systems.
Effective communication skills and the ability to lead technical discussions, mentor engineers, and drive engineering quality.
Comfortable working from the office and contributing to a fast-moving, high-ownership team culture.

Nice-to-Have

Experience with ML systems engineering, distributed GPU scheduling, open source inference engine like vLLM, Sglang, or TRT-LLM
Experience in building large scale ML/MLOps infrastructure
Proficiency in CUDA or ROCm and experience with GPU profiling tools
Experience at an AI/ML startup, research lab, or Big Tech infrastructure/ML team.
Familiarity with multimodal model architectures, raw-byte models, or efficient inference techniques.
Contributions to open-source ML or HPC infrastructure

Benefits Include

Medical, dental, and vision insurance
401k plan
Daily lunch, snacks, and beverages
Flexible time off
Competitive salary and equity

Equal Opportunity

Sciforium is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Lead Software Engineer, Model Serving Platform in San Francisco, CA vacancy

Model Implementation Engineer
...Model Implementation Engineer Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support...
Platform
Flexible hours
Sciforium
San Francisco, CA
9 days ago
Engineering Manager, Model Library
...frontier of AI to bring cutting‑edge models into production. We’re growing... .... Join us and help build the platform engineers turn to ship AI products. THE ROLE You’ll lead the Model Library team at... ...Experience building or supporting self‑serve workflows. NICE TO HAVE...
Platform
Flexible hours
Baseten
San Francisco, CA
5 days ago
Engineering Manager, Model Routing & Inference Engineering · · San Francisco Apply →
...research, design, and engineering. Our organization is... ...About the Role You will lead the Model Routing & Inference... ...owning the inference platform that powers every AI... ...especially in inference serving, traffic routing, or... .... You have strong software engineering fundamentals...
Platform
Anysphere
San Francisco, CA
5 days ago
Model Behavior Engineer
$98k - $140k
...with product and engineering teams to build systems... ...'t a traditional software engineering role.... ...ll shape Notion’s model strategy and work... ...launch new models with leading research labs —... ...and eval platforms (e.g., Braintrust)... ...data — You can self‑serve insights from large...
Platform
Live in
Work at office
Local area
Notion
San Francisco, CA
2 days ago
ML Model Serving Engineer
...spanning hardware and software. Join us in... ...Turbocharge our serving layer, consisting... ...speech, and vision models. Partner with ML... ...infrastructure and training engineers to build a fast,... ...a public cloud platform such as GCP, AWS,... ...a track record of leading complex multi-month...
Platform
Full time
Contract work
Flexible hours
SESAME
San Francisco, CA
4 days ago
Engineering Manager, Model Inference
...world-class. We’re looking for an Engineering Manager to lead and grow our Model Inference team. The Inference team... ...direction of how our models are served: from architecting low-latency, high... ...ML Research and the broader AI Platform, and ensure the systems underpinning...
Platform
Hourly pay
Full time
Flexible hours
AI Chopping Block, Inc.
San Francisco, CA
2 days ago
Manager, Product Management, Gen AI Model Gateway
$144k - $164k
...Product Management, Gen AI Model Gateway At Capital One... ...Generative AI Platform team is at the forefront... ...prototyping, development, and serving). The FM Gateway... ...to influence and lead. Basic Qualifications:... ...analysis, data science, or software engineering. Preferred...
Platform
Full time
Part time
Local area
Capital One National Association
San Francisco, CA
2 days ago
Senior Manager, Engineering - Model Serving
$217k - $312.2k
...'s best data and AI infrastructure platform so our customers can use deep data... ...improve their business. Databricks’ Model Serving product provides enterprises with a... ...and cost efficiency. As a Senior Engineering Manager, you will lead the team owning both the product experience...
Platform
Local area
I did my part and supported the Regular Toilet
San Francisco, CA
5 days ago
Model Validation Manager - Balance Sheet Modeling
$170.26k - $200.3k
...Helping the customers and businesses we serve to make better and smarter financial... ...U.S. Bank is seeking an experienced Model Validation Manager to lead validation efforts for our Balance... .... ~ Familiarity with vendor platforms such as: QRM, Polypaths, Yield Book,...
Platform
Temporary work
Local area
3 days per week
U.S. Bank
San Francisco, CA
2 days ago
Senior Software Engineer - Model Performance
$220k - $320k
...specialized language models for companies that need... ...up to 90% cheaper. Our platform handles everything end... ...ten-person team of engineers who work in-person in... ...founded and run their own software companies. We are high... ...with the goal of serving models faster and cheaper...
Platform
Work at office
Inference
San Francisco, CA
3 days ago
Model Performance Software Engineer, Claude Code
$405k
...committed researchers, engineers, policy experts,... ...for a Staff Software Engineer to set technical... ...that measure model capabilities across... ...roadmap * Lead the design of infrastructure... ...to completion * Serve as a senior... ...infrastructure, or platforms that orchestrate many...
Platform
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
3 days ago
Software Engineer - Model APIs
...of AI to bring cutting-edge models into production. With our recent... ...the models running on our platform are fast, reliable, and cost‑... ...spans distributed systems, model serving, and developer experience.... ...contributions to open-source inference engines (vLLM, TensorRT-LLM, SGLang,...
Platform
Flexible hours
Baseten
San Francisco, CA
4 days ago
Senior Global AI Large-Model Sales Leader
1. Lead the development and strategic management of... ...sales of Keling’s large model offerings and deepen... ...integrators (SIs), independent software vendors (ISVs), and... ...3 years dedicated to serving domestic Strategic Key... ...large models, big data platforms, or enterprise digital...
Platform
Contract work
Overseas
Kuaishou Technology
San Francisco, CA
1 day ago
Staff Software Engineer, Model Serving
$192k - $260k
...best data and AI infrastructure platform so our customers can use deep... .... Databricks' Model Serving product provides enterprises... ...cost efficiency. As a Staff Engineer, you'll play a critical role... ...reliable and performant systems. Lead technical initiatives that...
Platform
Local area
Worldwide
Databricks
San Francisco, CA
2 days ago
Lead Software Engineer - Apple
$164.64k - $246.96k
...Lead Software Engineer - Apple Welcome to Paramount Streaming — the team behind... ...video products. From platform innovation to global distribution... ...Experience integrating ad-serving products into Apple applications... ...whether issues are a data, model, or rendering problem....
Platform
Paramount Global Services
San Francisco, CA
4 days ago
Staff Product Manager, Model Lifecycle & Management
$164.7k - $339.08k
...product strategy for the ML platform that powers how Pinterest... ...and measures content safety models at scale. Lead the development of ML Signal... ...access. Partner with ML Engineering to reduce model iteration time... ...proficiency - able to self‑serve data investigation and...
Platform
Work at office
Local area
Relocation package
Pinterest
San Francisco, CA
5 days ago
Backend Integration Engineer (AI/Model Services)
$74.38 - $83.8 per hour
...Specialty Software Engineer - API Developer - GenAI Charlotte... ...Our client is a leading financial services organization... ...data science and AI platforms. Based out of... ...Rather than building models, you'll be responsible... ...Exposure to model serving or inference gateways...
Platform
Full time
Contract work
Temporary work
Flexible hours
Motion Recruitment
San Francisco, CA
3 days ago
Staff Software Engineer, Model Serving
$192k - $260k
...best data and AI infrastructure platform so our customers can use deep... ...their business. Databricks’ Model Serving product provides enterprises... ...cost efficiency. As a Staff Engineer, you’ll play a critical role... ...and performant systems. Lead technical initiatives that improve...
Platform
Local area
Worldwide
Cacheflow
San Francisco, CA
4 days ago
AI Engineer - Model Performance
Role Overview We’re hiring a Model Performance Engineer to own the speed, cost, and... ...be optimizing real systems serving millions of meetings —... ...or similar serverless GPU platforms. Understanding of audio processing... ...to shape the foundational software services of a growing...
Platform
Fathom
San Francisco, CA
4 days ago
Senior Model Serving Engineer - Low-Latency AI Platform
A leading data and AI company in San Francisco is seeking a Staff Engineer to design and implement systems for their AI/ML Model Serving platform. You will collaborate with product, infrastructure, and research teams to ensure high-performance system delivery. The ideal...
Platform
Menlo Ventures
San Francisco, CA
5 days ago
Senior/Lead/Principal Software Engineer, Agentforce Operations
$13 per hour
...Job Category Software Engineering Job Details About... ...career at the company leading workforce transformation... ...chain with an AI-powered platform for designing,... ...also powering the self-serve data synchronization... ...backend engineering: data modeling, DB performance...
Platform
Salesforce.Com Inc
San Francisco, CA
4 days ago
Lead Sr Staff Software Engineer - Front End Fundamentals
$166.2k - $304.7k
...technology company and the world's leading independent platform for digital advertising, with nearly... ...we do Our Lead Senior Staff Software Engineers are end-to-end owners who will participate... ...system performs every day, 24/7, serving global traffic. It's important that...
Platform
Full time
Temporary work
Local area
Worldwide
The Trade Desk
San Francisco, CA
2 days ago
Lead Product Software Engineer
...media, and build products and platforms that enable the connection... ...and Builders. Entertainers and Engineers. The Walt Disney Company,... ...subsidiaries and affiliates, is a leading diversified international... ...account management. You will serve as a high level technical...
Platform
Work experience placement
Local area
Worldwide
The Walt Disney Studios
San Francisco, CA
4 days ago
Lead Product Software Engineer
$170.5k - $228.6k
...Lead Product Software Engineer On any given day at Disney Entertainment & ESPN Technology, we're reimagining... .... Reach & Scale: The products and platforms this group builds and operates... ...effortless account management. You will serve as a high level technical resource...
Platform
Work experience placement
Worldwide
Disney
San Francisco, CA
4 days ago
Lead Backend Software Engineer (Product API)
$500 per month
...web, mobile, and connected TV platforms. We are seeking an enthusiastic, experienced backend engineer with a deep technical... ...Qualifications 8+ years of software development experience. 2+ years... ...machine learning systems and model serving infrastructure. Benefits Full...
Platform
Full time
Work at office
Immediate start
Remote work
Home office
3 days per week
Philo
San Francisco, CA
2 days ago
Senior AI Infrastructure Engineer, Model Serving Platform
$216k - $270k
...As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting... ...integrate and optimize models for production and... ...and performance. Lead projects end-to-end,...
Platform
Full time
Scale AI
San Francisco, CA
18 days ago
Principal Software Engineer, ML Flywheel Technical Lead
$332k - $421k
...Principal Software Engineer, ML Flywheel Technical Lead Waymo is an autonomous driving technology... ...to a range of vehicle platforms and product use cases.... ...improving our machine learning models, and ultimately the... ...Enable a data flywheel to serve the demands of scalable pre...
Platform
Full time
Remote work
Waymo
San Francisco, CA
6 days ago
Technical Lead - Software Developer, Data Foundry
$151.5k - $244.2k
...Scientific Software Developer At Lilly, we... ...Discovery Technology and Platforms (DTP) accelerates... ...capabilities—serving both human... ...standards. MLOps & Model Operationalization... ...Biology, Biomedical Engineering, or related STEM... ...'s Initiative for Leading at Lilly (WILL), enAble...
Platform
Full time
Flexible hours
Eli Lilly
San Francisco, CA
1 day ago
Staff Software Engineer / Tech Lead, ML Infrastructure
$190k - $250k
...Staff Software Engineer / Tech Lead, ML Infrastructure Heartflow is a medical technology... ...provides a color-coded, 3D model of a patient's coronary... ...position gives you the platform to lead technically. This... ...We are proponents of self-serve interfaces and robust user...
Platform
Full time
Work at office
Local area
Worldwide
Relocation
HeartFlow
San Francisco, CA
3 days ago
Lead Security Engineer
...Lead Security Engineer (Series A Fintech) We are a $20M Series... ...Security Engineer to serve as an autonomous... ...infrastructure and ML models remain resilient as we... ...penetration testing of our software. Security Culture:... ...GCP (Google Cloud Platform) . Technical Depth...
Platform
Brahma Consulting Group
San Francisco, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Software Engineer, Model Serving Platform. Be the first to apply!