Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead Software Engineer, Model Serving Platform

Sciforium

Sciforium's Next-Generation Model Serving Platform Architect

Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from AMD engineers the team is scaling rapidly to build the full stack powering frontier AI models and real-time applications.

About the Role

This is a rare chance to help architect and lead the development of Sciforium's next-generation model serving platform, the high-performance engine that will bring a multimodal, highly efficient foundation model to market. As a senior technical leader, you'll not only build core components yourself but also guide and mentor other engineers, influencing engineering direction, standards, and execution quality.

You will learn and shape the full AI stack: from GPU kernels and quantized execution paths to distributed serving, scheduling, and the APIs that power real-time AI applications. If you enjoy deep systems work, thrive on ownership, and want to lead engineers in building foundational AI infrastructure, this role puts you at the center of SciForium's mission and growth.

What You'll Do
  • Lead the technical direction of the model serving platform, owning architecture decisions and guiding engineering execution.
  • Build core serving components including execution runtimes, batching, scheduling, and distributed inference systems.
  • Develop high-performance C++ and CUDA/HIP modules, including custom GPU kernels and memory-optimized runtimes.
  • Collaborate with ML researchers to productionize new multimodal models and ensure low-latency, scalable inference.
  • Build Python APIs and services that expose model capabilities to downstream applications.
  • Mentor and support other engineers through code reviews, design discussions, and hands-on technical guidance.
  • Drive performance profiling, benchmarking, and observability across the inference stack.
  • Ensure high reliability and maintainability through testing, monitoring, and engineering best practices.
  • Troubleshoot and resolve complex issues across GPU, runtime, and service layers.
Ideal Candidate Profile
  • Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience
  • 5+ years of experience designing and building scalable, reliable backend systems or distributed infrastructure.
  • Strong understanding of LLM inference mechanics (prefill vs decode, batching, KV cache)
  • Experience with Kubernetes/Ray, Containerization
  • Strong proficiency in C++, Python.
  • Strong debugging, profiling, and performance optimization skills at the system level.
  • Ability to collaborate closely with ML researchers and translate model or runtime requirements into production-grade systems.
  • Effective communication skills and the ability to lead technical discussions, mentor engineers, and drive engineering quality.
  • Comfortable working from the office and contributing to a fast-moving, high-ownership team culture.
Nice-to-Have
  • Experience with ML systems engineering, distributed GPU scheduling, open source inference engine like vLLM, Sglang, or TRT-LLM
  • Experience in building large scale ML/MLOps infrastructure
  • Proficiency in CUDA or ROCm and experience with GPU profiling tools
  • Experience at an AI/ML startup, research lab, or Big Tech infrastructure/ML team.
  • Familiarity with multimodal model architectures, raw-byte models, or efficient inference techniques.
  • Contributions to open-source ML or HPC infrastructure
Benefits Include
  • Medical, dental, and vision insurance
  • 401k plan
  • Daily lunch, snacks, and beverages
  • Flexible time off
  • Competitive salary and equity
Equal Opportunity

Sciforium is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Lead Software Engineer, Model Serving Platform in San Francisco, CA vacancy
  •  ...Model Implementation Engineer Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support... 
    Platform
    Flexible hours

    Sciforium

    San Francisco, CA
    1 day ago
  •  ...spanning hardware and software. Join us in...  ...Turbocharge our serving layer, consisting...  ...speech, and vision models. Partner with ML...  ...infrastructure and training engineers to build a fast,...  ...a public cloud platform such as GCP, AWS,...  ...a track record of leading complex multi-month... 
    Platform
    Full time
    Contract work
    Flexible hours

    SESAME

    San Francisco, CA
    1 day ago
  •  ...frontier of AI to bring cutting‑edge models into production. We’re growing...  .... Join us and help build the platform engineers turn to ship AI products. THE ROLE You’ll lead the Model Library team at...  ...Experience building or supporting self‑serve workflows. NICE TO HAVE... 
    Platform
    Flexible hours

    Baseten

    San Francisco, CA
    2 days ago
  • $98k - $140k

     ...with product and engineering teams to build systems...  ...'t a traditional software engineering role....  ...ll shape Notion’s model strategy and work...  ...launch new models with leading research labs —...  ...and eval platforms (e.g., Braintrust)...  ...data — You can self‑serve insights from large... 
    Platform
    Live in
    Work at office
    Local area

    Notion

    San Francisco, CA
    4 days ago
  • $144k - $164k

     ...Product Management, Gen AI Model Gateway At Capital One...  ...Generative AI Platform team is at the forefront...  ...prototyping, development, and serving). The FM Gateway...  ...to influence and lead. Basic Qualifications:...  ...analysis, data science, or software engineering. Preferred... 
    Platform
    Full time
    Part time
    Local area

    Capital One National Association

    San Francisco, CA
    4 days ago
  • $220k - $320k

     ...specialized language models for companies that need...  ...up to 90% cheaper. Our platform handles everything end...  ...ten-person team of engineers who work in-person in...  ...founded and run their own software companies. We are high...  ...with the goal of serving models faster and cheaper... 
    Platform
    Work at office

    Inference

    San Francisco, CA
    5 days ago
  • $405k

     ...committed researchers, engineers, policy experts,...  ...for a Staff Software Engineer to set technical...  ...that measure model capabilities across...  ...roadmap * Lead the design of infrastructure...  ...to completion * Serve as a senior...  ...infrastructure, or platforms that orchestrate many... 
    Platform
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    5 days ago
  • $192k - $260k

     ...best data and AI infrastructure platform so our customers can use deep...  .... Databricks' Model Serving product provides enterprises...  ...cost efficiency. As a Staff Engineer, you'll play a critical role...  ...reliable and performant systems. Lead technical initiatives that... 
    Platform
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    4 days ago
  • $192k - $260k

     ...data and AI infrastructure platform so our customers can use deep...  ...business. Foundation Model Serving is the API Product for...  ...necessary. We're looking for engineers who have owned high scale operational...  ...systems. ~ Experience leading high-scale operationally... 
    Platform
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    5 days ago
  •  ...of AI to bring cutting-edge models into production. With our recent...  ...the models running on our platform are fast, reliable, and cost‑...  ...spans distributed systems, model serving, and developer experience....  ...contributions to open-source inference engines (vLLM, TensorRT-LLM, SGLang,... 
    Platform
    Flexible hours

    Baseten

    San Francisco, CA
    1 day ago
  • $148.5k - $266.2k

     ...planet. By creating software tools for making buildings...  ...a Machine Learning Engineering Manager on the Model Delivery team within...  ...Research, you will lead production ML...  ...and new research or platform advancements Lead...  ...improvements for inference and serving, including capacity... 
    Platform
    For contractors
    Remote work

    Autodesk

    San Francisco, CA
    2 days ago
  • $111.8k

     ...NASDAQ: SFIX) is the leading online personal...  ...Fix’s CRM & MarTech engineering team is looking for a Lead Software Engineer to help shape...  ..., collect data to serve clients throughout...  ...capabilities for the platforms we support as well...  ...project execution. Model consistently... 
    Platform

    GrabJobs

    San Francisco, CA
    2 days ago
  •  ...in healthcare. Our AI-powered platform was purpose-built for medical...  ..., technologists, and engineers working together to empower people...  ...ML Infrastructure Engineer, Model Inference at Abridge, you’ll...  ...optimize, and maintain ML model serving infrastructure, ensuring high... 
    Platform
    Hourly pay
    Full time
    Flexible hours

    Abridge

    San Francisco, CA
    2 days ago
  • A leading data and AI company in San Francisco is seeking a Senior Engineer to enhance their Model Serving platform. This role requires expertise in building large-scale distributed systems and collaboration across teams to optimize performance and reliability. Ideal candidates... 
    Platform

    Jobleads-US

    San Francisco, CA
    2 days ago
  • $110k - $205k

     ...is the place for you.As a Lead Infrastructure Engineer at Commerce, you will be an...  ...systems administration and software engineering. We are charged...  ...architectures.* Collaborate on platform-level initiatives that...  ...description is intended to serve as a summary of key duties... 
    Platform
    Full time
    Remote work

    BigCommerce

    San Francisco, CA
    4 days ago
  • $157k - $200k

     ...Lead Software Engineer - Apple 45623 San Francisco, CA, US, 94107 Technology...  ...of the iOS and tvOS platforms and deep knowledge with Xcode...  ...Experience integrating ad-serving products into Apple applications...  ...whether issues are a data, model, or rendering problem. Excellent... 
    Platform
    Full time

    Paramount Global Services

    San Francisco, CA
    1 day ago
  • $143k - $224k

     ...Wells Fargo is seeking a Lead Software Engineer to join The Digital Technology...  ...Lead projects, teams, or serve as a peer mentor Required...  ...Kubernetes, OpenShift Container Platform ~4+ years of Kafka/MQ Series...  ...~1+ year experience with Model Context Protocol (MCP), enterprise... 
    Platform
    Work experience placement
    Relocation package

    Wells Fargo

    San Francisco, CA
    4 days ago
  • $190k - $245k

     ...Lead Software Engineer Altana is the network for trusted trade. Our AI-powered product network...  ...team is responsible for building out the platforms, API's, services, workflows, and...  ...efficient, scalable, and maintainable. Serve as the first point of escalation for a... 
    Platform
    Full time
    Temporary work
    Work experience placement
    Flexible hours

    Altana Technologies

    San Francisco, CA
    1 day ago
  •  ...Senior Engineer Opportunity We're building the all-in-one B2B post-sales support platform powered by conversational data and layered with...  ...months, then transition into leading a small team while continuing...  ...leadership skills that will serve you for the rest of your career... 
    Platform
    Work at office
    Relocation

    Pylon

    San Francisco, CA
    1 day ago
  • $192k - $260k

     ...best data and AI infrastructure platform so our customers can use deep...  ...their business. Databricks’ Model Serving product provides enterprises...  ...cost efficiency. As a Staff Engineer, you’ll play a critical role...  ...and performant systems. Lead technical initiatives that improve... 
    Platform
    Local area
    Worldwide

    Cacheflow

    San Francisco, CA
    1 day ago
  • Role Overview We’re hiring a Model Performance Engineer to own the speed, cost, and...  ...be optimizing real systems serving millions of meetings —...  ...or similar serverless GPU platforms. Understanding of audio processing...  ...to shape the foundational software services of a growing... 
    Platform

    AI Chopping Block, Inc.

    San Francisco, CA
    1 day ago
  • $13 per hour

     ...Job Category Software Engineering Job Details About...  ...career at the company leading workforce transformation...  ...chain with an AI-powered platform for designing,...  ...also powering the self-serve data synchronization...  ...backend engineering: data modeling, DB performance... 
    Platform

    Salesforce.Com Inc

    San Francisco, CA
    4 days ago
  • $166.2k - $304.7k

     ...technology company and the world's leading independent platform for digital advertising, with nearly...  ...we do Our Lead Senior Staff Software Engineers are end-to-end owners who will participate...  ...system performs every day, 24/7, serving global traffic. It's important that... 
    Platform
    Full time
    Temporary work
    Local area
    Worldwide

    The Trade Desk

    San Francisco, CA
    5 days ago
  •  ...measures physical spaces. Our AI-powered platform transforms standard photos into fully...  ...for our users. The Role: Technical Lead & Engineering Coach We are seeking a Senior Lead...  ...services. Engineering Mentorship: Serve as a dedicated coach for the engineering... 
    Platform

    GrabJobs

    San Francisco, CA
    4 days ago
  • A leading data and AI company in San Francisco is seeking a Staff Engineer to design and implement systems for their AI/ML Model Serving platform. You will collaborate with product, infrastructure, and research teams to ensure high-performance system delivery. The ideal... 
    Platform

    Menlo Ventures

    San Francisco, CA
    2 days ago
  • $216k - $270k

     ...As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting...  ...integrate and optimize models for production and...  ...and performance. Lead projects end-to-end,... 
    Platform
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  • $172.5k - $260.1k

     ...efforts. Job Category Software Engineering Job Details About...  ...your career at the company leading workforce transformation in...  ...Knowledge product at Salesforce, serving over 12 million monthly active...  ...technologies, such as large language models (LLMs), recommendation... 
    Platform
    Immediate start

    Salesforce.Com Inc

    San Francisco, CA
    3 days ago
  • $13 per hour

     ...chain with an AI-powered platform for designing,...  ...the same founders and engineers who built the original...  ...Do****## As a Senior/Lead AI Software Engineer, for the Agentforce...  ...for distributed systems serving millions of users with...  ...-making for AI model deployment, safety constraints... 
    Platform
    Immediate start

    Salesforce, Inc.

    San Francisco, CA
    2 days ago
  • $280k - $308k

     ...leader will structure, lead, support, drive, and grow...  ...lead on IT Operating Model and Outsourcing Advisory...  ...Support and lead teams serving clients across...  ...Sciences, High Tech & Software, Utilities, Insurance,...  ...transformation and related platforms, productivity, and transformation... 
    Platform
    Full time
    Contract work
    Work at office
    Local area
    Immediate start
    Flexible hours

    West Monroe

    San Francisco, CA
    21 hours ago
  • $332k - $421k

     ...Principal Software Engineer, ML Flywheel Technical Lead Waymo is an autonomous driving technology...  ...to a range of vehicle platforms and product use cases....  ...improving our machine learning models, and ultimately the...  ...Enable a data flywheel to serve the demands of scalable pre... 
    Platform
    Full time
    Remote work

    Waymo

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Software Engineer, Model Serving Platform. Be the first to apply!