Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Software Engineer, Model Serving

$192k - $260k

Cacheflow

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business. Databricks’ Model Serving product provides enterprises with a unified, scalable, and governed platform to deploy and manage AI/ML models — from traditional ML to fine-tuned and proprietary large language models. It offers real-time, low-latency inference, governance, monitoring, and lineage. As AI adoption accelerates, Model Serving is a core pillar of the Databricks platform, enabling customers to operationalize models at scale with strong SLAs and cost efficiency. As a Staff Engineer, you’ll play a critical role in shaping both the product experience and the foundational infrastructure of Model Serving. You will design and build systems that enable high-throughput, low-latency inference across CPU and GPU workloads, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world-class serving platform. The impact you will have: Design and implement core systems and APIs that power Databricks Model Serving, ensuring scalability, reliability, and operational excellence. Partner with product and engineering leadership to define the technical roadmap and long-term architecture for serving workloads. Drive architectural decisions and trade-offs to optimize performance, throughput, autoscaling, and operational efficiency for CPU and GPU serving workloads. Contribute directly to key components across the serving infrastructure — from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling — ensuring smooth and efficient operations at scale. Collaborate cross-functionally with product, platform, and research teams to translate customer needs into reliable and performant systems. Lead technical initiatives that improve latency, availability, and cost-effectiveness across both customer-facing and foundational serving layers. Establish best practices for code quality, testing, and operational readiness, and mentor other engineers through design reviews and technical guidance. Represent the team in cross-organizational technical discussions and influence Databricks’ broader AI platform strategy. What we look for: 10+ years of experience building and operating large-scale distributed systems. Deep expertise in model serving, inference systems, and related infrastructure (e.g., routing, scheduling, autoscaling, and observability). Strong foundation in algorithms, data structures, and system design as applied to large-scale, low-latency serving systems. Proven ability to deliver technically complex, high-impact initiatives that create measurable customer or business value. Experience leading architecture for large-scale, performance-sensitive CPU/GPU inference systems. Strong communication skills and ability to collaborate across teams in fast-moving environments. Strategic and product-oriented mindset with the ability to align technical execution with long-term vision. Passion for mentoring, growing engineers, and fostering technical excellence. Pay Range Transparency Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here. Local Pay Range

$192,000 — $260,000 USD

About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn and Facebook. Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone. #J-18808-Ljbffr Cacheflow

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Staff Software Engineer, Model Serving in San Francisco, CA vacancy
  • $192k - $260k

     ...data insights to improve their business. Databricks' Model Serving product provides enterprises with a unified, scalable, and...  ...models at scale with strong SLAs and cost efficiency. As a Staff Engineer, you'll play a critical role in shaping both the product... 
    Suggested
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    2 days ago
  • $208.73k - $279.57k

     ...Staff Software Engineer For The Ai Model Lifecycle Team Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack —... 
    Suggested
    Temporary work

    G2 Venture Partners

    San Francisco, CA
    1 day ago
  • $300 per month

     ...abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously...  ...cloud infrastructure. About this role About this role: The Staff Software Engineer for the Model LifeCycle team will play a key role in building a... 
    Suggested
    Temporary work

    Crusoe Energy Systems LLC

    San Francisco, CA
    5 days ago
  • $220k - $320k

     ...and hosts specialized language models for companies that need...  ...well-funded ten-person team of engineers who work in-person in downtown...  ...has founded and run their own software companies. We are high-agency...  ...approaches, always with the goal of serving models faster and cheaper at... 
    Suggested
    Work at office

    Inference

    San Francisco, CA
    3 days ago
  • $320k

     ...committed researchers, engineers, policy experts, and business...  ...'s research. As a Software Engineer on the...  ...a range of deployment models, and creating the reliable...  ...database infrastructure that serves both product and...  ...Currently, we expect all staff to be in one of our offices... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  • $211k - $240k

     ...use. We're looking for a staff-level Technical Lead to drive...  ...frameworks, all while mentoring the engineers and developer advocates on...  ...team's Engineering Manager, serving as a public face for API...  ...the best elements of hybrid models to ensure that every one of our... 
    Work at office
    Local area
    Work from home
    Worldwide
    Flexible hours

    Asana

    San Francisco, CA
    4 days ago
  • $200k - $275k

     ...0+ states and two countries, serving more than 125 million people...  ...internationally.  Team As an engineering team, we believe strongly...  ...; machine learning models hosted in Bedrock and Sagemaker...  ...Passion for crafting and shipping software solutions that delight users... 
    Work at office
    Local area

    Peregrine Technologies

    San Francisco, CA
    2 days ago
  • $405k

     ...group of committed researchers, engineers, policy experts, and...  ...THE ROLE We're looking for a Staff Software Engineer to set technical direction...  ...eval frameworks that measure model capabilities across diverse...  ...drive them to completion * Serve as a senior technical bridge... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  • $237.6k - $288k

     ...Senior Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence...  ...for our cloud software team who will serve as a technical leader and strategic...  ...scaling our carbon-reducing operating model, as well as driving the long-term... 
    Temporary work

    G2 Venture Partners

    San Francisco, CA
    1 day ago
  • $189k - $236k

     ...Senior Staff Software Engineer - Pricing and Packaging San Francisco, CA At Gusto, we're on a mission to grow the small business economy....  ...nationwide and are building a workplace that reflects the people we serve. All full-time employees receive competitive base pay,... 
    Full time
    Work at office
    Local area
    Remote work
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    3 days ago
  • $163k - $204k

     ...workplace that reflects the people we serve. All full-time employees receive competitive...  ...We're looking for seasoned full-stack software engineers to join the teams behind Gusto's...  ...small business owners every day. As a Staff Software Engineer, you'll operate... 
    Full time
    Work at office
    Local area
    Remote work
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    4 days ago
  • $141k - $242k

     ...Waabi Senior Or Staff Software Engineer Waabi, founded by AI visionary Raquel Urtasun, is the leader...  ...from data ingestion to real-time serving. You will collaborate and exchange ideas...  ...Learning pipelines or integrating AI models into production engineering systems.... 

    G2 Venture Partners

    San Francisco, CA
    1 day ago
  •  ...make data-driven strategic decisions. We serve many of the world's largest accounting...  ...talent in AI, product development, and engineering-innovative, humble, and forward-thinking...  ...We are looking for a highly experienced Software Engineer to join our team. You'll have a... 
    Relocation package

    Laurel Property Services

    San Francisco, CA
    5 days ago
  •  ...About the Role As a Staff Engineer on our Builders, you'll be the...  ...entire classes of problems, and serve as a technical bridge between...  ...Looking For ~8+ years of software engineering experience, with...  ...they introduce (data quality, model drift, pipeline failures, etc... 
    Permanent employment
    Full time
    Temporary work
    Local area
    Home office
    Flexible hours

    EvenUp Inc.

    San Francisco, CA
    3 days ago
  • $180k - $315k

     ...the Team The Growth Engineering team builds world-class...  ...- from recommendation models and enrichment...  ...Role We're seeking a Staff AI/ML Engineer to architect...  ...evaluate models Deploy and serve models using FastAPI,...  ...need ~7+ years of software engineering experience,... 
    Work at office
    Immediate start
    3 days per week

    Rippling

    San Francisco, CA
    5 days ago
  • $241k - $284k

     ...Staff Software Engineer, Frontend Hybrid - SF About GlossGenius GlossGenius is the AI-powered system behind the world's most meaningful...  ...at GlossGenius leverage AI effectively and responsibly. Serve as a technical leader and go-to resource in code reviews, architectural... 
    Work at office
    Home office
    Flexible hours
    3 days per week

    GlossGenius

    San Francisco, CA
    4 days ago
  • $240k - $310k

     ...Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI...  ...Storage team at Crusoe is seeking a Staff Software Engineer to serve as a primary architect and visionary for our storage... 
    Temporary work

    Crusoe

    San Francisco, CA
    5 days ago
  •  ...Staff+ Software Engineer, Inference Runtime Remote-Friendly (Travel-Required) | San Francisco, CA...  ...Anthropic's Inference organization serves Claude to millions of users and enterprise...  ...'s expansion cost low by ensuring new models and deployment targets pay only for their... 
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    7 days ago
  •  ...Customers The explosive growth in model intelligence and increasing relevance for...  ...systems to life. Work with other AI engineers, software engineers and machine learning...  ...Haves Experience training and/or serving ML models in production, or fine-tuning... 
    Work experience placement
    Local area
    Shift work

    Plaid

    San Francisco, CA
    1 day ago
  • $197k - $247k

     ...Software Engineer San Francisco, CA At Gusto, we're on a mission to grow the small business...  ...a workplace that reflects the people we serve. All full-time employees receive competitive...  ...knowledge of the Android security model, including the Android Keystore system,... 
    Full time
    Work at office
    Local area
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    1 day ago
  • $230k - $285k

     ...Staff Software Engineer Hover helps people design, improve, and protect the properties they love...  ...measured, accurate, and interactive 3D models of any property — all from a smartphone...  ..., purpose, and a shared commitment to serving our customers, communities, and each... 
    Full time
    For contractors
    Work at office
    Local area
    Flexible hours

    Almaz Capital

    San Francisco, CA
    1 day ago
  • $200k - $400k

     ...maintain, and improve their AI agents. We transform complex engineering work into intuitive self-serve products, making sophisticated AI capabilities...  ...AI agents. This means working closely with the Agent Software Engineering and Agent PM teams to understand current workflows... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    2 days ago
  • $233.5k - $350.5k

     ...us! GoFundMe is searching for a passionate and driven Senior Staff Software Engineer with a strong background in building scalable, high-...  ...systems, driving impact for both the company and the users we serve. Join us if you're excited to grow both personally and professionally... 
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    GoFundMe

    San Francisco, CA
    1 day ago
  • $405k

     ...Senior Staff Software Engineer, API San Francisco, CA | New York City, NY About Anthropic...  ...the Claude Developer Platform team and serve as the senior-most individual contributor...  ...applications with our industry-leading models. The API serves as the primary channel... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  • $300 per month

     ...Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI...  ...Service), a physical infrastructure classification layer that serves as the single source of truth for every GPU node's state,... 
    Temporary work

    Crusoe

    San Francisco, CA
    19 days ago
  • $189k - $210k

     ...Staff Software Engineer, Developer Productivity Async Denver, CO;San Francisco, CA;New York, NY;Los Angeles, CA;Seattle, WA;Toronto, Ontario...  ...nationwide and are building a workplace that reflects the people we serve. All full-time employees receive competitive base pay,... 
    Full time
    Work at office
    Local area
    Remote work
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    4 days ago
  • $245k - $280k

     ...Role Matters We're hiring a Staff Engineer to define and own the...  ...mission-critical infrastructure serving 5M+ users and 600K+ organizations...  ...hard. Scribe's pricing model is evolving from simple seat-...  ...You have 10+ years of software engineering experience, with... 
    Full time
    Live in
    Work at office
    Home office
    Flexible hours
    3 days per week

    Scribe

    San Francisco, CA
    2 days ago
  •  ...Staff Software Engineer, Listings & Host Tools and AI Airbnb was born in 2007 when two hosts welcomed three guests to their...  ...standards etc. We own data pipelines and ML models and will build services for serving that are used in the above areas. The Difference... 
    Work experience placement

    airbnb, Inc.

    San Francisco, CA
    3 hours ago
  •  ...the frontier of AI to bring cutting-edge models into production. With our recent $150M Series...  ...work spans distributed systems, model serving, and developer experience. You’ll join a...  ...or contributions to open-source inference engines (vLLM, TensorRT-LLM, SGLang, TGI)... 
    Flexible hours

    Baseten

    San Francisco, CA
    4 days ago
  • $200k - $300k

     ...F2 Staff Software Engineer, Infrastructure Location: San Francisco Employment Type: Full time...  ..., uptime guarantees, deployment models_ into pragmatic infrastructure solutions...  ..., and ideally vector databases or LLM serving infrastructure. Security-first mindset... 
    Full time

    F2

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Software Engineer, Model Serving. Be the first to apply!