Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Model Serving Engineer On-Prem & Edge AI Infra

$145k - $200k

Palantir Technologies

Palantir is seeking an experienced software engineer for a role focused on enabling machine learning models in production. The successful candidate will build high-performance model serving infrastructure and design intelligent request handling. You should have 4+ years of experience, strong coding skills in languages like Java and Python, and familiarity with Kubernetes. This position offers a salary range of $145,000 to $200,000 per year, plus comprehensive benefits including medical insurance, generous paid time off, and 401(k) plans. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Model Serving Engineer On-Prem & Edge AI Infra in Palo Alto, CA vacancy
  • $155k - $235k

     ...Senior Model Based Systems Engineer Arlington, VA, Mountain View, CA, San Diego, CA We're a combat...  ...and entrepreneurs who believe America's edge depends on autonomous airpower that's...  ...you develop. What You'll Do Serve as system-level SE: derive and allocate... 
    Suggested
    Full time
    Work experience placement
    Local area
    Relocation package

    Atropos Inc

    Mountain View, CA
    4 days ago
  • $180k

     ...Technical Staff - Imagine Model Palo Alto, CA;...  ...s mission is to create AI systems that can accurately...  ..., and focused on engineering excellence. This organization...  ...will develop cutting-edge AI experiences beyond text...  ..., training, inference serving, and product... 
    Suggested
    Temporary work

    Xai

    Palo Alto, CA
    4 days ago
  • $220.2k - $330.4k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering Group >...  ...leader in connected intelligent edge, focusing on AI, edge computing and...  ...verticals and industries. AI on‑prem Appliance is a new product line...  ...customer data, fine‑tuned models, and inference loads to remain... 
    Suggested
    Work experience placement
    Work at office

    Qualcomm

    Santa Clara, CA
    16 hours ago
  •  ...Technical Staff — Diffusion Model About the Role RadixArk...  ...will work on cutting-edge diffusion and flow-...  ...research thinking with strong engineering execution — from...  ...-generation generative AI systems used by researchers...  ..., the fastest open LLM serving engine), and developed... 
    Suggested
    Flexible hours

    RadixArk

    Palo Alto, CA
    1 day ago
  • $147k - $211k

    PMax and Automation Infra Software Engineer Google, Mountain View, CA, USA Bachelor’s degree or equivalent...  ...Max (PMax), Google's flagship AI-driven campaign type. We own the critical...  ...which automatically generates and manages serving campaigns across Search, YouTube,... 
    Suggested
    Full time

    Google Inc.

    Mountain View, CA
    4 days ago
  • $119.8k - $234.7k

     ...500 enterprises. Ourconverged AI fabricdelivers inference capabilities...  ...more. As a Senior Software Engineer , you will shape the future...  ...strategy. Our mission is to serve models at scale-reliably, efficiently...  ...experiences. Drive cutting-edge innovation in AI systems... 
    Ongoing contract
    Local area

    Microsoft Corporation

    Mountain View, CA
    3 days ago
  •  ...skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California...  .... The role offers an opportunity to work closely with cutting-edge technologies and a collaborative team. Benefits include a competitive... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $145k - $200k

     .... The Role We are a software engineering team with expertise in enabling ML models in production. We deploy AI models to run in variety of environments...  ...defense environments, edge nodes, and enterprises with...  ..., Rust, Python and Go Model serving engines for GPU-accelerated... 
    Work experience placement
    Work at office
    Remote work
    Work from home
    Relocation package

    Palantir Technologies

    Palo Alto, CA
    1 day ago
  •  ...Moveworks is the Agentic AI Assistant platform...  ..., our proprietary models, and a...  ...Moveworks' Reasoning Engine and natural language...  ...help build cutting edge ML infrastructure for building and serving LLM's at Moveworks....  ...learning systems. The ML infra team covers a variety... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    3 days ago
  • $180k

    xAI is looking for a candidate passionate about truth-seeking AI and model building. You will tackle critical AI modeling challenges and...  ...in this hands-on role. Join a motivated team invested in engineering excellence and the pursuit of knowledge. #J-18808-Ljbffr xAI

    xAI

    Palo Alto, CA
    3 days ago
  •  ...a rapidly growing Industrial AI and Computer Vision startup that...  ...visible, customer-facing engineering role where you'll work directly...  ...customer infrastructure, and serving as the technical bridge between...  ...facilities Install and configure edge computing devices, including... 

    TRC Talent Solutions

    Palo Alto, CA
    8 days ago
  • $154.45k - $208.96k

    Groq is seeking a Software Engineer, Model Evaluation Systems, to build and optimize systems ensuring AI models achieve exceptional quality on our platform. This role involves developing benchmarking frameworks and integrating models into Groq’s infrastructure. Ideal candidates... 

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    16 hours ago
  • $82k - $109k

     ...develop or restart their career in engineering. We fundamentally believe top talent...  ...at LinkedIn developing cutting-edge machine learning models that serve millions of members. You will learn...  ...your thoughts. If you choose to use AI while writing your application, please... 
    For contractors
    Apprenticeship
    Work experience placement

    NLP PEOPLE

    Mountain View, CA
    1 day ago
  •  ...leading tech company in Mountain View seeks a PMax and Automation Infra Software Engineer to develop features in Java and C++. The role requires a...  ...and benefits offered, with the chance to work at the forefront of AI-driven technologies. #J-18808-Ljbffr Google Inc.

    Google Inc.

    Mountain View, CA
    4 days ago
  • $181.1k - $318.4k

     ..., is looking for an experienced Machine Learning engineer to optimize and build production-grade solutions serving millions in real time. You will work closely with...  ...contributing directly to optimizing language and vision models. Applicants should have at least 5 years of... 

    Apple

    Santa Clara, CA
    1 day ago
  •  ...Cupertino, California, is seeking a Senior Research Engineer focused on training data infrastructure for advanced AI models. The ideal candidate will possess strong skills...  ...offers the opportunity to contribute to cutting-edge AI development in a collaborative and innovation... 

    Apple

    Cupertino, CA
    1 day ago
  • $240k - $280k

     ...revenue streams. As a machine learning model engineer of the Samsung Ads Platform Intelligence...  ...envisioning, designing, and implementing cutting-edge machine learning products with a growing...  ...work with machine learning platform and serving teams to deploy and streamline machine... 
    Worldwide

    Samsung Electronics America North America

    Mountain View, CA
    6 days ago
  • $181.1k - $318.4k

     ...United States Machine Learning and AI Do you feel you think...  ...to people’s face”. Foundation Model Services team, within Machine Learning...  ...upcoming ever exciting Apple products serving millions of queries every day...  ...develop inference for cutting‑edge model architectures. Build... 
    Relocation

    Apple

    Santa Clara, CA
    1 day ago
  •  ...About The Role Model Alignment And Deployment Is A Critical, Cross-Functional...  ...Our Post-Training, Safety Engineering, Trust & Safety, ML Infra/Model Serving, And User Experience Research (UXR...  ...And High-Stakes Work At Character.ai. This Is A Role For Someone Who Thrives... 

    Character

    Redwood City, CA
    3 days ago
  • $118k - $162k

     ...o j e c t Software Engineering Mountain View, CA (HQ)...  ...team We're X's AI for chemistry moonshot, applying...  ...-of-the-art multi-modal models. This role requires a...  ...development, you will also serve as a critical bridge between cutting-edge AI research and real-world... 
    Full time

    X: The Moonshot Factory

    Mountain View, CA
    2 days ago
  •  ...Moveworks is the Agentic AI Assistant platform that...  ...advanced LLMs, our proprietary models, and a sophisticated...  ...Moveworks' Reasoning Engine and natural language capabilities...  ...-tune, evaluate, and serve your own models in...  ...keeping our ML at the cutting edge of data privacy and... 
    Work at office
    Immediate start
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    4 days ago
  • $150k - $350k

     ...Institute of Foundation Models We are a dedicated...  ...the next generation of AI builders, and drive transformative...  ...on the core of cutting‑edge foundation model...  ..., data scientists, and engineers, tackling the most fundamental...  ...Stability to serve as the backbone of our... 
    Live in
    Immediate start

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  • $184k - $287.5k

    Responsibilities Develop state‑of‑the‑art model optimization techniques—...  ...across diverse NVIDIA edge architectures, maximizing the...  ...in Computer Science, Computer Engineering, or a related technical field....  ...time robotic control, embodied AI, and autonomous decision‑making... 

    NVIDIA Gruppe

    Santa Clara, CA
    16 hours ago
  •  ...Harmonic in Palo Alto is seeking a pragmatic Software Engineer to lead the productionization of research pipelines within our advanced AI projects. You will engage in building robust ML pipelines as part of a cutting-edge team, ensuring efficient coding practices and... 

    Harmonic

    Palo Alto, CA
    1 day ago
  • $224k - $356.5k

     ...people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts...  ...-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...builds the world's largest AI chip, 56 times larger...  ...current customers include top model labs, global enterprises, and cutting-edge AI-native startups....  ...like across the models we serve, building AI-driven systems...  ...." You'll sit between engineering, product, and customer-facing... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    5 days ago
  • $207k - $300k

    Senior Research Engineer, On-Device Inference, Robotics,...  ...optimizing machine learning models for resource-constrained...  ...architectures with AI accelerators (e.g., distillation...  ...model architectures and edge device constraints. You...  ...of the users we serve, creating a culture of belonging... 
    Full time

    Google Inc.

    Mountain View, CA
    3 days ago
  •  ...turn Fish Audio's models into production voice...  ...a Forward Deployed Engineer to embed directly...  ...contact platforms, AI agents, and creator...  ...– the ones with on‑prem requirements, sub‑1...  ...back to research, infra, and product. Influence...  ...customers can self‑serve from. What You... 
    Contract work
    Work at office
    Remote work
    Visa sponsorship

    39 Ai, Inc.

    Mountain View, CA
    2 days ago
  • $150k - $300k

    ## Distinguished Engineer, Applied AIApplylocations: Palo Alto, CAtime...  ...and sales representatives who serve more than 80 million customers...  ...customers. We are building an AI-powered CRM platform using...  ...our CRM roadmap powered by LLM models and Agentic AI. The ideal candidate... 
    Hourly pay
    Work experience placement
    Local area
    Flexible hours
    Shift work

    GEICO

    Palo Alto, CA
    1 day ago
  •  ...A global technology delivery partner is hiring an AI Quality Infrastructure Engineer in Mountain View, California. This full-time role involves designing...  ...H-1B visa sponsorship and offers the opportunity to work on cutting-edge AI reliability systems. #J-18808-Ljbffr... 
    Full time
    H1b
    Visa sponsorship

    NewsNowGh

    Mountain View, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Model Serving Engineer On-Prem & Edge AI Infra. Be the first to apply!