Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Model Serving Infra Engineer - On-Prem & Edge

Palantir Technologies

Palantir Technologies is looking for a Software Engineer to enhance its AI modeling capabilities. You will develop and maintain production systems, ensuring robust model serving infrastructure. The ideal candidate has 4+ years of software engineering experience, strong coding abilities in languages like Java and Python, and is well-versed in containers and Kubernetes. Benefits include comprehensive health insurance, paid time off, 401k plan, and more. #J-18808-Ljbffr Palantir Technologies

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the ML Model Serving Infra Engineer - On-Prem & Edge in Palo Alto, CA vacancy
  • $181.1k - $318.4k

     ...intersection of cutting-edge AI and human-...  ...These are multimodal models that power Siri on-...  ...and modeling engineers train models, iterate...  ..., observable, self-serve system. The work spans...  ...automation pipelines for ML models where agents...  ...-backed training infra) including... 
    Suggested
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  • $220.2k - $330.4k

     ...Technologies, Inc.Job Area:Engineering Group, Engineering...  ...connected intelligent edge, focusing on AI, edge computing...  ...and industries.AI on‑prem Appliance is a new...  ...customer data, fine‑tuned models, and inference loads to...  ...10+ years in the AI/ML fields.Qualcomm is an equal... 
    Suggested
    Work experience placement
    Work at office
    Work from home

    Nutanix

    Santa Clara, CA
    2 days ago
  •  ...Technical Staff — Diffusion Model About the Role RadixArk...  ...will work on cutting-edge diffusion and flow-...  ...research thinking with strong engineering execution — from...  ...years of experience in ML research or applied ML...  ..., the fastest open LLM serving engine), and developed... 
    Suggested
    Flexible hours

    RadixArk

    Palo Alto, CA
    3 days ago
  •  ...skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California...  .... The role offers an opportunity to work closely with cutting-edge technologies and a collaborative team. Benefits include a competitive... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role...  ...learning systems. The ML infra team covers a variety of responsibilities...  ...pipeline for large language models (LLM), model evaluation and... 
    Suggested

    Moveworks

    Mountain View, CA
    3 days ago
  • Model AI is looking for a Founding Machine Learning Infrastructure Engineer in Palo Alto to help optimize infrastructure for AI systems....  ...will focus on enhancing model serving performance and cost efficiency...  ...have strong experience in ML and distributed systems, and troubleshooting... 

    Model AI

    Palo Alto, CA
    3 days ago
  • $145k - $200k

     ...more. The Role We are a software engineering team with expertise in enabling ML models in production. We deploy AI...  ...forward-deployed defense environments, edge nodes, and enterprises with...  ...Java, Rust, Python and Go Model serving engines for GPU-accelerated inference... 
    Work experience placement
    Work at office
    Remote work
    Work from home
    Relocation package

    Palantir Technologies

    Palo Alto, CA
    16 hours ago
  • $170.5k - $315.49k

    ## Inference Optimization Engineer (local / edge runtime)Applylocations: US, California...  ...by design. Small, efficient models run directly on the user's machine (AI PC, edge, on-prem, and beyond), keeping data...  ...belong to the people it serves# Role SummaryMake models fast... 
    Internship
    Local area
    Immediate start
    Shift work

    Intel

    Santa Clara, CA
    2 days ago
  • $155k - $207k

     ...and deploy cutting-edge AI technology to help...  ...responsible for ML and work alongside...  ...veteran scientists and engineers. As a Senior/Staff...  ...and speech models using PyTorch and/or...  ...training, inference, and serving infrastructure while...  ...-end with product, infra, research, and data... 
    Permanent employment

    Cacheflow

    Mountain View, CA
    3 days ago
  •  ...that a single foundation model works out of the box...  ...and evaluate the cutting edge solutions developed by...  ...mechanism. Work with software engineers to implement your...  ...Working knowledge of ML basics back prop, loss...  ...environments, already serving Tier-1 semiconductor and... 

    Getvinci

    Palo Alto, CA
    6 hours ago
  •  ...Engineering Manager, Agentic Systems - Moveworks Job Description...  ...of the cutting‑edge platform that powers Moveworks...  ...systems for the entire ML/LLM lifecycle. This...  ...and inference, model evaluation frameworks,...  ...frameworks your team builds serve as the foundation for all... 

    Moveworks.ai

    Mountain View, CA
    5 days ago
  • About the role Model Alignment and Deployment is a critical, cross-functional effort spanning our Post-Training, Safety Engineering, Trust & Safety, ML Infra/Model Serving, and User Experience Research (UXR) teams. Together, these groups are responsible for transforming... 

    Character.AI

    Redwood City, CA
    2 days ago
  •  ...Inc. is seeking a Senior Machine Learning Engineer in Cupertino, California, to evaluate and...  ...design and develop key infrastructures for model and agent evaluations, contribute to quality...  ...learning, and proficiency in Python and ML frameworks such as PyTorch. Join Apple to... 

    Apple Inc.

    Cupertino, CA
    2 days ago
  • Harmonic in Palo Alto is seeking a pragmatic Software Engineer to lead the productionization of research pipelines within...  ...AI projects. You will engage in building robust ML pipelines as part of a cutting-edge team, ensuring efficient coding practices and scalable... 

    Harmonic

    Palo Alto, CA
    2 days ago
  •  ...turn Fish Audio's models into production voice...  ...a Forward Deployed Engineer to embed directly...  ...– the ones with on‑prem requirements, sub‑1...  ...back to research, infra, and product. Influence...  ...customers can self‑serve from. What You...  ...Experience deploying ML/AI systems into production... 
    Contract work
    Work at office
    Remote work
    Visa sponsorship

    39 Ai, Inc.

    Mountain View, CA
    4 days ago
  • $100k - $150k

     ...their operations. We leverage cutting-edge technologies to create scalable, secure...  ...continue to grow, we’re looking for a skilled Model Serving Engineer to join our dynamic team and contribute...  ..., throughput, cost, and quality in ML serving. Key Responsibilities... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Newark, CA
    5 days ago
  • $224k - $356.5k

     ...-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in crafting...  ...building or improving evaluation frameworks, benchmarks, or ML infrastructure used by other teams or external users.* A... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...effortlessly run large‑scale ML applications, without the...  ...customers include top model labs, global enterprises, and cutting‑edge AI‑native startups....  ...hiring a Senior Performance Engineer to join our Product team....  ...inference performance and will serve as our resident expert on... 
    Contract work
    Shift work

    Cerebras

    Sunnyvale, CA
    16 hours ago
  • $174k - $253k

     ...leveraging AI solutions, ML APIs, prompting, agent...  ...with large language models (LLMs), retrieval‑augmented...  ...management and engineering teams to stay on top of...  ...leverage Google’s cutting‑edge technology, and tools that...  ...benefits Responsibilities Serve as a trusted advisor to... 
    Temporary work

    Google

    Sunnyvale, CA
    2 days ago
  • $82k - $109k

     ...and drive to either develop or restart their career in engineering. We fundamentally believe top talent can come from...  ...Learning engineering team at LinkedIn developing cutting-edge machine learning models that serve millions of members. You will learn from fellow engineers... 
    For contractors
    Apprenticeship
    Work experience placement

    NLP PEOPLE

    Mountain View, CA
    16 hours ago
  • $150k - $300k

    ## Distinguished Engineer, Applied AIApplylocations: Palo Alto, CAtime...  ...and sales representatives who serve more than 80 million customers...  ...CRM roadmap powered by LLM models and Agentic AI. The ideal candidate...  ...capabilities across AI/ML, distributed systems, and operational... 
    Hourly pay
    Work experience placement
    Local area
    Flexible hours
    Shift work

    GEICO

    Palo Alto, CA
    2 days ago
  •  ...advanced device intelligence, powerful decision engine, and investigation tools work together to...  ...Science. During the onboarding phase, serve as the technical escalation point for...  ...and machine learning, to provide cutting‑edge solutions to clients. Stay updated on product... 

    DataVisor

    Mountain View, CA
    3 days ago
  • $133k - $254k

     ...system. Our AI Data Pipeline Engineers build up the core data...  ...autonomous driving cutting edge algorithms. We develop the...  ...well as high‑performance data serving SDKs for ML model training / evaluation. The...  ...ML application, and Cloud Infra to align data pipelines with... 
    Work experience placement

    42dot

    Sunnyvale, CA
    4 days ago
  • $150k - $180k

     ...variable uptime workloads (e.g., AI/ML). Verrus builds and capitalizes...  ...is looking for candidates to serve as software-focused Senior Site Reliability Engineer at Verrus. This is a full‑time position...  ...at the intersection of leading‑edge electrical and mechanical... 
    Full time
    Work at office
    Local area
    Flexible hours

    Verrus, LLC

    Mountain View, CA
    3 days ago
  • MatX is seeking a skilled software engineer to build custom silicon for AI language models in Mountain View, California. The role requires strong programming...  ...memory management and developing an LLM inference serving stack. Benefits include generous salary, equity offerings... 
    Remote job

    MatX

    Mountain View, CA
    2 days ago
  • $175k - $275k

     ...users to run large-scale ML applications without managing...  ...customers include top model labs, global enterprises, and cutting-edge AI-native startups....  ...hands-on Senior Quality Engineer to drive Manufacturing Quality...  ...Manufacturing Quality Execution: Serve as the primary quality... 
    Contract work

    Cerebras

    Sunnyvale, CA
    4 days ago
  •  ...Cupertino, California, is seeking a Senior Research Engineer focused on training data infrastructure for advanced AI models. The ideal candidate will possess strong skills...  ...offers the opportunity to contribute to cutting-edge AI development in a collaborative and innovation... 

    Apple Inc.

    Cupertino, CA
    3 days ago
  •  ...building a 100x better job search engine: fast, comprehensive, honest,...  .... We are looking for a founding ML engineer who can help us turn powerful AI and ML models into fast, reliable production systems...  ...latency and throughput, scaling serving systems, and making sure our... 
    Relocation package

    HiringCafe

    Cupertino, CA
    4 days ago
  • $180k

     ...highly motivated, and focused on engineering excellence. This organization is for...  ...engineer on the Imagine Model Team, you will develop cutting-edge AI experiences beyond text, with a...  ...curation, modeling, training, inference serving, and product integration, covering... 
    Temporary work

    xAI

    Palo Alto, CA
    6 days ago
  •  ...motivated Senior Electro-Mechanical Engineer to join our team. In this position, you will serve as the subject matter expert for...  ...in the success of our cutting-edge eVTOL aircraft....  ...electrical system performance, thermal modeling, and stress analysis for enclosures... 
    Work at office

    Pivotal

    Palo Alto, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Model Serving Infra Engineer - On-Prem & Edge. Be the first to apply!