ML Model Serving Infra Engineer - On-Prem & Edge

Palantir Technologies

Palantir Technologies is looking for a Software Engineer to enhance its AI modeling capabilities. You will develop and maintain production systems, ensuring robust model serving infrastructure. The ideal candidate has 4+ years of software engineering experience, strong coding abilities in languages like Java and Python, and is well-versed in containers and Kubernetes. Benefits include comprehensive health insurance, paid time off, 401k plan, and more. #J-18808-Ljbffr Palantir Technologies

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the ML Model Serving Infra Engineer - On-Prem & Edge in Palo Alto, CA vacancy

Sr. ML Production Model Automation Engineer, Siri Speech
$181.1k - $318.4k
...intersection of cutting-edge AI and human-... ...These are multimodal models that power Siri on-... ...and modeling engineers train models, iterate... ..., observable, self-serve system. The work spans... ...automation pipelines for ML models where agents... ...-backed training infra) including...
Suggested
Relocation
Apple
Cupertino, CA
1 day ago
Principal Engineer, Solutions Architect Lead - Industrial & Embedded IoT, Edge AI On‑Prem Appliance
$220.2k - $330.4k
...Technologies, Inc.Job Area:Engineering Group, Engineering... ...connected intelligent edge, focusing on AI, edge computing... ...and industries.AI on‑prem Appliance is a new... ...customer data, fine‑tuned models, and inference loads to... ...10+ years in the AI/ML fields.Qualcomm is an equal...
Suggested
Work experience placement
Work at office
Work from home
Nutanix
Santa Clara, CA
2 days ago
Member of Technical Staff — Diffusion Model
...Technical Staff — Diffusion Model About the Role RadixArk... ...will work on cutting-edge diffusion and flow-... ...research thinking with strong engineering execution — from... ...years of experience in ML research or applied ML... ..., the fastest open LLM serving engine), and developed...
Suggested
Flexible hours
RadixArk
Palo Alto, CA
3 days ago
Senior DL Engineer: Edge Model Optimization & Inference
...skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California... .... The role offers an opportunity to work closely with cutting-edge technologies and a collaborative team. Benefits include a competitive...
Suggested
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Senior Machine Learning Engineer, Agentic Systems - Moveworks
...looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role... ...learning systems. The ML infra team covers a variety of responsibilities... ...pipeline for large language models (LLM), model evaluation and...
Suggested
Moveworks
Mountain View, CA
3 days ago
Founding ML Systems Engineer - Infra & Inference
Model AI is looking for a Founding Machine Learning Infrastructure Engineer in Palo Alto to help optimize infrastructure for AI systems.... ...will focus on enhancing model serving performance and cost efficiency... ...have strong experience in ML and distributed systems, and troubleshooting...
Model AI
Palo Alto, CA
3 days ago
Software Engineer - Hosted Model Infrastructure
$145k - $200k
...more. The Role We are a software engineering team with expertise in enabling ML models in production. We deploy AI... ...forward-deployed defense environments, edge nodes, and enterprises with... ...Java, Rust, Python and Go Model serving engines for GPU-accelerated inference...
Work experience placement
Work at office
Remote work
Work from home
Relocation package
Palantir Technologies
Palo Alto, CA
16 hours ago
Inference Optimization Engineer (local / edge runtime)
$170.5k - $315.49k
## Inference Optimization Engineer (local / edge runtime)Applylocations: US, California... ...by design. Small, efficient models run directly on the user's machine (AI PC, edge, on-prem, and beyond), keeping data... ...belong to the people it serves# Role SummaryMake models fast...
Internship
Local area
Immediate start
Shift work
Intel
Santa Clara, CA
2 days ago
Senior/Staff ML Engineer - Speech & NLP Systems
$155k - $207k
...and deploy cutting-edge AI technology to help... ...responsible for ML and work alongside... ...veteran scientists and engineers. As a Senior/Staff... ...and speech models using PyTorch and/or... ...training, inference, and serving infrastructure while... ...-end with product, infra, research, and data...
Permanent employment
Cacheflow
Mountain View, CA
3 days ago
Systems Engineer - Simulation Correctness
...that a single foundation model works out of the box... ...and evaluate the cutting edge solutions developed by... ...mechanism. Work with software engineers to implement your... ...Working knowledge of ML basics back prop, loss... ...environments, already serving Tier-1 semiconductor and...
Getvinci
Palo Alto, CA
6 hours ago
Engineering Manager, Agentic Systems - Moveworks
...Engineering Manager, Agentic Systems - Moveworks Job Description... ...of the cutting‑edge platform that powers Moveworks... ...systems for the entire ML/LLM lifecycle. This... ...and inference, model evaluation frameworks,... ...frameworks your team builds serve as the foundation for all...
Moveworks.ai
Mountain View, CA
5 days ago
Technical Program Manager, Model Alignment and Deployment
About the role Model Alignment and Deployment is a critical, cross-functional effort spanning our Post-Training, Safety Engineering, Trust & Safety, ML Infra/Model Serving, and User Experience Research (UXR) teams. Together, these groups are responsible for transforming...
Character.AI
Redwood City, CA
2 days ago
Senior AIML Engineer — AI Model Evaluation & Benchmarking
...Inc. is seeking a Senior Machine Learning Engineer in Cupertino, California, to evaluate and... ...design and develop key infrastructures for model and agent evaluations, contribute to quality... ...learning, and proficiency in Python and ML frameworks such as PyTorch. Join Apple to...
Apple Inc.
Cupertino, CA
2 days ago
ML Systems Engineer: Production Pipelines & Cloud Infra
Harmonic in Palo Alto is seeking a pragmatic Software Engineer to lead the productionization of research pipelines within... ...AI projects. You will engage in building robust ML pipelines as part of a cutting-edge team, ensuring efficient coding practices and scalable...
Harmonic
Palo Alto, CA
2 days ago
Forward Deployed Engineer
...turn Fish Audio's models into production voice... ...a Forward Deployed Engineer to embed directly... ...– the ones with on‑prem requirements, sub‑1... ...back to research, infra, and product. Influence... ...customers can self‑serve from. What You... ...Experience deploying ML/AI systems into production...
Contract work
Work at office
Remote work
Visa sponsorship
39 Ai, Inc.
Mountain View, CA
4 days ago
Model Serving Engineer
$100k - $150k
...their operations. We leverage cutting-edge technologies to create scalable, secure... ...continue to grow, we’re looking for a skilled Model Serving Engineer to join our dynamic team and contribute... ..., throughput, cost, and quality in ML serving. Key Responsibilities...
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Newark, CA
5 days ago
Senior Deep Learning Engineer - Model Evaluation & AI Systems
$224k - $356.5k
...-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in crafting... ...building or improving evaluation frameworks, benchmarks, or ML infrastructure used by other teams or external users.* A...
NVIDIA Corporation
Santa Clara, CA
3 days ago
Senior Performance Engineer, Inference
...effortlessly run large‑scale ML applications, without the... ...customers include top model labs, global enterprises, and cutting‑edge AI‑native startups.... ...hiring a Senior Performance Engineer to join our Product team.... ...inference performance and will serve as our resident expert on...
Contract work
Shift work
Cerebras
Sunnyvale, CA
16 hours ago
Customer Engineer III, Applied AI, Google Cloud
$174k - $253k
...leveraging AI solutions, ML APIs, prompting, agent... ...with large language models (LLMs), retrieval‑augmented... ...management and engineering teams to stay on top of... ...leverage Google’s cutting‑edge technology, and tools that... ...benefits Responsibilities Serve as a trusted advisor to...
Temporary work
Google
Sunnyvale, CA
2 days ago
Apprentice Engineer - Artificial Intelligence / Machine Learning
$82k - $109k
...and drive to either develop or restart their career in engineering. We fundamentally believe top talent can come from... ...Learning engineering team at LinkedIn developing cutting-edge machine learning models that serve millions of members. You will learn from fellow engineers...
For contractors
Apprenticeship
Work experience placement
NLP PEOPLE
Mountain View, CA
16 hours ago
Distinguished Engineer, Applied AI
$150k - $300k
## Distinguished Engineer, Applied AIApplylocations: Palo Alto, CAtime... ...and sales representatives who serve more than 80 million customers... ...CRM roadmap powered by LLM models and Agentic AI. The ideal candidate... ...capabilities across AI/ML, distributed systems, and operational...
Hourly pay
Work experience placement
Local area
Flexible hours
Shift work
GEICO
Palo Alto, CA
2 days ago
Senior Sales Engineering
...advanced device intelligence, powerful decision engine, and investigation tools work together to... ...Science. During the onboarding phase, serve as the technical escalation point for... ...and machine learning, to provide cutting‑edge solutions to clients. Stay updated on product...
DataVisor
Mountain View, CA
3 days ago
Senior AI Data Pipeline Engineer (Autonomous Driving)
$133k - $254k
...system. Our AI Data Pipeline Engineers build up the core data... ...autonomous driving cutting edge algorithms. We develop the... ...well as high‑performance data serving SDKs for ML model training / evaluation. The... ...ML application, and Cloud Infra to align data pipelines with...
Work experience placement
42dot
Sunnyvale, CA
4 days ago
Staff Site Reliability Engineer
$150k - $180k
...variable uptime workloads (e.g., AI/ML). Verrus builds and capitalizes... ...is looking for candidates to serve as software-focused Senior Site Reliability Engineer at Verrus. This is a full‑time position... ...at the intersection of leading‑edge electrical and mechanical...
Full time
Work at office
Local area
Flexible hours
Verrus, LLC
Mountain View, CA
3 days ago
Runtime Engineer — Remote ML Inference & Systems
MatX is seeking a skilled software engineer to build custom silicon for AI language models in Mountain View, California. The role requires strong programming... ...memory management and developing an LLM inference serving stack. Benefits include generous salary, equity offerings...
Remote job
MatX
Mountain View, CA
2 days ago
Quality Engineering Senior
$175k - $275k
...users to run large-scale ML applications without managing... ...customers include top model labs, global enterprises, and cutting-edge AI-native startups.... ...hands-on Senior Quality Engineer to drive Manufacturing Quality... ...Manufacturing Quality Execution: Serve as the primary quality...
Contract work
Cerebras
Sunnyvale, CA
4 days ago
Senior Data Systems Engineer for Foundation Model Training
...Cupertino, California, is seeking a Senior Research Engineer focused on training data infrastructure for advanced AI models. The ideal candidate will possess strong skills... ...offers the opportunity to contribute to cutting-edge AI development in a collaborative and innovation...
Apple Inc.
Cupertino, CA
3 days ago
ML Engineer - Inference & Model Deployment
...building a 100x better job search engine: fast, comprehensive, honest,... .... We are looking for a founding ML engineer who can help us turn powerful AI and ML models into fast, reliable production systems... ...latency and throughput, scaling serving systems, and making sure our...
Relocation package
HiringCafe
Cupertino, CA
4 days ago
Member of Technical Staff - Imagine Model
$180k
...highly motivated, and focused on engineering excellence. This organization is for... ...engineer on the Imagine Model Team, you will develop cutting-edge AI experiences beyond text, with a... ...curation, modeling, training, inference serving, and product integration, covering...
Temporary work
xAI
Palo Alto, CA
6 days ago
Senior Electro-Mechanical Engineer
...motivated Senior Electro-Mechanical Engineer to join our team. In this position, you will serve as the subject matter expert for... ...in the success of our cutting-edge eVTOL aircraft.... ...electrical system performance, thermal modeling, and stress analysis for enclosures...
Work at office
Pivotal
Palo Alto, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Model Serving Infra Engineer - On-Prem & Edge. Be the first to apply!