ML Model Serving Infra Engineer - On-Prem & Edge
Palantir Technologies
Palantir Technologies is looking for a Software Engineer to enhance its AI modeling capabilities. You will develop and maintain production systems, ensuring robust model serving infrastructure. The ideal candidate has 4+ years of software engineering experience, strong coding abilities in languages like Java and Python, and is well-versed in containers and Kubernetes. Benefits include comprehensive health insurance, paid time off, 401k plan, and more. #J-18808-Ljbffr Palantir Technologies
$181.1k - $318.4k
...intersection of cutting-edge AI and human-... ...These are multimodal models that power Siri on-... ...and modeling engineers train models, iterate... ..., observable, self-serve system. The work spans... ...automation pipelines for ML models where agents... ...-backed training infra) including...SuggestedRelocation$220.2k - $330.4k
...Technologies, Inc.Job Area:Engineering Group, Engineering... ...connected intelligent edge, focusing on AI, edge computing... ...and industries.AI on‑prem Appliance is a new... ...customer data, fine‑tuned models, and inference loads to... ...10+ years in the AI/ML fields.Qualcomm is an equal...SuggestedWork experience placementWork at officeWork from home- ...Technical Staff — Diffusion Model About the Role RadixArk... ...will work on cutting-edge diffusion and flow-... ...research thinking with strong engineering execution — from... ...years of experience in ML research or applied ML... ..., the fastest open LLM serving engine), and developed...SuggestedFlexible hours
- ...skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California... .... The role offers an opportunity to work closely with cutting-edge technologies and a collaborative team. Benefits include a competitive...Suggested
- ...looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role... ...learning systems. The ML infra team covers a variety of responsibilities... ...pipeline for large language models (LLM), model evaluation and...Suggested
- Model AI is looking for a Founding Machine Learning Infrastructure Engineer in Palo Alto to help optimize infrastructure for AI systems.... ...will focus on enhancing model serving performance and cost efficiency... ...have strong experience in ML and distributed systems, and troubleshooting...
$145k - $200k
...more. The Role We are a software engineering team with expertise in enabling ML models in production. We deploy AI... ...forward-deployed defense environments, edge nodes, and enterprises with... ...Java, Rust, Python and Go Model serving engines for GPU-accelerated inference...Work experience placementWork at officeRemote workWork from homeRelocation package$170.5k - $315.49k
## Inference Optimization Engineer (local / edge runtime)Applylocations: US, California... ...by design. Small, efficient models run directly on the user's machine (AI PC, edge, on-prem, and beyond), keeping data... ...belong to the people it serves# Role SummaryMake models fast...InternshipLocal areaImmediate startShift work$155k - $207k
...and deploy cutting-edge AI technology to help... ...responsible for ML and work alongside... ...veteran scientists and engineers. As a Senior/Staff... ...and speech models using PyTorch and/or... ...training, inference, and serving infrastructure while... ...-end with product, infra, research, and data...Permanent employment- ...that a single foundation model works out of the box... ...and evaluate the cutting edge solutions developed by... ...mechanism. Work with software engineers to implement your... ...Working knowledge of ML basics back prop, loss... ...environments, already serving Tier-1 semiconductor and...
- ...Engineering Manager, Agentic Systems - Moveworks Job Description... ...of the cutting‑edge platform that powers Moveworks... ...systems for the entire ML/LLM lifecycle. This... ...and inference, model evaluation frameworks,... ...frameworks your team builds serve as the foundation for all...
- About the role Model Alignment and Deployment is a critical, cross-functional effort spanning our Post-Training, Safety Engineering, Trust & Safety, ML Infra/Model Serving, and User Experience Research (UXR) teams. Together, these groups are responsible for transforming...
- ...Inc. is seeking a Senior Machine Learning Engineer in Cupertino, California, to evaluate and... ...design and develop key infrastructures for model and agent evaluations, contribute to quality... ...learning, and proficiency in Python and ML frameworks such as PyTorch. Join Apple to...
- Harmonic in Palo Alto is seeking a pragmatic Software Engineer to lead the productionization of research pipelines within... ...AI projects. You will engage in building robust ML pipelines as part of a cutting-edge team, ensuring efficient coding practices and scalable...
- ...turn Fish Audio's models into production voice... ...a Forward Deployed Engineer to embed directly... ...– the ones with on‑prem requirements, sub‑1... ...back to research, infra, and product. Influence... ...customers can self‑serve from. What You... ...Experience deploying ML/AI systems into production...Contract workWork at officeRemote workVisa sponsorship
$100k - $150k
...their operations. We leverage cutting-edge technologies to create scalable, secure... ...continue to grow, we’re looking for a skilled Model Serving Engineer to join our dynamic team and contribute... ..., throughput, cost, and quality in ML serving. Key Responsibilities...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$224k - $356.5k
...-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in crafting... ...building or improving evaluation frameworks, benchmarks, or ML infrastructure used by other teams or external users.* A...- ...effortlessly run large‑scale ML applications, without the... ...customers include top model labs, global enterprises, and cutting‑edge AI‑native startups.... ...hiring a Senior Performance Engineer to join our Product team.... ...inference performance and will serve as our resident expert on...Contract workShift work
$174k - $253k
...leveraging AI solutions, ML APIs, prompting, agent... ...with large language models (LLMs), retrieval‑augmented... ...management and engineering teams to stay on top of... ...leverage Google’s cutting‑edge technology, and tools that... ...benefits Responsibilities Serve as a trusted advisor to...Temporary work$82k - $109k
...and drive to either develop or restart their career in engineering. We fundamentally believe top talent can come from... ...Learning engineering team at LinkedIn developing cutting-edge machine learning models that serve millions of members. You will learn from fellow engineers...For contractorsApprenticeshipWork experience placement$150k - $300k
## Distinguished Engineer, Applied AIApplylocations: Palo Alto, CAtime... ...and sales representatives who serve more than 80 million customers... ...CRM roadmap powered by LLM models and Agentic AI. The ideal candidate... ...capabilities across AI/ML, distributed systems, and operational...Hourly payWork experience placementLocal areaFlexible hoursShift work- ...advanced device intelligence, powerful decision engine, and investigation tools work together to... ...Science. During the onboarding phase, serve as the technical escalation point for... ...and machine learning, to provide cutting‑edge solutions to clients. Stay updated on product...
$133k - $254k
...system. Our AI Data Pipeline Engineers build up the core data... ...autonomous driving cutting edge algorithms. We develop the... ...well as high‑performance data serving SDKs for ML model training / evaluation. The... ...ML application, and Cloud Infra to align data pipelines with...Work experience placement$150k - $180k
...variable uptime workloads (e.g., AI/ML). Verrus builds and capitalizes... ...is looking for candidates to serve as software-focused Senior Site Reliability Engineer at Verrus. This is a full‑time position... ...at the intersection of leading‑edge electrical and mechanical...Full timeWork at officeLocal areaFlexible hours- MatX is seeking a skilled software engineer to build custom silicon for AI language models in Mountain View, California. The role requires strong programming... ...memory management and developing an LLM inference serving stack. Benefits include generous salary, equity offerings...Remote job
$175k - $275k
...users to run large-scale ML applications without managing... ...customers include top model labs, global enterprises, and cutting-edge AI-native startups.... ...hands-on Senior Quality Engineer to drive Manufacturing Quality... ...Manufacturing Quality Execution: Serve as the primary quality...Contract work- ...Cupertino, California, is seeking a Senior Research Engineer focused on training data infrastructure for advanced AI models. The ideal candidate will possess strong skills... ...offers the opportunity to contribute to cutting-edge AI development in a collaborative and innovation...
- ...building a 100x better job search engine: fast, comprehensive, honest,... .... We are looking for a founding ML engineer who can help us turn powerful AI and ML models into fast, reliable production systems... ...latency and throughput, scaling serving systems, and making sure our...Relocation package
$180k
...highly motivated, and focused on engineering excellence. This organization is for... ...engineer on the Imagine Model Team, you will develop cutting-edge AI experiences beyond text, with a... ...curation, modeling, training, inference serving, and product integration, covering...Temporary work- ...motivated Senior Electro-Mechanical Engineer to join our team. In this position, you will serve as the subject matter expert for... ...in the success of our cutting-edge eVTOL aircraft.... ...electrical system performance, thermal modeling, and stress analysis for enclosures...Work at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Model Serving Infra Engineer - On-Prem & Edge. Be the first to apply!
- machine learning engineer Palo Alto, CA
- senior ml engineer Palo Alto, CA
- computer vision machine learning engineer Palo Alto, CA
- ai ml engineer Palo Alto, CA
- machine learning software engineer Palo Alto, CA
- machine learning ai engineer Palo Alto, CA
- machine learning scientist Palo Alto, CA
- machine learning remote Palo Alto, CA
- machine learning Palo Alto, CA
- machine learning researcher Palo Alto, CA


