Staff ML Infra Engineer: Large-Scale Pretraining & MLOps
$181.1k - $318.4kApple Oakbrook
Apple Inc. is looking for a Staff ML Infrastructure Engineer in San Francisco to lead pre-training initiatives for cutting-edge foundation models in machine learning. The successful candidate will have over 6 years of experience in building scalable backend systems, be proficient in Python and Go, and possess strong knowledge of distributed systems and containerization. The role offers a competitive base pay range of $181,100 to $318,400, along with comprehensive benefits, employee stock options, and educational reimbursements. #J-18808-Ljbffr
$181.1k - $318.4k
Apple Inc. is looking for a Staff ML Infrastructure Engineer in San Francisco to lead pre-training initiatives for cutting-edge foundation models in machine learning. The successful candidate will have over 6 years of experience in building scalable backend systems, be...Suggested- Responsibilities Design, deploy, and maintain large distributed ML training and inference clusters Develop efficient, scalable end-to-end pipelines to manage petabyte-scale datasets and model training throughout the entire ML lifecycle Research and test various training...Suggested
$250k
...Consulting Ltd is looking for a talented ML/AI Research Engineer to join their San Francisco team. You will... ...powers training, deployment, and governance of large-scale AI systems. The ideal candidate has a strong background in MLOps, Kubernetes, and cloud infrastructure,...Suggested$250k - $350k
...This one builds what makes them actually work. We’re hiring ML Infrastructure Engineers to tackle a hard, real-world problem, understanding what’s happening on live job sites using wearable devices, large-scale video, and AI. This isn’t clean benchmark data. It’s messy,...Suggested- ...A leading AI technology company in San Francisco is seeking an engineering professional to develop and manage intelligent job scheduling systems for large-scale AI applications. This role focuses on ensuring efficient resource allocation across GPU and TPU clusters while...Suggested
- ...Hayden AI Technologies, Inc. is looking for an MLOps Engineer in San Francisco to design and implement cloud-based workflows for AI models. This role involves collaboration with cross-functional teams to enhance workflow efficiency and demands a Bachelor's degree in Computer...Work at office
- A leading AI company in San Francisco is seeking a skilled ML Infrastructure Engineer to manage and optimize large-scale training systems. In this role, you will design and maintain infrastructure for model training, ensuring efficient GPU/TPU utilization while working...
$131.4k - $235.95k
...Autodesk, Inc. is seeking a Senior Machine Learning Engineer for MLOps in San Francisco. You will ensure AI-powered experiences meet high standards for reliability and scalability. Key responsibilities include automating model testing, managing inference services, and...- ...Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,...
- ...accordingly. That doesn't scale. We need a scheduling... ...tolerance, and making large-scale distributed training... .... The Team The ML Infrastructure team... ...will work closely with ML Infra (training systems),... ...have: - Strong software engineering fundamentals -...Flexible hours
$100k - $200k
...Voiceflow is seeking a skilled ML-Infrastructure Engineer in San Francisco to architect and operate auto-scaling systems for our voice AI simulation platform. The role includes optimizing GPU and compute infrastructure, ensuring high performance and reliability. Ideal...Work at office- ...this role you will help scale and optimize our... ...critical infrastructure for large-scale training, from managing... ...researchers and model engineers to translate ideas into... ...at the intersection of ML, software engineering,... ...research needs into infra capabilities and guide...
- ...URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling of our GPU compute platform from the ground up, ensuring high availability and low-latency inference. This is a founding technical...
$155.58k - $320.32k
A leading social media platform is seeking a Senior MLOps Engineer to enhance their Connected TV ad-buying platform. The ideal candidate will scale machine learning practices, improve developer experiences, and provide technical leadership. Strong expertise in Linux, high...- ...A leading AI infrastructure company is seeking a Senior Staff Software Engineer for its Cloud Storage team. This role involves developing the... ...hardware and software solutions, and innovating on large-scale cloud infrastructures. Candidates should have over 12 years...
- ...A leading AI research organization in San Francisco seeks an Infrastructure Engineer to design and maintain large distributed ML training and inference clusters. The ideal candidate will have a strong grasp of optimizing training workloads and experience with distributed...
- ...A pioneering tech startup in neurotechnology is seeking a Senior Machine Learning Infrastructure Engineer to design and scale critical infrastructure powering ML applications. This role involves creating robust data pipelines and optimizing modeling processes, essential...
$150k - $350k
...ML Engineer — Tilde Research Location: San Francisco, CA (Onsite) Compensation... .... They work on reasoning, pretraining, scaling laws, and architecture-level... ...architectures, debug large-scale training issues, and... ...theory This Role Is NOT For MLOps or infrastructure-focused profiles...- ...A leading technology company is looking for an ML Infrastructure Engineer in San Francisco. The successful candidate will build and maintain ML training pipelines and ensure low-latency model serving. Candidates should have over 4 years of experience in ML engineering...Work at office
$147.6k - $274k
...Description: Machine Learning Engineer - Infra San Francisco, CA The Opportunity... ...seeking a highly motivated and skilled ML Engineer to join our growing team within... ...Preferred Extensive experience with large-scale ML model platforms and tools. Deep understanding...Relocation package- ...is looking for a Machine Learning Engineer in San Francisco, CA, who will drive the development of advanced ML and AI solutions. This role focuses on applying large language models and deep learning... ...model and the opportunity to scale impactful solutions. #J-18808-Ljbffr...
- ...models, and automate processes at scale. We’ve grown incredibly... ...are hiring a Machine Learning Engineer to help us train and deploy the... ...product. The Opportunity As an ML Infra Engineer , you’ll play a key role... ...inference workloads across large GPU clusters while improving utilization...Work at officeLocal area
- ...ML Systems Engineer – Robotics & AI We are building the full-stack foundation for the next generation... .... We work at the intersection of large-scale learning, robotics, and systems, with... ...manufacturing scale-up. We are hiring a Staff/Principal ML Systems Engineer to own...
- ...About the Role ML Ops Engineer — Agentic AI Lab (Founding Team)... ...Infrastructure: 4+ years in MLOps, ML platform engineering, or infra-focused ML roles Deep... ...HuggingFace Hub Experience with large model deployments (open-... ...(spot instance scaling, batch prioritization, model...Full time
$250k
...backed AI Startup, is hiring a talented ML/AI Research Engineer to join their team in San Francisco.... ...deployment, evaluation and governance of large‑scale AI systems. Build end‑to‑end pipelines... ...Minimum of 4 years of experience in MLOps, ML infrastructure or backend/platform...- A cutting-edge AI technology company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate...
- ...We don't believe culture can be engineered - but when it falls into place,... ...Position Overview We're looking for an ML infrastructure engineer to help design, build, and scale the foundational systems we need... ...to dataset curation to large-scale model training and deployment...Local area
- ...Senior Infrastructure Engineer – Bland As a Senior... ...and solving scaling challenges related... ...industries. Lead – AI/ML Stack Infrastructure... ...and roadmap for the MLOps team to support ML... ...Engineers to manage large datasets and artifacts... ...capabilities. Staff DevOps Engineer – AI...Temporary work
- ...A pioneering AI firm based in San Francisco is seeking a Research Engineer, Distributed Data Systems. In this role, you will design and maintain infrastructure for large-scale multimodal training, ensuring scalability and reliability of data systems. Candidates should...Work at officeRelocation package
- ...company in San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation models. You... ...Ideal candidates have over 5 years of experience in ML infrastructure and a strong background in...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff ML Infra Engineer: Large-Scale Pretraining & MLOps. Be the first to apply!
- software engineer staff San Francisco, CA
- assistant engineer San Francisco, CA
- assistant engineering manager San Francisco, CA
- staff design engineer San Francisco, CA
- project engineer assistant project manager San Francisco, CA
- technology administrator San Francisco, CA
- staff data engineer San Francisco, CA
- assistant chief engineer San Francisco, CA
- senior staff systems engineer San Francisco, CA
- staff engineer San Francisco, CA

