Senior MLOps & AI Infrastructure Engineer

$149.1k - $215.93k

Full-time

Altera

Job Details: Job Description: About Altera At Altera™, our independence as the world’s largest pure‑play FPGA solutions provider gives us the focus, speed, and agility to innovate without compromise. With more than four decades of industry‑leading FPGA expertise, our singular mission is to deliver the programmable technologies that help customers differentiate, innovate, and scale across rapidly evolving markets like AI, cloud, networking, and edge. As an independent company, we move faster, invest deeper, and partner more closely—empowering our teams to drive breakthrough innovation and shape the future of the FPGA industry. About the Role We are looking for a Senior MLOps & AI Infrastructure Engineer to architect, build, and operationalize machine learning systems at scale. This role sits at the intersection of data science, software engineering, and infrastructure — combining deep ML expertise with the DevOps/MLOps discipline required to ship models reliably into production. You will partner closely with software, data, and infrastructure teams to design end-to-end ML pipelines, automate model lifecycle management, and deliver AI-powered capabilities across our EDA, HPC, and cloud environments. Key Responsibilities: ML Platform & Pipeline Engineering • Design, build, and maintain scalable ML pipelines for training, evaluation, and deployment across cloud and on-prem HPC environments • Build MLOps infrastructure including experiment tracking, model registry, feature stores, and automated retraining workflows • Implement CI/CD/CT (Continuous Training) pipelines for ML models using tools such as Kubeflow, MLflow, Airflow, or similar • Containerize ML workloads with Docker and orchestrate at scale using Kubernetes and GPU node pools Model Development & Optimization • Develop, fine-tune, and deploy large-scale models including LLMs, GNNs, and reinforcement learning agents for EDA and chip design applications • Apply advanced techniques: transfer learning, quantization, pruning, distillation, and RLHF for production-grade model efficiency • Implement A/B testing frameworks and shadow deployments for safe model rollout • Benchmark and optimize model inference performance on GPU/TPU clusters Data Engineering & Feature Management • Build and maintain data pipelines for large-scale structured and unstructured datasets (terabyte-scale) • Collaborate with data teams to design feature engineering systems and maintain data quality for ML training • Implement data versioning and lineage tracking (DVC, Delta Lake, or similar) Infrastructure & Operations • Manage cloud ML infrastructure on AWS (SageMaker), Azure (AML), or GCP (Vertex AI) with cost and performance optimization • Automate infrastructure provisioning using Terraform or CloudFormation for GPU-backed ML environments • Build monitoring, alerting, and observability systems for model performance drift, data quality, and system health • Support HPC schedulers (LSF, Slurm) for large-scale distributed training jobs Collaboration & Leadership • Partner with research scientists to productionize experimental models with engineering rigor • Mentor junior engineers and define ML engineering best practices across the organization • Drive adoption of AI/ML solutions within semiconductor, EDA, and simulation workflows Technology Stack ML Frameworks: PyTorch • TensorFlow • JAX • Hugging Face • scikit-learn • XGBoost MLOps & Pipelines: MLflow • Kubeflow • Airflow • Weights & Biases • DVC • Feast Infrastructure & Cloud: AWS SageMaker / GCP Vertex AI / Azure ML • Terraform • Docker • Kubernetes • Slurm / LSF Languages: Python • Bash • Go • SQL Monitoring & Observability: Prometheus • Grafana • ELK Stack • Evidently AI • Arize Key Competencies • Strong ownership mindset — you drive ML initiatives from prototype to production without being asked • Bias toward automation: if you do it twice, you automate it • Ability to bridge research and engineering — translating papers into production-grade systems • Thrives in fast-paced, ambiguous environments typical of deep-tech and semiconductor companies • Clear communicator who can explain complex ML concepts to non-technical stakeholders Salary Range The pay range below is for Bay Area California only. Actual salary may vary based on a number of factors including job location, job-related knowledge, skills, experiences, trainings, etc. We also offer incentive opportunities that reward employees based on individual and company performance. $149,100 - $215,925 USD We use artificial intelligence to screen, assess, or select applicants for the position. Applicants must be eligible for any required U.S. export authorizations. Qualifications: Required Qualifications Bachelor’s or Master’s degree in Computer Science, Machine Learning, Statistics, or related field and 10+ years of industry experience 10+ years of experience across ML engineering, data science, and MLOps — including frameworks (PyTorch, TensorFlow, JAX, Hugging Face) and production model deployment at scale 8+ years of experience experience with parallelism strategies (FSDP, DeepSpeed, data/model parallelism) 10+ years of experience and proficiency in Python programming 8+ years of experience in cloud ML platforms (AWS, GCP, Azure), Docker/Kubernetes, and CI/CD pipelines 5+ years of hands-on experience with MLflow, W&B, or Neptune for tracking and reproducibility Preferred Qualifications Phd in Computer Science, Machine Learning, Statistics, or related field Experience applying ML/AI to semiconductor, EDA, or chip design domains (e.g., timing prediction, place & route optimization, DRC closure) Familiarity with HPC schedulers such as LSF or Slurm and GPU cluster management for training workloads Knowledge of LLM fine-tuning, Retrieval-Augmented Generation (RAG) architectures, and AI agent frameworks such as LangChain or AutoGen Experience with graph neural networks (GNNs) or geometric deep learning for circuit and netlist analysis Background in reinforcement learning for optimization problems Exposure to zero-trust security, DevSecOps, and compliance automation for ML systems Experience working with large-scale simulation pipelines and synthetic data generation Experience at organizations such as NVIDIA, AMD, Intel, Google DeepMind, or similar AI/HPC-focused companies Published research or open-source contributions in ML, MLOps, or AI for EDA Experience building AI-powered developer tools or copilot-style products Familiarity with Synopsys, Cadence, or Siemens EDA toolchains and associated data formats Job Type: Regular Shift: Shift 1 (United States of America) Primary Location: San Jose, California, United States Additional Locations: Posting Statement: All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance. About Altera Altera: Accelerating Innovators Altera provides leadership programmable solutions that are easy-to-use and deploy in applications from cloud to edge, offering limitless AI possibilities. Our end-to-end broad portfolio of products including FPGAs, CPLDs, Intellectual Property, development tools, System on Modules, SmartNICs and IPUs provide the flexibility to accelerate innovation. Altera is helping to shape the future through pioneering innovation that unlocks extraordinary possibilities for everyone on the planet. Don't see the dream job you are looking for? Click "Get Started" below to drop off your contact information and resume and we will reach out to you if we find the perfect fit.

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Senior MLOps & AI Infrastructure Engineer in San Jose, CA vacancy

Senior MLOps & AI Ops Engineer — Remote
A leading AI solutions provider is seeking an experienced MLOps / AI Ops Engineer for a remote 12-month contract. The role involves building and automating CI/CD pipelines for machine learning models, establishing monitoring frameworks, and managing deployment strategies...
Senior
Remote job
Contract work
DeWinter Group
Campbell, CA
3 days ago
Senior Staff AI Platform Engineer
...hire a deeply technical, creative, and Senior AI Platform Engineer to build, support, and maintain the... ...be doing: Define and lead AI-native infrastructure roadmaps and cross‑organizational... ...operating AI/ML platforms, including MLOps, model serving, and GPU‑accelerated...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior AI Infrastructure Engineer
$180k - $240k
...logistics operations. ABOUT THE ROLE We are seeking a Senior AI Infrastructure Engineer to design, build, and scale the high-performance AI... ...edge cases from raw data. * Model Management & Lifecycle (MLOps) * Automated Lifecycle Management: Design and maintain...
Senior
Odd job
Full time
Work at office
Gatik AI
Santa Clara, CA
7 hours ago
Senior AI/RL Infrastructure Engineer
$126k - $423k
Decisive Point is seeking a Research Engineer (AI/RL Infrastructure) in Sunnyvale, California to design and operate large-scale ML systems. You will collaborate with leading experts and contribute to next-generation physical AI, impacting self-driving technologies. This...
Senior
Decisive Point
Sunnyvale, CA
1 day ago
Senior AI Platform Engineer
$172.5k - $306.63k
...organizations to create exceptional content effortlessly. The AI for Engineering team builds a scalable, production‑grade AI platform that... ...engines, tools, and data streams into adaptive AI systems. Mentor senior engineers in modern AI system design, LLM orchestration...
Senior
Local area
Dormont Manufacturing Company
San Jose, CA
4 days ago
Senior AI Infrastructure Engineer, Large-Scale GPU Clusters
NVIDIA Corporation in Santa Clara is seeking a Senior Software Engineer to lead the optimization of large-scale AI systems. This role will involve profiling and... ...will have over 8 years of experience in software infrastructure for AI systems, with expert-level programming...
Senior
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior AI Training Infrastructure Engineer
$182k - $242k
CoreWeave is seeking an experienced professional to contribute to building distributed systems and ML infrastructure. The successful candidate will play a pivotal role in designing an optimal research cluster experience, including a Python SDK, while collaborating closely...
Senior
Jobr
Sunnyvale, CA
3 days ago
Senior AI Infrastructure Engineer (C++/GPU)
$283.4k
KLA is seeking a Sr. AI Infrastructure Software Engineer in Milpitas, California. This role focuses on C++ programming and involves designing core infrastructure for AI workloads. Join a top-notch team solving complex problems at the intersection of software and hardware...
Senior
Dormont Manufacturing Company
Milpitas, CA
4 days ago
Senior Lead AI Engineer (GenAI Platform Services)
$229.9k - $262.4k
...Senior Lead AI Engineer (GenAI Platform Services) Overview At Capital One, we are creating responsible and reliable AI systems, changing... ...personalized customer experiences. Our investments in technology infrastructure and world‑class talent — along with our deep experience...
Senior
Local area
Comfort Systems USA
San Jose, CA
2 days ago
Senior AI Infrastructure Support Engineer
Drive Capital is seeking a Senior Customer Support Engineer in Campbell, CA. This role involves responding to customer inquiries, managing technical operations, and building strong relationships with customers based on technical excellence. The ideal candidate will have...
Senior
Drive Capital
Campbell, CA
2 days ago
Senior AI Platform Engineer
$172.5k - $306.63k
...Staff Engineer - AI For Engineering Adobe empowers individuals and organizations to create exceptional content effortlessly. The AI for... ..., tools, and data streams into adaptive AI systems. Mentor senior engineers in modern AI system design, LLM orchestration patterns...
Senior
Temporary work
Local area
Worldwide
Adobe
San Jose, CA
2 days ago
Senior Staff AI Infrastructure Engineer
$262k - $365k
Google Inc. seeks a Senior Staff Software Engineer for AI Infrastructure within Google Cloud. This role involves architecting high-performance, distributed infrastructure for agentic AI workflows, with responsibilities including system reliability and transitioning experimental...
Senior
Google Inc.
Sunnyvale, CA
1 day ago
Senior AI Infra Engineer - Large-Scale DGX Cloud (Equity)
$356.5k
NVIDIA Gruppe is seeking an experienced AI infrastructure software engineer to join its DGX Cloud AI Efficiency Team in Santa Clara, California. This role focuses on developing the infrastructure for optimizing AI workloads and ensuring high availability and efficiency...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior Lead AI Engineer (Gen AI Platform Services)
$229.9k - $262.4k
...Senior Lead AI Engineer (Gen AI Platform Services) Overview: At Capital One, we are creating responsible and reliable AI systems, changing... ...personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience...
Senior
Full time
Part time
Local area
Capital One
San Jose, CA
1 day ago
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
$229.9k - $262.4k
...Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One, we are creating responsible and reliable... ...personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience...
Senior
Full time
Part time
Local area
Capital One Financial Corp
San Jose, CA
5 days ago
Senior AI Infrastructure Engineer - Special Projects
$181.1k - $318.4k
...for its Special Projects team in Cupertino, California. The role focuses on building innovative applications and robust infrastructure to support AI research. Candidates should excel in programming languages like Go or Swift and have experience with web services and containers...
Senior
Apple Inc.
Cupertino, CA
1 day ago
Senior AI Platform Engineer - Scale LLM Infra
$168k - $322k
NVIDIA Gruppe is seeking a Senior AI Platform Engineer to improve engineering efficiency and data security through AI-powered products. The... ...involves working with Cloud and AI/ML teams to build and scale infrastructure and shape the technological future of the organization....
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior Datacenter AI Platform Engineer - Equity Options
NVIDIA Corporation is seeking a Datacenter Product Engineer to join its Datacenter team in Santa Clara, California. This role focuses on launching AI supercomputing platforms and supporting GPU production. The ideal candidate will collaborate with NPI teams and implement...
Senior
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior PCIe Networking & AI Fabric Solutions Engineer
NVIDIA Gruppe in Santa Clara is looking for an experienced engineer to support our new supercomputers and AI technologies. You will lead collaboration across various teams and work closely with customers to understand their needs and develop tailored features. The ideal...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior Windows AI Platform Engineer — GPU-Driven AI Deployment
NVIDIA Gruppe is looking for an experienced GPU Deployment Engineer to tackle end-to-end AI deployment challenges on the NVIDIA RTX AI platform. The role involves analyzing GPU-accelerated applications, improving user experiences, and collaborating with teams to influence...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior AI-Driven Cloud Platform Engineer (Multi-Cloud)
...searching for a high-level DevOps Platform Engineer to enhance its Multi-Cloud Platform. In this role, you will architect AI-driven workflows and lead production environments... ..., and Azure. You will build self-healing infrastructure and develop advanced CI/CD pipelines while...
Senior
Palo Alto Networks
Santa Clara, CA
4 days ago
Senior AI Platform Engineer - Scale & Security
$210k - $295k
...EXPLORATION TECHNOLOGIES CORP in Sunnyvale, CA, is seeking a Principal Software Engineer for the Platform Team. This role focuses on building foundational AI tooling and security infrastructure to enhance engineering workflows at SpaceX. The ideal candidate will have...
Senior
SPACE EXPLORATION TECHNOLOGIES CORP
Sunnyvale, CA
5 days ago
Senior AI Inference Systems Engineer: GPU-Optimized, Cloud
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior Applied AI Cloud Solutions Engineer
$174k - $253k
Google is seeking an Applied AI Customer Engineer in Sunnyvale, CA, offering a competitive salary ranging from $174,000 to $253,000 plus a bonus and equity. In this role, you will leverage your technical expertise to assist customers in adopting Conversational AI solutions...
Senior
Google
Sunnyvale, CA
4 days ago
Senior AI Platform Engineer - Build Production-Grade Agents
A technology firm specializing in AI solutions is seeking a Senior AI Engineer in Santa Clara, California. You will design and implement AI-powered software, managing everything from backend to frontend interfaces. Responsibilities include developing production-grade AI...
Senior
Dexmate
Santa Clara, CA
5 days ago
Senior DGX Cloud AI Infrastructure Software Engineer
Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research... ...an AI infrastructure software engineer to join our team. You'll be instrumental... ...of AI systems. As a senior DGX Cloud AI Infrastructure software...
Senior
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior AI/ML Software Engineer (Cloud) | Equity + Bonus
$174k - $253k
Google Inc. is seeking a Senior Software Engineer specialized in AI/ML for its Sunnyvale, CA location. The role requires expertise in developing and optimizing machine learning infrastructure, along with deep experience in programming with Python or C++. Candidates should...
Senior
Google Inc.
Sunnyvale, CA
3 days ago
Senior AI Engineer - Lead LLM Platforms & Agentic Workflows
Lendistry, LLC. is seeking a Senior AI Engineer to lead the delivery of AI solutions, including document intelligence and risk assessment tools. In this role, you will be responsible for mentoring junior engineers and shaping AI-driven workflows, improving the borrower...
Senior
Lendistry, LLC.
Santa Clara, CA
2 days ago
Senior/Staff AI & Knowledge Platforms Engineer
A leading technology company in Santa Clara is seeking a Senior/Staff Software Engineer specializing in AI and search technologies. Candidates must possess strong skills in Kotlin or Java, have a minimum of 5 years' experience, and be proficient in containerized environments...
Senior
Apple Inc.
Santa Clara, CA
2 days ago
Senior Applied AI Engineer, iCloud Data Platform
Apple Inc. is seeking an Applied AI Engineer based in Cupertino, California. In this role, you will build the AI foundation of the company's data platform, developing scalable and trustworthy AI products that enhance data analytics across iCloud. Ideal candidates have over...
Senior
Apple Inc.
Cupertino, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior MLOps & AI Infrastructure Engineer. Be the first to apply!