Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior MLOps & AI Infrastructure Engineer

$149.1k - $215.93k
Full-time

Altera

Job Details: Job Description: About Altera At Altera™, our independence as the world’s largest pure‑play FPGA solutions provider gives us the focus, speed, and agility to innovate without compromise. With more than four decades of industry‑leading FPGA expertise, our singular mission is to deliver the programmable technologies that help customers differentiate, innovate, and scale across rapidly evolving markets like AI, cloud, networking, and edge. As an independent company, we move faster, invest deeper, and partner more closely—empowering our teams to drive breakthrough innovation and shape the future of the FPGA industry. About the Role We are looking for a Senior MLOps & AI Infrastructure Engineer to architect, build, and operationalize machine learning systems at scale. This role sits at the intersection of data science, software engineering, and infrastructure — combining deep ML expertise with the DevOps/MLOps discipline required to ship models reliably into production. You will partner closely with software, data, and infrastructure teams to design end-to-end ML pipelines, automate model lifecycle management, and deliver AI-powered capabilities across our EDA, HPC, and cloud environments. Key Responsibilities: ML Platform & Pipeline Engineering • Design, build, and maintain scalable ML pipelines for training, evaluation, and deployment across cloud and on-prem HPC environments • Build MLOps infrastructure including experiment tracking, model registry, feature stores, and automated retraining workflows • Implement CI/CD/CT (Continuous Training) pipelines for ML models using tools such as Kubeflow, MLflow, Airflow, or similar • Containerize ML workloads with Docker and orchestrate at scale using Kubernetes and GPU node pools Model Development & Optimization • Develop, fine-tune, and deploy large-scale models including LLMs, GNNs, and reinforcement learning agents for EDA and chip design applications • Apply advanced techniques: transfer learning, quantization, pruning, distillation, and RLHF for production-grade model efficiency • Implement A/B testing frameworks and shadow deployments for safe model rollout • Benchmark and optimize model inference performance on GPU/TPU clusters Data Engineering & Feature Management • Build and maintain data pipelines for large-scale structured and unstructured datasets (terabyte-scale) • Collaborate with data teams to design feature engineering systems and maintain data quality for ML training • Implement data versioning and lineage tracking (DVC, Delta Lake, or similar) Infrastructure & Operations • Manage cloud ML infrastructure on AWS (SageMaker), Azure (AML), or GCP (Vertex AI) with cost and performance optimization • Automate infrastructure provisioning using Terraform or CloudFormation for GPU-backed ML environments • Build monitoring, alerting, and observability systems for model performance drift, data quality, and system health • Support HPC schedulers (LSF, Slurm) for large-scale distributed training jobs Collaboration & Leadership • Partner with research scientists to productionize experimental models with engineering rigor • Mentor junior engineers and define ML engineering best practices across the organization • Drive adoption of AI/ML solutions within semiconductor, EDA, and simulation workflows Technology Stack ML Frameworks: PyTorch • TensorFlow • JAX • Hugging Face • scikit-learn • XGBoost MLOps & Pipelines: MLflow • Kubeflow • Airflow • Weights & Biases • DVC • Feast Infrastructure & Cloud: AWS SageMaker / GCP Vertex AI / Azure ML • Terraform • Docker • Kubernetes • Slurm / LSF Languages: Python • Bash • Go • SQL Monitoring & Observability: Prometheus • Grafana • ELK Stack • Evidently AI • Arize Key Competencies • Strong ownership mindset — you drive ML initiatives from prototype to production without being asked • Bias toward automation: if you do it twice, you automate it • Ability to bridge research and engineering — translating papers into production-grade systems • Thrives in fast-paced, ambiguous environments typical of deep-tech and semiconductor companies • Clear communicator who can explain complex ML concepts to non-technical stakeholders Salary Range The pay range below is for Bay Area California only. Actual salary may vary based on a number of factors including job location, job-related knowledge, skills, experiences, trainings, etc. We also offer incentive opportunities that reward employees based on individual and company performance. $149,100 - $215,925 USD We use artificial intelligence to screen, assess, or select applicants for the position. Applicants must be eligible for any required U.S. export authorizations. Qualifications: Required Qualifications Bachelor’s or Master’s degree in Computer Science, Machine Learning, Statistics, or related field and 10+ years of industry experience 10+ years of experience across ML engineering, data science, and MLOps — including frameworks (PyTorch, TensorFlow, JAX, Hugging Face) and production model deployment at scale 8+ years of experience experience with parallelism strategies (FSDP, DeepSpeed, data/model parallelism) 10+ years of experience and proficiency in Python programming 8+ years of experience in cloud ML platforms (AWS, GCP, Azure), Docker/Kubernetes, and CI/CD pipelines 5+ years of hands-on experience with MLflow, W&B, or Neptune for tracking and reproducibility Preferred Qualifications Phd in Computer Science, Machine Learning, Statistics, or related field Experience applying ML/AI to semiconductor, EDA, or chip design domains (e.g., timing prediction, place & route optimization, DRC closure) Familiarity with HPC schedulers such as LSF or Slurm and GPU cluster management for training workloads Knowledge of LLM fine-tuning, Retrieval-Augmented Generation (RAG) architectures, and AI agent frameworks such as LangChain or AutoGen Experience with graph neural networks (GNNs) or geometric deep learning for circuit and netlist analysis Background in reinforcement learning for optimization problems Exposure to zero-trust security, DevSecOps, and compliance automation for ML systems Experience working with large-scale simulation pipelines and synthetic data generation Experience at organizations such as NVIDIA, AMD, Intel, Google DeepMind, or similar AI/HPC-focused companies Published research or open-source contributions in ML, MLOps, or AI for EDA Experience building AI-powered developer tools or copilot-style products Familiarity with Synopsys, Cadence, or Siemens EDA toolchains and associated data formats Job Type: Regular Shift: Shift 1 (United States of America) Primary Location: San Jose, California, United States Additional Locations: Posting Statement: All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance. About Altera Altera: Accelerating Innovators Altera provides leadership programmable solutions that are easy-to-use and deploy in applications from cloud to edge, offering limitless AI possibilities. Our end-to-end broad portfolio of products including FPGAs, CPLDs, Intellectual Property, development tools, System on Modules, SmartNICs and IPUs provide the flexibility to accelerate innovation. Altera is helping to shape the future through pioneering innovation that unlocks extraordinary possibilities for everyone on the planet. Don't see the dream job you are looking for? Click "Get Started" below to drop off your contact information and resume and we will reach out to you if we find the perfect fit.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior MLOps & AI Infrastructure Engineer in San Jose, CA vacancy
  • A leading AI solutions provider is seeking an experienced MLOps / AI Ops Engineer for a remote 12-month contract. The role involves building and automating CI/CD pipelines for machine learning models, establishing monitoring frameworks, and managing deployment strategies... 
    Senior
    Remote job
    Contract work

    DeWinter Group

    Campbell, CA
    3 days ago
  •  ...hire a deeply technical, creative, and Senior AI Platform Engineer to build, support, and maintain the...  ...be doing: Define and lead AI-native infrastructure roadmaps and cross‑organizational...  ...operating AI/ML platforms, including MLOps, model serving, and GPU‑accelerated... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $180k - $240k

     ...logistics operations. ABOUT THE ROLE We are seeking a Senior AI Infrastructure Engineer to design, build, and scale the high-performance AI...  ...edge cases from raw data. * Model Management & Lifecycle (MLOps) * Automated Lifecycle Management: Design and maintain... 
    Senior
    Odd job
    Full time
    Work at office

    Gatik AI

    Santa Clara, CA
    7 hours ago
  • $126k - $423k

    Decisive Point is seeking a Research Engineer (AI/RL Infrastructure) in Sunnyvale, California to design and operate large-scale ML systems. You will collaborate with leading experts and contribute to next-generation physical AI, impacting self-driving technologies. This... 
    Senior

    Decisive Point

    Sunnyvale, CA
    1 day ago
  • $172.5k - $306.63k

     ...organizations to create exceptional content effortlessly. The AI for Engineering team builds a scalable, production‑grade AI platform that...  ...engines, tools, and data streams into adaptive AI systems. Mentor senior engineers in modern AI system design, LLM orchestration... 
    Senior
    Local area

    Dormont Manufacturing Company

    San Jose, CA
    4 days ago
  • NVIDIA Corporation in Santa Clara is seeking a Senior Software Engineer to lead the optimization of large-scale AI systems. This role will involve profiling and...  ...will have over 8 years of experience in software infrastructure for AI systems, with expert-level programming... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $182k - $242k

    CoreWeave is seeking an experienced professional to contribute to building distributed systems and ML infrastructure. The successful candidate will play a pivotal role in designing an optimal research cluster experience, including a Python SDK, while collaborating closely... 
    Senior

    Jobr

    Sunnyvale, CA
    3 days ago
  • $283.4k

    KLA is seeking a Sr. AI Infrastructure Software Engineer in Milpitas, California. This role focuses on C++ programming and involves designing core infrastructure for AI workloads. Join a top-notch team solving complex problems at the intersection of software and hardware... 
    Senior

    Dormont Manufacturing Company

    Milpitas, CA
    4 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (GenAI Platform Services) Overview At Capital One, we are creating responsible and reliable AI systems, changing...  ...personalized customer experiences. Our investments in technology infrastructure and world‑class talent — along with our deep experience... 
    Senior
    Local area

    Comfort Systems USA

    San Jose, CA
    2 days ago
  • Drive Capital is seeking a Senior Customer Support Engineer in Campbell, CA. This role involves responding to customer inquiries, managing technical operations, and building strong relationships with customers based on technical excellence. The ideal candidate will have... 
    Senior

    Drive Capital

    Campbell, CA
    2 days ago
  • $172.5k - $306.63k

     ...Staff Engineer - AI For Engineering Adobe empowers individuals and organizations to create exceptional content effortlessly. The AI for...  ..., tools, and data streams into adaptive AI systems. Mentor senior engineers in modern AI system design, LLM orchestration patterns... 
    Senior
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    2 days ago
  • $262k - $365k

    Google Inc. seeks a Senior Staff Software Engineer for AI Infrastructure within Google Cloud. This role involves architecting high-performance, distributed infrastructure for agentic AI workflows, with responsibilities including system reliability and transitioning experimental... 
    Senior

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $356.5k

    NVIDIA Gruppe is seeking an experienced AI infrastructure software engineer to join its DGX Cloud AI Efficiency Team in Santa Clara, California. This role focuses on developing the infrastructure for optimizing AI workloads and ensuring high availability and efficiency... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services) Overview: At Capital One, we are creating responsible and reliable AI systems, changing...  ...personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience... 
    Senior
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    1 day ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One, we are creating responsible and reliable...  ...personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience... 
    Senior
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    5 days ago
  • $181.1k - $318.4k

     ...for its Special Projects team in Cupertino, California. The role focuses on building innovative applications and robust infrastructure to support AI research. Candidates should excel in programming languages like Go or Swift and have experience with web services and containers... 
    Senior

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $168k - $322k

    NVIDIA Gruppe is seeking a Senior AI Platform Engineer to improve engineering efficiency and data security through AI-powered products. The...  ...involves working with Cloud and AI/ML teams to build and scale infrastructure and shape the technological future of the organization.... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • NVIDIA Corporation is seeking a Datacenter Product Engineer to join its Datacenter team in Santa Clara, California. This role focuses on launching AI supercomputing platforms and supporting GPU production. The ideal candidate will collaborate with NPI teams and implement... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe in Santa Clara is looking for an experienced engineer to support our new supercomputers and AI technologies. You will lead collaboration across various teams and work closely with customers to understand their needs and develop tailored features. The ideal... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • NVIDIA Gruppe is looking for an experienced GPU Deployment Engineer to tackle end-to-end AI deployment challenges on the NVIDIA RTX AI platform. The role involves analyzing GPU-accelerated applications, improving user experiences, and collaborating with teams to influence... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...searching for a high-level DevOps Platform Engineer to enhance its Multi-Cloud Platform. In this role, you will architect AI-driven workflows and lead production environments...  ..., and Azure. You will build self-healing infrastructure and develop advanced CI/CD pipelines while... 
    Senior

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $210k - $295k

     ...EXPLORATION TECHNOLOGIES CORP in Sunnyvale, CA, is seeking a Principal Software Engineer for the Platform Team. This role focuses on building foundational AI tooling and security infrastructure to enhance engineering workflows at SpaceX. The ideal candidate will have... 
    Senior

    SPACE EXPLORATION TECHNOLOGIES CORP

    Sunnyvale, CA
    5 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $174k - $253k

    Google is seeking an Applied AI Customer Engineer in Sunnyvale, CA, offering a competitive salary ranging from $174,000 to $253,000 plus a bonus and equity. In this role, you will leverage your technical expertise to assist customers in adopting Conversational AI solutions... 
    Senior

    Google

    Sunnyvale, CA
    4 days ago
  • A technology firm specializing in AI solutions is seeking a Senior AI Engineer in Santa Clara, California. You will design and implement AI-powered software, managing everything from backend to frontend interfaces. Responsibilities include developing production-grade AI... 
    Senior

    Dexmate

    Santa Clara, CA
    5 days ago
  • Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research...  ...an AI infrastructure software engineer to join our team. You'll be instrumental...  ...of AI systems. As a senior DGX Cloud AI Infrastructure software... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $174k - $253k

    Google Inc. is seeking a Senior Software Engineer specialized in AI/ML for its Sunnyvale, CA location. The role requires expertise in developing and optimizing machine learning infrastructure, along with deep experience in programming with Python or C++. Candidates should... 
    Senior

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • Lendistry, LLC. is seeking a Senior AI Engineer to lead the delivery of AI solutions, including document intelligence and risk assessment tools. In this role, you will be responsible for mentoring junior engineers and shaping AI-driven workflows, improving the borrower... 
    Senior

    Lendistry, LLC.

    Santa Clara, CA
    2 days ago
  • A leading technology company in Santa Clara is seeking a Senior/Staff Software Engineer specializing in AI and search technologies. Candidates must possess strong skills in Kotlin or Java, have a minimum of 5 years' experience, and be proficient in containerized environments... 
    Senior

    Apple Inc.

    Santa Clara, CA
    2 days ago
  • Apple Inc. is seeking an Applied AI Engineer based in Cupertino, California. In this role, you will build the AI foundation of the company's data platform, developing scalable and trustworthy AI products that enhance data analytics across iCloud. Ideal candidates have over... 
    Senior

    Apple Inc.

    Cupertino, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior MLOps & AI Infrastructure Engineer. Be the first to apply!