Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Engineer

$209k

Zoom Video Communications, Inc.

Immigration sponsorship is not available for this position

Responsibilities:

• Develop the Machine Learning Platform management system.

• Design and implement intuitive user interfaces and APls for seamless interaction with the platform.

• Ensure robust access control and security measures for the Machine Learning Platform.

• Regularly evaluate and enhance platform performance, scalability, and reliability. Integrate tools for data versioning, experiment tracking, and workflow orchestration.

• Build the toolchains, service, pipeline for model development workflow, and model serving architecture.

• Create automated pipelines for data preprocessing, feature engineering, and dataset versioning.

• Develop Cl/CD pipelines for deploying models into production environments with minimal downtime.

• Enable support for distributed model training and hyperparameter optimization.

• Incorporate A/B testing frameworks for evaluating multiple model deployments.

• Collaborate with data scientists and engineers to streamline the model development lifecycle.

• Prioritize various metrics for model training and inferencing monitoring. Implement logging and monitoring tools to track model performance, resource utilization, and throughput.

• Develop dashboards to visualize key metrics such as latency, accuracy, and drift detection in realtime.

• Establish alerting mechanisms to detect and respond to anomalies or performance degradation.

• Continuously refine metric prioritization based on stakeholder feedback and evolving business goals.

• Develop and maintaining the high-performance LLM training GPU infrastructure and cluster.

• Optimize GPU utilization for large-scale training workloads, ensuring minimal resource wastage.

• Implement fault-tolerant and distributed training strategies for handling large language models (LLMs).

• Evaluate and integrate emerging hardware technologies, such as TPUs, into the training infrastructure.

• Regularly update cluster configurations to support new frameworks and model architectures.

• Manage scheduling and resource allocation for multi-tenant GPU clusters.

• Understand the auto scale for inference service and multi-models for dynamical loading.

• Design systems that dynamically allocate resources based on real-time demand for inference services.

• Develop mechanisms for loading and unloading models in memory to optimize latency and resource usage.

• Implement strategies for caching frequently used models to improve inference performance.

• Experiment with serverless architectures to further enhance scalability and cost efficiency.

• Ensure compatibility with edge devices and deploy lightweight models for edge inference.

• Support, troubleshoot, and resolve any issues during the training and inferencing.

• Create detailed runbooks for common troubleshooting scenarios to reduce resolution times.

• Perform root cause analysis for failures and implement long-term fixes to prevent recurrence.

• Collaborate with DevOps and IT teams to ensure the stability of underlying infrastructure.

• Develop self-healing systems that can automatically recover from common training or inference issues.

• Provide technical support and guidance to data scientists and engineers working on the platform.

What we're looking for:

Requires a Bachelor's degree in Communications Engineering, Artificial Intelligence, Software Engineering, a related field, or a foreign degree equivalent. Must have 2 years of experience in job offered or related occupation. Must have 2 years of experience in:

• Designing, Implementing, or optimizing large-scale distributed training systems using technologies like Horovod, DeepSpeed, PyTorch Distributed, or Ray;

• Tensor/model parallelism and pipeline parallelism;

• Utilizing cloud-native or on-prem infrastructure (Kubernetes, Docker, Slurm) to support scalable, fault-tolerant, and resource-efficient AI workloads across multi-node GPU clusters;

• Using Performance Profiling and Optimization to diagnose and improve end-to-end training performance by optimizing data pipelines (e.g., DALI, tf.data), minimizing communication overhead (e.g., NCCL, gRPC), and tuning hardware-specific kernels (e.g., CUDA, Triton);

• Systems Programming and Automation in systems-level programming with Python, Bash, and C++ or Go;

• Automating deployment and orchestration of AI workloads and monitoring using Prometheus, Grafana, Weights & Biases.

• Telecommuting work arrangement permitted one day a week. Four days in office required. Position does not require domestic or international travel

Zoom Communications, Inc.#LI-DNI#Ind0

Salary Range or On Target Earnings:

Minimum:

$209,000.00

Maximum:

$275,400.00

In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.

Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.

We also have a location based compensation structure; there may be a different range for candidates in this and other locations.

Ways of WorkingOur structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.

BenefitsAs part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn ( for more information.

About UsZoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars.We’re problem-solvers, working at a fast pace to design solutions with our customers and users in mind. Find room to grow with opportunities to stretch your skills and advance your career in a collaborative, growth-focused environment.

Our Commitment​

At Zoom, we believe great work happens when people feel supported and empowered. We’re committed to fair hiring practices that ensure every candidate is evaluated based on skills, experience, and potential. If you require an accommodation during the hiring process, let us know—we’re here to support you at every step.

We welcome people of different backgrounds, experiences, abilities and perspectives including qualified applicants with arrest and conviction records and any qualified applicants requiring reasonable accommodations in accordance with the law.

If you need assistance navigating the interview process due to a medical disability, please submit an Accommodations Request Form ( and someone from our team will reach out soon. This form is solely for applicants who require an accommodation due to a qualifying medical disability. Non-accommodation-related requests, such as application follow-ups or technical issues, will not be addressed.

Think of this opportunity as a marathon, not a sprint! We're building a strong team at Zoom, and we're looking for talented individuals to join us for the long haul. No need to rush your application – take your time to ensure it's a good fit for your career goals. We continuously review applications, so submit yours whenever you're ready to take the next step.

Our interviews are supported by BrightHire, a tool that helps us create a consistent and thoughtful interview experience and may include recordings. Please refer to our candidate privacy statement ( for more information of how we use your data.

We believe that the unique contributions of all Zoomies is the driver of our success. To make sure that our products and culture continue to incorporate everyone's perspectives and experience we never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. Zoom is proud to be an equal opportunity workplace and is an affirmative action employer. All your information will be kept confidential according to EEO guidelines

Vacancy posted 19 hours ago
Similar jobs that could be interesting for youBased on the Senior AI Engineer in San Jose, CA vacancy
  •  ...A technology firm located in California is seeking candidates with experience in AI and ML algorithm development, particularly in LLM Inference and Similarity Search. Applicants should have strong communication skills and the ability to work independently. Familiarity... 
    Senior

    ETHEREUM TECHNOLOGIES LLC

    Sunnyvale, CA
    3 days ago
  • $184k - $287.5k

     ...NVIDIA Gruppe in Santa Clara is seeking a skilled Agentic AI Software Engineer to join the team. The role requires expertise in building autonomous multi-agent systems with a focus on designing, developing, and maintaining integrations of NVIDIA NIM microservices into... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $50k - $120k

     ...An innovative AI solutions company is seeking a Senior Generative AI Engineer in Sunnyvale, California. The role involves advancing AI capabilities with a focus on large language models and implementing sophisticated architectures. With a base salary ranging from $50,0... 
    Senior

    Early Stage Partners LP

    Sunnyvale, CA
    4 days ago
  •  ...A leading crypto exchange is seeking a highly skilled Machine Learning Engineer to optimize post-training pipelines for large models. This role requires strong expertise in reinforcement learning, preference optimization, and deployment techniques in a low-latency environment... 
    Senior

    P2P

    San Jose, CA
    4 days ago
  • $240k - $250k

     ...Ten Eleven Ventures is seeking a Senior Software Engineer to enhance their AI-driven identity platform. In this role, you will build and deploy AI agents, improve customer escalation workflows, and work closely with engineering and product teams. Candidates should have... 
    Senior

    TenEleven Ventures

    Milpitas, CA
    4 days ago
  • $160.5k - $240.7k

     ...Qualcomm in Santa Clara seeks a Machine Learning Engineer. The role involves designing and optimizing edge AI systems for real-time video analytics and developing computer vision algorithms. Applicants should have a background in Hardware/Software Engineering with strong... 
    Senior

    Qualcomm

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA Gruppe is seeking versatile software engineers for their XLA team in Santa Clara, California. The role involves developing compiler...  ...deep learning workloads and collaborating with teams to accelerate AI systems. Candidates must have a strong background in C/C++ and 4... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA Gruppe in Santa Clara, California is seeking a skilled engineer to work on analyzing deep learning networks and developing compiler optimization algorithms using CUDA. The role demands strong programming skills and an ability to work independently. The ideal candidate... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...NVIDIA Gruppe in Santa Clara is seeking an AI & Deep Learning Compiler Engineer to join its Deep Learning & AI Compiler team. This role involves developing compiler IR and collaborating with various teams to enhance deep learning software. The ideal candidate will have... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $107.87k - $154.1k

     ...About the Role We are seeking an experienced Senior Software Engineer to transform Fujitsu Research projects into production‑ready MVPs. This role requires a unique blend of deep AI expertise and robust software engineering capabilities to bridge the gap between research... 
    Senior
    Temporary work
    Local area

    Fujitsu Careers

    Santa Clara, CA
    4 days ago
  •  ...A leading semiconductor company is seeking a Senior Staff Engineer for their AI Models and Applications team, focusing on Generative AI training and inference at scale. The ideal candidate will have a PhD or Master's degree and deep expertise in generative AI applications... 
    Senior

    Advanced Micro Devices , Inc.

    San Jose, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA Gruppe is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Dormont Manufacturing Co seeks a skilled engineer to optimize Wafer Scale Engines in Sunnyvale, California. This position requires expertise...  ...Python and Verilog. Join us to work on groundbreaking technology in the AI sector and help us build the future. #J-18808-Ljbffr... 
    Senior

    Dormont Manufacturing Company

    Sunnyvale, CA
    4 days ago
  •  ...A forward-thinking AI infrastructure company is seeking a Staff AI Runtime Engineer to lead the design and optimization of their AI compute platform. In this leadership role, you'll enhance AI training and inference capabilities. Successful candidates will have over 8... 
    Senior

    FlexAI

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA Gruppe is seeking an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler team in Santa Clara, California. This role involves analyzing and optimizing deep learning networks, as well as developing compiler algorithms to enhance performance on... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $98k - $182k

     ...Software Engineer At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology. We...  ...experience in Machine Learning. You will work at the intersection of AI, high-performance software engineering, and electronic design... 
    Senior

    Cadence Inc

    San Jose, CA
    1 day ago
  • $152k - $241.5k

     ...NVIDIA Gruppe is hiring an AI & Deep Learning Compiler Engineer for the Deep Learning & AI Compiler team. This role involves analyzing deep learning networks and developing optimization algorithms while collaborating with software and GPU architecture teams. The ideal... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $123.24k - $200k

     ...Senior / Principal AI Engineer for Business Intelligence Overview of Role As a Sr./Principal AI Engineer within TSMC's Artificial Intelligence for Business Intelligence Innovation (AI4BII) Center, you will join an exciting global team dedicated to generating crucial... 
    Senior
    Work at office

    TSMC

    San Jose, CA
    1 day ago
  • $152k - $241.5k

     ...optimizations and analysis, crafting and implementing compiler techniques for AI workloads and future NVIDIA GPUs. What we need to see: Bachelor’s, master’s or Ph.D. in Computer Science, Computer Engineering, related field or equivalent experience. 3+ years of relevant work or... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $148k - $235.75k

     ...people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our...  ...impact on the world. We are looking for outstanding Senior High Performance AI Engineer to build groundbreaking multi-agent systems for the CUDA... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $193.3k - $261.5k

    Relha LLC is seeking a Sr. Software Development Engineer to enhance Prime Video’s personalization algorithms. You will leverage machine learning...  ...models and collaborate across teams to develop innovative AI solutions. The ideal candidate will have extensive experience in... 
    Senior

    Relha LLC

    Sunnyvale, CA
    1 day ago
  • $184k - $287.5k

     ...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you'll develop libraries, code generators... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $223k - $306.5k

     ...Disruption, Collaboration, Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do and use it to augment the...  ...a security-first world. Job Summary As a Sr Principal AI Engineer, you will join a dynamic team to pioneer our Generative AI... 
    Senior
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    2 days ago
  •  ...Senior Principal Ai Agent / Ml Software Engineer The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role responsible for defining, building, and operating next-generation AI systems on Oracle Cloud Infrastructure... 
    Senior

    Oracle

    Santa Clara, CA
    5 days ago
  • $187k - $215k

     ...Senior AI Engineer KlearNow is building the intelligence layer for global trade and logistics. We turn fragmented, messy data into instant, reliable intelligence, enabling AI to reason and act in high-stakes, real-world environments. We are a small, sharp team operating... 
    Senior
    Work at office
    Immediate start
    Visa sponsorship
    Flexible hours

    KlearNow.AI

    San Jose, CA
    2 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for exceptional engineers to join their autonomous driving team in Santa Clara. You will design, implement, and deploy...  ...edge autonomous driving systems, leveraging advanced models for AI-powered vehicles. The ideal candidate possesses experience in... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • NVIDIA Corporation seeks a Senior Applied AI Engineer in Santa Clara, CA to lead the AI-driven toolchain for silicon productization. You will focus on simulating chips and developing systems that enhance performance and efficiency using AI technologies. The ideal candidate... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  •  ...Advanced Micro Devices is seeking a Senior Software Engineer located in San Jose, California. In this role, you will develop comprehensive software...  ...and a degree in a relevant technical field. Join AMD to shape the future of AI and technology together. #J-18808-Ljbffr... 
    Senior

    Advanced Micro Devices , Inc.

    San Jose, CA
    3 days ago
  • A leading technology firm is looking for a High Speed AI Interconnect Signal Integrity Engineer in Santa Clara, CA. The role involves designing and validating high-bandwidth links for AI systems, with responsibilities including developing architectures and performing analysis... 
    Senior

    Tenstorrent Inc.

    Santa Clara, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Engineer. Be the first to apply!