Machine Learning Platform Lead Engineer, Training and Inference
$130.2k - $195.3kParamount
#WeAreParamount on a mission to unleash the power of content… you in?We’ve got the brands, we’ve got the stars, we’ve got the power to achieve our mission to entertain the planet – now all we’re missing is… YOU! Becoming a part of Paramount means joining a team of passionate people who not only recognize the power of content but also enjoy a touch of fun and uniqueness. Together, we co-create moments that matter – both for our audiences and our employees – and aim to leave a positive mark on culture. Overview We are seeking a Senior Lead / Lead ML Platform Engineer to architect and own the technical direction for our Training and Inference infrastructure. This is a high-leverage role designed for an expert who understands the deep technical stack required to shift ML models from research to global production. You will be responsible for the "engine room" of the AMLG, ensuring that our MLEs can train massive models efficiently and serve them with sub-millisecond reliability. This role requires a unique blend of expertise in distributed systems and hardware acceleration. You will lead the adoption and optimization of AnyScale (Ray) for distributed training and manage a high-performance Kubernetes-based inference environment. You aren't just managing clusters; you are building a seamless, scalable platform that abstracts the complexity of GPUs and distributed compute for the entire organization. Why This Role Matters The ML Platform Lead is the force-multiplier for every other ML pod. In this role, you will directly shape: The Training Foundation: Establishing AnyScale/Ray as the standard for distributed compute, enabling MLEs to train models on petabytes of data without managing infrastructure. Inference at Scale: Architecting the serving layer that handles billions of requests per day, optimizing for both p99 latency and GPU utilization. Operational Excellence: Setting the organizational standards for how ML models are deployed, monitored, and scaled across the enterprise. Key Responsibilities Technical Roadmap & Strategy: Own the long-term architectural direction for the Training and Inference domains, ensuring the platform scales 10x over a 1–3 year horizon. Distributed Training Leadership: Lead the implementation and optimization of Ray/AnyScale, providing a unified compute layer for batch processing, model training, and reinforcement learning. High-Performance Inference: Design and maintain K8s-based inference servers (e.g., Triton, TorchServe, or vLLM) optimized for GPU memory management and high throughput. Hardware & Cost Optimization: Navigate the trade-offs between different GPU instances (A100s, H100s, T4s), optimizing for cost, availability, and performance. Cross-Team Standardization: Solve high-leverage problems that affect multiple pods (e.g., Entry, Session, Presentation), establishing reusable patterns for CI/CD, model versioning, and canary deployments. Reliability Engineering: Define and enforce SLIs/SLOs for the platform, ensuring that infrastructure failures never interrupt the user-facing personalization experience. Mentorship & Coaching: Act as a technical mentor to senior engineers across the ML Platform and Applied ML pods, raising the bar for system design and operational rigor. Basic Qualifications 6-8+ years of experience in ML Infrastructure, Platform Engineering, or high-scale Backend Engineering. Orchestration & Serving: Extensive experience with Kubernetes (K8s) and serving frameworks for large-scale ML models. Hardware Proficiency: Strong knowledge of GPU architecture, CUDA, and optimizing ML workloads for hardware acceleration. Leadership (IC4/5): Proven track record of owning the technical direction for a major domain anddriving impact across multiple teams. Preferred Qualifications Experience with Infra-as-Code (Terraform/Pulumi) and building automated MLOps pipelines. Distributed Systems Mastery: Deep expertise with Ray (AnyScale) or similar distributed compute frameworks. Familiarity with ML observability tools (Prometheus, Grafana, Weights & Biases, or MLFlow). Experience managing multi-cloud or hybrid-cloud ML environments. Deep knowledge of Python and C++ for performance-critical systems. What Success Looks Like In your first 6–12 months, you will: Unify the Compute Layer: Successfully transition the majority of AMLG training workloads to a governed AnyScale/Ray environment. Optimize Inference ROI: Measurably improve GPU utilization and reduce inference costs through better auto-scaling and server optimization. Establish Durable Standards: Author the "Gold Standard" for ML deployments that is adopted by at least three other pods in the organization. Reduce Systemic Risk: Implement a self-healing infrastructure layer that significantly reduces manual intervention for cluster-related failures.
#LI-KA1
Paramount Streaming, a division within Paramount Global, is the home to the company's direct-to-consumer services spanning free and paid in the form of Pluto TV and Paramount+. Pluto TV is the global leader in free ad-supported TV, delivering more than 1,400 global channels and an extensive library of streaming content, including live and original channels. Paramount+, digital subscription video-on-demand and live streaming service, combines live sports, breaking news, and A Mountain of Entertainment™. Paramount+ features an expansive library of original series, hit shows and popular movies across every genre from world-renowned brands and production studios, including SHOWTIME®.ADDITIONAL INFORMATION
Hiring Salary Range: $130,200.00 - 195,300.00. The hiring salary range for this position applies to New York, California, Colorado, Washington state, and most other geographies. Starting pay for the successful applicant depends on a variety of job-related factors, including but not limited to geographic location, market demands, experience, training, and education. The benefits available for this position include medical, dental, vision, 401(k) plan, life insurance coverage, disability benefits, tuition assistance program and PTO or, if applicable, as otherwise dictated by the appropriate Collective Bargaining Agreement. This position is bonus eligible. What We Offer: Attractive compensation and comprehensive benefits packages. Check out our full list of benefits here: Generous paid time off. An exciting and fulfilling opportunity to be part of one of Paramount’s most dynamic teams. Opportunities for both on-site and virtual engagement events. Unique opportunities to make meaningful connections and build a vibrant community, both inside and outside the workplace. Explore life at Paramount: Paramount is an equal opportunity employer (EOE) including disability/vet. At Paramount, the spirit of inclusion feeds into everything that we do, on-screen and off. From the programming and movies we create to employee benefits/programs and social impact outreach initiatives, we believe that opportunity, access, resources and rewards should be available to and for the benefit of all. Paramount is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ethnicity, ancestry, religion, creed, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, gender expression, and Veteran status. If you are a qualified individual with a disability or a disabled veteran, you may request a reasonable accommodation if you are unable or limited in your ability to use or access as a result of your disability. You can request reasonable accommodations by calling View phone number on click.appcast.io or by sending an email to View email address on click.appcast.io. Only messages left for this purpose will be returned.$171.6k - $230.1k
...Lead Machine Learning Engineer Technology is at the heart of Disney's past, present, and future... ...and building the products and platforms that will power our media, advertising... ...stores supporting both real-time inference and offline training Partner with product and...Training- ...Senior Data Platform Engineer Los Angeles, California, United States About the Job... ...Construct and maintain efficient ML training and inference pipelines utilizing tools such as... ...Spark. In-depth understanding of machine learning frameworks, libraries, data structures...Training
$120k
...Lead Platform Engineer WME is seeking a Lead Platform Engineer to help build, automate, and support the infrastructure platforms that power... ...of platform capabilities through knowledge sharing, training, and collaboration with engineering teams. Cross-Functional...TrainingTemporary workLocal area$164.64k - $246.96k
...culture. POSITION TITLELead Engineer, Apple LOCATIONIn Office... ...Streaming is seeking a Lead Engineer, Apple to join the... ...with backend, machine learning, data, and platform teams to deliver scalable... ...market demands, experience, training, and education. The benefits...TrainingFull timeWork at officeLocal area$157k - $200k
...seeking a highly motivated lead engineer to join our Applle engineering... ...'s always something new to learn. The ideal candidate is a technically... ...of the iOS and tvOS platforms and deep knowledge with... ...market demands, experience, training, and education. The benefits...TrainingLocal area$164.64k - $246.96k
...Paramount is looking for a hardworking Lead Software Engineer - Android to develop cohesive... ...the delivery and maintenance of those platforms once in place. We are seeking someone... ...location, market demands, experience, training, and education. The benefits available...TrainingWork at officeLocal areaRemote work- A leading AI research organization in San Francisco seeks an Inference Technical Lead to evaluate silicon platforms and work on model deployment for edge devices. You will collaborate with top machine learning researchers to push the boundaries of model capabilities. This...
$190k - $230k
...real-world problems leveraging robotics, machine learning and computer vision, among other... ...collaboratively and respectfully. The Lead Engineer, RL Scaling & Procedural Scenario Generation... ...is responsible for building scalable training pipelines and generating high-fidelity...TrainingLive inLocal areaRemote work$157k - $235k
...a positive mark on culture.Lead Machine Learning Operations EngineerPersonalization... ...Learning Operations Engineer to own the operational excellence... ...ML Engineering, DevOps, Platform Engineering, Data... ...-end ML systems, including training data, features, model artifacts...TrainingFull timeImmediate startShift work$171.6k - $230.1k
...global organization of engineers, product developers,... ...the products and platforms that will power our media... ...Disney's industry-leading ad technology and products... ...across multiple machine learning areas with primary focus... ...via publications and training resources in...Training- ...Sr Software Engineer - Content Platform Engineering Technology is at the heart of Disney's past,... ...strong emphasis on computer vision and machine learning (ML) workflows. Our team develops... ...Required Education, Experience/Skills/Training: ~5+ years relevant industry...TrainingWork experience placementWorldwide
$146.16k - $219.24k
...OverviewWe are looking for a Senior MLOps Engineer (IC3) to join the Platform Engineering pod. Your mission is to... ..., reason, and collaborate. You will lead the development of the MCP (Model... ..., market demands, experience, training, and education. The benefits available...TrainingFull timeShift work$170k - $190k
...outcomes company. Our measurement platform connects convergent TV... ...looking for a Senior Platform Engineer to help accelerate our shift... ...help software, data, and machine learning engineers ship safely and efficiently... ...experience, key skills, training, and business considerations...TrainingFull timeWork experience placementWork at officeImmediate startRemote workFlexible hoursShift work- ...Principal Data Platform Engineer The Principal Data Platform Engineer is a senior individual... ...-ready / feature-ready datasets. Lead platform and architectural design... ...efficient feature engineering, model training, and inference through well-designed data assets....TrainingImmediate start
$203.5k
...architecture and engineering, and client stakeholders... ...significant learning and growth opportunities... ...traditional data platforms), covering... ...data science and machine learning capabilities... ...model selection, training, validation/testing... ...technical stakeholders; lead working sessions,...TrainingFull timeTemporary workApprenticeshipWork at officeLocal areaWork from homeHome office3 days per week$155.7k - $208.7k
...is a global organization of engineers, product developers, designers... ...and building the products and platforms that will power our media,... ...Job Summary: We are seeking a Lead Software Engineer to help build... ...Education, Experience/Skills/Training: Basic Qualifications * 7+...TrainingWork experience placementWork at office$164.64k - $246.96k
...culture. Overview As a Lead Software Engineer, you will be tasked with key areas... ...personalization engine, a real-time inference platform that dynamically determines the optimal... ..., market demands, experience, training, and education. The benefits available...TrainingFull timeContract work$164.64k - $246.96k
...Lead Software Engineer Paramount Skydance Corp. is seeking a Lead Software Engineer to architect... ...enterprise-scale AI tooling and internal platforms that transform how Global Quality... ...location, market demands, experience, training, and education. The benefits available...Training$164.64k - $246.96k
...Lead Software Engineer - Android Paramount is seeking a Lead Software Engineer - Android to develop... ...estimates, and contribute to platform architecture initiatives focused on optimization... ...location, market demands, experience, training, and education. The benefits available...TrainingRemote work$164.64k - $246.96k
...Lead Software Engineer - Web Paramount is looking for a Lead Software Engineer to join our Web... ...frontend architectures, multi-zones, or platform/design system integration Experience... ...location, market demands, experience, training, and education. The benefits available...Training$164.64k - $246.96k
...leave a positive mark on culture. LEAD SOFTWARE ENGINEER - ROKU Location: On-Site - New York... ...the Paramount+ Roku application - the platform, the patterns, the architecture. This... ...location, market demands, experience, training, and education. The benefits available...TrainingWork experience placementLocal areaWeekend work$157k - $190k
...Technology is seeking a highly skilled Lead Cloud Infrastructure Engineer to design, build, and evolve our next-generation cloud platform supportingreal-time data streaming, AI-driven... ...location, market demands, experience, training, and education. The benefits available...Training- Inference Technical Lead, On-Device Transformers Consumer Products - San Francisco About the Team... ...Responsibilities Evaluate and select silicon platforms (GPUs, NPUs, and specialized... ...workloads. Build and lead a team of engineers responsible for implementing the low-...Work at officeRelocation package
$130.2k - $195.3k
...Lead Software Engineer (Java) in Test Paramount Skydance Corp. is seeking a Staff Software Engineer... ...solutions across our enterprise platforms. This senior technical role combines deep... ...location, market demands, experience, training, and education. The benefits available...TrainingContract workRemote work$164.64k - $246.96k
...Lead Software Engineer, Content Systems MAM Pluto TV is seeking an experienced Lead Software Engineer... ..., Node.js, React, and modern cloud platforms such as AWS and GCP. Strong technical... ...location, market demands, experience, training, and education. The benefits available...Training$164.64k - $246.96k
.... Summary We're looking for a Lead Software Engineer who has genuinely changed how they work... ...the services for our Enterprise Platform. This includes our centralized management... ...location, market demands, experience, training, and education. The benefits available...TrainingContract work- ...Lead Cloud Engineering And Production Operations Engineer This role acts as a hands-on technical lead, driving cloud engineering initiatives... ...expertise, demonstrated skill level, relevant experience, geographic location, education, certifications, and training....Training
- ...Senior Product Software Engineer - Ad Platform (Audience Team) Disney Entertainment... ...for Disney's industry-leading ad technology and products... ...areas, including machine learning, big data, microservices,... ...Education, Experience/Skills/Training: Basic Qualifications...TrainingWork experience placement
- ...Technical Lead, Forward Deployed Engineering Known for being a great place to work and build a career... ...systems, with a focus on data, machine learning, and AI-native applications Bachelor... ...; Solid experience with cloud platforms (Azure, GCP, or AWS) is strongly preferred...H1b
- ...into AI technologies with engineering teams. You will own the full... ..., generative AI, model training, inference, pipelines, etc.) to both... ...teams to inform roadmap and platform improvements.Pre-Requisites... ...).Background in AI, machine learning, cloud infrastructure, or...TrainingLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Platform Lead Engineer, Training and Inference. Be the first to apply!

