Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Inference Platform

Cerebras Systems, Inc.

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. This architecture allows Cerebras to deliver industry-leading training and inference speeds; over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Cerebras works with the leading model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. About the Role We’re hiring a Software Engineer to help contribute to projects on our Inference Platform team. Our team primarily owns the orchestration layer that runs inference on our datacenter clusters, connecting cloud components with machine learning services. We are often the first team to face problems that haven’t been solved yet, leading solutions across Kubernetes operators, service security policies, and CI/CD. If you’re interested in building the next-generation architecture of a globally distributed inference platform, we’d like to talk. Responsibilities Design, develop, test, and maintain production software, with responsibilities spanning testing, continuous development, observability, security, networking, debugging, and productionization. Platform Direction. Help shape the technical direction for the Inference Platform, Kubernetes custom resource definitions, failure domains, service boundaries, and system evolution over time, and own the roadmap for major technical areas. Reliability & Performance. Architect active-active systems with rapid failover, graceful degradation, and clear SLOs. Drive system-level improvements in latency, throughput, capacity efficiency, and resilience under unpredictable demand. Execution on Critical Paths. Write and review production code in the most important parts of the platform. Make high-consequence architectural decisions within your area and set the technical bar through design reviews, code reviews, and sound engineering judgment. Production Leadership. Lead on the hardest production issues and cross-system bottlenecks. Drive observability, incident response, capacity planning, and post-incident improvement with a high standard for operational rigor. Technical Influence. Partner with ML, Product, Infrastructure, and Cloud teams to translate product and business requirements into scalable system designs, and drive alignment on shared technical decisions within your domain and adjacent platform surfaces. Skills & Qualifications 3+ years of experience in software engineering, with experience building and operating large-scale distributed systems or cloud infrastructure. Experience in distributed systems, ideally with Kubernetes. Experience building highly available, latency-sensitive systems at scale. Experience with security (certificates, TLS, mTLS). Experience optimizing latency, throughput, and efficiency in high-QPS systems. Experience with TTFT and tail-latency reduction is a strong plus. Strong proficiency in backend or systems languages such as Go or C++. Preferred Skills & Qualifications Experience with ML inference infrastructure, model serving systems, or GPU-accelerated workloads. Why Join Cerebras People who are serious about software make their own hardware. At Cerebras, we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras: Build a breakthrough AI platform beyond the constraints of the GPU. Publish and open source their cutting-edge AI research. Work on one of the fastest AI supercomputers in the world. Enjoy job stability with startup vitality. Our simple, non-corporate work culture that respects individual beliefs. Find out more about what it’s like to work at Cerebras here! Apply today and become part of the forefront of groundbreaking advancements in AI! Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them. This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice. #J-18808-Ljbffr Cerebras Systems, Inc.

Vacancy posted 14 hours ago
Similar jobs that could be interesting for youBased on the Software Engineer, Inference Platform in Sunnyvale, CA vacancy
  • Cerebras is seeking a Staff Engineer to join their Inference Platform team in Sunnyvale, California. This role involves leading and contributing to projects...  ...candidate will have over 8 years of experience in software engineering, particularly in distributed systems and... 
    Software

    Cerebras

    Sunnyvale, CA
    3 days ago
  • Cerebras Systems, Inc. is seeking a Principal Engineer to lead their Inference Cloud Platform team. This pivotal role involves identifying key platform issues...  .... The ideal candidate has over 10 years of software engineering experience and deep expertise in distributed... 
    Software

    Cerebras Systems, Inc.

    Sunnyvale, CA
    14 hours ago
  • $195k - $298k

     ...relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms...  ...are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute...  ...and implement core platform backend software components. Collaborate with ML... 
    Software
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • Location: Sunnyvale We're hiring a Software Engineer to help contribute to projects on our Inference Platform team. Our team primarily owns the orchestration layer that runs inference on our datacenter clusters which glues together the cloud components to the ML components... 
    Software

    Cerebras

    Sunnyvale, CA
    3 days ago
  • $195k - $298k

     ...relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms...  ...are seeking a Staff ML Infrastructure Engineer to build and scale robust compute...  ...Design and implement core platform backend software components. Collaborate with ML... 
    Software
    Local area
    Relocation package
    Flexible hours

    Israelvcforum

    Sunnyvale, CA
    5 days ago
  • Cerebras Systems, Inc. is looking for an experienced Staff Engineer to join our Inference Platform team in Sunnyvale, California. The role involves designing and maintaining production software that operates at scale, solving complex engineering challenges on the cutting... 
    Software

    Cerebras Systems, Inc.

    Sunnyvale, CA
    14 hours ago
  • We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and...  ...backend or infrastructure systems at scale Strong software engineering skills in languages such as Go, Rust,... 
    Software
    Local area
    Worldwide

    MongoDB

    Palo Alto, CA
    2 days ago
  • $126k - $248k

     ...About the Role We’re looking for a Senior Engineer to help build the next‑generation inference platform that supports embedding models used for semantic search...  ...backend or infrastructure systems at scale Strong software engineering skills in languages such as Go, Rust,... 
    Software
    Local area

    The Consulting Solutions

    Palo Alto, CA
    3 days ago
  • $128.7k - $261.3k

     ...Team The Model Deployment & Inference Solutions team in GM AV deploys...  ...: build the ML deployment platform that makes model rollouts fast...  ...currently performed manually by engineers. Build the developer...  ...designing clean, well‑tested software with clear interfaces and good... 
    Software
    Local area
    Remote work
    Flexible hours
    Shift work

    General Motors

    Mountain View, CA
    2 days ago
  • $165k - $242k

    A cloud service provider is seeking a Senior Software Engineer II for their Inference team in Sunnyvale, California. In this role, you'll lead design reviews, implement optimizations, and improve service reliability. The ideal candidate has extensive experience with distributed... 
    Software

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • Cerebras is seeking a Software Engineer to join our Inference Platform team in Sunnyvale, California. This role involves developing and leading projects that integrate cloud and ML components. You will contribute to shaping the technical direction and improve system performance... 
    Software

    Cerebras

    Sunnyvale, CA
    3 days ago
  •  ...Inc. is looking for a Sr. Member of Technical Staff to design software features that enhance system resiliency and high availability...  ...distributed environments. The role includes developing scalable AI inference services and deploying cloud-based workflows. Ideal candidates... 
    Software

    Cerebras Systems, Inc.

    Sunnyvale, CA
    14 hours ago
  • $230k - $250k

    Cerebras Systems is seeking a Sr. Member of Technical Staff in Sunnyvale, CA. This role involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a Master’s degree in Computer Science and experience... 
    Software

    Cerebras Systems

    Sunnyvale, CA
    2 days ago
  •  ...Software Engineer, AI Platform - Intern Mountain View, California (HQ) Nuro believes self-driving vehicles are the most immediate and profound...  ...in Nuro and optimize on-cloud training and onboard inference. Our solutions include a distributed training platform, ML... 
    Software
    Internship
    Immediate start
    Flexible hours

    Nuro

    Mountain View, CA
    3 days ago
  • Cerebras Systems, Inc. is looking for a Software Engineer to enhance its Inference Platform. You will design and maintain critical software to support a high-performance AI architecture. As part of your role, you will tackle innovative challenges and help ensure the reliability... 
    Software

    Cerebras Systems, Inc.

    Sunnyvale, CA
    14 hours ago
  •  ...deliver industry-leading training and inference speeds; over 10 times faster than...  .... We're hiring a Principal Engineer for our Inference Cloud Platform. This team owns the cloud layer behind...  ...10+ years of experience in software engineering, with substantial individual... 
    Software

    Cerebras Systems, Inc.

    Sunnyvale, CA
    14 hours ago
  • United States Digital Space LLC in Palo Alto is looking for a Senior Engineer to develop a next-generation inference platform integrated with Atlas. This role involves building scalable infrastructure and collaborating with teams to enhance AI capabilities. Ideal candidates... 
    Software

    United States Digital Space LLC

    Palo Alto, CA
    2 days ago
  • $152k - $241.5k

    Overview AI & Deep Learning Compiler Engineer for NVIDIA’s Deep Learning & AI Compiler (DLC) team. What you’ll be doing Analyze deep learning...  ...optimization algorithms. Collaborate with deep learning software framework teams and GPU architecture teams to accelerate the next... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • About the Role We're hiring a Staff Engineer to own major areas of the architecture of our Inference Cloud Platform. This team owns the cloud layer behind our Inference Service...  ...& Qualifications 8+ years of experience in software engineering, with substantial individual... 
    Software

    Cerebras Systems, Inc.

    Sunnyvale, CA
    14 hours ago
  •  ...looking for a Senior ML Infrastructure Engineer in Mountain View, California. This...  ...position aims to build and scale robust platforms for ML inference workflows supporting GM’s AI efforts....  ...strategies and handle backend software components. The position demands 5+ years... 
    Software
    Remote job

    Israelvcforum

    Mountain View, CA
    3 days ago
  • General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With... 
    Software
    Remote job

    General Motors

    Sunnyvale, CA
    5 days ago
  • $184k - $287.5k

    NVIDIA is the platform for every new AI-powered application. We seek a senior engineer to own and evolve the core NIM Platform SDK...  ...delivering production-ready AI inference at scale. This is a hands-on,...  ...role involves solving deep software engineering challenges. These... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $175k - $215k

     ...also be applied to a range of vehicle platforms and product use cases. The Waymo Driver...  ...Platform team is part of the Marketplace engineering: we provide specialized business...  ...feature engineering, training workflows and inference services; we are working on next-generation... 
    Software
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  •  ...industry-leading training and inference speeds; over 10 times faster...  ...role working directly with Engineering, Product, Infrastructure,...  ...of new capacity management platform Drive org level strategic...  ...People who are serious about software make their own hardware. At... 
    Software

    Cerebras Systems

    Sunnyvale, CA
    3 days ago
  • $208.4k - $365.4k

     ..., California, United States Software and Services Imagine what you...  ...be made? The Apple Services Engineering org is building...  ...and delivery across our AI/ML platform and infrastructure programs....  ...infrastructure, Foundation Model inference platforms, and hybrid-cloud... 
    Software
    Contract work
    Relocation

    Apple Inc.

    Santa Clara, CA
    3 days ago
  • $181.1k - $318.4k

    Senior Software Engineer (Ads Platform)- Experimentation Cupertino, California, United States Software and Services At Apple, we work every day to...  ...Prior experience with advanced statistical and causal inference is a plus Curious business attitude with a proven ability... 
    Software
    Temporary work
    Immediate start
    Relocation

    Apple

    Cupertino, CA
    4 days ago
  • $152k - $241.5k

    We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the...  ...at the intersection of GPU performance engineering and public accountability. What You Will...  ...equivalent experience. 5+ years of relevant software development experience. Strong Python... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Lead Software Engineer - Distributed Microservices Platform Role overview This role combines technical leadership, system design, and practical engineering contribution. You will guide the technical direction of the team while staying close to implementation—supporting... 
    Software

    Dynamic Yield

    Mountain View, CA
    2 days ago
  • Cerebras Systems, Inc. is hiring a Staff Engineer to oversee critical areas of the architecture for their Inference Cloud Platform. This role focuses on hands-on contributions...  ...ideal candidate will have over 8 years in software engineering with expertise in distributed... 
    Software

    Cerebras Systems, Inc.

    Sunnyvale, CA
    14 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Inference Platform. Be the first to apply!