Staff Software Engineer, Inference Cloud

Cerebras

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. About The Role As a software engineer on our AI cloud platform, you will work on our cloud platform for AI model training and inference. In this role, you will be responsible for optimizing deployment for minimal latency and efficient load balancing, and on ensuring high reliability, scalability, security, and observability of our distributed training and inference infrastructure. You will define and document production requirements, implement scaling strategies, and ensure robust API server management and high availability. You will develop tools to map out system bottlenecks and sources of instability, and design and build solutions to address them. We're looking for talented software engineers who thrive in ambiguity, view change as an opportunity, and have a voracious desire to learn and share knowledge clearly and concisely. Skills And Qualifications 5+ years as an individual contributor developing production-grade cloud services. Experience building ML serving and/or training services, preferably for modern generative AI models. Experience building high-reliability, production-grade cloud services. Familiarity with the latest AI model architectures and efficient implementation of serving systems for these models. Strong problem-solving skills and the ability to thrive in ambiguous, rapidly changing conditions. Excellent communication skills and a passion for continuous learning and knowledge sharing. Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them. This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice. #J-18808-Ljbffr Cerebras

Apply

Vacancy posted 22 hours ago

Similar jobs that could be interesting for youBased on the Staff Software Engineer, Inference Cloud in Sunnyvale, CA vacancy

Staff Software Engineer, Inference Cloud
About the Role We're hiring a Staff Engineer to own major areas of the architecture of our Inference Cloud Platform. This team owns the cloud layer behind our Inference... ...Skills & Qualifications 8+ years of experience in software engineering, with substantial individual...
Suggested
Cerebras Systems, Inc.
Sunnyvale, CA
7 days ago
Staff Software Engineer, Inference Platform
Location: Sunnyvale We're hiring a Staff Engineer to help lead, drive, and contribute to projects on our Inference Platform team. Our team... ...which glues together the cloud components to the ML components... ...Qualifications 8+ years of experience in software engineering, with substantial...
Suggested
Cerebras
Sunnyvale, CA
5 days ago
Staff Software Engineer, Inference
$188k - $275k
What You’ll Do Inference Platform Team The Inference team builds and operates CoreWeave... ...systems. About the Role As a Staff Software Engineer (IC5) on the Inference team, you will... ...operating large‑scale distributed systems or cloud platforms Proven experience leading cross...
Suggested
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
Dormont Manufacturing Company
Sunnyvale, CA
1 day ago
Staff Software Engineer - Real-Time AI Inference Infra
Cerebras Systems, Inc. is seeking a Software Engineer in Sunnyvale, California to enhance high-performance, low-latency inference infrastructure. This role involves deploying scalable services, optimizing resource allocation, and integrating with containerized environments...
Suggested
Cerebras Systems, Inc.
Sunnyvale, CA
7 days ago
Staff Software Engineer: AI Inference Infra & Kubernetes
Cerebras Systems in Sunnyvale, CA is seeking a Member of Technical Staff (Software Engineer) to implement infrastructure for high-performance, low-latency inference services. Applicants should have a Master’s degree in Computer Science or a related field and at least one...
Suggested
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
4 days ago
Staff Software Engineer, ML Training and Inference Infrastructure
$228k - $285k
...protect it for future generations. Role Summary As a Staff Software Engineer, ML training and inference infrastructure, you will be a member of the... ...providers of background checks, staffing services, and cloud services. Rivian may transfer or store internationally...
Full time
Contract work
Local area
Rivian
Palo Alto, CA
2 days ago
Staff Software Engineer, Inference
$188k - $275k
...Description CoreWeave is The Essential Cloud for AI™. Built for pioneers by... .... Learn more at What You'll Do: Inference Platform Team The Inference team builds... ...inference systems. About the role: As a Staff Software Engineer (IC5) on the Inference team, you will...
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
4 days ago
Senior/Staff Software Engineer - AI, Search & Knowledge Platforms
$181.1k - $318.4k
Senior/Staff Software Engineer - AI, Search & Knowledge Platforms Santa Clara, California, United States Machine Learning and AI The... ...to over half a billion end-user devices and to Private Cloud Compute inference infrastructure. As a member of the team, you would design...
Relocation
Apple
Santa Clara, CA
4 days ago
Inference Platform Engineer | Kubernetes & Scalable AI
Cerebras Systems, Inc. is looking for a Software Engineer to enhance its Inference Platform. You will design and maintain critical software to support a high-performance AI architecture. As part of your role, you will tackle innovative challenges and help ensure the reliability...
Cerebras Systems, Inc.
Sunnyvale, CA
3 days ago
Principal Engineer, Inference Cloud
...industry‑leading training and inference speeds and empowers machine... ...faster than GPU‑based hyperscale cloud inference services. Location:... ...a deeply technical, hands‑on engineering leader for our Inference... ...Leadership : 6+ years in high‑scale software engineering, with 3+ years...
Cerebras
Sunnyvale, CA
2 days ago
Senior Staff Engineer - AI Inference & Resilient Cloud
...Systems, Inc. is looking for a Sr. Member of Technical Staff to design software features that enhance system resiliency and high... .... The role includes developing scalable AI inference services and deploying cloud-based workflows. Ideal candidates have a master's degree...
Cerebras Systems, Inc.
Sunnyvale, CA
3 days ago
Principal Engineer, Inference Cloud
...industry-leading training and inference speeds; over 10 times faster than GPU‑based hyperscale cloud inference services. This order... .... We're hiring a Principal Engineer for our Inference Cloud Platform... ...10+ years of experience in software engineering, with substantial...
Cerebras Systems, Inc.
Sunnyvale, CA
3 days ago
Senior Platform Engineer, Inference & Kubernetes
Cerebras is seeking a Software Engineer to join our Inference Platform team in Sunnyvale, California. This role involves developing and leading projects that integrate cloud and ML components. You will contribute to shaping the technical direction and improve system performance...
Cerebras
Sunnyvale, CA
5 days ago
Senior Staff Engineer — AI Inference & Cloud Infra
$230k - $250k
Cerebras Systems is seeking a Sr. Member of Technical Staff in Sunnyvale, CA. This role involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a Master’s degree in Computer Science and experience...
Cerebras Systems
Sunnyvale, CA
4 days ago
Software Engineer, Inference Platform
...Cerebras to deliver industry-leading training and inference speeds; over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude... ...inference. About the Role We’re hiring a Software Engineer to help contribute to projects on our...
Cerebras Systems, Inc.
Sunnyvale, CA
7 days ago
Staff Engineer, Inference Cloud — Global, Low-Latency
Cerebras Systems, Inc. is hiring a Staff Engineer to oversee critical areas of the architecture for their Inference Cloud Platform. This role focuses on hands-on contributions... ...ideal candidate will have over 8 years in software engineering with expertise in distributed...
Cerebras Systems, Inc.
Sunnyvale, CA
3 days ago
Senior Inference Platform Engineer Kubernetes & Latency
$165k - $242k
A cloud service provider is seeking a Senior Software Engineer II for their Inference team in Sunnyvale, California. In this role, you'll lead design reviews, implement optimizations, and improve service reliability. The ideal candidate has extensive experience with distributed...
CoreWeave
Sunnyvale, CA
4 days ago
Lead Engineer, Inference Platform & Scale
A pioneering AI hardware company in Sunnyvale is looking for an engineering leader to oversee the Inference Service Platform. You will guide a team in scaling LLM inference and architecting low latency systems. Candidates should have substantial experience in distributed...
Cerebras
Sunnyvale, CA
2 days ago
Staff Software Engineer
$160.5k - $240.7k
...Technologies, Inc. Job Area Engineering Group Machine Learning Engineering... ...learning hardware and software. Minimum Qualifications Bachelor... ...hardware, firmware, cloud, and product teams. Experience... ...spanning model architectures, inference pipelines, and runtime frameworks...
Work experience placement
Work from home
Qualcomm
Santa Clara, CA
1 day ago
Staff Engineer, AI Inference Platform & Kubernetes
Cerebras Systems, Inc. is looking for an experienced Staff Engineer to join our Inference Platform team in Sunnyvale, California. The role involves designing and maintaining production software that operates at scale, solving complex engineering challenges on the cutting...
Cerebras Systems, Inc.
Sunnyvale, CA
3 days ago
Senior Staff Software Engineer
$185.9k - $278.9k
...Company Qualcomm Technologies, Inc. Job Area Engineering Group Machine Learning Engineering... ...through machine learning hardware and software. Minimum Qualifications Bachelor's... ...designed with machine learning software) for inference or training solutions. Develops...
Work experience placement
Immediate start
Work from home
Qualcomm
Santa Clara, CA
2 days ago
Senior Engineering Manager AI Inference Platform, Distributed Cloud
$262k - $365k
Senior Engineering Manager AI Inference Platform, Distributed Cloud Location: Sunnyvale, CA, USA Pay US: $262,000 - $365,000 (USD) + 25% bonus target + equity + benefits. About the role In this role, you will be pivotal in architecting and optimizing the serving stack...
Google Inc.
Sunnyvale, CA
4 days ago
Senior Software Development Engineer - SGLang and Inference Stack
...RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node... ...You will collaborate across internal GPU software teams and engage with open-source... ...software ecosystem. THE PERSON: Skilled engineer with strong technical and analyticalexpertisein...
Advanced Micro Devices , Inc.
Santa Clara, CA
2 days ago
Senior Software Engineer, Deep Learning Inference - TensorRT
$152k - $241.5k
Senior Software Engineer - Deep Learning Inference What you’ll be doing: Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning...
NVIDIA Gruppe
Santa Clara, CA
5 days ago
Senior ML Infrastructure Engineer, Inference Platform
$155.42k - $205.9k
...Description Senior ML Infrastructure Engineer (ML Inference Platform). About the Team The ML Inference... ...organization. Our team owns the cloud‑agnostic, reliable, and cost‑efficient... ...Design and implement core platform backend software components. Collaborate with ML...
Local area
Remote work
Relocation
Relocation package
Flexible hours
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago
Lead Principal Engineer, Inference Cloud Platform
Cerebras Systems, Inc. is seeking a Principal Engineer to lead their Inference Cloud Platform team. This pivotal role involves identifying key platform... ...reliability. The ideal candidate has over 10 years of software engineering experience and deep expertise in...
Cerebras Systems, Inc.
Sunnyvale, CA
3 days ago
Senior ML Inference Platform Engineer (Remote)
...looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position... ...build and scale robust platforms for ML inference workflows supporting GM’s AI efforts.... ...model serving strategies and handle backend software components. The position demands 5+...
Remote job
Israelvcforum
Mountain View, CA
5 days ago
ML Engineer — AI Platform & Multimodal Inference
...Mountain View is seeking a Machine Learning Engineer to build and optimize the infrastructure... ...data understanding, optimizing inference pipelines, and collaborating with teams... ...frameworks are required. Knowledge of NLP and cloud ML infrastructure is preferred. #J-18808...
Corvic
Mountain View, CA
3 days ago
Senior ML Inference Engineer - Platform
$128.7k - $261.3k
...DescriptionAbout the TeamThe Model Deployment & Inference Solutions team in GM AV deploys machine... ...currently performed manually by engineers.Build the developer experience that ML model... ...Experience designing clean, well-tested software with clear interfaces and good...
Local area
Remote work
Work from home
Relocation package
Flexible hours
Shift work
General Motors
Sunnyvale, CA
3 days ago
Senior ML Inference Platform Engineer (Remote)
Dormont Manufacturing Co is looking for a Senior ML Infrastructure Engineer to help build and scale robust platforms for ML inference workflows. You will collaborate with ML engineers and researchers while shaping the future of AI infrastructure at GM. The ideal candidate...
Remote job
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Software Engineer, Inference Cloud. Be the first to apply!