Distributed Software Engineer
CEREBRAS SYSTEMS INC.
Distributed Software Engineer
Bengaluru, Karnataka, India; Sunnyvale CA or Toronto Canada
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.
Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.
Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.
About The Role
Cerebras Systems is a pioneer in large-scale AI Supercomputers. These multi-exaflop supercomputers are deployed in some of the biggest datacenters. These supercomputers are built using our Wafer-Scale Cluster technology - a cluster of several Wafer Scale Engine (WSE) chips. The Cluster engineering team is responsible for delivering software that are all-things related to cluster.
Responsibilities
- Automate bare-metal configuration of networking, OS, and application software in large clusters of Cerebras WSE, servers, and switches.
- Additional push button workflows for cluster upgrades, downgrades, and security patching with key metrics to minimize downtime on clusters.
- An orchestration and scheduler system for resource allocation, job submission C placements for a multi-user environment on a cluster.
- Seamless support for both on-premise and cloud mode deployment and operations.
- A robust system for monitoring, detecting and handling failures for a variety of resources on the clusters (including High Availability of clusters).
- Broad cluster and job monitoring and visualization capabilities, along with alerting systems.
- User facing tools to monitor the status of jobs and collect metrics.
- Administrator facing tools to manage and operate large clusters.
Skills & Qualifications
- Strong track record of software architecture, system design and development.
- Strong track record of development in distributed cluster.
- Strong understanding of Kubernetes (K8s) software ecosystem, Prometheus and Grafana.
- Strong development skills in GoLang, Python, bash.
- Strong debugging skills with distributed systems.
- Strong skill to develop tests for the new features and regress old features.
Why Join Cerebras
People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we've reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:
- Build a breakthrough AI platform beyond the constraints of the GPU.
- Publish and open source their cutting-edge AI research.
- Work on one of the fastest AI supercomputers in the world.
- Enjoy job stability with startup vitality.
- Our simple, non-corporate work culture that respects individual beliefs.
Apply today and become part of the forefront of groundbreaking advancements in AI!
Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.
$215k - $250k
...Onehouse Data Infrastructure Engineer Onehouse is a mission-driven company dedicated... ...created large-scale data systems and globally distributed platforms that sit at the heart of some... ...of our data tech stack by building the software and data features that actually process...SuggestedOdd jobWork at officeLocal areaRemote workRelocationRelocation package$100k
...Software Engineer, TT-Distributed Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in...SuggestedPermanent employment$147k - $211k
Software Engineer, Distributed Rate Limiting Services Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. Requirements Bachelor’s degree or equivalent practical experience...SuggestedFull time$168k - $270.25k
...using advanced programming skills to build distributed and compute systems, backend services,... ...BS or MS in Computer Science, Computer Engineering or related field (or equivalent... ...experience developing microservices, cloud software and/or tooling roles. Desirable Experience...Suggested$160.36k - $240.54k
...more connected future. About the Role We’re looking for senior engineers to build/scale Nuro's large-scale computing infrastructure in... ...have proven experience in building and developing large-scale distributed applications (e.g. Kubernetes). You’re self-motivated to...Suggested$147k - $211k
Software Engineer, Google Distributed Cloud Infrastructure Cluster corporate_fare Google place Sunnyvale, CA, USA Bachelor’s degree or equivalent practical experience. 2 years of experience programming in Python or Go. Experience with large scale distributed systems,...Full time$140k - $240k
..., best practices, security-first based engineering. Cerebras cluster involves complex HW components... ...integrated cluster management software stack - all the way from a bare-metal deployment... ...leadership/management role in distributed systems security. ~ Proven track record...- A technology company in California is seeking a Software Engineer in Robotics to develop distributed software systems for their innovative micromanufacturing technology. You will work closely with engineering teams to design, debug, and implement software for complex robotics...
- ...your career. THE ROLE: AMD is looking for a strategic software engineering lead who is passionate about improving the performance of... ...Expertise with techniques used to optimize inference like distributed kv-cache, disaggregation, request scheduling etc Ability...
$2,000 per month
...organizations deliver on the promise of AI. What is The Role: We are on the lookout for a Principal Software Engineer I to join our Elasticsearch - Distributed Systems team and focus on how Elasticsearch provides scale, performance, and resilience. This team owns...Local areaFlexible hours$120.5k - $243k
...A leading technology firm is seeking a System Software Engineer to work onsite in Sunnyvale, California. This role will involve designing... ...4 years of software development experience, particularly in distributed systems and microservice architectures. A competitive salary...- Nuro, based in Mountain View, is seeking senior engineers to build and scale its large-scale computing infrastructure. The role involves... ...applications. The ideal candidate has experience with distributed applications and holds a bachelor's degree in Computer Science...
- ...cybersecurity firm in Sunnyvale, CA is looking for a dedicated engineer to develop secure, distributed multi-tenant systems. You will be responsible for... ...science, experience in Go and SQL, and a passion for software development. This role includes mentoring junior staff...
$207k - $300k
A leading technology company in Sunnyvale, CA seeks a Staff Software Engineer for Infrastructure roles. You will drive project priorities, working with distributed teams to design and develop scalable software solutions. Candidates should have over 8 years of software development...Full time$192k - $260k
Databricks is looking for a seasoned engineer with over 8 years of experience in Java, Scala, or C++. The ideal candidate will contribute... ...is required, with a strong foundation in algorithms and distributed systems. We offer a generous salary range of $192,000 — $260,0...$192k - $260k
...growing SaaS companies in the world. Our engineering teams build highly technical products... ...and operate one of the largest scale software platforms. The fleet consists of millions... .... Optional: MS or PhD in databases, distributed systems. Comfortable working towards a...Work at officeLocal area- ...digital future. Requirements We’re looking for a Staff Software Engineer to join our Confidential Computing Management team—an... ...design, build, and own core platform services powering secure, distributed systems at scale. This is a high-impact, hands-on technical...H1bWorldwide
- ...Infrastructure team supports 1000+ chip design engineers by building tools and platforms that... ...infrastructure runs as userspace software on bare‑metal Linux hosts (no sudo, no... ...locks, concurrency, and debugging Solid distributed‑systems thinking: failures, retries/timeouts...
- A leading transportation network company in Sunnyvale, California seeks a Software Engineer to design and develop software applications. The ideal candidate will hold a Master's degree and possess advanced knowledge in programming languages like C++, Python, and Go. Responsibilities...Remote jobWork at office
$166k - $244k
A leading technology company based in Sunnyvale is looking for a Senior Software Engineer to develop next-generation software solutions. The ideal candidate will have 5 years of experience in software development and expertise in C++. Responsibilities include writing and...$136.3k - $231.7k
## Software Engineer II (Distributed / Scalable Systems)Applylocations: Milpitas, CAtime type: Full timeposted on: Posted Todayjob requisition id: 2636655**Company Overview**KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem....Minimum wageTemporary workWork experience placementFlexible hours- Mibura in Mountain View is looking for a Junior Software Engineer to join their team. In this role, you will contribute to building and managing scalable infrastructure technologies that support modern applications across the globe. The ideal candidate will have a Bachelor...
- ...in Sunnyvale is seeking an experienced engineer to develop containerized microservices for... ...mentoring junior engineers and owning software features across the development lifecycle... ...have at least 8 years of experience in distributed systems and cloud technologies (AWS, Azure...
$120k - $170k
...Full Stack Software Engineer Sunnyvale, CA The future of defense will be decided by those who field intelligent machines at scale.... ...or similar) ~ Solid understanding of databases, APIs, and distributed systems ~ Experience shipping production software end-to-end...Full timeRelocation package- ...critical assets. Our Team's Vision Our Engineering team has established a culture based on... ...that ranges from operating systems to distributed applications to UI and visualization.... ...testing, ensuring robust and resilient software delivery Partner cross-...Work at officeImmediate start
$175k - $263k
...Technical Lead, Distributed Systems, Portworx Santa Clara, California We're in an unbelievably exciting area of tech and are fundamentally... ...partners in designing highly scalable and production quality software ~ Proven design sensibility and the willingness to work in a...Work at officeFlexible hours- Pure Storage, Inc. is seeking a Senior Software Engineer in Santa Clara to lead the digital transformation of their Modern Data platforms.... ...8 years of experience in systems software, particularly in distributed systems. Join a team that values innovation and supports personal...Flexible hours
$174.4k - $237.8k
...Senior Full Stack Software Engineer, Platform Mountain View, California Intrinsic is an AI robotics group at Google aiming to reimagine... ...and the team will own the design and implementation of a distributed cloud and on-premises system that enables users across the world...Full timeWork experience placementLocal area$207k - $300k
Tech Lead, Kernel and OS, Google Distributed Cloud Google Sunnyvale, CA, USA Bachelor's degree... ...experience. 8 years of experience in software development. 5 years of experience... ...qualifications: Master’s degree or PhD in engineering, computer science, or a related...Full time$125k - $222k
...deploy these complex systems safely. Our software is used by top automotive OEMs,... ...analytics. Our platform gives autonomy engineers complete visibility and control over the... ...infrastructure, platform engineering, or distributed systems * Experience with large-scale...Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Distributed Software Engineer. Be the first to apply!
- software developer internship no experience Sunnyvale, CA
- federal - software developer Sunnyvale, CA
- research software engineer Sunnyvale, CA
- software engineer contract Sunnyvale, CA
- software engineer healthcare Sunnyvale, CA
- network software engineer Sunnyvale, CA
- ngo software engineer Sunnyvale, CA
- software development engineer aws Sunnyvale, CA
- software developer internship Sunnyvale, CA
- software developer intern Sunnyvale, CA

