Staff Software Engineer - Real-Time AI Inference Infra
Cerebras Systems, Inc.
Cerebras Systems, Inc. is seeking a Software Engineer in Sunnyvale, California to enhance high-performance, low-latency inference infrastructure. This role involves deploying scalable services, optimizing resource allocation, and integrating with containerized environments like Docker and Kubernetes. The ideal candidate holds a Master’s in Computer Science and has at least one year of development experience, along with expertise in Docker, Kubernetes, Java, and Python. Join us in building revolutionary AI technology! #J-18808-Ljbffr Cerebras Systems, Inc.
$132k - $330k
Software Engineer, AI Inference Codesign The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research...SuggestedHourly payFull timeTemporary workFlexible hours- Cerebras Systems in Sunnyvale, CA is seeking a Member of Technical Staff (Software Engineer) to implement infrastructure for high-performance, low-latency inference services. Applicants should have a Master’s degree in Computer Science or a related field and at least one...Suggested
$170k - $190k
...is simple: deliver the best AI-powered customer experience—faster... ...to-text, text-to-speech, and real-time streaming audio pipelines.... ...handling, streaming inference, and audio quality, and can translate... ...leadership within the team, mentoring engineers and promoting best practices...SuggestedRemote work- ...the world's largest AI chip, 56 times larger than GPUs. This... ...leading training and inference speeds; over 10 times... ...applications, unlocking real-time iteration and... ...Role We're hiring a Staff Engineer to help lead, drive,... ...maintain production software, with responsibilities...Suggested
- Google Inc. is seeking a software engineer to develop next-generation technologies impacting billions of users. The role involves working with real-time communication technologies and contributing to product design with a focus on innovation and scalability. Candidates...Suggested
$248.71k - $292.6k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™,... ...compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast. Sr. Staff Software Engineer - High Performance GPU Inference Systems...$190k - $250k
A leading financial technology firm in California seeks an AI Inference engineer to join its team. The role involves developing APIs for AI inference, improving system reliability, and optimizing LLM performance. Required qualifications include experience with ML systems...- A technology startup is seeking a deeply technical software engineer to develop high-performance, real-time software. The successful candidate will design distributed... ...compensation, comprehensive benefits, and equity in the rapidly growing company. #J-18808-Ljbffr Coram AI
$120k - $240k
Lyte AI Inc. is looking for a DSP Engineer in Sunnyvale, CA to develop real-time signal processing software across embedded platforms. You will collaborate with cross-functional teams for system integration and participate in software optimization. Ideal candidates have...Flexible hours$215k - $230k
...robotics company developing AI-powered robots to support humanity... .... JOB SUMMARY As a Senior Software Engineer, you will play a pivotal role... ...precisely and reliably in real-world scenarios. You will be... ...board computer to enable real-time control, seamless integration...Full timeLocal area$160k - $200k
Zoomcar is seeking a backend developer in Santa Clara to create and maintain media-related services for real-time communication including audio and video. Ideal candidates will have a strong proficiency in C++ and/or Go, a solid understanding of real-time streaming protocols...- Illumio is hiring a Senior Software Engineer in Sunnyvale, California, to architect high-scale distributed systems, focusing on data processing and real-time analytics. The role requires expertise in backend engineering with Java, Python, or Go, and strong knowledge in...
$188k - $275k
...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...at What You'll Do: Inference Platform Team The Inference... ...efficiency, and reliability across real-time inference systems. About the role: As a Staff Software Engineer (IC5) on the Inference team,...Permanent employmentTemporary workCasual workWork at officeFlexible hours- ...in Sunnyvale, California, is looking for an ambitious Applied AI Engineer to build and maintain AI integrated backend services. You'll collaborate... ..., product managers, and AI researchers to design scalable real-time systems. Ideal candidates will have a background in computer...
- ...Deductive-Ai is seeking a Software Engineering Intern for fall 2024 in Mountain View, CA. This paid, on-site internship offers hands-on experience in... ...machine learning. Competitive compensation and opportunities for full-time roles are also provided. #J-18808-Ljbffr...Full timeInternship
- ...the Role We're hiring a Staff Engineer to own major areas of... ...architecture of our Inference Cloud Platform. This team... ...under bursty AI workloads, performance... ...system evolution over time, and own the roadmap for... ...years of experience in software engineering, with substantial...
- ...Digital Space LLC in Palo Alto is looking for a Senior Engineer to develop a next-generation inference platform integrated with Atlas. This role involves... ...infrastructure and collaborating with teams to enhance AI capabilities. Ideal candidates will have over 5 years...
$193.93k - $291.15k
...scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses... ...standard protocols. About the Work Engineered Connectivity: Architect a network bonding... ...-safe code and understand the nuances of real-time systems. Protocol Native: You don’t just...Remote work$160.5k - $240.7k
...Technologies, Inc. Job Area Engineering Group Machine... ...learning hardware and software. Minimum Qualifications... ...specialization in edge AI, computer vision, or embedded... ...experience developing real‑time edge AI systems with... ...model architectures, inference pipelines, and runtime...Work experience placementWork from home- A leading AI security firm in California is seeking a hands-on Engineering Manager for its Platform team. The role involves defining technical roadmaps, leading... ...projects. Candidates should have strong experience in real-time distributed systems, be proficient in C++, and...
$145k - $235.5k
...innovation and impact, solving real-world problems with... ..., and Inclusion. We weave AI into the fabric of everything... ...work from the office full time, with flexibility when it’... ...We are seeking a Sr Staff Software Engineer for our Global Infra Team to develop and maintain...Full timeWork at officeVisa sponsorshipWork visa- A leading AI-powered fraud detection platform in Mountain View... ...seeking experienced platform engineers to design and build advanced... ...machine learning, and developing real-time processing systems. Ideal... ...candidates should possess substantial software development experience in...
$152k - $241.5k
## Senior Software Engineer, Deep Learning Inference - TensorRTApplylocations: US, CA, Santa Claratime type: Full... ...Workflows team and help build the real-time, cost-effective computing platform... ...for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.NVIDIA...$166k - $244k
Senior Software Engineer, Infra, Vertex Gemini API+ Serving - Sunnyvale, CA, USA. About... ...and technology on the Vertex AI platform, bridging the gap between research and real‑world applications. Design and... ...salary range for this full‑time position is $166,000-$244,000...Full time- A leading AI technology company is seeking a Software Engineering Intern for fall 2024 in Mountain View, CA. In this paid on-site role, you will tackle deep technical... ..., competitive compensation, and potential for full-time opportunities in a disruptive startup environment....Full timeInternship
$200k - $350k
...building a cinematic AI storytelling experience... ...looking for a Senior or Staff Software Engineer who ships product... ...end-to-end. You'll own real surfaces of the product... ...the model layer, or the infra Experience shipping... ...Experience with video, real-time systems, or AI/ML...Live in$196.5k - $219.3k
...LLM requests/responses in real time, preventing prompt... ...features. Mentor junior engineers on secure backend development... ...of high‑quality software features while adhering... ...language models or other AI/ML systems (e.g. implementing model inference pipelines, fine‑tuning models...Full time- Sanas, located in Palo Alto, is seeking a Software Engineer to lead the development of cross-... ...platform applications for our advanced speech AI models. Candidates should have a... ...high-performance systems and working with real-time capabilities as part of a rapidly growing...
- ...observability, and RCA infrastructure that makes Sage Care’s AI assistant Role Overview Own and build the full... ...Sage Care’s AI assistant trustworthy and debuggable —in real time and post-call. This engineer builds the visibility layer across telephony, transcription...Immediate start
$180k
...xAI’s mission is to create AI systems that can... ...motivated, and focused on engineering excellence. This organization... ...tier that powers training, inference, recommendations, and real-time data extraction. We... ...7 years of experience in software development, plus 2+ years...Temporary work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Software Engineer - Real-Time AI Inference Infra. Be the first to apply!
- embedded software Sunnyvale, CA
- software sales Sunnyvale, CA
- android software developer Sunnyvale, CA
- software sales executive Sunnyvale, CA
- software quality assurance Sunnyvale, CA
- software sales representative Sunnyvale, CA
- software asset management analyst Sunnyvale, CA
- id software Sunnyvale, CA
- software support Sunnyvale, CA
- software technical support Sunnyvale, CA

