Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Software Engineer - Real-Time AI Inference Infra

Cerebras Systems, Inc.

Cerebras Systems, Inc. is seeking a Software Engineer in Sunnyvale, California to enhance high-performance, low-latency inference infrastructure. This role involves deploying scalable services, optimizing resource allocation, and integrating with containerized environments like Docker and Kubernetes. The ideal candidate holds a Master’s in Computer Science and has at least one year of development experience, along with expertise in Docker, Kubernetes, Java, and Python. Join us in building revolutionary AI technology! #J-18808-Ljbffr Cerebras Systems, Inc.

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the Staff Software Engineer - Real-Time AI Inference Infra in Sunnyvale, CA vacancy
  • $132k - $330k

    Software Engineer, AI Inference Codesign The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research... 
    Suggested
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla Motors, Inc.

    Palo Alto, CA
    3 days ago
  • Cerebras Systems in Sunnyvale, CA is seeking a Member of Technical Staff (Software Engineer) to implement infrastructure for high-performance, low-latency inference services. Applicants should have a Master’s degree in Computer Science or a related field and at least one... 
    Suggested

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago
  • $170k - $190k

     ...is simple: deliver the best AI-powered customer experience—faster...  ...to-text, text-to-speech, and real-time streaming audio pipelines....  ...handling, streaming inference, and audio quality, and can translate...  ...leadership within the team, mentoring engineers and promoting best practices... 
    Suggested
    Remote work

    ASAPP

    Mountain View, CA
    2 days ago
  •  ...the world's largest AI chip, 56 times larger than GPUs. This...  ...leading training and inference speeds; over 10 times...  ...applications, unlocking real-time iteration and...  ...Role We're hiring a Staff Engineer to help lead, drive,...  ...maintain production software, with responsibilities... 
    Suggested

    Cerebras Systems, Inc.

    Sunnyvale, CA
    11 hours ago
  • Google Inc. is seeking a software engineer to develop next-generation technologies impacting billions of users. The role involves working with real-time communication technologies and contributing to product design with a focus on innovation and scalability. Candidates... 
    Suggested

    Google Inc.

    Mountain View, CA
    1 day ago
  • $248.71k - $292.6k

    About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™,...  ...compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast. Sr. Staff Software Engineer - High Performance GPU Inference Systems... 

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    2 days ago
  • $190k - $250k

    A leading financial technology firm in California seeks an AI Inference engineer to join its team. The role involves developing APIs for AI inference, improving system reliability, and optimizing LLM performance. Required qualifications include experience with ML systems... 

    Pantera Capital

    Palo Alto, CA
    1 day ago
  • A technology startup is seeking a deeply technical software engineer to develop high-performance, real-time software. The successful candidate will design distributed...  ...compensation, comprehensive benefits, and equity in the rapidly growing company. #J-18808-Ljbffr Coram AI

    Coram AI

    Sunnyvale, CA
    4 days ago
  • $120k - $240k

    Lyte AI Inc. is looking for a DSP Engineer in Sunnyvale, CA to develop real-time signal processing software across embedded platforms. You will collaborate with cross-functional teams for system integration and participate in software optimization. Ideal candidates have... 
    Flexible hours

    Lyte AI Inc.

    Sunnyvale, CA
    22 hours ago
  • $215k - $230k

     ...robotics company developing AI-powered robots to support humanity...  .... JOB SUMMARY As a Senior Software Engineer, you will play a pivotal role...  ...precisely and reliably in real-world scenarios. You will be...  ...board computer to enable real-time control, seamless integration... 
    Full time
    Local area

    Apptronik

    Sunnyvale, CA
    13 hours ago
  • $160k - $200k

    Zoomcar is seeking a backend developer in Santa Clara to create and maintain media-related services for real-time communication including audio and video. Ideal candidates will have a strong proficiency in C++ and/or Go, a solid understanding of real-time streaming protocols... 

    Zoomcar

    Santa Clara, CA
    3 days ago
  • Illumio is hiring a Senior Software Engineer in Sunnyvale, California, to architect high-scale distributed systems, focusing on data processing and real-time analytics. The role requires expertise in backend engineering with Java, Python, or Go, and strong knowledge in... 

    Illumio

    Sunnyvale, CA
    3 days ago
  • $188k - $275k

     ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by...  ...at What You'll Do: Inference Platform Team The Inference...  ...efficiency, and reliability across real-time inference systems. About the role: As a Staff Software Engineer (IC5) on the Inference team,... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    26 days ago
  •  ...in Sunnyvale, California, is looking for an ambitious Applied AI Engineer to build and maintain AI integrated backend services. You'll collaborate...  ..., product managers, and AI researchers to design scalable real-time systems. Ideal candidates will have a background in computer... 

    Zoomcar

    Sunnyvale, CA
    22 hours ago
  •  ...Deductive-Ai is seeking a Software Engineering Intern for fall 2024 in Mountain View, CA. This paid, on-site internship offers hands-on experience in...  ...machine learning. Competitive compensation and opportunities for full-time roles are also provided. #J-18808-Ljbffr... 
    Full time
    Internship

    Deductive-Ai

    Mountain View, CA
    3 days ago
  •  ...the Role We're hiring a Staff Engineer to own major areas of...  ...architecture of our Inference Cloud Platform. This team...  ...under bursty AI workloads, performance...  ...system evolution over time, and own the roadmap for...  ...years of experience in software engineering, with substantial... 

    Cerebras Systems, Inc.

    Sunnyvale, CA
    11 hours ago
  •  ...Digital Space LLC in Palo Alto is looking for a Senior Engineer to develop a next-generation inference platform integrated with Atlas. This role involves...  ...infrastructure and collaborating with teams to enhance AI capabilities. Ideal candidates will have over 5 years... 

    United States Digital Space LLC

    Palo Alto, CA
    2 days ago
  • $193.93k - $291.15k

     ...scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses...  ...standard protocols. About the Work Engineered Connectivity: Architect a network bonding...  ...-safe code and understand the nuances of real-time systems. Protocol Native: You don’t just... 
    Remote work

    Icehouseventures

    Mountain View, CA
    22 hours ago
  • $160.5k - $240.7k

     ...Technologies, Inc. Job Area Engineering Group Machine...  ...learning hardware and software. Minimum Qualifications...  ...specialization in edge AI, computer vision, or embedded...  ...experience developing real‑time edge AI systems with...  ...model architectures, inference pipelines, and runtime... 
    Work experience placement
    Work from home

    Qualcomm

    Santa Clara, CA
    3 days ago
  • A leading AI security firm in California is seeking a hands-on Engineering Manager for its Platform team. The role involves defining technical roadmaps, leading...  ...projects. Candidates should have strong experience in real-time distributed systems, be proficient in C++, and... 

    Coram AI

    Sunnyvale, CA
    3 days ago
  • $145k - $235.5k

     ...innovation and impact, solving real-world problems with...  ..., and Inclusion. We weave AI into the fabric of everything...  ...work from the office full time, with flexibility when it’...  ...We are seeking a Sr Staff Software Engineer for our Global Infra Team to develop and maintain... 
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    2 days ago
  • A leading AI-powered fraud detection platform in Mountain View...  ...seeking experienced platform engineers to design and build advanced...  ...machine learning, and developing real-time processing systems. Ideal...  ...candidates should possess substantial software development experience in... 

    DataVisor

    Mountain View, CA
    2 days ago
  • $152k - $241.5k

    ## Senior Software Engineer, Deep Learning Inference - TensorRTApplylocations: US, CA, Santa Claratime type: Full...  ...Workflows team and help build the real-time, cost-effective computing platform...  ...for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.NVIDIA... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $166k - $244k

    Senior Software Engineer, Infra, Vertex Gemini API+ Serving - Sunnyvale, CA, USA. About...  ...and technology on the Vertex AI platform, bridging the gap between research and real‑world applications. Design and...  ...salary range for this full‑time position is $166,000-$244,000... 
    Full time

    Carlsbad Tech

    Sunnyvale, CA
    2 days ago
  • A leading AI technology company is seeking a Software Engineering Intern for fall 2024 in Mountain View, CA. In this paid on-site role, you will tackle deep technical...  ..., competitive compensation, and potential for full-time opportunities in a disruptive startup environment.... 
    Full time
    Internship

    Deductive AI

    Mountain View, CA
    4 days ago
  • $200k - $350k

     ...building a cinematic AI storytelling experience...  ...looking for a Senior or Staff Software Engineer who ships product...  ...end-to-end. You'll own real surfaces of the product...  ...the model layer, or the infra Experience shipping...  ...Experience with video, real-time systems, or AI/ML... 
    Live in

    Utopai Studios, Inc

    Mountain View, CA
    4 days ago
  • $196.5k - $219.3k

     ...LLM requests/responses in real time, preventing prompt...  ...features. Mentor junior engineers on secure backend development...  ...of high‑quality software features while adhering...  ...language models or other AI/ML systems (e.g. implementing model inference pipelines, fine‑tuning models... 
    Full time

    Zoomcar

    Sunnyvale, CA
    3 days ago
  • Sanas, located in Palo Alto, is seeking a Software Engineer to lead the development of cross-...  ...platform applications for our advanced speech AI models. Candidates should have a...  ...high-performance systems and working with real-time capabilities as part of a rapidly growing... 

    Sanas

    Palo Alto, CA
    4 days ago
  •  ...observability, and RCA infrastructure that makes Sage Care’s AI assistant Role Overview Own and build the full...  ...Sage Care’s AI assistant trustworthy and debuggable —in real time and post-call. This engineer builds the visibility layer across telephony, transcription... 
    Immediate start

    Sage Care

    Palo Alto, CA
    2 days ago
  • $180k

     ...xAI’s mission is to create AI systems that can...  ...motivated, and focused on engineering excellence. This organization...  ...tier that powers training, inference, recommendations, and real-time data extraction.  We...  ...7 years of experience in software development, plus 2+ years... 
    Temporary work

    xAI

    Palo Alto, CA
    more than 2 months ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Software Engineer - Real-Time AI Inference Infra. Be the first to apply!