Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Inference Performance Engineer

Cerebras

Cerebras seeks a Senior Performance Engineer to join the Product team in Sunnyvale, CA. The role involves developing benchmarks to measure inference performance and creating competitive pricing models. The ideal candidate will have deep knowledge of open-source inference frameworks and at least 5 years of experience in ML systems or high-performance computing. Cerebras offers a unique non-corporate culture in a groundbreaking AI environment, along with job stability and opportunities for growth. #J-18808-Ljbffr Cerebras

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Inference Performance Engineer in Sunnyvale, CA vacancy
  • $184k - $356.5k

    A leading technology company in California is seeking a Senior DL Algorithms Engineer to drive inference performance for Deep Learning workloads. The role involves implementing advanced model inference and collaborating with co-design teams to optimize performance across... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $184k - $356.5k

    A leading AI computing company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-collaborative teams to implement cutting-edge algorithms... 
    Senior
    Performance
    Full time

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate will have a PhD and 3...  ...experience in deep learning, specifically in inference. This role involves profiling, analyzing... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe in Santa Clara, California is seeking a Senior Software Engineer specializing in Deep Learning Inference. In this role, you will craft and develop high-performance software tailored for scalable platforms while collaborating with experts in the field. The... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe is looking for a skilled engineer to join their TensorRT Edge-LLM team in Santa...  ...role involves developing a state-of-the-art inference framework for large language models and optimizing it for real-time performance on embedded platforms. Candidates should... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

    We are looking for a Senior DL Algorithms Engineer for LLM/Omni model optimizations! Seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every...  ...Cosmos) on NVIDIA’s accelerated inference SW stack. Contribute new features,... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $287.5k

    NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly...  ...agentic computation. About The Role We are hiring a Senior Performance Engineer to join our Product team. You are an expert on state-of-... 
    Senior
    Performance
    Contract work
    Shift work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $135.8k - $237.05k

     ...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning...  ...power billions of daily decisions, with a focus on the performance, reliability, and scalability of inference systems.... 
    Senior
    Performance
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    3 days ago
  • $184k - $287.5k

     ...Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers,... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe is looking for a skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California. Candidates should have a strong background in DL model training and deployment, ideally with a PhD or... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer to analyze and improve the performance of NVIDIA's inference ecosystem. This role requires significant expertise in deep learning frameworks and performance optimization. The ideal candidate will have... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

    NVIDIA is looking for a Senior Deep Learning Software Engineer in Santa Clara, California. This role involves analyzing and improving LLM inference performance using NVIDIA GPUs. Candidates should have extensive software development experience, strong skills in Python/C++... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

    NVIDIA Gruppe in Santa Clara is looking for a Senior Deep Learning Software Engineer to analyze and improve performance in NVIDIA's inference ecosystem. The role involves establishing benchmarking methodologies, contributing to NVIDIA's OSS frameworks, and developing optimized... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $190k - $235k

     ...Senior Perception Learning Engineer Sunnyvale, CA Apptronik is a human-centered robotics company developing...  ...to deliver deployable, high-performance perception stacks. You will collaborate...  ..., training infrastructure, and inference frameworks to accelerate model development... 
    Senior
    Performance
    Local area

    Apptronik

    Sunnyvale, CA
    20 days ago
  • $190k - $235k

     ...Senior Learning Perception Engineer - Slam Sunnyvale, C A Apptronik is a human-centered robotics company...  ...to deliver deployable, high-performance SLAM and perception stacks. You will...  ...pipelines, training infrastructure, and inference frameworks to accelerate model... 
    Senior
    Performance
    Local area

    Apptronik

    Sunnyvale, CA
    1 day ago
  • $152k - $241.5k

     ...We are looking for versatile software engineers for our XLA team. NVIDIA is at the center...  ...technology. Come join us to build high-performance, production‑grade software that is at...  ...learning workloads. You will optimize inference and training performance for the JAX framework... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems.... 
    Senior
    Performance

    SambaNova

    Palo Alto, CA
    3 days ago
  • $186k - $279k

     ...Senior Storage Benchmarking Engineer Santa Clara, California We're in an unbelievably exciting area...  ...will design, execute, and analyze performance benchmarks spanning both industry-standard...  ..., DLIO, and GPU-driven training/inference data pipelines). As AI has made... 
    Senior
    Performance
    Work at office
    Flexible hours

    Everpure LLC

    Santa Clara, CA
    3 days ago
  • $168k - $270.25k

    NVIDIA is seeking a senior software engineer to automate and optimize performance analysis workflows for AI training and inference. You will design and build tools that enhance efficiency across engineering teams. Ideal candidates will possess an M.S. or PhD in a relevant... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

    Responsibilities Build performance modeling and prediction tools for AI workloads at Data-...  ...and TensorFlow, distributed training and inference Knowledge of GPU cluster job scheduling...  ...makes a candidate stand out Proven SW engineering experience experience in deploying SW... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • Staff Runtime Systems Engineer Hybrid, working onsite at our Santa Clara, CA, headquarters...  ...focusing on in-memory compute for AI inference in datacenters. This position is for...  ...responsible for all aspects of runtime performance of the silicon product. You will architect... 
    Senior
    Performance
    3 days per week

    d-Matrix inc.

    Santa Clara, CA
    1 day ago
  •  ...developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU,...  ...experience) with 5+ software engineering and HPC/AI experience Development or integration...  ...Frameworks such as PyTorch, JAX, and inference engines such as TRT‑LLM, vLLM, SGLang... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • A leading technology firm is seeking a Senior Software Engineer for Quantized Inference to implement quantized recipes for advanced model optimization. This role demands strong skills in Python and C++, alongside experience in ML accelerators and software engineering fundamentals... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $244.8k

     ...are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and...  ...work at the cutting edge of AI efficiency, enhancing the performance, scalability, and deployment of large-scale generative AI... 
    Senior
    Performance
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    4 days ago
  • $120.1k - $225.7k

     ...What the Role Entails End-to-End Inference Optimization: Lead the optimization of...  ...Toolchain: Design and implement high-performance inference frameworks; optimize...  ...Ph.D. in Computer Science, Electronic Engineering, AI, or related fields; significant professional... 
    Senior
    Performance
    Relocation package

    Tencent

    Palo Alto, CA
    4 days ago
  • $224k - $356.5k

     ...NVIDIA, a company at the forefront of AI and high-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play...  ...best practices. Work alongside model training, inference, and product divisions to provide trusted... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $128.7k - $261.3k

     ...Job Description Role As a ? Senior System Performance Engineer ?on GM's AV System Performance Team, you will design, build, and optimize reliable...  ...stacks Experience with AI/ML applications or inference software. Background withreal-timeoperating systems.... 
    Senior
    Performance
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $168k - $270.25k

     ...including NeMo microservices and NVIDIA Inference Microservices (NIM), enabling...  .... We are looking for a senior, technically strong test development engineer to drive quality, automation, and...  ...functional, integration, system, performance, and reliability testing Design,... 
    Senior
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Inference Performance Engineer. Be the first to apply!