Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, ML Performance

Odyssey

Job Title

Odyssey is an AI lab pioneering general-purpose world models: causal, multimodal systems that learn to predict and interact with the world over long horizons, while generating real-time, interactive simulations from any starting point. This foundational technology promises to revolutionize robotics, science, healthcare, education, gaming, defense, and beyond.

What We're Looking For

We're seeking those who are obsessed with gaining every last drop of performance from complex systems. We're building inference infrastructure to scale to hundreds of thousands of users within a year, while also working with massive, ever-growing datasets and models in training. Your focus will be ensuring our models deliver exceptional speed, reliability, and scalability in both the training and inference phases, optimizing efficiency to minimize TFLOPS per user and training compute cost.

What You'll Do
  • Optimize models that will be used in real-time by hundreds of thousands of users.

  • Design and implement distributed training strategies to reduce training time and resource consumption on large GPU clusters.

  • Partner with our elite team of ML researchers and engineers to ensure model architectures are highly performant from conception.

  • Develop sophisticated tools to identify performance bottlenecks and stability issues in both training and serving environments.

  • Pioneer innovative approaches, frameworks, and system designs that enhance performance metrics across our model development and inference infrastructure.

  • Have significant autonomy in technical decisions.

  • Use the latest-generation GPUs.

Who You Are
  • 8+ years of software engineering experience, with significant work in ML performance.

  • Deep insight into modern machine learning architectures with a natural instinct for performance optimization, particularly distributed training and inference.

  • Track record of owning projects end to end.

  • Problem-solving mindset with the ability to acquire new skills as needed.

  • Proficiency with PyTorch (or TF/JAX) and Triton as well as NVIDIA GPU ecosystems and optimization stacks.

  • Highly metric-based.

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, ML Performance in Santa Clara, CA vacancy
  •  ...About the Role We are seeking a Member of Technical Staff, Machine Learning Kernels to design, optimize, and benchmark high-performance compute kernels for modern machine learning...  ...implement, and optimize high-performance ML kernels, primarily targeting GPUs (CUDA)... 
    Performance
    Visa sponsorship
    Relocation package

    Netpreme

    Santa Clara, CA
    7 hours ago
  • $169.6k

     ...users to effortlessly run large-scale ML applications, without the hassle of managing...  ...Inc. has multiple openings for Member of Technical Staff (Software Engineer) Title : Member...  ...Implement infrastructure to support high-performance, low-latency inference service.... 
    Performance
    Full time
    Part time
    Internship
    Remote work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  • $180k

     ...About the Role We are looking for a Member of Technical Staff - Mid-Training to lead the development...  ...mixtures to improve downstream RL performance. Build and optimize distributed training...  ..., formal methods, or large-scale ML - rather than post-training specifically... 
    Performance
    Full time

    Hark

    San Jose, CA
    3 days ago
  • $180k

     ...Member Of Technical Staff - Data Platform Palo Alto, CA xAI's mission is to create AI systems that...  ...Flink, and Trino, enabling real-time ML pipelines, feed ranking,...  ...systems that require fault tolerance, performance, and absolute reliability. As a software... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    2 days ago
  •  ...About the Role RadixArk is seeking a Member of Technical Staff - Training to build and scale the systems that train frontier...  ...thousands of GPUs. This role sits at the intersection of ML, systems, and performance engineering. Your work will directly impact how... 
    Performance
    Flexible hours

    RadixArk

    Palo Alto, CA
    2 days ago
  •  ...About the Role RadixArk is hiring a Member of Technical Staff - Supercomputing to help build,...  ...infrastructure, LLM inference serving, ML systems, or large-scale training workloads...  ..., orchestration, serving, and performance layers. Experience with Python, Bash... 
    Performance
    Flexible hours

    RadixArk

    Palo Alto, CA
    3 days ago
  • $180k

     ...Member of Technical Staff - Multimodal Understanding Palo Alto, CA About xAI xAI's mission is...  ...scaling paradigms for state-of-the-art performance. Build research tooling, user-...  ...or optimizing large-scale distributed ML systems (training/inference optimization... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    3 days ago
  •  ...hardware revolution. What You'll Do As a Founding Member of the Technical Staff (ML infra) at Architect, you'll be responsible for the...  ...detect bottlenecks and implement optimizations for high-performance training setups . Collaborate closely with ML researchers... 
    Performance

    Architect Labs

    Palo Alto, CA
    7 hours ago
  • $150k - $400k

     ...Engineer and evolve the core Agent OS—a high-performance, resilient platform encompassing the...  ...deep collaboration with Application and ML Platform teams to meet tight latency,...  ...centralized policy/safety infrastructure. Technical Leadership: Strong technical... 
    Performance

    Boson AI

    Santa Clara, CA
    1 day ago
  • $119.8k - $234.7k

     ...capability into reliable, aligned, and measurable performance improvements, directly shaping how...  ...advance model capabilities. Each team member owns meaningful parts of the post-training...  ...Can design, run, and interpret large-scale ML experiments with careful statistical and... 
    Performance
    Ongoing contract
    Local area
    Worldwide

    Microsoft Corporation

    Mountain View, CA
    7 hours ago
  •  ...Job Title : Principal Member of Technical Staff Reports to : Head, ASIC Engineering Job Location : Santa Clara, CA About Exo...  ...Supervise layout personnel and laboratory technicians. Perform reviews of designs with worst case analysis that impact performance... 
    Performance
    Immediate start

    EXO

    Santa Clara, CA
    4 days ago
  •  ...About the Role We are seeking a Member of Technical Staff, Senior ASIC Verification Engineers. This role demands proven technical leadership...  ...to design memory acceleration. This role will be performed onsite from one of our offices in Santa Clara, CA or Boston... 
    Performance
    Visa sponsorship
    Relocation package

    Netpreme

    Santa Clara, CA
    4 days ago
  •  ...interns to help them grow as engineers and become productive members of the team. ~ You will primarily write code in Java (...  ...early developer testing and functional/integration as well as performance testing to ensure we build robust and resilient products.... 
    Performance
    Immediate start

    Illumio

    Sunnyvale, CA
    5 days ago
  • $142.8k - $274.8k

     ...Overview Microsoft AI is looking for a Member of Technical Staff - Reinforcement Learning Systems to help build the world's most advanced...  ...scalable systems from the ground up, identify and resolve performance bottlenecks, debug complex, cross-system issues with extremely... 
    Performance
    Ongoing contract
    Work at office
    Local area
    Shift work

    Microsoft Corporation

    Mountain View, CA
    7 hours ago
  • $148.5k - $223.9k

     ...systems with customers. With your strong technical competence, strategic thinking and...  ...exceptional engineering skills. Has deep ML knowledge with meaningful implementation...  ...for correctness, quality, security, and performance Strong software engineering... 
    Performance

    Salesforce.Com Inc

    Palo Alto, CA
    2 days ago
  • $250k

     ...effortlessly run large-scale ML applications, without the...  ...has multiple openings for Sr. Technical Staff. Title: Sr. Technical Staff...  ...specifications, record performance data, and recommend design modifications...  .... Work with system team members to diagnose system related... 
    Performance
    Remote work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  •  ...closely with a strong engineering team on technically challenging problems. ⚙️ What You’ll...  ...reliability, observability, and performance Collaborate across engineering to deliver...  ...⭐ Nice to Have Experience with AI/ML frameworks (e.g. PyTorch, TensorFlow)... 
    Performance
    Flexible hours
    3 days per week

    DeepRec.ai

    Palo Alto, CA
    4 days ago
  • $180k

     ...Member Of Technical Staff - Imagine Model Palo Alto, CA; Seattle, WA About XAI XAI's mission is to create AI systems that can accurately...  ...Implement efficient algorithms for state-of-the-art model performance, including real-time inference, distillation, and scalable... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    13 hours ago
  • $264k - $342k

     ...Job Description Technical Staff-Storage Architect From applied research...  ...and tune systems for maximum performance. Drive innovation at...  ...Every Dell Technologies team member brings something unique to the...  ...~ Deep understanding of AI/ML workflows, data pipelines, training... 
    Performance

    Dell

    Santa Clara, CA
    1 day ago
  • $263k - $341k

     ...Job Description Technical Staff-Network Architect From applied research...  ...and tune systems for maximum performance. Drive innovation at...  ...Every Dell Technologies team member brings something unique to the...  ...TensorFlow, PyTorch, Kubernetes for ML workloads) • Experience... 
    Performance

    Dell

    Santa Clara, CA
    5 days ago
  •  ...points/cells). You will collaborate with our ML and physics simulation teams, contribute...  ...those with a background in high-performance geometry or graphics, in HPC and parallel...  ...efficiency. Experience working in a relevant technical domain such as computational geometry,... 
    Performance

    Vinci AI

    Palo Alto, CA
    4 days ago
  •  ...prototype them with AI tooling. We're hiring a Technical Staff (the in-fashion term for an ML engineer), but it’s really a founding member of the team. We're looking for someone...  ..., or another safety-critical field. Performance Expectations By month 3: leading 2 engagements... 
    Performance
    Full time

    Tessel

    Sunnyvale, CA
    4 days ago
  •  ...from paper to production. Design high-performance data and compute systems for video...  ...possible next. Who You Are A staff-level or senior engineer with experience...  ...optimization, or model deployment. Deeply technical and hands-on: you find energy in writing... 
    Performance

    Odyssey

    Santa Clara, CA
    1 day ago
  • $180k

     ...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI systems that can accurately understand the...  .... About The Role We are building the high-performance inference platform that serves Grok to millions of users every... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    2 days ago
  • $180k

     ...benchmarks to measure speech quality, latency, robustness, and overall user experience. Optimize models and systems for real-time performance, scalability, and production deployment. Collaborate closely with product and engineering teams to translate research... 
    Performance
    Full time

    Hark

    San Jose, CA
    2 days ago
  • $180k

     ...Member Of Technical Staff - Media Palo Alto, CA; Seattle, WA About XAI XAI's mission is to create AI systems that can accurately understand...  ...core media processing and distribution pipelines in high-performance languages (Rust, C++ or Go) Obsess over every... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    2 days ago
  •  ...envelope in terms of not only usability and functionality, but also performance and scale. This role will provide you with an opportunity to...  ...work with, offering a great learning experience for all team members. Collaboration and support are key values within the team. You... 
    Performance
    Work at office
    Remote work
    3 days per week

    Nutanix

    San Jose, CA
    5 days ago
  • $180k

     ...Member Of Technical Staff - Voice Product Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe...  ...product teams to deliver end-to-end experiences. Drive performance, reliability, and quality of voice interactions at global... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    4 days ago
  • $180k

     ...Member Of Technical Staff - Grok Product Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe...  ...Grok leading in reasoning and utility. Architect high-performance systems for personalized, reliable interactions at global... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    3 days ago
  •  ...Member Technical Staff - DevEx (MSP/Cloud Native) As a Member Technical Staff - DevEx (MSP/Cloud Native) at Nutanix, your role will be to...  ...services like IAM, NKP, Object Storage and Self-Service Perform in-depth troubleshooting of MSP deployment, upgrades, migration... 
    Performance
    Work at office
    Remote work
    3 days per week

    Nutanix

    San Jose, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, ML Performance. Be the first to apply!