Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, ML Performance

Odyssey

Who we are Odyssey is an AI lab pioneering general-purpose world models—a new form of multimodal intelligence unlocking entirely new consumer, enterprise, and intelligence applications. World models are the next major frontier in AI, and Odyssey is leading the way with breakthrough models like Odyssey-2 Pro. What we're looking for We’re seeking those who are obsessed with gaining every last drop of performance from complex systems. We're building inference infrastructure to scale to hundreds of thousands of users within a year, while also working with massive, ever-growing datasets and models in training. Your focus will be ensuring our models deliver exceptional speed, reliability, and scalability in both the training and inference phases, optimizing efficiency to minimize TFLOPS per user and training compute cost. What you'll do Optimize models that will be used in real-time by hundreds of thousands of users. Design and implement distributed training strategies to reduce training time and resource consumption on large GPU clusters. Partner with our elite team of ML researchers and engineers to ensure model architectures are highly performant from conception. Develop sophisticated tools to identify performance bottlenecks and stability issues in both training and serving environments. Pioneer innovative approaches, frameworks, and system designs that enhance performance metrics across our model development and inference infrastructure. Have significant autonomy in technical decisions. Use the latest-generation GPUs. Who you are 8+ years of software engineering experience, with significant work in ML performance. Deep insight into modern machine learning architectures with a natural instinct for performance optimization, particularly distributed training and inference. Track record of owning projects end to end. Problem-solving mindset with the ability to acquire new skills as needed. Proficiency with PyTorch (or TF/JAX) and Triton as well as NVIDIA GPU ecosystems and optimization stacks. Highly metric-based. #J-18808-Ljbffr

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, ML Performance in Santa Clara, CA vacancy
  • $230k

     ...users to effortlessly run large-scale ML applications, without the hassle of managing...  ...Inc. has multiple openings for Sr. Member of Technical Staff. Title: Sr. Member of Technical Staff...  ...low-latency and scalable system performance. Develop Python-based scripts and APIs... 
    Performance
    Remote work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    5 days ago
  •  ...Member Of Technical Staff, Machine Learning Kernels We are seeking a Member of Technical Staff, Machine...  ...design, optimize, and benchmark high-performance compute kernels for modern machine...  ...implement, and optimize high-performance ML kernels, primarily targeting GPUs (... 
    Performance
    Visa sponsorship
    Relocation package

    Netpreme

    Santa Clara, CA
    1 day ago
  • $169.6k

     ...users to effortlessly run large-scale ML applications, without the hassle of managing...  ...Inc. has multiple openings for Member of Technical Staff (Software Engineer) Title : Member...  ...Implement infrastructure to support high-performance, low-latency inference service.... 
    Performance
    Full time
    Part time
    Internship
    Remote work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  •  ...About the Role As a Member of Technical Staff [Platform] at NeoCognition , you’ll design and build the...  ...Implement and monitor observability and performance metrics across services and pipelines....  ...data pipelines , job scheduling , or ML experimentation workflows . Excellent... 
    Performance

    NeoCognition Inc.

    Palo Alto, CA
    5 days ago
  •  ...despite non-deterministic model behavior. Role As a Member of Technical Staff, Machine Learning, you will build core ML components. You will work on real production...  ...‑world and synthetic data. Debug model issues, performance problems, and production incidents. Ship... 
    Performance
    Immediate start

    A1 Services

    Palo Alto, CA
    5 days ago
  •  ...RadixArk is seeking a Member of Technical Staff — Inference to push the limits of large-scale AI inference...  ...frontier models at scale, optimizing performance, latency, throughput, and cost across...  ...intersection of systems engineering, ML infrastructure, and performance... 
    Performance
    Worldwide
    Flexible hours

    RadixArk

    Palo Alto, CA
    4 days ago
  •  ...Member of Technical Staff — Kernel / Compiler / Communication About the Role RadixArk is seeking a Member...  .../ Communication to push the limits of performance for frontier AI systems. You will work...  ...Contributions to kernel/compiler/ML systems open source Experience scaling... 
    Performance
    Flexible hours

    RadixArk

    Palo Alto, CA
    4 days ago
  •  ...Member of Technical Staff Physical AI (Robotics / World Models) Palo Alto, CA About Orbifold AI Orbifold...  ...role focused on real-world system performance , not purely theoretical research. Key...  ...experience building or iterating on applied ML systems at scale Comfortable operating... 
    Performance
    Shift work

    Bonfirevc

    Palo Alto, CA
    5 days ago
  •  ...Member of Technical Staff -- Cluster / Platform About the Role RadixArk is looking for a Member of Technical...  ...and operate highly reliable, high-performance GPU/TPU clusters, build next-...  ...Strong Plus: Experience with large-scale ML/AI workloads Familiarity with RDMA, InfiniBand... 
    Performance
    Flexible hours

    RadixArk

    Palo Alto, CA
    5 days ago
  •  ...partners. We are looking for an exceptional Member of Technical Staff to help design, build, and scale core...  ..., and infrastructure. Emphasis on performance, scalability, and reliability. AI...  ...software, distributed systems, compilers, ML systems, hardware‑software co‑design,... 
    Performance

    DensityAI

    Mountain View, CA
    4 days ago
  •  ...at our Santa Clara, CA or Boston, MA office. Responsibilities Performance modeling for GPU memory subsystems and memory‑semantics interconnects...  ...movement ASICs Familiarity with state‑of‑the‑art distributed ML algorithms Preferred Qualifications Experience architecting... 
    Performance
    Work at office
    Visa sponsorship
    Relocation package

    Netpreme

    Santa Clara, CA
    4 days ago
  •  ...custom ASICs alongside evolving ML workloads, and enable a new...  ...Intel. We're looking for staff/principal-level compiler engineers...  .... What You’ll Do As a Member of the Technical Staff on the Compilers team...  ...silicon team. Feed real workload performance data back into architectural... 
    Performance

    Architect Labs

    Palo Alto, CA
    4 days ago
  • $180k - $250k

     ...Member of Technical Staff -- TPU Systems (JAX / XLA / PALLAS) About the Role RadixArk is looking for a TPU Systems Engineer to build high-performance inference and training systems using JAX, XLA, and Pallas...  ...experience building production ML systems with JAX, XLA, or TPU... 
    Performance
    Full time
    Flexible hours

    RadixArk

    Palo Alto, CA
    5 hours ago
  •  ...Member of Technical Staff — Supercomputing About the Role RadixArk is hiring a Member of Technical Staff...  ..., LLM inference serving, ML systems, or large‑scale training workloads...  ...infrastructure, orchestration, serving, and performance layers. Experience with Python, Bash,... 
    Performance
    Flexible hours

    RadixArk

    Palo Alto, CA
    4 days ago
  •  ...design custom ASICs alongside evolving ML workloads, and enable a new era of...  ..., Apple and Intel. What You’ll Do As Member of the Technical Staff - Software at Architect, you’ll build...  ...full stack, developing intuitive, high‑performance systems using TypeScript, Python, and... 
    Performance

    Architect, Inc.

    Palo Alto, CA
    4 days ago
  • $90k - $130k

     ...Member of Technical Staff - Program Analysis This role is based in Palo Alto, California, and follows...  ...role focused on our static analysis and ML‑for‑code initiatives. As an MTS, you...  ...improve precision, recall, coverage, and performance across supported languages. Help... 
    Performance

    Endor Labs

    Palo Alto, CA
    5 days ago
  •  ...hardware revolution. What You'll Do As a Founding Member of the Technical Staff (ML infra) at Architect, you'll be responsible for the...  ...detect bottlenecks and implement optimizations for high-performance training setups . Collaborate closely with ML researchers... 
    Performance

    Architect Labs

    Palo Alto, CA
    1 day ago
  •  ...About the Role As a Member of Technical Staff [Research] at NeoCognition , you’ll be part of the core...  ...Design and execute experiments, benchmark performance, and analyze model behaviors to...  ...in Python and familiarity with modern ML frameworks (e.g., PyTorch, JAX, or TensorFlow... 
    Performance

    NeoCognition Inc.

    Palo Alto, CA
    4 days ago
  •  ...largest real-world radiology datasets. About the Role As a Member of Technical Staff on the ML Infrastructure team, you will build and operate the...  ...turn training requirements into scalable systems. Debug performance and reliability issues across the ML stack. What You... 
    Performance

    Cognita Imaging Inc.

    Palo Alto, CA
    5 hours ago
  •  ...RadixArk is seeking a Member of Technical Staff — Training to build and scale the systems that train frontier AI models. You will work...  ...of GPUs. This role sits at the intersection of ML, systems, and performance engineering. Your work will directly impact how next-... 
    Performance
    Flexible hours

    RadixArk

    Palo Alto, CA
    4 days ago
  • $180k

     .... About the Role We are looking for a Member of Technical Staff - Mid-Training to lead the development...  ...data mixtures to improve downstream RL performance. Build and optimize distributed...  ...compilers, formal methods, or large-scale ML — rather than post-training specifically... 
    Performance
    Full time

    Hark

    San Jose, CA
    4 days ago
  •  ...design custom ASICs alongside evolving ML workloads, and enable a new era of...  ...Intel. What You’ll Do As a Founding Member of the Technical Staff at Architect, you’ll be at the forefront...  ..., ensuring that theoretical performance translates into production‑ready implementations... 
    Performance

    Architect, Inc.

    Palo Alto, CA
    4 days ago
  • $177k - $387k

     ...memory and storage solutions. The Senior Member of the Technical Staff in the Core Data Center Business Unit...  ...long‑term technology vision for high‑performance storage in evolving system...  ...across diverse workloads, including AI/ML infrastructure. Ability to articulate... 
    Performance
    Local area

    Micron Technology

    San Jose, CA
    4 days ago
  • $180k

     ...and Trino, enabling real‑time ML pipelines, feed ranking,...  ...that require fault tolerance, performance, and absolute reliability. About...  ...phone interview”) during which a member of our team will ask some...  ...process, which consists of 2 technical interviews and 1 project deep... 
    Performance
    Temporary work
    Work at office
    Work from home

    Pantera Capital

    Palo Alto, CA
    4 days ago
  • $180k

     ...Member of Technical Staff - Multimodal Understanding About xAI xAI’s mission is to create AI systems...  ...scaling paradigms for state‑of‑the‑art performance. Build research tooling, user‑friendly...  ...or optimizing large‑scale distributed ML systems (training/inference optimisation... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    5 days ago
  • $180k

     ...and Trino, enabling real-time ML pipelines, feed ranking,...  ...that require fault tolerance, performance, and absolute reliability. About...  ...phone interview”) during which a member of our team will ask some...  ...process, which consists of 2 technical interviews and 1 project deep... 
    Performance
    Temporary work
    H1b
    Work at office
    Work from home
    Work visa

    Xai

    Palo Alto, CA
    5 hours ago
  •  ...design custom ASICs alongside evolving ML workloads, and enable a new era of...  ...Intel. What You'll Do As a Founding Member of the Technical Staff on the RTL Design team at Architect,...  ...development on energy-efficient, high-performance HW accelerators on your block-of-expertise... 
    Performance

    Architect Labs

    Palo Alto, CA
    5 days ago
  • $148.5k - $223.9k

     ...Senior Member of Technical Staff - AI ResearchSkip to main content#Senior Member of Technical Staff -...  ...exceptional engineering skills.** *Has deep ML knowledge with meaningful...  ...for correctness, quality, security, and performance** *Strong software engineering fundamentals... 
    Performance
    Work at office

    Salesforce

    Palo Alto, CA
    4 days ago
  • $180k

     ...Member of Technical Staff - Data Platform Palo Alto, CA About xAI xAI's mission is to create...  ...Flink, and Trino, enabling real-time ML pipelines, feed ranking, experimentation...  ...systems that require fault tolerance, performance, and absolute reliability. As a software... 
    Performance
    Temporary work

    Xai

    Palo Alto, CA
    3 days ago
  • $280k - $350k

     ...prototypes, shaping both the technical roadmap and the research culture...  ...run experiments, benchmark performance, and analyze model behaviors...  ...roadmap as one of the first members of the team. What we’re looking...  ...or model evaluation. Strong ML fundamentals across model architectures... 
    Performance
    Internship
    Relocation
    Visa sponsorship
    Relocation package

    Raydar Inc

    Palo Alto, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, ML Performance. Be the first to apply!