Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff (Inference)

Xai

Requirements Worked on system optimizations for model serving, such as batching, caching, load balancing, and parallelism , Worked on low-level optimizations for inference, such as GPU kernels and code generation , Worked on algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding, and low-precision numerics , Worked on large-scale inference engines or reinforcement learning frameworks , Worked on large-scale, high-concurrent production serving , Worked on testing, benchmarking, and reliability of inference services What the job involves Optimizing the latency and throughput of model inference , Building reliable production serving systems to serve millions of users , Accelerating research on scaling test-time compute , Model-hardware co-design for next-generation architectures #J-18808-Ljbffr

Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff (Inference) in San Francisco, CA vacancy
  • $150k - $300k

     ...position spanning cloud LLM serving, LLM inference optimization and RL systems. You will be...  ...into our RL training stack. Core Technical Responsibilities LLM Serving Multi‑tenant...  ...in open development and encourage team members to contribute to the broader AI community... 
    Suggested
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime Intellect

    San Francisco, CA
    4 days ago
  • $180k

     ...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence... 
    Suggested
    Temporary work

    Xai

    San Francisco, CA
    2 days ago
  •  ...production workloads built to scale to gigawatt‑class AI datacenters. Mission Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference. In this role, you will design and build inference systems that execute full models end‑to‑end under real... 
    Suggested

    Gimlet Labs, Inc.

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure company is building a scalable cloud platform designed for next-generation machine learning workloads ($80M series A). As AI systems continue to grow in complexity... 
    Suggested

    Acceler8 Talent

    San Francisco, CA
    7 hours ago
  • $200k - $350k

     ...Member Of Technical Staff, Inference & Serving Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today... 
    Suggested
    Immediate start
    Flexible hours

    Inception LLC

    San Francisco, CA
    5 days ago
  • $225k

     ...more reliably than humans can alone. Our approach combines frontier‑scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this goal. About The Role As a Software Engineer on the Inference & RL Systems team, you will design and... 
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    7 hours ago
  •  ...great products. Join us on our mission and shape the future! Member of Technical Staff, Search Why this role? We are looking for talented...  .... Work closely with the model serving team to ensure that inference is fast and stable. Collaborate with product teams to develop... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    4 days ago
  •  ...Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff As a founding member of the engineering...  ...ingestion, transformation, training/fine-tuning, and inference? You will also: Find opportunities to go deep into a wide... 
    Full time
    Part time
    Work at office
    Work from home
    Flexible hours
    2 days per week

    Pixeltable, Inc.

    San Francisco, CA
    7 hours ago
  •  ...What we are looking for? Seeking a Member of Technical Staff - Backend with 5+ years of experience. We are looking for an exceptional builder...  ...scalability of output Design and build the integration of ML inference, monitoring systems, LLM interactions, application layers,... 
    Work experience placement

    RST Recruitment

    San Francisco, CA
    7 hours ago
  • $170k - $220k

     ...Member of Technical Staff – Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation: $170,000 – $220,000 base + 1–3% equity Work...  ...a lean, high-performance team building next-generation inference infrastructure for LLMs. This is an opportunity to own the... 
    Full time
    Temporary work
    Immediate start
    Visa sponsorship
    Work visa

    Amadeus Search

    San Francisco, CA
    7 hours ago
  •  ...uses Shapes every single day, and everyone talks to users. Member of Technical Staff is the title we use for engineers who own hard problems...  ...have experience with LLM training, fine-tuning, evaluation, inference, or RAG at scale High-performance Python backends at scale... 

    Shapes

    San Francisco, CA
    7 hours ago
  •  ...pointing ours at the frontier of science. Role Overview As a Member of Technical Staff you will shape Conductor's core offerings: AI software...  ...Build back‑end services for data collection, labelling, and inference. Integrate with external systems for secure, reliable... 

    Conductor Quantum

    San Francisco, CA
    7 hours ago
  •  ...recognize parts of inputs that are unimportant, reducing inference costs for scale-ups and enterprises that integrate LLMs into...  ...team is 5 people with a research and product focus. As a Member of Technical Staff on our infrastructure team, you'll own the cloud systems that... 
    Visa sponsorship

    The Token Company

    San Francisco, CA
    2 days ago
  •  ...exceptional people to help us get there. The Opportunity Our Edge Inference team compiles Liquid Foundation Models into optimized machine...  ...on-device AI possible. You will work directly with the technical lead on problems that require deep understanding of both ML architectures... 

    Liquid AI

    San Francisco, CA
    7 hours ago
  • $300k

     ...Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are...  ...at scale: distributed rollouts, training orchestration, inference, evals, data pipelines, observability, and reliability. You... 
    Work at office
    Local area

    VMAX LLC

    San Francisco, CA
    7 hours ago
  • $200k

     ...pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About The Role Evals builds...  ...of many of the company's most important decisions. As a Member of Technical Staff on Evals, you will build both the platform and the... 
    Visa sponsorship
    Relocation package

    Magic Inc

    San Francisco, CA
    7 hours ago
  •  ...Member Of Technical Staff We're looking for a member of technical staff to build and deploy production-grade AI systems. In this role, you...  ...world applications Design scalable pipelines for training, inference, and data processing Improve latency, throughput, cost... 

    ERAGON

    San Francisco, CA
    4 days ago
  •  ...to gigawatt‑class AI datacenters. Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an AI Researcher (Intern)...  ...exploring new model architectures and experimenting with novel inference efficiency techniques such as KV caching and FlashAttention.... 
    Internship

    Gimlet Labs

    San Francisco, CA
    7 hours ago
  •  ...Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training role at Reflection AI . Our Mission...  ...pipelines, reward models, reinforcement learning algorithms, and inference‑time scaling techniques. Collaborate across pre‑training... 
    Full time
    Relocation package

    Reflection AI

    San Francisco, CA
    7 hours ago
  •  ...Member of Technical Staff, Applied Research About Us At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers...  ...highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as... 

    SupportFinity

    San Francisco, CA
    7 hours ago
  •  ...contributions to developer tools or AI/ML repositories (Desirable) Inference & Hardware Knowledge: Interest in the hardware side of AI—...  ...end‑to‑end What the job involves We are seeking a Member of Technical Staff, Evals & Post‑Training Product to help define how... 

    Fireworks AI

    San Francisco, CA
    7 hours ago
  •  ...Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration...  ...on (not limited to): Building and scaling training and inference infrastructure (potentially for various chips across... 

    Mirendil

    San Francisco, CA
    7 hours ago
  • $150k

     ...pioneers to lead key initiatives in robotic intelligence. As a Member of Technical Staff, you'll spearhead the development of breakthrough...  ...end‑to‑end vision‑language‑action models, efficient model inference, and video tokenization Design and implement novel deep learning... 
    Local area

    Amazon Science

    San Francisco, CA
    2 days ago
  • $200k

     ...Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe...  ...pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About The Role As a Software... 
    Full time
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    2 days ago
  •  ...boundaries of what's possible in robotic intelligence. As a Member of Technical Staff, you'll be at the forefront of developing breakthrough...  ...end‑to‑end vision‑language‑action models, efficient model inference, video tokenization Design and implement novel deep learning... 
    Local area

    Amazon Science

    San Francisco, CA
    1 day ago
  •  ...Activant, 1984 Ventures and Page One. The Role We’re hiring a Member of Technical Staff – AI/ML to design, build, and deploy AI-powered systems...  ...: Develop robust AI pipelines from data ingestion through inference, ensuring reliability, scalability, and maintainability.... 
    Full time
    Flexible hours

    Stuut

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, Applied AI The opportunity We are looking for a Member of Technical Staff with deep expertise in generative modelling...  ...of generative model architectures, training dynamics and inference behaviour. You are a skilful ML developer. You write ML... 
    Flexible hours

    Latent Labs

    San Francisco, CA
    2 days ago
  •  ...design and the responsibility to defend. About the Role As a Member of Technical Staff, Infrastructure & Training Systems at Radical Numerics,...  ...only strong research ideas, but exceptional training and inference systems: infrastructure that makes large-scale experimentation... 
    Local area

    Radical Numerics Inc.

    San Francisco, CA
    20 hours ago
  • $150k - $300k

     ...infrastructure that runs the jobs. Core Technical Responsibilities Hosted Training Infrastructure...  ...operate Kubernetes-based training and inference orchestration across multi-cluster,...  ...in open development and encourage team members to contribute to the broader AI community... 
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    7 hours ago
  • $150k - $280k

     ...Member of Technical Staff (Backend) San Francisco, CA Compensation: $150,000 – $280,000 + Competitive Equity Type: Full-Time Visa Sponsorship...  ...used by AI agents. Build ML/agent pipelines, distributed inference, and automation frameworks. Own features vertically: design... 
    Full time
    Temporary work
    H1b
    Work at office
    Visa sponsorship
    Relocation package

    Fuku

    San Francisco, CA
    7 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff (Inference). Be the first to apply!