Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, Inference & RL Systems

$225k

Magic Inc

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier‑scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this goal. About The Role As a Software Engineer on the Inference & RL Systems team, you will design and operate the distributed systems that serve our models in production and power large‑scale post‑training workflows. This role sits at the boundary between model execution and distributed infrastructure. You will work on systems that determine inference latency, throughput, stability, and the reliability of RL and post‑training training loops. Magic’s long‑context models introduce demanding execution constraints: KV‑cache scaling, memory pressure under long sequences, batching trade‑offs, long‑horizon trajectory rollouts, and sustained throughput under real‑world workloads. You will own the infrastructure that makes both production inference and large‑scale RL iteration fast and reliable. What you’ll work on Design and scale high‑performance inference serving systems Optimize KV‑cache management, batching strategies, and scheduling Improve throughput and latency for long‑context workloads Build and maintain distributed RL and post‑training infrastructure Improve reliability of rollout, evaluation, and reward pipelines Automate fault detection and recovery for serving and RL systems Profile and eliminate performance bottlenecks across GPU, networking, and storage layers Collaborate with Kernels and Research to align execution systems with model architecture What we’re looking for Strong software engineering and distributed systems fundamentals Experience building or operating large‑scale inference or training systems Deep understanding of GPU execution constraints and memory trade‑offs Experience debugging performance issues in production ML systems Ability to reason about system‑level trade‑offs between latency, throughput, and cost Track record of owning critical production infrastructure Compensation, Benefits, And Perks (US) Annual salary range: $225K - $550K Equity is a significant part of total compensation, in addition to salary 401(k) plan with 6% salary matching Generous health, dental and vision insurance for you and your dependents Unlimited paid time off Visa sponsorship and relocation stipend to bring you to SF, if possible A small, fast‑paced, highly focused team Our culture Integrity. Words and actions should be aligned Hands‑on. At Magic, everyone is building Teamwork. We move as one team, not N individuals Focus. Safely deploy AGI. Everything else is noise Quality. Magic should feel like magic #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Inference & RL Systems in San Francisco, CA vacancy
  •  ...first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits...  ...AI datacenters. Mission Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference. In this role, you will design and build inference... 
    Suggested

    Gimlet Labs, Inc.

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...and pair it with the full RL post‑training stack: environments...  ...cloud LLM serving, LLM inference optimization and RL systems. You will be working on...  ...RL training stack. Core Technical Responsibilities LLM Serving...  ...and encourage team members to contribute to the broader... 
    Suggested
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime Intellect

    San Francisco, CA
    3 days ago
  • $300k

     ...Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing...  ...open-ended learning. We are building systems to exceed humans in all capacities by...  ...distributed rollouts, training orchestration, inference, evals, data pipelines,... 
    Suggested
    Work at office
    Local area

    VMAX LLC

    San Francisco, CA
    3 days ago
  • $180k

     ...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of...  ...research on scaling test-time compute, RL rollout, and model-hardware co-design for... 
    Suggested
    Temporary work

    Xai

    San Francisco, CA
    2 days ago
  • $300k

     ...Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About... 
    Suggested
    Work at office
    Local area
    Shift work

    VMAX LLC

    San Francisco, CA
    3 days ago
  • $180k

     ...Member Of Technical Staff - RL Infrastructure Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering... 
    Temporary work

    Xai

    San Francisco, CA
    1 day ago
  • $225k

     ...Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About the role As a Software Engineer on the Pre-training Systems team, you will design and operate the distributed infrastructure... 
    Relocation
    Visa sponsorship

    Dormont Manufacturing Company

    San Francisco, CA
    3 days ago
  •  ...Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration...  ...on (not limited to): Building and scaling training and inference infrastructure (potentially for various chips across... 

    Mirendil

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure company is building a scalable cloud platform designed...  ...machine learning workloads ($80M series A). As AI systems continue to grow in complexity, traditional infrastructure... 

    Acceler8 Talent

    San Francisco, CA
    4 days ago
  •  ...is bringing the rigor of distributed systems, model architecture, and numerics research...  ...to defend. About the Role As a Member of Technical Staff, Infrastructure & Training Systems at...  ...ideas, but exceptional training and inference systems: infrastructure that makes large... 
    Local area

    Radical Numerics Inc.

    San Francisco, CA
    5 hours ago
  • $200k - $350k

     ...Member Of Technical Staff, Inference & Serving Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's...  ...engineers and scientists to design, optimize, and scale the systems that power our diffusion LLMs in production. Your work... 
    Immediate start
    Flexible hours

    Inception LLC

    San Francisco, CA
    4 days ago
  • $200k

     ...approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About the Role...  ...and operate the data, evaluation, and environment systems that improve model capabilities after pre-training. This... 
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    1 day ago
  • $200k

     ...scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal...  ...most important decisions. As a Member of Technical Staff on Evals, you will build both the...  ...building trustworthy evaluation systems, you will help Magic make better... 
    Visa sponsorship
    Relocation package

    Magic AI Corp.

    San Francisco, CA
    4 days ago
  • $150k - $350k

    Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution... 

    Gimlet Labs, Inc.

    San Francisco, CA
    2 days ago
  •  ...building the first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in power,...  ...to gigawatt-class AI datacenters. Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will... 

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • Requirements Worked on system optimizations for model serving, such as batching, caching, load balancing, and parallelism , Worked on low-level optimizations for inference, such as GPU kernels and code generation , Worked on algorithmic optimizations for inference, such... 

    xAI

    San Francisco, CA
    2 days ago
  •  ...Member of Technical Staff, Autonomous AI Systems Mirendil The Role We are looking for talented engineers and researchers to join as Members of Technical Staff. This is an intentionally open-ended role. Some example areas you might work on (not limited to): AI for AI systems... 

    Mirendil

    San Francisco, CA
    3 days ago
  • Member of Technical Staff - Post‑Training Join to apply for the Member of Technical...  .... About The Role Build systems that transform powerful pre...  ...learning algorithms, and inference‑time scaling techniques. Collaborate...  ...data, reward modeling, or RL techniques. Evidence of... 
    Full time
    Relocation package

    Reflection AI

    San Francisco, CA
    4 days ago
  • $160k - $250k

     ...of that, we protect uninterrupted time to build. What you'll own & build As a Member of Technical Staff within the Research Tribe, you’ll be one of the early engineers shaping the core systems that power Blok. You won’t just build agents - you’ll design the... 
    Work at office
    Weekend work
    3 days per week

    BLOK

    San Francisco, CA
    3 days ago
  •  ...This Role We're looking for an engineer with deep Rust expertise and strong algorithmic fundamentals to work on performance‑critical systems. You'll build the low‑level infrastructure that powers real‑time robotic perception, planning, and control. Core Responsibilities... 

    dimensional

    San Francisco, CA
    3 days ago
  •  ...What we are looking for? Seeking a Member of Technical Staff - Backend with 5+ years of experience....  ...will be crucial in building the core systems that deploy our machine learning capabilities...  ...and build the integration of ML inference, monitoring systems, LLM interactions... 
    Work experience placement

    RST Recruitment

    San Francisco, CA
    4 days ago
  • $300k

     ...Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab...  ...open-ended learning. We are building systems to exceed humans in all capacities by...  ...internal representations evolve during RL and post-training, and use these insights... 
    Work at office
    Local area

    VMAX LLC

    San Francisco, CA
    3 days ago
  •  ...and enterprises who are building AI systems to power magical experiences like content...  ...on our mission and shape the future! Member of Technical Staff, Search Why this role? We are looking...  ...model serving team to ensure that inference is fast and stable. Collaborate with... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    3 days ago
  •  ...Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical...  ...transformation, training/fine-tuning, and inference? You will also: Find opportunities to...  ..., or a related field 5+ years of systems engineering experience in an industry... 
    Full time
    Part time
    Work at office
    Work from home
    Flexible hours
    2 days per week

    Pixeltable, Inc.

    San Francisco, CA
    3 days ago
  •  ...operations for spin and superconducting‑qubit systems, turning raw cryogenic hardware into...  ...of science. Role Overview As a Member of Technical Staff you will shape Conductor's core...  ...for data collection, labelling, and inference. Integrate with external systems for... 

    Conductor Quantum

    San Francisco, CA
    3 days ago
  • $170k - $220k

     ...Member of Technical Staff – Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation...  ...performance team building next-generation inference infrastructure for LLMs. This is an...  ...development of performance-critical systems from day one, working directly on... 
    Full time
    Temporary work
    Immediate start
    Visa sponsorship
    Work visa

    Amadeus Search

    San Francisco, CA
    4 days ago
  •  ...on building safe, advanced AI systems that accelerate progress on...  ...reliably than humans can alone. Our technical approach combines frontier-...  ..., ultra-long context, and inference-time compute. These systems...  ...with training, inference, and RL teams to make large-scale model... 
    Work at office
    Visa sponsorship
    Relocation package
    Flexible hours

    Acceler8 Talent

    San Francisco, CA
    4 days ago
  •  ...MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets...  ...us get there. The Opportunity Our Edge Inference team compiles Liquid Foundation Models...  .... You will work directly with the technical lead on problems that require deep understanding... 

    Liquid AI

    San Francisco, CA
    4 days ago
  •  ...to build a living, breathing operating system unique to each user, dynamically generating...  ...to post-train on and unique usage to RL on possess strong opinions about how AI...  ...and 2) to win. think that this role should be renamed "member of tomo staff" #J-18808-Ljbffr... 
    Immediate start

    Tomo

    San Francisco, CA
    4 days ago
  •  ...human attention, and an agentic operating system can lift that ceiling by an order of...  ...to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who build...  ...observability. Background in offline RL, contextual bandits, or sequential decision... 

    BEACON SOFTWARE COMPANY

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Inference & RL Systems. Be the first to apply!