Member of Technical Staff, Inference & RL Systems
$225kMagic Inc
Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier‑scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this goal. About The Role As a Software Engineer on the Inference & RL Systems team, you will design and operate the distributed systems that serve our models in production and power large‑scale post‑training workflows. This role sits at the boundary between model execution and distributed infrastructure. You will work on systems that determine inference latency, throughput, stability, and the reliability of RL and post‑training training loops. Magic’s long‑context models introduce demanding execution constraints: KV‑cache scaling, memory pressure under long sequences, batching trade‑offs, long‑horizon trajectory rollouts, and sustained throughput under real‑world workloads. You will own the infrastructure that makes both production inference and large‑scale RL iteration fast and reliable. What you’ll work on Design and scale high‑performance inference serving systems Optimize KV‑cache management, batching strategies, and scheduling Improve throughput and latency for long‑context workloads Build and maintain distributed RL and post‑training infrastructure Improve reliability of rollout, evaluation, and reward pipelines Automate fault detection and recovery for serving and RL systems Profile and eliminate performance bottlenecks across GPU, networking, and storage layers Collaborate with Kernels and Research to align execution systems with model architecture What we’re looking for Strong software engineering and distributed systems fundamentals Experience building or operating large‑scale inference or training systems Deep understanding of GPU execution constraints and memory trade‑offs Experience debugging performance issues in production ML systems Ability to reason about system‑level trade‑offs between latency, throughput, and cost Track record of owning critical production infrastructure Compensation, Benefits, And Perks (US) Annual salary range: $225K - $550K Equity is a significant part of total compensation, in addition to salary 401(k) plan with 6% salary matching Generous health, dental and vision insurance for you and your dependents Unlimited paid time off Visa sponsorship and relocation stipend to bring you to SF, if possible A small, fast‑paced, highly focused team Our culture Integrity. Words and actions should be aligned Hands‑on. At Magic, everyone is building Teamwork. We move as one team, not N individuals Focus. Safely deploy AGI. Everything else is noise Quality. Magic should feel like magic #J-18808-Ljbffr
- ...first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits... ...AI datacenters. Mission Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference. In this role, you will design and build inference...Suggested
$150k - $300k
...and pair it with the full RL post‑training stack: environments... ...cloud LLM serving, LLM inference optimization and RL systems. You will be working on... ...RL training stack. Core Technical Responsibilities LLM Serving... ...and encourage team members to contribute to the broader...SuggestedWork at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work$300k
...Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing... ...open-ended learning. We are building systems to exceed humans in all capacities by... ...distributed rollouts, training orchestration, inference, evals, data pipelines,...SuggestedWork at officeLocal area$180k
...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of... ...research on scaling test-time compute, RL rollout, and model-hardware co-design for...SuggestedTemporary work$300k
...Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About...SuggestedWork at officeLocal areaShift work$180k
...Member Of Technical Staff - RL Infrastructure Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering...Temporary work$225k
...Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About the role As a Software Engineer on the Pre-training Systems team, you will design and operate the distributed infrastructure...RelocationVisa sponsorship- ...Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration... ...on (not limited to): Building and scaling training and inference infrastructure (potentially for various chips across...
- ...Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure company is building a scalable cloud platform designed... ...machine learning workloads ($80M series A). As AI systems continue to grow in complexity, traditional infrastructure...
- ...is bringing the rigor of distributed systems, model architecture, and numerics research... ...to defend. About the Role As a Member of Technical Staff, Infrastructure & Training Systems at... ...ideas, but exceptional training and inference systems: infrastructure that makes large...Local area
$200k - $350k
...Member Of Technical Staff, Inference & Serving Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's... ...engineers and scientists to design, optimize, and scale the systems that power our diffusion LLMs in production. Your work...Immediate startFlexible hours$200k
...approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About the Role... ...and operate the data, evaluation, and environment systems that improve model capabilities after pre-training. This...RelocationVisa sponsorship$200k
...scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal... ...most important decisions. As a Member of Technical Staff on Evals, you will build both the... ...building trustworthy evaluation systems, you will help Magic make better...Visa sponsorshipRelocation package$150k - $350k
Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution...- ...building the first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in power,... ...to gigawatt-class AI datacenters. Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will...
- Requirements Worked on system optimizations for model serving, such as batching, caching, load balancing, and parallelism , Worked on low-level optimizations for inference, such as GPU kernels and code generation , Worked on algorithmic optimizations for inference, such...
- ...Member of Technical Staff, Autonomous AI Systems Mirendil The Role We are looking for talented engineers and researchers to join as Members of Technical Staff. This is an intentionally open-ended role. Some example areas you might work on (not limited to): AI for AI systems...
- Member of Technical Staff - Post‑Training Join to apply for the Member of Technical... .... About The Role Build systems that transform powerful pre... ...learning algorithms, and inference‑time scaling techniques. Collaborate... ...data, reward modeling, or RL techniques. Evidence of...Full timeRelocation package
$160k - $250k
...of that, we protect uninterrupted time to build. What you'll own & build As a Member of Technical Staff within the Research Tribe, you’ll be one of the early engineers shaping the core systems that power Blok. You won’t just build agents - you’ll design the...Work at officeWeekend work3 days per week- ...This Role We're looking for an engineer with deep Rust expertise and strong algorithmic fundamentals to work on performance‑critical systems. You'll build the low‑level infrastructure that powers real‑time robotic perception, planning, and control. Core Responsibilities...
- ...What we are looking for? Seeking a Member of Technical Staff - Backend with 5+ years of experience.... ...will be crucial in building the core systems that deploy our machine learning capabilities... ...and build the integration of ML inference, monitoring systems, LLM interactions...Work experience placement
$300k
...Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab... ...open-ended learning. We are building systems to exceed humans in all capacities by... ...internal representations evolve during RL and post-training, and use these insights...Work at officeLocal area- ...and enterprises who are building AI systems to power magical experiences like content... ...on our mission and shape the future! Member of Technical Staff, Search Why this role? We are looking... ...model serving team to ensure that inference is fast and stable. Collaborate with...Full timeWork at officeRemote workFlexible hours
- ...Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical... ...transformation, training/fine-tuning, and inference? You will also: Find opportunities to... ..., or a related field 5+ years of systems engineering experience in an industry...Full timePart timeWork at officeWork from homeFlexible hours2 days per week
- ...operations for spin and superconducting‑qubit systems, turning raw cryogenic hardware into... ...of science. Role Overview As a Member of Technical Staff you will shape Conductor's core... ...for data collection, labelling, and inference. Integrate with external systems for...
$170k - $220k
...Member of Technical Staff – Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation... ...performance team building next-generation inference infrastructure for LLMs. This is an... ...development of performance-critical systems from day one, working directly on...Full timeTemporary workImmediate startVisa sponsorshipWork visa- ...on building safe, advanced AI systems that accelerate progress on... ...reliably than humans can alone. Our technical approach combines frontier-... ..., ultra-long context, and inference-time compute. These systems... ...with training, inference, and RL teams to make large-scale model...Work at officeVisa sponsorshipRelocation packageFlexible hours
- ...MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets... ...us get there. The Opportunity Our Edge Inference team compiles Liquid Foundation Models... .... You will work directly with the technical lead on problems that require deep understanding...
- ...to build a living, breathing operating system unique to each user, dynamically generating... ...to post-train on and unique usage to RL on possess strong opinions about how AI... ...and 2) to win. think that this role should be renamed "member of tomo staff" #J-18808-Ljbffr...Immediate start
- ...human attention, and an agentic operating system can lift that ceiling by an order of... ...to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who build... ...observability. Background in offline RL, contextual bandits, or sequential decision...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff, Inference & RL Systems. Be the first to apply!
- remote support technician San Francisco, CA
- personal computer support technician San Francisco, CA
- customer support analyst San Francisco, CA
- systems support technician San Francisco, CA
- help desk administrator San Francisco, CA
- decision support analyst San Francisco, CA
- technical support assistant San Francisco, CA
- technical analyst San Francisco, CA
- technical assistant San Francisco, CA
- IT support technician San Francisco, CA

