Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, Performance and Scale

$200k - $400k

Inferact

Inferact's mission is to grow vLLM as the world's AI inference engine and accelerate AI progress by making inference cheaper and faster. Founded by the creators and core maintainers of vLLM, we sit at the intersection of models and hardware—a position that took years to build. About The Role We're looking for an infrastructure engineer to build the distributed systems that power inference at global scale. You'll design and implement the foundational layers that enable vLLM to serve models across thousands of accelerators with minimal latency and maximum reliability. Tomorrow, deploying a frontier model at scale should be as straightforward as spinning up a serverless database. The complexity doesn't disappear as it gets absorbed into the infrastructure you're building. Skills And Qualifications Minimum qualifications: Bachelor's degree or equivalent experience in computer science, engineering, or similar. Strong systems programming skills in Rust, Go, or C++. Experience designing and building high-performance distributed systems at scale. Understanding of network protocols and high-performance I/O. Ability to debug complex distributed systems issues. Preferred qualifications: Experience with ML serving infrastructure and disaggregated inference architecture. Familiarity with GPU programming models and memory hierarchies. Knowledge of GPU interconnects (NVLink, InfiniBand, RoCE) and their performance characteristics. Track record of improving system reliability and performance at scale. Bonus points if you have: Prior experience in supporting large‑scale model training or inference environments. Logistics Location: This role is based in San Francisco, California. Will consider remote in the US for exceptional candidates. Compensation: Depending on background, skills, and experience, the expected annual salary range for this position is $200,000 - $400,000 USD + equity. Visa sponsorship: We sponsor visas on a case-by-case basis. Benefits: Inferact offers generous health, dental, and vision benefits as well as 401(k) company match. Compensation Range: $200K - $400K #J-18808-Ljbffr

Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Performance and Scale in San Francisco, CA vacancy
  •  ...But succeed an unfair amount. Job: You will build, operate, and scale our infrastructure, including our infrastructure around large...  ...: Have deep intuition on distributed systems, cloud platforms, performance tuning, and scalable architecture. You like to reason about trade... 
    Performance

    Parallel Web Systems

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...Germany with a growing presence in San Francisco, we’re scaling fast while staying true to what makes us different...  ...storage systems and providers Debug and resolve performance bottlenecks in distributed data loading Technical Focus Python, PyTorch DataLoader internals Object... 
    Performance
    Remote work
    Worldwide
    2 days per week

    blackforestlabs

    San Francisco, CA
    3 days ago
  •  ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental...  ...component to hardware that best fits its performance and efficiency needs. This approach...  ...datacenters. Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an... 
    Performance
    Internship

    Gimlet Labs

    San Francisco, CA
    7 hours ago
  • $150k - $350k

     ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental...  ...to hardware that best fits its performance and efficiency needs. This approach enables...  .... Mission Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU... 
    Performance

    Gimlet Labs, Inc.

    San Francisco, CA
    7 hours ago
  • $150k - $350k

     ...Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this...  ...AI workloads reliably at production scale. You will work on systems that coordinate...  ...end‑to‑end system correctness and performance Qualifications Strong software engineering... 
    Performance

    Gimlet Labs, Inc.

    San Francisco, CA
    7 hours ago
  •  ...Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving...  ...systems, and infrastructure for large-scale experiments. Our team includes researchers...  ...custom ML kernels Optimizing performance (latency, throughput, cost) Developing... 
    Performance

    Mirendil

    San Francisco, CA
    7 hours ago
  •  ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental...  ...to hardware that best fits its performance and efficiency needs. This approach enables...  .... Gimlet Labs is seeking a Member of Technical Staff focused on compilers. In this role, you... 
    Performance

    Gimlet Labs

    San Francisco, CA
    7 hours ago
  •  ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental...  ...to hardware that best fits its performance and efficiency needs. This approach enables...  .... Mission Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference.... 
    Performance

    Gimlet Labs, Inc.

    San Francisco, CA
    4 days ago
  • $100k - $150k

     ...Founding Member of Technical Staff (Security) Location: San Francisco • Singapore • Hyderabad • London...  ...closely with one of the founders to scale our vulnerability research on open-...  ...Create benchmarks to evaluate agent performance on real-world scenarios. Work closely... 
    Performance
    Full time
    For contractors
    Work at office

    Crane Venture Partners

    San Francisco, CA
    7 hours ago
  •  ...settle digital assets safely and at scale, improving digital asset liquidity...  ...driver of the system architecture, technical direction and each team member’s technical skill development. At Anchorage...  ...at Anchorage Digital. We define performance as acquiring, possessing, and... 
    Performance

    Motive Partners

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure...  ...higher utilization and better performance across multi-vendor systems. The company...  ..., you will build and optimize large-scale inference systems that serve modern AI... 
    Performance

    Acceler8 Talent

    San Francisco, CA
    7 hours ago
  •  ...Job Description As a Member of Technical Staff (Research) at Trajectory, you will design and build the post‑training stack that...  ...or post-training RL for LLMs Experience with high-performance computing or large-scale clusters Contributions to open-source ML research or... 
    Performance

    Trajectory

    San Francisco, CA
    4 days ago
  • $170k - $220k

     ...Member of Technical Staff – Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation...  ...strong engineer to join a lean, high-performance team building next-generation inference...  ..., working directly on problems like: Scaling multi-GPU inference workloads... 
    Performance
    Full time
    Temporary work
    Immediate start
    Visa sponsorship
    Work visa

    Amadeus Search

    San Francisco, CA
    7 hours ago
  •  ...Shapes every single day, and everyone talks to users. Member of Technical Staff is the title we use for engineers who own hard problems...  ...training, fine-tuning, evaluation, inference, or RAG at scale High-performance Python backends at scale Realtime infrastructure (WebRTC... 
    Performance

    Shapes

    San Francisco, CA
    7 hours ago
  •  ...unimportant, reducing inference costs for scale-ups and enterprises that integrate...  ...a research and product focus. As a Member of Technical Staff on our infrastructure team, you'll own...  ..., Docker and CI/CD, and building for performance and reliability at scale Have owned... 
    Performance
    Visa sponsorship

    The Token Company

    San Francisco, CA
    2 days ago
  • $200k

     ...Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5...  ...alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra...  ...as Magic scales long‑context model performance and reliability. What you’ll work on... 
    Performance
    Full time
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    2 days ago
  •  ...techniques, and how they relate to model performance (Desirable) Startup DNA: Experience...  ...the job involves We are seeking a Member of Technical Staff, Evals & Post‑Training Product to help...  ...helping internal teams run evals at scale, enabling external developers through... 
    Performance

    Fireworks AI

    San Francisco, CA
    7 hours ago
  •  ...company is seeking an intrepid, polymathic Member of Technical Staff to take on one of the AI industry's...  ...compliance systems that efficiently scale with our growing portfolio of...  ...for recruiting, labor compliance, and performance management. Participate in meetings with... 
    Performance

    United States Digital Space LLC

    San Francisco, CA
    4 days ago
  •  ...responsibility to defend. About the Role As a Member of Technical Staff, Mechanistic Interpretability at...  ..., and manipulate information across scales. Build the tooling for model...  ...representations relate to downstream performance, reasoning, robustness, controllability... 
    Performance
    Local area

    Radical Numerics Inc.

    San Francisco, CA
    20 hours ago
  •  ...Department Modeling Who are we? Our mission is to scale intelligence to serve humanity. We’re training...  ...York but also embrace being remote-friendly! As a Member of Technical Staff, you will: Design and write high-performant and scalable software for training models.... 
    Performance
    Full time
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    7 hours ago
  • $176k - $253k

     ...Senior Member of Technical Staff, AI Quality Harper is an AI-native commercial insurance company in...  ...before the customer does. That's how we scale judgment without scaling headcount....  ...176,000–$253,000 cash (base + target performance bonus), plus competitive equity. Location... 
    Performance
    Permanent employment
    Work at office
    Relocation

    Harper Group

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, Infrastructure Join us and help shape the future of AI by architecting next...  ...for designing, building, and scaling core infrastructure that powers a high...  ...clusters for cost-effectiveness and performance. Enable external customer deployment... 
    Performance
    Work at office

    LlamaIndex, Inc.

    San Francisco, CA
    4 days ago
  •  ...responsibility to defend. About the Role As a Member of Technical Staff, Pre-Training Science at Radical...  ...world models learn during large-scale training. You will develop new pretraining...  ...-scale experiments, and writing high-performance code that turns ideas into measurable... 
    Performance
    Local area

    Radical Numerics Inc.

    San Francisco, CA
    20 hours ago
  •  ...and more in terms of: How will this perform? How will this scale? Is this simple? Is this reliable? You...  ...primary trait we are looking for is enough technical knowledge to execute without guidance...  ...to start Two Dots. Other team members include: Meta ML alumnus with decades... 
    Performance
    Odd job

    Two Dots Inc

    San Francisco, CA
    7 hours ago
  •  ...manufacturing sectors from Fortune 10 brands to scaling midmarkets. We're backed by top-tier...  ...Page One. The Role We’re hiring a Member of Technical Staff – AI/ML to design, build, and deploy...  ...frameworks to track AI system performance and quantify business outcomes. You Might... 
    Performance
    Full time
    Flexible hours

    Stuut

    San Francisco, CA
    4 days ago
  • $227.5k - $401k

     ...motivated individuals who tackle unique technical challenges at scale and solve them as a team,...  ...financial technology sector. As a Member of Technical Staff, you will operate with a high degree...  ...‑of‑concepts or fixing critical performance issues. Learn and Lead : connect... 
    Performance
    Work at office
    Immediate start
    Relocation
    Flexible hours

    Adyen

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...reinforcement learning at frontier scale, adapting models to real...  ...our RL training stack. Core Technical Responsibilities LLM Serving...  ...clusters. Inference Optimization & Performance Framework Development:...  ...development and encourage team members to contribute to the broader... 
    Performance
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime Intellect

    San Francisco, CA
    4 days ago
  • $150k

     ...initiatives in robotic intelligence. As a Member of Technical Staff, you'll spearhead the development of...  ...and real‑world deployment at Amazon scale. In this role, you'll combine hands‑...  ...initiatives, ensuring robust performance in production environments Mentor and... 
    Performance
    Local area

    Amazon Science

    San Francisco, CA
    2 days ago
  •  ...billions, of qubits to operate reliably at scale. The result is a quantum computer as...  ...of science. Role Overview As a Member of Technical Staff you will shape Conductor's core offerings...  ...devices to extract quantum features, performance metrics, and optimisation... 
    Performance

    Conductor Quantum

    San Francisco, CA
    7 hours ago
  • $150k - $300k

     ...reinforcement learning at frontier scale, adapting models to real...  ...on a distributed system with performance engineering at its core. The...  ...and reliable at scale. Core Technical Responsibilities Infrastructure...  ...and encourage team members to contribute to the broader... 
    Performance
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect, Inc.

    San Francisco, CA
    7 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Performance and Scale. Be the first to apply!