Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff - Efficient ML

Embedding VC

Introducing Moonlake, AI for creating world simulations. Scope of Work Training efficiency Dataloaders, fusion, activation remat, gradient checkpointing. FSDP/ZeRO/tensor+pipeline parallel; NCCL tuning. GPU + kernel performance Nsight profiling, Triton/CUDA kernels, fused ops. Flash-attention–style speedups, sequence packing, KV-cache tricks. Inference optimization Low-latency serving, continuous batching, speculative decoding. Quantization (GPTQ/AWQ), distillation, pruning. Infra + reliability SLURM/K8s multi-node jobs, checkpoint hygiene. Determinism, env pinning, GPU failure handling. We are committed to being an on-site, in-person team currently based in San Mateo #J-18808-Ljbffr Embedding VC

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - Efficient ML in San Francisco, CA vacancy
  • Member of Technical Staff, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models...  ...researchers and engineers focused on building reliable ML systems and pushing the boundaries of LLM inference... 
    Suggested
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    2 days ago
  •  ...function improvements in performance and efficiency. Customers deploy through production-...  ...role Gimlet Labs is seeking an Member of Technical Staff focused on AI research. As an...  ...comparable area of study Experience with AI/ML or applied data science. Strong... 
    Suggested

    Gimlet Labs

    San Francisco, CA
    3 days ago
  •  ...function improvements in performance and efficiency. Customers deploy through production-...  ...role Gimlet Labs is seeking a Member of Technical Staff (Intern) to help develop Gimlet's platform...  ...area of study Experience with AI/ML or distributed systems. Strong... 
    Suggested
    Internship

    Gimlet Labs

    San Francisco, CA
    3 days ago
  • $200k - $350k

     ...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first...  ...experience). Understanding of ML frameworks (PyTorch, TensorFlow) from... 
    Suggested
    Immediate start
    Flexible hours

    Inception LLC

    San Francisco, CA
    4 days ago
  • $200k - $350k

     ...Member Of Technical Staff, Inference & Serving Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially...  ...incident response. Collaborate with ML researchers to translate model advances... 
    Suggested
    Immediate start
    Flexible hours

    Inception LLC

    San Francisco, CA
    2 days ago
  •  ...Member Of Technical Staff We're looking for a member of technical staff to build and deploy production...  ...Improve latency, throughput, cost efficiency, and reliability of systems Work with...  ...in Python and modern engineering or ML frameworks Experience building and... 

    ERAGON

    San Francisco, CA
    1 day ago
  • $180k

     ...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create...  ...model-hardware co-design. Expert in ML and large model scaling, familiar with...  ...experience on optimizing ML training efficiency. Annual Salary Range $180,000 -... 
    Temporary work

    Xai

    San Francisco, CA
    4 days ago
  •  ...run benchmarks reliably and efficiently. At Vals, we believe in autonomy...  ...reviews for other members of the team Help establish engineering...  ...their needs Requirements Technical 2+ YOE: 2+ years of full-time...  ...own company Interest in AI/ML systems and evaluation What... 
    Full time
    Work experience placement
    Relocation
    Relocation package
    Shift work

    PetsApp

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...infrastructure to serve LLMs efficiently at scale. Optimization and integration...  ...our RL training stack. Core Technical Responsibilities LLM Serving...  ...Required Experience Building ML Systems at Scale: 3+ years...  ...and encourage team members to contribute to the broader... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime-Intellect

    San Francisco, CA
    1 day ago
  •  ...component to hardware that best fits its performance and efficiency needs. This approach enables heterogeneous systems...  ...class AI datacenters. Mission Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference. In this role, you will design... 

    Gimlet Labs, Inc.

    San Francisco, CA
    2 days ago
  • Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff...  ...stack, allowing data scientists and ML engineers to focus on building...  ...heterogeneous compute resources (CPU and GPU) efficiently? What data model will enable us to... 
    Full time
    Part time
    Work at office
    Work from home
    Flexible hours
    2 days per week

    Pixeltable, Inc.

    San Francisco, CA
    11 hours ago
  • Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure company is building a scalable cloud platform...  ...infrastructure models are facing limitations in efficiency, scalability, and cost. The platform addresses these challenges... 

    Acceler8 Talent

    San Francisco, CA
    1 day ago
  • $300k

    Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable...  ...LLMs. We are limited by the sample efficiency of the current policy gradient...  ...Experience designing and running rigorous ML experiments, including ablations, baselines... 
    Work at office
    Local area
    Shift work

    Vmax

    San Francisco, CA
    3 days ago
  • Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research...  ...analysis and new training methods for efficient RL Drive novel research in RL and...  ...reproducible research code in Python and modern ML frameworks. Ability to execute... 

    Patronus AI, Inc.

    San Francisco, CA
    11 hours ago
  •  ...precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who...  ...the field — will build on every day. ML and agentic systems are part of the stack...  ...made by humans. We use AI to support efficiency and consistency, not to replace human... 

    Beacon Software

    San Francisco, CA
    3 days ago
  • $150k

    Hiring: Member of Technical Staff (AI Systems / Full-Stack Engineering) only U.S. Citizen or Green Card...  ...throughput, and operational efficiency in real time. The company has already...  ...Experience developing and deploying AI/ML applications in production environments... 
    Immediate start

    Carnaby Fox

    San Francisco, CA
    1 day ago
  •  ...possible in robotic intelligence. As a Member of Technical Staff, you'll be at the forefront of...  ...multi‑modal robotic foundation models and efficient, promptable model architectures that can...  ...Master's degree and 4+ years of CS, CE, ML or related field experience... 
    Local area

    Amazon Science

    San Francisco, CA
    11 hours ago
  • Member of Technical Staff - Agents at Prime Intellect - San Francisco Building the Future of Open Source...  ...: Ability to craft intuitive and efficient interfaces for agent communication. Infrastructure...  ...infrastructure. Nice to Have GPU/ML Infrastructure : Understanding how to... 
    Remote work
    Flexible hours

    Victrays

    San Francisco, CA
    4 days ago
  • Member of Technical Staff — Voice & Audio AI Systems Build intelligent voice experiences that transform...  ...enables finance teams to work more efficiently while improving customer interactions....  ...and deploying production systems. AI/ML & Audio Systems Expertise Minimum of two... 
    Full time
    Flexible hours

    Andiamo

    San Francisco, CA
    1 day ago
  • Member of Technical Staff — AI/ML Engineering (Financial Technology) Build intelligent systems that redefine how businesses manage financial operations...  ...Develop and enhance AI‑driven features that improve efficiency and accuracy in financial workflows. Continuously refine... 
    Full time
    Flexible hours

    Andiamo

    San Francisco, CA
    3 days ago
  • $148.5k - $223.9k

     ...systems with customers. With your strong technical competence, strategic thinking and...  ...exceptional engineering skills. Has deep ML knowledge with meaningful implementation...  ...seamlessly into human workflows, driving efficiency and innovation at scale.... 

    Salesforce.Com Inc

    San Francisco, CA
    2 days ago
  • $150k - $280k

     ...Member of Technical Staff (Backend) San Francisco, CA Compensation: $150,000 – $280,000 + Competitive Equity Type: Full-Time Visa Sponsorship...  ...a track record of building and selling successful AI and ML systems. Key company highlights: - Has never lost an... 
    Full time
    Temporary work
    H1b
    Work at office
    Visa sponsorship
    Relocation package

    Fuku

    San Francisco, CA
    1 day ago
  •  ...that run our largest foundation models. You'll collaborate with ML researchers to productionize their models, automate deployment and...  ...of video encoding, chunking, and streaming formats for efficient multi-modal ingestion. Basic front-end experience (React /... 
    Worldwide

    Pear VC

    San Francisco, CA
    4 days ago
  •  ...Description Job Description What we are looking for? Seeking a Member of Technical Staff - Backend with 5+ years of experience. We are looking for an...  ...of output Design and build the integration of ML inference, monitoring systems, LLM interactions, application... 
    Work experience placement

    RST Recruitment

    San Francisco, CA
    14 days ago
  • $200k - $350k

     ...long-term success for both clients and candidates. Member of Technical Staff - Pre-Training Infrastructure Location: San Francisco,...  ...infrastructure for large-scale foundation model development. Build efficient and reproducible multi-GPU and multi-node training... 
    Work at office
    Visa sponsorship

    Recruiting from Scratch

    San Francisco, CA
    3 days ago
  • $150k - $250k

     ...with scalable and reliable AI agents. For example, our agents can efficiently reason about and verify mission-critical code, including...  ...source contributions or production experience with large-scale ML infrastructure Compensation, benefits, and perks We offer... 
    Work at office
    Remote work
    Flexible hours

    Asari AI

    San Francisco, CA
    7 days ago
  •  ...Member Of Technical Staff @ Lotus AI Lotus AI is a groundbreaking primary care app that integrates your medical records, AI, and real doctors...  ...Experience building production systems that power AI or ML workflows Hands-on experience with LLM APIs, prompt engineering... 

    Lotus Health

    San Francisco, CA
    14 days ago
  • $120k - $300k

     ...4× lower costs. The team is tackling three hard technical challenges: browser agents for the invisible web...  ...of daily transactions. About the Role As a Member of Technical Staff, you'll own backend development and ML systems that power Sphinx's financial crime detection... 
    Temporary work
    H1b
    Work at office
    Relocation package

    Transparent Search Group

    San Francisco, CA
    3 days ago
  •  ...advance our understanding of scaling large language models and data efficiency. Implement state-of-the-art methods from the deep learning...  ...training data preparation. Deep understanding of large-scale ML, especially as it relates to language models, distributed... 
    Relocation package

    Reflection AI, Inc

    San Francisco, CA
    8 days ago
  • $225k

     ...thousands of devices, long-running jobs that must survive failures, and efficient sequence packing under hardware constraints. You will own the...  ...offs Experience debugging cross-layer issues in production ML systems Strong ownership mindset and ability to operate... 
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Efficient ML. Be the first to apply!