Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer - Language Model Pre-Training

Full-time

Zyphra

Job Description

Job Description

Zyphra is an artificial intelligence company based in San Francisco, California.

The Role:

As a Research Engineer - Language Model Pre-Training , you'll shape our language model roadmap through end-to-end pretraining development. You will work extremely closely with our pretraining team, who will integrate your insights into our next-generation models.

You'll Work Across:
  • Large-scale training runs and model parallelization

  • Performance optimization of our pretraining stack

  • Dataset collection, processing, and evaluation

  • Architecture and methodology research, including optimizer ablations

What We're Looking For / Requirements:
  • Strong engineering aptitude for rapidly implementing reliable and robust systems

  • Can rapidly learn new fields and are excited to implement new ideas

  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale

Qualifications / Additional Skills:
  • Deep expertise and intuition for solving machine learning problems and training models

  • Experience with training on large-scale (multi-node) GPU clusters

  • Deep understanding of model training pipelines – including model/data parallelism, distributed optimizers, etc.

  • Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing

  • Understanding of large-scale, highly parallel data processing pipelines

  • High proficiency with PyTorch and Python.

  • Strong ability to dive into large pre-existing codebases and rapidly get up to speed

  • Published machine learning research in well-respected venues is a plus

  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Math, Physics)

Why Work at Zyphra:
  • Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

  • We strongly value new and crazy ideas and are very willing to bet big on new ideas

  • We move as quickly as we can; we aim to minimize the bar to impact as low as possible

  • We all enjoy what we do and love discussing AI

Benefits and Perks:
  • Comprehensive medical, dental, vision, and FSA plans

  • Competitive compensation and 401(k) plan

  • Relocation and immigration support on a case-by-case basis

  • In-office snacks and meals provided

  • Unlimited PTO and company holidays

  • In-person team in San Francisco with a collaborative, high-energy environment

Vacancy posted a month ago
Similar jobs that could be interesting for youBased on the Research Engineer - Language Model Pre-Training in San Francisco, CA vacancy
  • $340k - $425k

     ...is a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...seeking a Research Engineer to join our Pre-training team, responsible for developing the next generation of large language models. In this role, you will work at the intersection... 
    Training
    Language
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    3 days ago
  • $315k

     ...quickly growing group of committed researchers, engineers, policy experts, and business...  ...the role Anthropic's production models undergo sophisticated post-training processes to enhance their capabilities...  ...-tuning, or evaluating large language models Can balance research... 
    Training
    Language
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    3 days ago
  • Perplexity is seeking a Research Engineering Manager to lead the team of all...  ...for developing the models that drive our products. Our...  ...queries, leveraging cutting‑edge training techniques to scale AI model...  ...skills; versatility across languages and frameworks is a plus.... 
    Training
    Language

    Perplexity AI Inc.

    San Francisco, CA
    5 days ago
  •  ...Francisco, California. The Role: As a Research Engineer - Model Architectures , you will be a core...  ...testing novel model architectures and training methodologies, with a focus on...  ...also work extremely closely with our pre-training team, who will integrate your... 
    Training
    Full time
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    a month ago
  • $320k

     ...growing group of committed researchers, engineers, policy experts, and...  ...is the potential for models to interact with...  ...Develop privacy-first training algorithms and...  ...familiarity with large language models, how they work...  ...collecting training datasets, pre-training models, post... 
    Training
    Language
    Full time
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    1 day ago
  • A leading AI research firm in San Francisco is seeking a Research Engineer specializing in Model Architectures. You will design and rigorously test innovative model architectures...  ...and collaborating closely with pre-training teams. Candidates should exhibit strong research... 
    Training

    Zyphra

    San Francisco, CA
    2 days ago
  •  ...Hedra is a pioneering generative modeling company — first models to...  ...industry and economy use cases. As a Research Engineer on our Physical AI team, you will lead pre-training and post-training on action-...  ...world models and vision-language-action (VLA) models Develop and... 
    Training
    Language
    Work at office

    Hedra

    San Francisco, CA
    5 days ago
  • $180k - $270k

    Research Engineer (Focused on RL) You'll bring reinforcement learning...  ...— building the training infrastructure, reward...  ...systems that make our models meaningfully better at...  ...translate your work into language that engineers,...  ...future-you will thank you Pre-tax benefits — Access... 
    Training
    Language
    Full time
    Temporary work
    Remote work

    Firecrawl

    San Francisco, CA
    1 day ago
  •  ...A leading AI research company in San Francisco is seeking a Staff Research Engineer to enhance the efficiency of large language models. In this role, you will develop and implement advanced techniques to optimize model performance in production. Ideal candidates will hold... 
    Language
    Remote work

    Cohere

    San Francisco, CA
    3 days ago
  •  ...of memory features—from research to production. You’ll fine-tune models for extraction, updates,...  ...from papers; and ship with Engineering to SOTA latency,...  ...’ll Do Fine-tune and train models for memory extraction...  ...for complex vision-and-language tasks (gold sets,... 
    Training
    Language

    Mem0

    San Francisco, CA
    1 day ago
  •  ...digital coworkers. Our research team pushes the frontier of post-training and reinforcement...  ...applied research engineers sit side-by-side...  ...frontier-scale models and develop the methods...  ...frontier-scale language models on enterprise...  ...Background in pre-training or post-training... 
    Training
    Language
    Daily paid
    Work at office
    Visa sponsorship
    Relocation package

    Applied Compute Inc.

    San Francisco, CA
    3 days ago
  • $280k

     ...growing group of committed researchers, engineers, policy experts, and business...  ...to keep highly capable models helpful and honest, even as...  ...of our safety techniques by training language models to subvert our safety...  ...relocating". Team Matching * Pre-training — The Pre-training... 
    Training
    Language
    Contract work
    For contractors
    For subcontractor
    Work at office
    Relocation
    Visa sponsorship
    Work visa
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    5 days ago
  • $180k

     ...motivated, and focused on engineering excellence. This...  ...All engineers and researchers are expected to have strong...  ...engineers to build generative models that can accurately...  ...edge multimodal language models, with a particular...  ...data. Develop and train multimodal transformers... 
    Training
    Language
    Work at office
    Local area
    Relocation

    xAI

    San Francisco, CA
    more than 2 months ago
  • $320k

     ...growing group of committed researchers, engineers, policy experts, and business...  ...the eval against live training checkpoints, to interpreting...  ...leadership use to monitor model health during training, improving...  ...-on experience using large language models such as Claude,... 
    Training
    Language
    Remote job
    Work at office
    Visa sponsorship
    Flexible hours
    San Francisco, CA
    a month ago
  • $166k - $225k

     ...enables companies to develop AI models and systems using their own...  ...technologies ranging from pre‑training LLMs from scratch to...  ...all. Job Description As a research engineer on the Scaling team, you will...  ...training frameworks for large language models, including... 
    Training
    Language
    Worldwide

    Cacheflow

    San Francisco, CA
    5 days ago
  • $320k

    Anthropic in New York City is seeking a Research Engineer to develop evaluations for Claude’s capabilities. The ideal...  ...running evaluations, and debugging results during training runs. The role offers a hybrid work model and competitive compensation ranging from $320,000... 
    Training
    Remote job

    Menlo Ventures

    San Francisco, CA
    3 days ago
  •  ...will ultimately become the perception engine for a company’s physical footprint,...  ...deep-learning based vision, vision-language, and large language models to our world-class distributed perception...  ...collection, labelling, and model re-training platform Driving the design... 
    Training
    Language

    Specter

    San Francisco, CA
    1 day ago
  •  ...Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be...  ...in reinforcement learning, post-training, and human preference learning, and...  ...ideas at scale to our next generation of language models. What We’re Looking For /... 
    Training
    Language
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    4 hours ago
  •  ...Today, not even the best models can continuously...  ..., a new primitive for training efficient, large‑scale...  ...innovation and systems engineering paired with a design‑minded...  ...world’s diversity of languages and cultures. We are...  ...scalable systems that bridge research and production.... 
    Training
    Language
    Work at office
    Relocation package

    Cartesia

    San Francisco, CA
    1 day ago
  •  ...Description We are Genmo, a research lab dedicated to...  ..., state-of-the-art models for video generation towards...  ...exceptional Software Engineer to join our research...  ...frameworks and model training ~ Understanding of fundamental...  ...Large Language Models Familiarity... 
    Training
    Language
    Work at office

    Genmo

    San Francisco, CA
    4 hours ago
  •  ...Research Engineer, Interpretability Systems San Francisco Bay Area (On-site...  ...founded by former frontier-model researchers, focused on...  ...interpretability for large language models. They are building experimental...  ..., representations, or post-training systems Strong Python and... 
    Training
    Language

    Acceler8 Talent

    San Francisco, CA
    2 days ago
  •  ...Member of Technical Staff, Model EfficiencyWho are we?Our mission...  ...to serve humanity. We're training and deploying frontier...  ...customers.Cohere is a team of researchers, engineers, designers, and more, who...  ...Experience working with large language models and familiarity with... 
    Training
    Language
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    3 days ago
  • General Agents is an applied research lab exploring the frontiersof autonomous intelligence...  ...labor.We are a team of researchers, engineers, and operators with expertise in...  ...powered by our foundationvideo-language-action models trained on large-scale behavior data. Role... 
    Training
    Language

    Generalagents

    San Francisco, CA
    3 days ago
  • $320k - $405k

     ...growing group of committed researchers, engineers, policy experts, and business...  ...of adversaries to mis-use models in harmful ways Work...  ...papers on computer security, language modeling, or related topics...  ...combination of education, training, and/or experience Required... 
    Training
    Language
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    5 days ago
  •  ...Research Engineer, Foundation Models About the Opportunity We are seeking a Research Engineer to help...  ..., focusing on the development, training, evaluation, and deployment of state...  ...directly to cutting-edge work in large language models, reinforcement learning,... 
    Training
    Language
    Visa sponsorship
    Relocation package
    Flexible hours

    Acceler8 Talent

    San Francisco, CA
    9 hours ago
  • $147.4k - $220.9k

     ...AI/ML - Machine Learning Research Engineer, Machine Translation Work Locations (3) Submit...  ...translation (MT) technology and large language model (LLM) technologies. Our mission is to...  ...machine learning and NLP including model training, inference, Large Language Models and... 
    Training
    Language
    Relocation

    Apple

    San Francisco, CA
    4 days ago
  •  ...capability that determines whether a model can pick a signal out of...  ...searcher. We're hiring a Research Engineer to advance the science and...  ...them, and turn search post‑training from a craft into a measurable...  ...experience with RL on large language models — environments,... 
    Training
    Language
    Visa sponsorship

    Nerdleveltech

    San Francisco, CA
    3 days ago
  • Cartesia is seeking a Research Engineer in San Francisco to develop large-scale datasets essential for training our AI models. This role focuses on ensuring data quality and linguistic...  ...to enhance performance across multiple languages. The ideal candidate will have... 
    Training
    Language
    Flexible hours

    Cartesia

    San Francisco, CA
    2 days ago
  •  ...that powers breakthrough AI models at leading research labs and enterprises. Since...  ...to produce high-quality training data at scale Frontier Data...  ...Overview As an Applied Research Engineer, you will be at the...  ...frontier AI models—such as large language models and multimodal... 
    Training
    Language
    Flexible hours

    HRB

    San Francisco, CA
    5 days ago
  •  ...software vulnerabilities. We are training and scaling security AI...  ...We’re seeking an experienced Research Engineer to join our effort in...  ...strong intuition, experience in model evaluation, and benchmarks....  ...deep learning, and/or natural language processing Experience with... 
    Training
    Language
    Full time
    Work at office

    DepthFirst

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer - Language Model Pre-Training. Be the first to apply!