Research Engineer - Language Model Pre-Training

Full-time

Zyphra

Job Description

Zyphra is an artificial intelligence company based in San Francisco, California.

The Role:

As a Research Engineer - Language Model Pre-Training , you'll shape our language model roadmap through end-to-end pretraining development. You will work extremely closely with our pretraining team, who will integrate your insights into our next-generation models.

You'll Work Across:

Large-scale training runs and model parallelization
Performance optimization of our pretraining stack
Dataset collection, processing, and evaluation
Architecture and methodology research, including optimizer ablations

What We're Looking For / Requirements:

Strong engineering aptitude for rapidly implementing reliable and robust systems
Can rapidly learn new fields and are excited to implement new ideas
Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale

Qualifications / Additional Skills:

Deep expertise and intuition for solving machine learning problems and training models
Experience with training on large-scale (multi-node) GPU clusters
Deep understanding of model training pipelines – including model/data parallelism, distributed optimizers, etc.
Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing
Understanding of large-scale, highly parallel data processing pipelines
High proficiency with PyTorch and Python.
Strong ability to dive into large pre-existing codebases and rapidly get up to speed
Published machine learning research in well-respected venues is a plus
Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Math, Physics)

Why Work at Zyphra:

Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued
We strongly value new and crazy ideas and are very willing to bet big on new ideas
We move as quickly as we can; we aim to minimize the bar to impact as low as possible
We all enjoy what we do and love discussing AI

Benefits and Perks:

Comprehensive medical, dental, vision, and FSA plans
Competitive compensation and 401(k) plan
Relocation and immigration support on a case-by-case basis
In-office snacks and meals provided
Unlimited PTO and company holidays
In-person team in San Francisco with a collaborative, high-energy environment

Apply

Vacancy posted a month ago

Similar jobs that could be interesting for youBased on the Research Engineer - Language Model Pre-Training in San Francisco, CA vacancy

Research Engineer/Research Scientist, Pre-training
$340k - $425k
...is a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...seeking a Research Engineer to join our Pre-training team, responsible for developing the next generation of large language models. In this role, you will work at the intersection...
Training
Language
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
3 days ago
Research Engineer, Production Model Post-Training
$315k
...quickly growing group of committed researchers, engineers, policy experts, and business... ...the role Anthropic's production models undergo sophisticated post-training processes to enhance their capabilities... ...-tuning, or evaluating large language models Can balance research...
Training
Language
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
3 days ago
Research Engineering Manager - Model Training
Perplexity is seeking a Research Engineering Manager to lead the team of all... ...for developing the models that drive our products. Our... ...queries, leveraging cutting‑edge training techniques to scale AI model... ...skills; versatility across languages and frameworks is a plus....
Training
Language
Perplexity AI Inc.
San Francisco, CA
5 days ago
Research Engineer - Model Architectures
...Francisco, California. The Role: As a Research Engineer - Model Architectures , you will be a core... ...testing novel model architectures and training methodologies, with a focus on... ...also work extremely closely with our pre-training team, who will integrate your...
Training
Full time
Work at office
Relocation package
Zyphra
San Francisco, CA
a month ago
Privacy Research Engineer, Safeguards
$320k
...growing group of committed researchers, engineers, policy experts, and... ...is the potential for models to interact with... ...Develop privacy-first training algorithms and... ...familiarity with large language models, how they work... ...collecting training datasets, pre-training models, post...
Training
Language
Full time
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
1 day ago
Model Architecture Research Engineer — Pioneering AI
A leading AI research firm in San Francisco is seeking a Research Engineer specializing in Model Architectures. You will design and rigorously test innovative model architectures... ...and collaborating closely with pre-training teams. Candidates should exhibit strong research...
Training
Zyphra
San Francisco, CA
2 days ago
Research Engineer
...Hedra is a pioneering generative modeling company — first models to... ...industry and economy use cases. As a Research Engineer on our Physical AI team, you will lead pre-training and post-training on action-... ...world models and vision-language-action (VLA) models Develop and...
Training
Language
Work at office
Hedra
San Francisco, CA
5 days ago
Research Engineer — Reinforcement Learning
$180k - $270k
Research Engineer (Focused on RL) You'll bring reinforcement learning... ...— building the training infrastructure, reward... ...systems that make our models meaningfully better at... ...translate your work into language that engineers,... ...future-you will thank you Pre-tax benefits — Access...
Training
Language
Full time
Temporary work
Remote work
Firecrawl
San Francisco, CA
1 day ago
Staff Research Engineer: AI Model Efficiency & Speed
...A leading AI research company in San Francisco is seeking a Staff Research Engineer to enhance the efficiency of large language models. In this role, you will develop and implement advanced techniques to optimize model performance in production. Ideal candidates will hold...
Language
Remote work
Cohere
San Francisco, CA
3 days ago
Senior Research Engineer
...of memory features—from research to production. You’ll fine-tune models for extraction, updates,... ...from papers; and ship with Engineering to SOTA latency,... ...’ll Do Fine-tune and train models for memory extraction... ...for complex vision-and-language tasks (gold sets,...
Training
Language
Mem0
San Francisco, CA
1 day ago
Research Systems Engineer
...digital coworkers. Our research team pushes the frontier of post-training and reinforcement... ...applied research engineers sit side-by-side... ...frontier-scale models and develop the methods... ...frontier-scale language models on enterprise... ...Background in pre-training or post-training...
Training
Language
Daily paid
Work at office
Visa sponsorship
Relocation package
Applied Compute Inc.
San Francisco, CA
3 days ago
Research Engineer / Scientist, Alignment Science
$280k
...growing group of committed researchers, engineers, policy experts, and business... ...to keep highly capable models helpful and honest, even as... ...of our safety techniques by training language models to subvert our safety... ...relocating". Team Matching * Pre-training — The Pre-training...
Training
Language
Contract work
For contractors
For subcontractor
Work at office
Relocation
Visa sponsorship
Work visa
Flexible hours
Menlo Ventures
San Francisco, CA
5 days ago
Research Engineer - World Model
$180k
...motivated, and focused on engineering excellence. This... ...All engineers and researchers are expected to have strong... ...engineers to build generative models that can accurately... ...edge multimodal language models, with a particular... ...data. Develop and train multimodal transformers...
Training
Language
Work at office
Local area
Relocation
xAI
San Francisco, CA
more than 2 months ago
Research Engineer, Model Evaluations Новое Remote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY
$320k
...growing group of committed researchers, engineers, policy experts, and business... ...the eval against live training checkpoints, to interpreting... ...leadership use to monitor model health during training, improving... ...-on experience using large language models such as Claude,...
Training
Language
Remote job
Work at office
Visa sponsorship
Flexible hours
San Francisco, CA
a month ago
Senior GenAI Research Engineer - Optimization and Kernels
$166k - $225k
...enables companies to develop AI models and systems using their own... ...technologies ranging from pre‑training LLMs from scratch to... ...all. Job Description As a research engineer on the Scaling team, you will... ...training frameworks for large language models, including...
Training
Language
Worldwide
Cacheflow
San Francisco, CA
5 days ago
Research Engineer, Model Evaluations - Remote-Friendly Impact
$320k
Anthropic in New York City is seeking a Research Engineer to develop evaluations for Claude’s capabilities. The ideal... ...running evaluations, and debugging results during training runs. The role offers a hybrid work model and competitive compensation ranging from $320,000...
Training
Remote job
Menlo Ventures
San Francisco, CA
3 days ago
ML Research Engineer
...will ultimately become the perception engine for a company’s physical footprint,... ...deep-learning based vision, vision-language, and large language models to our world-class distributed perception... ...collection, labelling, and model re-training platform Driving the design...
Training
Language
Specter
San Francisco, CA
1 day ago
Research Engineer - Agency and Reasoning
...Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be... ...in reinforcement learning, post-training, and human preference learning, and... ...ideas at scale to our next generation of language models. What We’re Looking For /...
Training
Language
Work at office
Relocation package
Zyphra
San Francisco, CA
4 hours ago
Research Engineer, Data
...Today, not even the best models can continuously... ..., a new primitive for training efficient, large‑scale... ...innovation and systems engineering paired with a design‑minded... ...world’s diversity of languages and cultures. We are... ...scalable systems that bridge research and production....
Training
Language
Work at office
Relocation package
Cartesia
San Francisco, CA
1 day ago
Research Engineer (New Grad)
...Description We are Genmo, a research lab dedicated to... ..., state-of-the-art models for video generation towards... ...exceptional Software Engineer to join our research... ...frameworks and model training ~ Understanding of fundamental... ...Large Language Models Familiarity...
Training
Language
Work at office
Genmo
San Francisco, CA
4 hours ago
Research Engineer
...Research Engineer, Interpretability Systems San Francisco Bay Area (On-site... ...founded by former frontier-model researchers, focused on... ...interpretability for large language models. They are building experimental... ..., representations, or post-training systems Strong Python and...
Training
Language
Acceler8 Talent
San Francisco, CA
2 days ago
Member of Technical Staff, Model Efficiency
...Member of Technical Staff, Model EfficiencyWho are we?Our mission... ...to serve humanity. We're training and deploying frontier... ...customers.Cohere is a team of researchers, engineers, designers, and more, who... ...Experience working with large language models and familiarity with...
Training
Language
Full time
Work at office
Remote work
Flexible hours
Cohere
San Francisco, CA
3 days ago
Research Engineer
General Agents is an applied research lab exploring the frontiersof autonomous intelligence... ...labor.We are a team of researchers, engineers, and operators with expertise in... ...powered by our foundationvideo-language-action models trained on large-scale behavior data. Role...
Training
Language
Generalagents
San Francisco, CA
3 days ago
Offensive Security Research Engineer, Safeguards
$320k - $405k
...growing group of committed researchers, engineers, policy experts, and business... ...of adversaries to mis-use models in harmful ways Work... ...papers on computer security, language modeling, or related topics... ...combination of education, training, and/or experience Required...
Training
Language
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
5 days ago
Research Engineer
...Research Engineer, Foundation Models About the Opportunity We are seeking a Research Engineer to help... ..., focusing on the development, training, evaluation, and deployment of state... ...directly to cutting-edge work in large language models, reinforcement learning,...
Training
Language
Visa sponsorship
Relocation package
Flexible hours
Acceler8 Talent
San Francisco, CA
9 hours ago
AI/ML - Machine Learning Research Engineer, Machine Translation
$147.4k - $220.9k
...AI/ML - Machine Learning Research Engineer, Machine Translation Work Locations (3) Submit... ...translation (MT) technology and large language model (LLM) technologies. Our mission is to... ...machine learning and NLP including model training, inference, Large Language Models and...
Training
Language
Relocation
Apple
San Francisco, CA
4 days ago
Research Engineer, Search and Knowledge Post-Training
...capability that determines whether a model can pick a signal out of... ...searcher. We're hiring a Research Engineer to advance the science and... ...them, and turn search post‑training from a craft into a measurable... ...experience with RL on large language models — environments,...
Training
Language
Visa sponsorship
Nerdleveltech
San Francisco, CA
3 days ago
Global Data-Centric Research Engineer (Multilingual)
Cartesia is seeking a Research Engineer in San Francisco to develop large-scale datasets essential for training our AI models. This role focuses on ensuring data quality and linguistic... ...to enhance performance across multiple languages. The ideal candidate will have...
Training
Language
Flexible hours
Cartesia
San Francisco, CA
2 days ago
Applied Research Engineer
...that powers breakthrough AI models at leading research labs and enterprises. Since... ...to produce high-quality training data at scale Frontier Data... ...Overview As an Applied Research Engineer, you will be at the... ...frontier AI models—such as large language models and multimodal...
Training
Language
Flexible hours
HRB
San Francisco, CA
5 days ago
Research Engineer
...software vulnerabilities. We are training and scaling security AI... ...We’re seeking an experienced Research Engineer to join our effort in... ...strong intuition, experience in model evaluation, and benchmarks.... ...deep learning, and/or natural language processing Experience with...
Training
Language
Full time
Work at office
DepthFirst
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer - Language Model Pre-Training. Be the first to apply!