Post-Training Research Engineer

Baseten

Baseten Engineer Position

Baseten powers mission-critical inference for the world's most dynamic AI companies. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

We are looking for an engineer with strong experience in machine learning and solid foundations in maths and computer science to join our growing Post-Training team at Baseten.

Custom models are instrumental to the success of Baseten customers. By inference volume, the overwhelming majority of traffic at Baseten is to and from models that have been post-trained in some way. The Post-Training team is responsible for the success of our customers' post-trained models, and we employ a wide array of techniques to produce models that are more efficient and higher quality than even the biggest closed source models for the customer's specific needs.

Your role as a research engineer is to build the in-house tooling to support all of this. We care about training a wide spectrum of different model architectures with a variety of techniques efficiently and at scale. At times this involves zooming deep into a particular technical topic, but more often it involves working across the stack as a whole - systems-level concepts like Kubernetes, cgroups, storage systems, and networking topologies, as well as PyTorch distributed tensor computation, and GPU kernels.

Recent Research

Dense, on-policy or both?
Repeated kv cache for long-running agents
Distillation without the dark – replicating black-box on-policy distillation on Baseten

We don't have a rigid set of skills, but here's some of what we're looking for:

A deep understanding of modern ML techniques and tools for training transformers
Advanced experience in a tensor/array computation library like PyTorch, TensorFlow, Jax, or similar
A detailed understanding of transformer training parallelism strategies like data parallelism, sharded data parallelism, tensor parallelism, pipeline parallelism, context parallelism
The experience and knowledge to profile and improve the performance of a distributed GPU program in PyTorch or a similar library
The ability to perform roofline analysis on a transformer training setup
A willingness to dive into messy problems, work with researchers, derive specifications by asking important questions, and execute
Familiarity with HPC and distributed computing platforms like Slurm, Ray, Kubernetes, and Dask
Familiarity with cluster networking technology like Infiniband, RoCE, GPUDirect
Solid fundamentals in operating systems concepts like processes, files, kernel drivers, containerisation, and networking protocols
A sense of creativity and willingness to ask difficult questions about our approach, assumptions, and tooling choices

Benefits

Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Fertility and family-building stipend through Carrot
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Post-Training Research Engineer in United States vacancy

Research Engineer - Reinforcement Learning
...that enables anyone to create, train, and deploy them. We... ...and pair it with the full rl post-training stack: environments,... ...async RL trainer. We enable researchers, startups and enterprises to... ...deployment contexts. As a Research Engineer in our Reasoning team, you'll...
Training
Remote work
Worldwide
Visa sponsorship
Relocation package
Flexible hours
Prime Intellect
United States
2 days ago
Research Engineer - Distributed Training
$150k - $300k
...that enables anyone to create, train, and deploy them. We... ...and pair it with the full rl post-training stack: environments,... ...async RL trainer. We enable researchers, startups and enterprises to... ...deployment contexts. As a Research Engineer working on Distributed Training...
Training
Remote work
Worldwide
Visa sponsorship
Relocation package
Flexible hours
Prime Intellect
United States
2 days ago
Research Engineer (Agentic Models)
...Research Engineer (Agentic Models) At JetBrains, code is our passion. Ever since we started,... ...you'll be responsible for the models, training loops, and evaluation pipelines that power... ...the intersection of SFT and RL-style post-training, and product-driven evaluation...
Training
Remote work
JetBrains
United States
12 hours ago
Research Engineer, Privacy
...Privacy Engineering Team Role The Privacy Engineering Team at OpenAI... ...-functional engineering and research partners with the necessary tools... ...insights that inform model-training and product-safety decisions.... ...—from dataset curation to post-deployment monitoring. Collaborate...
Training
Remote work
Relocation package
OpenAI
United States
12 hours ago
Liquid Labs - Research Engineer
...Research Engineer Research has been core to Liquid AI from the beginning. Liquid Labs gives that work a formal home; an internal research... .... You'll design and implement novel architectures, training methods, and inference strategies to redefine what efficient...
Training
Immediate start
Remote work
Work from home
Liquid AI
United States
5 hours ago
Research Engineer, Multimodal
...Research Engineer Joining us as a Research Engineer on the Multimodal team, you'll be at the forefront of building and advancing video... ...interactions every day. The Multimodal team is responsible for training, fine-tuning, and deploying cutting-edge image, audio and...
Training
Remote work
Character
United States
12 hours ago
Research - engineering
...attract the world's most capable talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to larger & more capable models. They will be given the right to raise large amounts...
Training
Remote work
Home office
Flexible hours
Poolside
United States
12 hours ago
Research Engineer - Training Infra
$180k - $250k
...5, when Snorkel started as a research project in the Stanford AI Lab... ...to empower scientists, engineers, financial experts, product creators... ...that powers our model training and evaluation work. This is... ...maintain ML training frameworks and post-training pipelines, ensuring...
Training
Local area
Remote work
Snorkel AI
United States
2 days ago
Conversational Modelling Research Engineer
...AI Researcher Tavus is a research lab pioneering human computing. We're building AI Humans... ...~ Knowledge of large-scale model training and optimization. ~ Experience in duplex... ...working across research and engineering. ~(Bonus) Publications at EMNLP, COLING...
Training
Work at office
Remote work
Flexible hours
Tavus
United States
1 day ago
Research Engineer III - AI for Building Energy Systems
...Research Engineer III – Ai For Building Energy Systems The Electricity Infrastructure and Buildings Division, part of the Energy and Environment... ...automation, building controls and operations, workforce training, data mining, and AI-enabled software tools. The...
Training
Work experience placement
PNNL
Richland, WA
12 hours ago
Research Engineer, Machine Learning (RL Velocity)
...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...that let researchers iterate quickly on training runs. As a Research Engineer on the team... ...training (RL, pre-training, or post-training) Familiarity with JAX, PyTorch...
Training
Work at office
Remote work
Visa sponsorship
Flexible hours
Anthropic
United States
4 hours ago
Research Crawling Engineer
...Research Crawling Engineer As a Research Crawling Engineer, you will design and operate large-scale web data acquisition systems for research... ...Construct and maintain datasets for research and model training Monitor crawl performance, coverage, and data quality;...
Training
Remote work
Wynd Labs
United States
12 hours ago
Research Engineer IV
$5,667.67 - $9,583.33 per month
...Job Title Research Engineer IV Agency Texas A&M University System Offices Department Bush Combat Development Complex Proposed... ..., conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond....
Training
Texas A&M
Bryan, TX
4 days ago
RESEARCH ENGINEER - SR. RESEARCH ENGINEER - Computational Materials Integrity
$86.28k - $175.47k
...assess the performance of a variety of engineering materials that are used in a wide range... ...team to perform fundamental and applied research through the development of advanced fatigue... ...a PhD. ~0-5 years: Experience and/or training in fracture mechanics, fatigue crack growth...
Training
Permanent employment
Contract work
Work experience placement
Remote work
Southwest Research Institute
United States
3 days ago
Research Engineer, Environment Scaling
$350k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ...for novel verticals and use cases. The team builds the training environments that fuel RL at scale. This is a unique role that...
Training
Work at office
Remote work
Visa sponsorship
Flexible hours
Anthropic
United States
4 hours ago
Research Engineering Tech.
...Job Title Research Engineer IV Agency Texas A&M Engineering Department Materials Science & Engineering Proposed Minimum... ...off with holidays, vacation and sick leave. Robust free training access through LinkedIn Learning plus professional...
Training
Flexible hours
The Texas A&M University System
College Station, TX
3 days ago
Senior Research Engineer I
$6,251 - $10,000 per month
...Job Title Senior Research Engineer I Agency Texas A&M University System Offices Department Bush Combat Development Complex... ...education, conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond....
Training
Texas A&M
Bryan, TX
12 hours ago
Research Engineer - Search/IR
$180k - $290k
...Research Engineer - Search/IR Research Engineer (Focused on Search/IR) You'll own the search and information retrieval systems at... ...engineering team to connect search/IR improvements with model training and the broader product roadmap. What We're Looking For...
Training
Full time
Temporary work
Remote work
Firecrawl
United States
2 days ago
Research Engineer
...Research Engineer, Foundation Models About the Opportunity We are seeking a Research Engineer... ..., focusing on the development, training, evaluation, and deployment of state-of... ...Distributed Training, Pretraining, Fine-Tuning, Post-Training, Reinforcement Learning, RLHF,...
Training
Visa sponsorship
Relocation package
Flexible hours
Acceler8 Talent
Sonoma, CA
11 hours ago
Research Engineer
...Research Engineer The Research Engineer is an entry-level position in the field of research and development. They work under the guidance... ...(EAP) ~ Paid Time Off (PTO) – (11) Federal Holidays ~ Training and Development Opportunities Your application submission will...
Training
Full time
Contract work
Temporary work
For contractors
Work at office
Immediate start
CHICKASAW NATION INDUSTRIES INC
Kinsey, AL
12 hours ago
Research Crawling Engineer
$100k - $130k
...global scale. Additionally, the team has engineered sophisticated pipelines for the... ...facilitating dataset creation for frontier research labs. The organization operates as a... ...for research and machine learning model training. Monitor and optimize crawl performance...
Training
Full time
Remote work
MLabs
United States
3 days ago
Research Engineer, World Models
$155k - $269k
...Research Engineer Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking... ...rich generative priors for downstream planning, testing, and training. You will… Design, implement, and scale state-of-...
Training
Full time
Work at office
Remote work
Work from home
Flexible hours
Waabi
United States
2 days ago
Research Engineer I, II, III, or Senior
...Research Engineer I, II, III, or Senior Job no: 510665 Position type: Full-Time 12-Month Department: 193603 - Instit... ...rank must have an appropriate degree, or its equivalent in training and experience; a strong commitment to higher education, and...
Training
Full time
Work at office
Local area
Immediate start
Mississippi State University
Vicksburg, MS
1 day ago
Research Engineer - Evals
$160k - $240k
...Research Engineer - Evals You'll build the evaluation systems that tell us whether Firecrawl actually works. That sounds simple. It... ...models and RL. Evals here aren't a reporting layer - they're a training signal. You'll work closely with the RL and Search/IR...
Training
Full time
Temporary work
Remote work
Firecrawl
United States
1 day ago
Senior Radar Research Engineer
...Senior Radar Research Engineer Job No: 26047 Department: Vice Pres for Research... ...usually low to moderate. Required Training and Other Conditions of Employment Every... ...Supervisor Chief Scientist Posting Type Internal and External Dependent...
Training
Permanent employment
Full time
Part time
Work at office
Michigan Technological University
Ann Arbor, MI
3 days ago
Research Engineer III
...Research Engineer III Apply now ( Job No: 25293 20675 Department: Vice Pres for Research... ...competitive grant proposals. Required Training and Other Conditions of Employment Every... ...Research Scientist III Posting Type Internal & External Dependent...
Training
Permanent employment
Full time
Part time
Work at office
Remote work
Shift work
Michigan Technological University
Houghton, MI
12 hours ago
Research Engineer II - AI for Building Energy Systems
$89.3k
...specific area of scientific research or other function, with its own... ...BS&DG is seeking a Research Engineer II - AI for Building Energy... ...controls and operations, workforce training, data mining, and AI-enabled... ...background investigation post hire and receive a favorable...
Training
For contractors
Work at office
Local area
Relocation package
Flexible hours
Pacific Northwest National Laboratory
Richland, WA
4 days ago
Research Engineer, Level II - IV
...Job Title Research Engineer, Level II - IV Agency Texas A&M University System Offices Department Bush Combat Development... ..., conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond....
Training
Internship
Shift work
Night shift
The Texas A&M University System
Bryan, TX
3 days ago
Vulnerability Research Engineer
$115k - $181k
...Overview i3 is seeking a Vulnerability Research Engineer to support the Naval Research Laboratory’s Tactical Electronic Warfare Division... ..., cybersecurity and IT/IA innovative solutions and virtual training, simulation & serious game development and implementation. We...
Training
Full time
Integration Innovation, Inc.
Washington DC
2 days ago
Research Engineer, Frontier Capabilities
$189k - $289k
...Your Impact at Lila The AI Research team is tackling one of the most... ..., open problems in AI: training LLMs to run long-horizon scientific... ...Our approach spans the full post-training stack— from SFT to asynchronous... ...rapidly growing our Research Engineering org and seeking talented...
Training
Full time
Work at office
Local area
Remote work
Flexible hours
Dormont Manufacturing Co
Cambridge, MA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Post-Training Research Engineer. Be the first to apply!