Post-Training Research Engineer
Baseten
Baseten Engineer Position
Baseten powers mission-critical inference for the world's most dynamic AI companies. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.
We are looking for an engineer with strong experience in machine learning and solid foundations in maths and computer science to join our growing Post-Training team at Baseten.
Custom models are instrumental to the success of Baseten customers. By inference volume, the overwhelming majority of traffic at Baseten is to and from models that have been post-trained in some way. The Post-Training team is responsible for the success of our customers' post-trained models, and we employ a wide array of techniques to produce models that are more efficient and higher quality than even the biggest closed source models for the customer's specific needs.
Your role as a research engineer is to build the in-house tooling to support all of this. We care about training a wide spectrum of different model architectures with a variety of techniques efficiently and at scale. At times this involves zooming deep into a particular technical topic, but more often it involves working across the stack as a whole - systems-level concepts like Kubernetes, cgroups, storage systems, and networking topologies, as well as PyTorch distributed tensor computation, and GPU kernels.
Recent Research
- Dense, on-policy or both?
- Repeated kv cache for long-running agents
- Distillation without the dark – replicating black-box on-policy distillation on Baseten
We don't have a rigid set of skills, but here's some of what we're looking for:
- A deep understanding of modern ML techniques and tools for training transformers
- Advanced experience in a tensor/array computation library like PyTorch, TensorFlow, Jax, or similar
- A detailed understanding of transformer training parallelism strategies like data parallelism, sharded data parallelism, tensor parallelism, pipeline parallelism, context parallelism
- The experience and knowledge to profile and improve the performance of a distributed GPU program in PyTorch or a similar library
- The ability to perform roofline analysis on a transformer training setup
- A willingness to dive into messy problems, work with researchers, derive specifications by asking important questions, and execute
- Familiarity with HPC and distributed computing platforms like Slurm, Ray, Kubernetes, and Dask
- Familiarity with cluster networking technology like Infiniband, RoCE, GPUDirect
- Solid fundamentals in operating systems concepts like processes, files, kernel drivers, containerisation, and networking protocols
- A sense of creativity and willingness to ask difficult questions about our approach, assumptions, and tooling choices
Benefits
- Competitive compensation, including meaningful equity.
- 100% coverage of medical, dental, and vision insurance for employee and dependents
- Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
- Paid parental leave
- Fertility and family-building stipend through Carrot
- Company-facilitated 401(k)
- Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.
At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.
We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law.
- ...that enables anyone to create, train, and deploy them. We... ...and pair it with the full rl post-training stack: environments,... ...async RL trainer. We enable researchers, startups and enterprises to... ...deployment contexts. As a Research Engineer in our Reasoning team, you'll...TrainingRemote workWorldwideVisa sponsorshipRelocation packageFlexible hours
$150k - $300k
...that enables anyone to create, train, and deploy them. We... ...and pair it with the full rl post-training stack: environments,... ...async RL trainer. We enable researchers, startups and enterprises to... ...deployment contexts. As a Research Engineer working on Distributed Training...TrainingRemote workWorldwideVisa sponsorshipRelocation packageFlexible hours- ...Research Engineer (Agentic Models) At JetBrains, code is our passion. Ever since we started,... ...you'll be responsible for the models, training loops, and evaluation pipelines that power... ...the intersection of SFT and RL-style post-training, and product-driven evaluation...TrainingRemote work
- ...Privacy Engineering Team Role The Privacy Engineering Team at OpenAI... ...-functional engineering and research partners with the necessary tools... ...insights that inform model-training and product-safety decisions.... ...—from dataset curation to post-deployment monitoring. Collaborate...TrainingRemote workRelocation package
- ...Research Engineer Research has been core to Liquid AI from the beginning. Liquid Labs gives that work a formal home; an internal research... .... You'll design and implement novel architectures, training methods, and inference strategies to redefine what efficient...TrainingImmediate startRemote workWork from home
- ...Research Engineer Joining us as a Research Engineer on the Multimodal team, you'll be at the forefront of building and advancing video... ...interactions every day. The Multimodal team is responsible for training, fine-tuning, and deploying cutting-edge image, audio and...TrainingRemote work
- ...attract the world's most capable talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to larger & more capable models. They will be given the right to raise large amounts...TrainingRemote workHome officeFlexible hours
$180k - $250k
...5, when Snorkel started as a research project in the Stanford AI Lab... ...to empower scientists, engineers, financial experts, product creators... ...that powers our model training and evaluation work. This is... ...maintain ML training frameworks and post-training pipelines, ensuring...TrainingLocal areaRemote work- ...AI Researcher Tavus is a research lab pioneering human computing. We're building AI Humans... ...~ Knowledge of large-scale model training and optimization. ~ Experience in duplex... ...working across research and engineering. ~(Bonus) Publications at EMNLP, COLING...TrainingWork at officeRemote workFlexible hours
- ...Research Engineer III – Ai For Building Energy Systems The Electricity Infrastructure and Buildings Division, part of the Energy and Environment... ...automation, building controls and operations, workforce training, data mining, and AI-enabled software tools. The...TrainingWork experience placement
- ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...that let researchers iterate quickly on training runs. As a Research Engineer on the team... ...training (RL, pre-training, or post-training) Familiarity with JAX, PyTorch...TrainingWork at officeRemote workVisa sponsorshipFlexible hours
- ...Research Crawling Engineer As a Research Crawling Engineer, you will design and operate large-scale web data acquisition systems for research... ...Construct and maintain datasets for research and model training Monitor crawl performance, coverage, and data quality;...TrainingRemote work
$5,667.67 - $9,583.33 per month
...Job Title Research Engineer IV Agency Texas A&M University System Offices Department Bush Combat Development Complex Proposed... ..., conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond....Training$86.28k - $175.47k
...assess the performance of a variety of engineering materials that are used in a wide range... ...team to perform fundamental and applied research through the development of advanced fatigue... ...a PhD. ~0-5 years: Experience and/or training in fracture mechanics, fatigue crack growth...TrainingPermanent employmentContract workWork experience placementRemote work$350k
...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build... ...for novel verticals and use cases. The team builds the training environments that fuel RL at scale. This is a unique role that...TrainingWork at officeRemote workVisa sponsorshipFlexible hours- ...Job Title Research Engineer IV Agency Texas A&M Engineering Department Materials Science & Engineering Proposed Minimum... ...off with holidays, vacation and sick leave. Robust free training access through LinkedIn Learning plus professional...TrainingFlexible hours
$6,251 - $10,000 per month
...Job Title Senior Research Engineer I Agency Texas A&M University System Offices Department Bush Combat Development Complex... ...education, conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond....Training$180k - $290k
...Research Engineer - Search/IR Research Engineer (Focused on Search/IR) You'll own the search and information retrieval systems at... ...engineering team to connect search/IR improvements with model training and the broader product roadmap. What We're Looking For...TrainingFull timeTemporary workRemote work- ...Research Engineer, Foundation Models About the Opportunity We are seeking a Research Engineer... ..., focusing on the development, training, evaluation, and deployment of state-of... ...Distributed Training, Pretraining, Fine-Tuning, Post-Training, Reinforcement Learning, RLHF,...TrainingVisa sponsorshipRelocation packageFlexible hours
- ...Research Engineer The Research Engineer is an entry-level position in the field of research and development. They work under the guidance... ...(EAP) ~ Paid Time Off (PTO) – (11) Federal Holidays ~ Training and Development Opportunities Your application submission will...TrainingFull timeContract workTemporary workFor contractorsWork at officeImmediate start
$100k - $130k
...global scale. Additionally, the team has engineered sophisticated pipelines for the... ...facilitating dataset creation for frontier research labs. The organization operates as a... ...for research and machine learning model training. Monitor and optimize crawl performance...TrainingFull timeRemote work$155k - $269k
...Research Engineer Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking... ...rich generative priors for downstream planning, testing, and training. You will… Design, implement, and scale state-of-...TrainingFull timeWork at officeRemote workWork from homeFlexible hours- ...Research Engineer I, II, III, or Senior Job no: 510665 Position type: Full-Time 12-Month Department: 193603 - Instit... ...rank must have an appropriate degree, or its equivalent in training and experience; a strong commitment to higher education, and...TrainingFull timeWork at officeLocal areaImmediate start
$160k - $240k
...Research Engineer - Evals You'll build the evaluation systems that tell us whether Firecrawl actually works. That sounds simple. It... ...models and RL. Evals here aren't a reporting layer - they're a training signal. You'll work closely with the RL and Search/IR...TrainingFull timeTemporary workRemote work- ...Senior Radar Research Engineer Job No: 26047 Department: Vice Pres for Research... ...usually low to moderate. Required Training and Other Conditions of Employment Every... ...Supervisor Chief Scientist Posting Type Internal and External Dependent...TrainingPermanent employmentFull timePart timeWork at office
- ...Research Engineer III Apply now ( Job No: 25293 20675 Department: Vice Pres for Research... ...competitive grant proposals. Required Training and Other Conditions of Employment Every... ...Research Scientist III Posting Type Internal & External Dependent...TrainingPermanent employmentFull timePart timeWork at officeRemote workShift work
$89.3k
...specific area of scientific research or other function, with its own... ...BS&DG is seeking a Research Engineer II - AI for Building Energy... ...controls and operations, workforce training, data mining, and AI-enabled... ...background investigation post hire and receive a favorable...TrainingFor contractorsWork at officeLocal areaRelocation packageFlexible hours- ...Job Title Research Engineer, Level II - IV Agency Texas A&M University System Offices Department Bush Combat Development... ..., conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond....TrainingInternshipShift workNight shift
$115k - $181k
...Overview i3 is seeking a Vulnerability Research Engineer to support the Naval Research Laboratory’s Tactical Electronic Warfare Division... ..., cybersecurity and IT/IA innovative solutions and virtual training, simulation & serious game development and implementation. We...TrainingFull time$189k - $289k
...Your Impact at Lila The AI Research team is tackling one of the most... ..., open problems in AI: training LLMs to run long-horizon scientific... ...Our approach spans the full post-training stack— from SFT to asynchronous... ...rapidly growing our Research Engineering org and seeking talented...TrainingFull timeWork at officeLocal areaRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Post-Training Research Engineer. Be the first to apply!
- deep learning research engineer United States
- engineering business analyst United States
- junior research engineer United States
- research software engineer United States
- cyber research engineer United States
- robotics research engineer United States
- research programmer United States
- senior research engineer United States
- engineering analyst United States
- research assistant engineering United States

