Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Post-Training Research Engineer

Baseten

Baseten Engineer Position

Baseten powers mission-critical inference for the world's most dynamic AI companies. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

We are looking for an engineer with strong experience in machine learning and solid foundations in maths and computer science to join our growing Post-Training team at Baseten.

Custom models are instrumental to the success of Baseten customers. By inference volume, the overwhelming majority of traffic at Baseten is to and from models that have been post-trained in some way. The Post-Training team is responsible for the success of our customers' post-trained models, and we employ a wide array of techniques to produce models that are more efficient and higher quality than even the biggest closed source models for the customer's specific needs.

Your role as a research engineer is to build the in-house tooling to support all of this. We care about training a wide spectrum of different model architectures with a variety of techniques efficiently and at scale. At times this involves zooming deep into a particular technical topic, but more often it involves working across the stack as a whole - systems-level concepts like Kubernetes, cgroups, storage systems, and networking topologies, as well as PyTorch distributed tensor computation, and GPU kernels.

Recent Research
  • Dense, on-policy or both?
  • Repeated kv cache for long-running agents
  • Distillation without the dark – replicating black-box on-policy distillation on Baseten

We don't have a rigid set of skills, but here's some of what we're looking for:

  • A deep understanding of modern ML techniques and tools for training transformers
  • Advanced experience in a tensor/array computation library like PyTorch, TensorFlow, Jax, or similar
  • A detailed understanding of transformer training parallelism strategies like data parallelism, sharded data parallelism, tensor parallelism, pipeline parallelism, context parallelism
  • The experience and knowledge to profile and improve the performance of a distributed GPU program in PyTorch or a similar library
  • The ability to perform roofline analysis on a transformer training setup
  • A willingness to dive into messy problems, work with researchers, derive specifications by asking important questions, and execute
  • Familiarity with HPC and distributed computing platforms like Slurm, Ray, Kubernetes, and Dask
  • Familiarity with cluster networking technology like Infiniband, RoCE, GPUDirect
  • Solid fundamentals in operating systems concepts like processes, files, kernel drivers, containerisation, and networking protocols
  • A sense of creativity and willingness to ask difficult questions about our approach, assumptions, and tooling choices
Benefits
  • Competitive compensation, including meaningful equity.
  • 100% coverage of medical, dental, and vision insurance for employee and dependents
  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
  • Paid parental leave
  • Fertility and family-building stipend through Carrot
  • Company-facilitated 401(k)
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Post-Training Research Engineer in United States vacancy
  •  ...that enables anyone to create, train, and deploy them. We...  ...and pair it with the full rl post-training stack: environments,...  ...async RL trainer. We enable researchers, startups and enterprises to...  ...deployment contexts. As a Research Engineer in our Reasoning team, you'll... 
    Training
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    United States
    2 days ago
  • $150k - $300k

     ...that enables anyone to create, train, and deploy them. We...  ...and pair it with the full rl post-training stack: environments,...  ...async RL trainer. We enable researchers, startups and enterprises to...  ...deployment contexts. As a Research Engineer working on Distributed Training... 
    Training
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    United States
    2 days ago
  •  ...Research Engineer (Agentic Models) At JetBrains, code is our passion. Ever since we started,...  ...you'll be responsible for the models, training loops, and evaluation pipelines that power...  ...the intersection of SFT and RL-style post-training, and product-driven evaluation... 
    Training
    Remote work

    JetBrains

    United States
    12 hours ago
  •  ...Privacy Engineering Team Role The Privacy Engineering Team at OpenAI...  ...-functional engineering and research partners with the necessary tools...  ...insights that inform model-training and product-safety decisions....  ...—from dataset curation to post-deployment monitoring. Collaborate... 
    Training
    Remote work
    Relocation package

    OpenAI

    United States
    12 hours ago
  •  ...Research Engineer Research has been core to Liquid AI from the beginning. Liquid Labs gives that work a formal home; an internal research...  .... You'll design and implement novel architectures, training methods, and inference strategies to redefine what efficient... 
    Training
    Immediate start
    Remote work
    Work from home

    Liquid AI

    United States
    5 hours ago
  •  ...Research Engineer Joining us as a Research Engineer on the Multimodal team, you'll be at the forefront of building and advancing video...  ...interactions every day. The Multimodal team is responsible for training, fine-tuning, and deploying cutting-edge image, audio and... 
    Training
    Remote work

    Character

    United States
    12 hours ago
  •  ...attract the world's most capable talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to larger & more capable models. They will be given the right to raise large amounts... 
    Training
    Remote work
    Home office
    Flexible hours

    Poolside

    United States
    12 hours ago
  • $180k - $250k

     ...5, when Snorkel started as a research project in the Stanford AI Lab...  ...to empower scientists, engineers, financial experts, product creators...  ...that powers our model training and evaluation work. This is...  ...maintain ML training frameworks and post-training pipelines, ensuring... 
    Training
    Local area
    Remote work

    Snorkel AI

    United States
    2 days ago
  •  ...AI Researcher Tavus is a research lab pioneering human computing. We're building AI Humans...  ...~ Knowledge of large-scale model training and optimization. ~ Experience in duplex...  ...working across research and engineering. ~(Bonus) Publications at EMNLP, COLING... 
    Training
    Work at office
    Remote work
    Flexible hours

    Tavus

    United States
    1 day ago
  •  ...Research Engineer III – Ai For Building Energy Systems The Electricity Infrastructure and Buildings Division, part of the Energy and Environment...  ...automation, building controls and operations, workforce training, data mining, and AI-enabled software tools. The... 
    Training
    Work experience placement

    PNNL

    Richland, WA
    12 hours ago
  •  ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...that let researchers iterate quickly on training runs. As a Research Engineer on the team...  ...training (RL, pre-training, or post-training) Familiarity with JAX, PyTorch... 
    Training
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Anthropic

    United States
    4 hours ago
  •  ...Research Crawling Engineer As a Research Crawling Engineer, you will design and operate large-scale web data acquisition systems for research...  ...Construct and maintain datasets for research and model training Monitor crawl performance, coverage, and data quality;... 
    Training
    Remote work

    Wynd Labs

    United States
    12 hours ago
  • $5,667.67 - $9,583.33 per month

     ...Job Title Research Engineer IV Agency Texas A&M University System Offices Department Bush Combat Development Complex Proposed...  ..., conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond.... 
    Training

    Texas A&M

    Bryan, TX
    4 days ago
  • $86.28k - $175.47k

     ...assess the performance of a variety of engineering materials that are used in a wide range...  ...team to perform fundamental and applied research through the development of advanced fatigue...  ...a PhD. ~0-5 years: Experience and/or training in fracture mechanics, fatigue crack growth... 
    Training
    Permanent employment
    Contract work
    Work experience placement
    Remote work

    Southwest Research Institute

    United States
    3 days ago
  • $350k

     ...a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build...  ...for novel verticals and use cases. The team builds the training environments that fuel RL at scale. This is a unique role that... 
    Training
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Anthropic

    United States
    4 hours ago
  •  ...Job Title Research Engineer IV Agency Texas A&M Engineering Department Materials Science & Engineering Proposed Minimum...  ...off with holidays, vacation and sick leave. Robust free training access through LinkedIn Learning plus professional... 
    Training
    Flexible hours

    The Texas A&M University System

    College Station, TX
    3 days ago
  • $6,251 - $10,000 per month

     ...Job Title Senior Research Engineer I Agency Texas A&M University System Offices Department Bush Combat Development Complex...  ...education, conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond.... 
    Training

    Texas A&M

    Bryan, TX
    12 hours ago
  • $180k - $290k

     ...Research Engineer - Search/IR Research Engineer (Focused on Search/IR) You'll own the search and information retrieval systems at...  ...engineering team to connect search/IR improvements with model training and the broader product roadmap. What We're Looking For... 
    Training
    Full time
    Temporary work
    Remote work

    Firecrawl

    United States
    2 days ago
  •  ...Research Engineer, Foundation Models About the Opportunity We are seeking a Research Engineer...  ..., focusing on the development, training, evaluation, and deployment of state-of...  ...Distributed Training, Pretraining, Fine-Tuning, Post-Training, Reinforcement Learning, RLHF,... 
    Training
    Visa sponsorship
    Relocation package
    Flexible hours

    Acceler8 Talent

    Sonoma, CA
    11 hours ago
  •  ...Research Engineer The Research Engineer is an entry-level position in the field of research and development. They work under the guidance...  ...(EAP) ~ Paid Time Off (PTO) – (11) Federal Holidays ~ Training and Development Opportunities Your application submission will... 
    Training
    Full time
    Contract work
    Temporary work
    For contractors
    Work at office
    Immediate start

    CHICKASAW NATION INDUSTRIES INC

    Kinsey, AL
    12 hours ago
  • $100k - $130k

     ...global scale. Additionally, the team has engineered sophisticated pipelines for the...  ...facilitating dataset creation for frontier research labs. The organization operates as a...  ...for research and machine learning model training. Monitor and optimize crawl performance... 
    Training
    Full time
    Remote work

    MLabs

    United States
    3 days ago
  • $155k - $269k

     ...Research Engineer Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking...  ...rich generative priors for downstream planning, testing, and training. You will… Design, implement, and scale state-of-... 
    Training
    Full time
    Work at office
    Remote work
    Work from home
    Flexible hours

    Waabi

    United States
    2 days ago
  •  ...Research Engineer I, II, III, or Senior Job no: 510665 Position type: Full-Time 12-Month Department: 193603 - Instit...  ...rank must have an appropriate degree, or its equivalent in training and experience; a strong commitment to higher education, and... 
    Training
    Full time
    Work at office
    Local area
    Immediate start

    Mississippi State University

    Vicksburg, MS
    1 day ago
  • $160k - $240k

     ...Research Engineer - Evals You'll build the evaluation systems that tell us whether Firecrawl actually works. That sounds simple. It...  ...models and RL. Evals here aren't a reporting layer - they're a training signal. You'll work closely with the RL and Search/IR... 
    Training
    Full time
    Temporary work
    Remote work

    Firecrawl

    United States
    1 day ago
  •  ...Senior Radar Research Engineer Job No: 26047 Department: Vice Pres for Research...  ...usually low to moderate. Required Training and Other Conditions of Employment Every...  ...Supervisor Chief Scientist Posting Type Internal and External Dependent... 
    Training
    Permanent employment
    Full time
    Part time
    Work at office

    Michigan Technological University

    Ann Arbor, MI
    3 days ago
  •  ...Research Engineer III Apply now ( Job No: 25293 20675 Department: Vice Pres for Research...  ...competitive grant proposals. Required Training and Other Conditions of Employment Every...  ...Research Scientist III Posting Type Internal & External Dependent... 
    Training
    Permanent employment
    Full time
    Part time
    Work at office
    Remote work
    Shift work

    Michigan Technological University

    Houghton, MI
    12 hours ago
  • $89.3k

     ...specific area of scientific research or other function, with its own...  ...BS&DG is seeking a Research Engineer II - AI for Building Energy...  ...controls and operations, workforce training, data mining, and AI-enabled...  ...background investigation post hire and receive a favorable... 
    Training
    For contractors
    Work at office
    Local area
    Relocation package
    Flexible hours

    Pacific Northwest National Laboratory

    Richland, WA
    4 days ago
  •  ...Job Title Research Engineer, Level II - IV Agency Texas A&M University System Offices Department Bush Combat Development...  ..., conduct research, commercialize technology, offer training, and deliver services for the people of Texas and beyond.... 
    Training
    Internship
    Shift work
    Night shift

    The Texas A&M University System

    Bryan, TX
    3 days ago
  • $115k - $181k

     ...Overview i3 is seeking a Vulnerability Research Engineer to support the Naval Research Laboratory’s Tactical Electronic Warfare Division...  ..., cybersecurity and IT/IA innovative solutions and virtual training, simulation & serious game development and implementation. We... 
    Training
    Full time

    Integration Innovation, Inc.

    Washington DC
    2 days ago
  • $189k - $289k

     ...Your Impact at Lila The AI Research team is tackling one of the most...  ..., open problems in AI: training LLMs to run long-horizon scientific...  ...Our approach spans the full post-training stack— from SFT to asynchronous...  ...rapidly growing our Research Engineering org and seeking talented... 
    Training
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    Dormont Manufacturing Co

    Cambridge, MA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Post-Training Research Engineer. Be the first to apply!