LLM Training Engineer
$200k - $275kSciforium
Base pay range
$200,000.00/yr - $275,000.00/yr
Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct sponsorship from AMD with hands‑on support from AMD engineers the team is scaling rapidly to build the full stack powering frontier AI models and real‑time applications.
About The Role
As a Research Engineer , you’ll work across the full foundation-model stack: pretraining and scaling , post-training and Reinforcement Learning , sandbox environments for evaluation and agentic learning , and deployment + inference optimization . You’ll build and iterate quickly on research ideas, contribute production‑grade infrastructure, and help deliver models that can serve real‑world use cases at scale.
What You’ll Work On
This role spans multiple tracks – candidates may focus on one or contribute across several. Examples include:
Pretraining & Scaling
- Train large byte-native foundation models across massive, heterogeneous corpora
- Design stable training recipes and scaling laws for novel architectures
- Improve throughput, memory efficiency, and utilization on large GPU clusters
- Build and maintain distributed training infrastructure and fault‑tolerant pipelines
Post-training & RL
- Develop post‑training pipelines (SFT, preference optimization, RLHF/RLAIF, RL)
- Curate and generate targeted datasets to improve specific model capabilities
- Build reward models and evaluation frameworks to drive iterative improvement
- Explore inference‑time learning and compute techniques to enhance performance
Sandbox Environments & Evaluation
- Build scalable sandbox environments for agent evaluation and learning
- Create realistic, high‑signal automated evals for reasoning, tool use, and safety
- Design offline + online environments that support RL‑style training at scale
- Instrument environments for observability, reproducibility, and iteration speed
Deployment & Inference Optimization
- Optimize inference throughput/latency for byte‑native architectures
- Build high‑performance serving pipelines (KV caching, batching, quantization, etc.)
- Improve end‑to‑end model efficiency, cost, and reliability in production
- Profile and optimize GPU kernels, runtime bottlenecks, and memory behavior
Ideal Candidate Credentials
Technical Strength
- Strong general software engineering skills (writing robust, performant systems)
- Experience with training or serving large neural networks (LLMs or similar)
- Solid grasp of deep learning fundamentals and modern literature
- Comfort working in high‑performance environments (GPU, distributed systems, etc.)
Relevant Experience (one or more)
- Pretraining / large‑scale distributed training (FSDP/ZeRO/Megatron‑style systems)
- Post‑training pipelines (SFT, RLHF/RLAIF, preference optimization, eval loops)
- Building RL environments, simulators, or agent frameworks
- Inference optimization, model compression, quantization, kernel‑level profiling
- Building large ETL pipelines for internet‑scale data ingestion and cleaning
- Owning end‑to‑end production ML systems with monitoring and reliability
Research Orientation
- Ability to propose and evaluate research ideas quickly
- Strong experimental hygiene: ablations, metrics, reproducibility, analysis
- Bias toward building – you can turn ideas into working code and results
Benefits Include
- Medical, dental, and vision insurance
- 401k plan
- Daily lunch, snacks, and beverages
- Flexible time off
- Competitive salary and equity
Equal Opportunity
Sciforium is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
#J-18808-Ljbffr- ...Applied AI team, responsible for building our Forward Deployed Engineering function. This team plays a critical role in bridging our cutting... ...leveraging state‑of‑the‑art models, orchestrating complex LLM workflows, integrating with enterprise infrastructure, and deploying...TrainingFull timeRelocation package
- B Capital is seeking a data engineer to ensure high data quality for training AI models. You will own the upstream data quality for LLM post-training and design automated QA methods in a collaborative environment. Ideal candidates will have strong engineering skills, a...Training
- NeuBird AI seeks an AI engineer based in San Francisco, California. You will lead the development of large-scale language models, focusing on training, deployment, and optimization to enhance the NeuBird platform performance. The ideal candidate holds a Bachelor's or Master...Training
- ...production and research roles focusing on large language models and code generation. Responsibilities include building distributed training systems, implementing quality assurance pipelines, and developing innovative evaluation frameworks. The ideal candidate has a...TrainingFlexible hours
$180k - $270k
...San Francisco. This role involves building advanced audio and speech models and includes responsibilities related to research and engineering. Successful candidates will earn a competitive salary between $180K to $270K plus bonuses and equity. The hybrid work environment...TrainingWork at office- ...for specific customer domains, tasks, and constraints. As a ML Engineer, you will work hands-on with customer data, run fine-tuning workflows... ...for customer-specific use cases: prepare datasets, configure training runs (SFT, preference optimization, reinforcement fine-tuning),...TrainingFull timeRelocation package
$200.8k - $251k
...a machine learning framework for large language models. Candidates should have system optimization experience and solid software engineering skills, particularly in tools like CUDA and Pytorch. This full-time position offers a competitive salary range of $200,800 - $251...TrainingFull time- B Capital is seeking a data engineer focused on ensuring high-quality data for AI model training. You will work closely with top researchers to develop metrics and standards for data quality. The ideal candidate has solid experience in Python and data pipelines and is...Training
- ...date. About the role The Customer Engineer will play a crucial role in the... ...technically curious and passionate about ML/AI, LLM, vLLM and the role of AI in next generation... ...engineer developing data pipelines for training, fine-tuning and inference/serving of LLMs...Training
- ...GPU Kernel Engineer Sciforium is an AI infrastructure company developing next-generation... ...level ML frameworks used for large-scale training and inference. This role is ideal for... ...operations, with a focus on large-scale LLM training and inference. Integrate low-...TrainingFlexible hours
$169.7k - $217.5k
...looking for a highly collaborative Senior Engineer to join our team to help enhance and... ...security subject matter expert - educating, training, and mentoring others Who You Are... ...with AI risk assessment, red teaming of AI/LLM systems, and defenses against prompt injection...TrainingContract workLocal area$127k - $296k
...and collaboration, join us! The Marketing Engineer, AI Deployment will design and build... ...demonstrated track record as an AI builder—agents, LLM workflows, automations, or prompt systems... ...Lead cross-functional enablement, training, and change-management programs that translate...TrainingFull timeRemote workWork from home$110k - $140k
...About the Role The Implementation Engineer is a key bridge between the Business Unit... ...cross-team transparency. Enablement & Training: Educate internal teams on new AI tools,... ...client before. Hands-on exposure to LLM orchestration frameworks and AI toolchains...TrainingWork at office- About the Team: The Backline Engineering Team serves as the critical bridge between Frontline Support... ...and generative AI systems, including LLM -based applications and agent-driven workflows. Strong grasp of model training, evaluation, and deployment in distributed...Training
- We are a team of ex-Google engineers who built the world’s largest defensive moats: Safe Browsing... ...deep in the telemetry, identifying where LLM-backed attackers are finding gaps and... ...the wild are instantly fed back into training sets to harden the system against the next...TrainingLive in
$170k - $350k
Job Title: Founding Engineer Salary: $170k-$350k + Equity Company Description: well-funded... ...infrastructure that powers the training of frontier AI agents. By developing containerized... ...chance to work on the infrastructure layer of LLM-based agent self-improvement. Location:...Training- ...than the status quo. As our ML Performance Engineer, you will be the person who makes that... ...Strong background in model inference and post‑training optimization at scale Fluency in GPU... ...your depth (vLLM, Flash‑Attention, TensorRT‑LLM, PyTorch, or equivalent) Experience with...TrainingFlexible hoursShift work
$350k
...Performance Engineer, Inference Systems San Francisco, CA | New York City, NY | Seattle, WA About Anthropic... ...Qualifications Experience with ML systems, especially training or inference infrastructure or general LLM serving stacks. Direct large-scale inference...TrainingWork at officeVisa sponsorshipFlexible hours- The role As a GTM Engineer, you’ll sit at the center of Applied Compute's commercial engine... ...modern AI tooling and frameworks, including LLM APIs, agent orchestration, evals, and the... ...for companies to build agent workforces trained on proprietary data and institutional...TrainingWork at officeVisa sponsorshipRelocation package
$50k
...Company Datawizz helps companies reduce LLM costs by 85% while improving accuracy by... ...evaluating them on different models, and training custom SLMs for repeated tasks. Datawizz... ...accuracy. The Role As a founding full-stack engineer, you’ll own the application layer of our...TrainingWork at office- ...transition yet. The Role The Forward Deployed Engineer is one of the most important roles at... ...hands-on enablement rather than passive training Monitor deployment health and proactively... ...structured delivery framework Prior exposure to LLM-based tools or AI-assisted workflows in...TrainingH1bImmediate start
- ...one. We're looking for a Forward Deployed Engineer to join our growing team! In this role,... ...proficient in using our solutions. Conduct training sessions and develop enablement materials... ...with kernel debugging. Experience with LLM deployments, even at small scale. Background...TrainingRemote workFlexible hours
$160k - $230k
...LLM Inference Frameworks and Optimization Engineer San Francisco, Singapore, Amsterdam About the Role At Together.ai, we are building state-of-the-art infrastructure to enable efficient and scalable inference for large language models (LLMs). Our mission is to...Full time- ...best: selling. Just as coding agents 10x'd engineering, revenue agents 10x customer work. We... ...of AI-driven product development, from training and evaluating models to integrating... ...multi-step workflows, applying cutting-edge LLM research in production, and driving the...TrainingWork experience placementWork at officeImmediate startRelocation
$136k - $184k
...role is for you. What You’ll Do As a GTM Engineer, you will build the strategy and... ...knowledge, examples, and answers, and develop training programs to increase Retool and AI fluency... ...tools and automations Familiarity with AI/LLM integration in production workflows (prompt...TrainingWork at officeImmediate start- ...thinking peers. Goodwin is seeking a Legal Engineer to help design and build the next... ...processes Contribute to documentation, training, and rollout of new tools Who You Are:... ...with AI/ML, natural language processing, or LLM‑based applications Experience with cloud...Training
- ...We are looking for a Go To Market (GTM) Engineer focusing on AI Adoption, who can code, build... ...quality and security standards for LLM usage. Write and maintain production code... ...Create internal documentation, runbooks, and training content to drive team adoption; conduct...TrainingWork at officeWorldwideHome officeFlexible hoursShift work3 days per week
$205k - $265k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...automation scripts Experience with AI/LLM integration for financial operations,... ...an equivalent combination of education, training, and/or experience Required field of study...TrainingWork at officeVisa sponsorshipFlexible hours$138.68k - $174.43k
APPLIED AI ENGINEER (1042) - Department of Technology Apply using SmartRecruiters, the City... ...options Career growth opportunities through training, internal mobility, and subsidized... ..., managing vector databases, integrating LLM APIs, and ensuring backend systems are reliable...TrainingPermanent employmentFull timeTraineeshipWork experience placementWork at officeRemote workWork from homeFlexible hoursNight shift1 day per week$220k - $320k
ML Model Serving Engineer Want to build the layer that actually makes AI usable in real time... ...under real-world load. This is not about training models. It’s about making them fast,... ...Building high-performance serving systems for LLM, speech, and vision models Scaling...Training3 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to LLM Training Engineer. Be the first to apply!

