AI Researcher, Core ML (Turbo)

Gravity Engineering Services Pvt Ltd.

About the Role The Turbo team operates at the intersection of efficient inference (algorithms, architectures, engines) and post‑training / RL systems. We develop and manage the systems powering Together’s API, including high‑performance inference and RL/post‑training engines capable of operating at production scale. Our objective is to advance the frontier of efficient inference and RL‑driven training: making models significantly faster and more cost‑effective to run, while simultaneously enhancing their capabilities through RL-based post‑training (e.g., GRPO‑style objectives). This work involves both algorithms and systems: asynchronous RL, rollout collection, scheduling, and batching all interact with engine design, providing numerous parameters to adjust across the RL algorithm, training loop, and inference stack. A significant portion of the role involves modifying production inference systems—such as SGLang‑ or vLLM‑style serving stacks and speculative decoding systems like ATLAS—rooted in a strong understanding of post‑training and inference theory, rather than solely theoretical algorithm design. You will work across the entire stack—from RL algorithms and training engines to kernels and serving systems—to develop and enhance frontier models using RL pipelines. Team members often exhibit specialized strengths: some are more proficient in RL, while others excel in systems. Depth in one of these areas, coupled with an eagerness to collaborate across disciplines (and grow towards more full‑stack ownership over time), is highly desirable. Requirements We do not expect every candidate to meet every single requirement. Team members typically possess deep expertise in one or more areas and sufficient breadth (or interest) to effectively work across the stack. The closer you are to a full‑stack profile (inference + post‑training/RL + systems), the stronger the fit—however, having deep expertise in one area and a strong desire to grow is perfectly acceptable. You might be a good fit if you: Have strong expertise in at least one of the following, and are excited to collaborate across (and grow into) the others: Systems-first profile: Large‑scale inference systems (e.g., SGLang, vLLM, FasterTransformer, TensorRT, custom engines, or similar), GPU performance, distributed serving. RL-first profile: RL / post‑training for LLMs or large models (e.g., GRPO, RLHF/RLAIF, DPO‑like methods, reward modeling), and using these to train or fine‑tune real models. Model architecture design for Transformers or other large neural networks. Distributed systems / high‑performance computing for ML. Are comfortable working from algorithms to engines: Strong coding ability in Python . Experience profiling and optimizing performance across GPU, networking, and memory layers. Able to take a new sampling method, scheduler, or RL update and transform it into a production‑grade implementation within the engine and/or training stack. Have a solid research foundation in your area(s) of depth: Track record of impactful work in ML systems, RL, or large‑scale model training (papers, open‑source projects, or production systems). Can comprehend new RL / post‑training papers, understand their implications on the stack, and design minimal, correct changes in the appropriate layer (training engine vs. inference engine vs. data / API). Operate well as a full‑stack problem solver: You naturally ask: “Where in the stack is this really bottlenecked?” You enjoy collaborating with infra, research, and product teams, and you value both scientific quality and user‑visible achievements. Minimum qualifications 3+ years of experience working on ML systems, large‑scale model training, inference, or adjacent areas (or equivalent experience via research / open source). Advanced degree in Computer Science, EE, or a related field, or equivalent practical experience. Demonstrated experience owning complex technical projects end‑to‑end. If you’re excited about the role and strong in some of these areas, we encourage you to apply even if you don’t meet every single requirement. Responsibilities Advance inference efficiency end‑to‑end Design and prototype algorithms, architectures, and scheduling strategies for low‑latency, high‑throughput inference. Implement and maintain changes in high‑performance inference engines (e.g., SGLang‑ or vLLM‑style systems and Together’s inference stack), including kernel backends, speculative decoding (e.g., ATLAS), quantization, etc. Profile and optimize performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Unify inference with RL / post‑training Design and operate RL and post‑training pipelines (e.g., RLHF, RLAIF, GRPO, DPO‑style methods, reward modeling) where 90+% of the cost is inference, jointly optimizing algorithms and systems. Make RL and post‑training workloads more efficient with inference‑aware training loops—for example, async RL rollouts, speculative decoding, and other techniques that make large‑scale rollout collection and evaluation cheaper. Use these pipelines to train, evaluate, and iterate on frontier models on top of our inference stack. Co‑design algorithms and infrastructure so that objectives, rollout collection, and evaluation are tightly coupled to efficient inference, and quickly identify bottlenecks across the training engine, inference engine, data pipeline, and user‑facing layers. Run ablations and scale‑up experiments to understand trade‑offs between model quality, latency, throughput, and cost, and feed these insights back into model, RL, and system design. Own critical systems at production scale Profile, debug, and optimize inference and post‑training services under real production workloads. Drive roadmap items that require real engine modification—changing kernels, memory layouts, scheduling logic, and APIs as needed. Establish metrics, benchmarks, and experimentation frameworks to validate improvements rigorously. Provide technical leadership (Staff level) Set technical direction for cross‑team efforts at the intersection of inference, RL, and post‑training. Mentor other engineers and researchers on full‑stack ML systems work and performance engineering. #J-18808-Ljbffr

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the AI Researcher, Core ML (Turbo) in San Francisco, CA vacancy

AI Researcher, Core ML (Turbo)
$200k - $280k
...About the Role The Turbo team sits at the intersection of efficient... ...high-performance computing for ML. Are comfortable working... ...stack. Have a solid research foundation in your area(s) of depth... .... About Together AI Together AI is a research-...
Suggested
Full time
Together AI
San Francisco, CA
3 days ago
AI Behavior Researcher - Child Safety and Mental Health
$250k
...AI Behavior Researcher - Child Safety and Mental Health Transluce is a fast-moving nonprofit research... ...methods. You don't need to be a pure ML engineer, but you should be comfortable... ...safety and mental health experts. Core responsibility: Identify and measure...
Suggested
Transluce
San Francisco, CA
2 days ago
AI Researcher
$250k - $325k
...was first-to-market with: An AI agent that lives in MS Word and... ...: Why, What, and Who Why AI Researchers are the engine of innovation at... ...engineering team. Advance the core AI platform. Design and... ...Publications at top venues — e.g., ML/AI conferences (NeurIPS, AAAI,...
Suggested
Contract work
Work at office
Immediate start
Remote work
Ivo Inc.
San Francisco, CA
4 days ago
Senior Staff AI Researcher
$212k - $292k
...The Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is... ...and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio... ...technologies by focusing on three core areas:Cutting-Edge ResearchYou will:Lead...
Suggested
Full time
Local area
Worldwide
Flexible hours
Via Licensing Corporation
San Francisco, CA
4 days ago
Principal / Distinguished AI/ML Researcher and/or Engineer, Reasoning and Planning
...About the Role We are seeking a Principal / Distinguished AI/ML Researcher and/or Engineer with deep experience in reasoning, planning, and... ...across disciplines and influence company-wide AI architecture. A core dimension of this role is the design and deployment of multi-...
Suggested
Local area
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
4 days ago
Physician AI Researcher
...in hospitals through advanced AI-powered reporting and analytics... ...founding member of the clinical AI research team at Pharos. As a Physician... ...contribute to Pharos's core mission: improving patient safety... ...prototype, implement, and evaluate AI/ML models for healthcare quality...
Work at office
Pharos
San Francisco, CA
6 days ago
User Researcher, AI Evaluations
$196k - $230k
...UX Researcher Notion is the collaborative AI workspace where teams and agents think together. We're building one... ...automation) and working with Data Science/ML partners on measurement strategy and... .... For some roles, AI fluency is a core requirement — when that's the case,...
Local area
Shift work
Notion, LLC
San Francisco, CA
4 days ago
Principal AI Researcher
...leader in Application Risk Management for the AI era. Powered by trillions of lines of code... .... We are seeking a Principal AI Researcher to join Veracode's AI & Innovation Research... ...evaluation techniques essential for AI / ML projects. Strong understanding of various...
Worldwide
Veracode
San Francisco, CA
3 days ago
Architectural Researcher for Next-Gen AI Models
About Cartesia Our mission is to architect AI that learns from and interacts with the... ...in AI. Your Impact Conduct groundbreaking research in neural network architecture design to advance... ...proven research track record in top‑tier ML/AI venues (e.g., NeurIPS, ICML, ICLR, CVPR...
Work at office
Visa sponsorship
Flexible hours
Cartesia AI, Inc.
San Francisco, CA
2 days ago
Cybersecurity Researcher (Remote) - AI/Threat Intel
A leading cybersecurity firm is seeking a Cybersecurity Researcher to drive advancements in threat analysis and vulnerability investigation. You will collaborate with AI/ML teams to enhance systems, delivering actionable research with a focus on clear results. The ideal...
Remote job
Full time
Work at office
AI Cybersecurity Company
San Francisco, CA
6 days ago
Responsible AI Researcher: Safety, GenAI & ML
A leading technology company is looking for candidates in Machine Learning and Applied Research. This role focuses on Responsible AI and Safety, collaborating with various stakeholders to shape policies and develop groundbreaking solutions. Ideal candidates will have experience...
Apple Inc.
San Francisco, CA
4 days ago
Principal AI Researcher
Requirements 8+ years of experience in AI/ML research or applied science, with a proven history of taking products from applied research to global‑scale production Ph.D. or Master’s degree in Computer Science, Artificial Intelligence, Math, Physics, or equivalent technical...
Workday
San Francisco, CA
6 days ago
Researcher, Health AI
$295k
...the safety, robustness, and reliability of AI models towards their deployment in the... ...We work at the intersection of AI safety research and healthcare applications, aiming to create... ...team stakeholders to integrate methods in core model training and launch safety improvements...
Work at office
Relocation package
OpenAI
San Francisco, CA
6 days ago
Principal AI/ML Researcher: Planning & Multi-Agent
...Gravity Engineering Services Pvt Ltd. is seeking a Principal/Distinguished AI/ML Researcher in San Francisco, California. This role involves advancing AI capabilities, focusing on reasoning, planning, and adaptive decision-making systems. The ideal candidate will possess...
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
4 days ago
Principal AI Security & Risk Researcher
...software that operationalizes responsible AI governance at scale. We're a 4-month-old AI... ...re seeking a Principal AI Security & Risk Researcher to join our founding research team and... ...cybersecurity, with 2+ years focused on AI/ML security, red teaming, or adversarial testing...
Part time
Remote work
Flexible hours
Ciph Lab
San Francisco, CA
2 days ago
Applied ML Researcher: Collaborative AI Agents
...Overview We are an applied AI lab building end-to-end software agents. We’re building collaborative AI teammates that enable engineers to focus on more interesting problems and empower engineering teams to strive for more ambitious goals. Team and Opportunity Our team...
Cognition Corp
San Francisco, CA
5 days ago
Lead AI Security & Risk Researcher Equity & Remote
A pioneering AI governance company is looking for a Principal AI Security & Risk Researcher to join its remote-first team. This role offers the opportunity to build adaptive... ...years in cybersecurity, especially focused on AI/ML, and a strong ability to translate technical...
Remote job
Nashville Public Radio
San Francisco, CA
6 days ago
Senior AI Foundations Researcher — Applied ML Leader
...services organization is seeking a Senior Distinguished Applied Researcher to lead AI innovation. In this role based in San Francisco, you will... ...the opportunity to influence banking for good through AI and ML advancements. #J-18808-Ljbffr Capital One National Association
Capital One National Association
San Francisco, CA
3 days ago
AI Education Researcher: ML, RAG & Multimodal Models
An innovative educational technology company based in San Francisco is seeking a Machine Learning Researcher to enhance K-12 education through AI. This role combines advanced technical skills with a passion for improving teaching experiences. Responsibilities include leveraging...
Flexible hours
Kiddom
San Francisco, CA
3 days ago
Lead AI Researcher for Enterprise Agentic Systems
Join Workday as a Principal AI Researcher, leading the AI Research Team to redefine how LLMs function in enterprise environments. We seek a... ...systems and drive innovative research at scale. Your expertise in AI/ML and commitment to ethical AI will guide impactful contributions...
Workday
San Francisco, CA
5 days ago
Member of Technical Staff (AI Researcher)
About Perplexity Perplexity is seeking top‑tier AI Research Scientists and Engineers to advance our AI products and capabilities. We’re building... ...and expertise, you’ll work on one of three specialized teams: Core Research Team (Horizontal) : Focus on generating and improving...
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
4 days ago
Senior Research Scientist - Series E AI Startup - Up to $360,000 base salary
$360k
...Looking to apply world-class research to some of the most challenging problems in modern AI? A well-funded, venture-backed AI... ...focused on advancing the company's core AI capabilities. You'll identify... ...Programming skills in Python and modern ML frameworks Compensation: Up...
Saragossa
San Francisco, CA
1 day ago
ML Research Scientist: Music AI & Generative Models
$160k - $280k
...winning artists use Suno, but our core user base consists of everyday... ...We are a team of musicians and AI experts, including alumni from... ...for early members of our research team. You’ll work closely with... ...and deploy our state of the art ML models trained with an H100/scientist...
Work at office
Flexible hours
Menlo Ventures
San Francisco, CA
4 days ago
Machine Learning Researcher
$140k - $250k
...invasively. We apply deep learning research to large scale EEG datasets... ...Learning Researcher to join our core R&D team. This role involves... ...experts in neural decoding and AI, pushing the boundaries of what... ...on the latest advancements in ML architectures (e.g., transformers...
Work from home
Visa sponsorship
Alljoined
San Francisco, CA
1 day ago
Researcher, Core Product Strategy
$169k - $250k
...translating designs into code, or iterating with AI. From idea to product, Figma empowers teams to streamline... ...design and collaboration, join us! The Figma Research team is hiring a top-tier researcher to support our core product, Figma Design, and actively develop and...
Full time
Temporary work
Remote work
Work from home
Figma
San Francisco, CA
1 day ago
Machine Learning Researcher, Audio
$160k - $250k
Machine Learning Researcher, Audio Location: San Francisco, CA or Remote... ...to empower enterprises to build AI phone agents at scale. Based in... ...research and development across the core components of our voice stack:... ...systems or telephony. PhD in ML, AI, or a related field, or...
Work at office
Remote work
Bland
San Francisco, CA
6 days ago
Audio AI Researcher: Build Steerable Speech Systems
$350k
A leading AI research company in San Francisco is hiring for a position on their Audio team. The role involves developing and training advanced audio models, optimizing performance, and working collaboratively across teams. Ideal candidates will have strong expertise in...
Flexible hours
Menlo Ventures
San Francisco, CA
6 days ago
Senior AI Research Scientist
..., applications, processes, and AI into a single, governed platform... ...continuous innovation at its core, Workato provides the trusted foundation... ...looking for an exceptional AI Research Scientist to join our growing... ...MS/PhD in Computer Science, ML, or related field—or equivalent...
Remote work
Flexible hours
Workato
San Francisco, CA
5 days ago
Staff AI Research Scientist: Enterprise LLMs & Agentic Systems
...writer.com is seeking a Staff AI Research Scientist based in San Francisco or New York City. This role focuses on leading high-impact research... ...and mentor other researchers. Ideal candidates have extensive ML experience and a Ph.D. or equivalent in a related field. #J-188...
Writer Corporation
San Francisco, CA
1 day ago
Staff AI research scientist
...the world's leading enterprises orchestrate AI-powered work. Our vision is to expand... ...future of work with AI. About the role AI research at WRITER isn't just about publishing papers... ...culture What you need 7+ years of hands‑on ML research experience, with deep expertise in...
Full time
Work at office
Local area
Flexible hours
Writer Corporation
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Researcher, Core ML (Turbo). Be the first to apply!