Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Researcher, Core ML (Turbo)

Gravity Engineering Services Pvt Ltd.

About the Role The Turbo team operates at the intersection of efficient inference (algorithms, architectures, engines) and post‑training / RL systems. We develop and manage the systems powering Together’s API, including high‑performance inference and RL/post‑training engines capable of operating at production scale. Our objective is to advance the frontier of efficient inference and RL‑driven training: making models significantly faster and more cost‑effective to run, while simultaneously enhancing their capabilities through RL-based post‑training (e.g., GRPO‑style objectives). This work involves both algorithms and systems: asynchronous RL, rollout collection, scheduling, and batching all interact with engine design, providing numerous parameters to adjust across the RL algorithm, training loop, and inference stack. A significant portion of the role involves modifying production inference systems—such as SGLang‑ or vLLM‑style serving stacks and speculative decoding systems like ATLAS—rooted in a strong understanding of post‑training and inference theory, rather than solely theoretical algorithm design. You will work across the entire stack—from RL algorithms and training engines to kernels and serving systems—to develop and enhance frontier models using RL pipelines. Team members often exhibit specialized strengths: some are more proficient in RL, while others excel in systems. Depth in one of these areas, coupled with an eagerness to collaborate across disciplines (and grow towards more full‑stack ownership over time), is highly desirable. Requirements We do not expect every candidate to meet every single requirement. Team members typically possess deep expertise in one or more areas and sufficient breadth (or interest) to effectively work across the stack. The closer you are to a full‑stack profile (inference + post‑training/RL + systems), the stronger the fit—however, having deep expertise in one area and a strong desire to grow is perfectly acceptable. You might be a good fit if you: Have strong expertise in at least one of the following, and are excited to collaborate across (and grow into) the others: Systems-first profile: Large‑scale inference systems (e.g., SGLang, vLLM, FasterTransformer, TensorRT, custom engines, or similar), GPU performance, distributed serving. RL-first profile: RL / post‑training for LLMs or large models (e.g., GRPO, RLHF/RLAIF, DPO‑like methods, reward modeling), and using these to train or fine‑tune real models. Model architecture design for Transformers or other large neural networks. Distributed systems / high‑performance computing for ML. Are comfortable working from algorithms to engines: Strong coding ability in Python . Experience profiling and optimizing performance across GPU, networking, and memory layers. Able to take a new sampling method, scheduler, or RL update and transform it into a production‑grade implementation within the engine and/or training stack. Have a solid research foundation in your area(s) of depth: Track record of impactful work in ML systems, RL, or large‑scale model training (papers, open‑source projects, or production systems). Can comprehend new RL / post‑training papers, understand their implications on the stack, and design minimal, correct changes in the appropriate layer (training engine vs. inference engine vs. data / API). Operate well as a full‑stack problem solver: You naturally ask: “Where in the stack is this really bottlenecked?” You enjoy collaborating with infra, research, and product teams, and you value both scientific quality and user‑visible achievements. Minimum qualifications 3+ years of experience working on ML systems, large‑scale model training, inference, or adjacent areas (or equivalent experience via research / open source). Advanced degree in Computer Science, EE, or a related field, or equivalent practical experience. Demonstrated experience owning complex technical projects end‑to‑end. If you’re excited about the role and strong in some of these areas, we encourage you to apply even if you don’t meet every single requirement. Responsibilities Advance inference efficiency end‑to‑end Design and prototype algorithms, architectures, and scheduling strategies for low‑latency, high‑throughput inference. Implement and maintain changes in high‑performance inference engines (e.g., SGLang‑ or vLLM‑style systems and Together’s inference stack), including kernel backends, speculative decoding (e.g., ATLAS), quantization, etc. Profile and optimize performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Unify inference with RL / post‑training Design and operate RL and post‑training pipelines (e.g., RLHF, RLAIF, GRPO, DPO‑style methods, reward modeling) where 90+% of the cost is inference, jointly optimizing algorithms and systems. Make RL and post‑training workloads more efficient with inference‑aware training loops—for example, async RL rollouts, speculative decoding, and other techniques that make large‑scale rollout collection and evaluation cheaper. Use these pipelines to train, evaluate, and iterate on frontier models on top of our inference stack. Co‑design algorithms and infrastructure so that objectives, rollout collection, and evaluation are tightly coupled to efficient inference, and quickly identify bottlenecks across the training engine, inference engine, data pipeline, and user‑facing layers. Run ablations and scale‑up experiments to understand trade‑offs between model quality, latency, throughput, and cost, and feed these insights back into model, RL, and system design. Own critical systems at production scale Profile, debug, and optimize inference and post‑training services under real production workloads. Drive roadmap items that require real engine modification—changing kernels, memory layouts, scheduling logic, and APIs as needed. Establish metrics, benchmarks, and experimentation frameworks to validate improvements rigorously. Provide technical leadership (Staff level) Set technical direction for cross‑team efforts at the intersection of inference, RL, and post‑training. Mentor other engineers and researchers on full‑stack ML systems work and performance engineering. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the AI Researcher, Core ML (Turbo) in San Francisco, CA vacancy
  • $200k - $280k

     ...About the Role The Turbo team sits at the intersection of efficient...  ...high-performance computing for ML. Are comfortable working...  ...stack. Have a solid research foundation in your area(s) of depth...  .... About Together AI Together AI is a research-... 
    Suggested
    Full time

    Together AI

    San Francisco, CA
    3 days ago
  • $250k

     ...AI Behavior Researcher - Child Safety and Mental Health Transluce is a fast-moving nonprofit research...  ...methods. You don't need to be a pure ML engineer, but you should be comfortable...  ...safety and mental health experts. Core responsibility: Identify and measure... 
    Suggested

    Transluce

    San Francisco, CA
    2 days ago
  • $250k - $325k

     ...was first-to-market with: An AI agent that lives in MS Word and...  ...: Why, What, and Who Why AI Researchers are the engine of innovation at...  ...engineering team. Advance the core AI platform. Design and...  ...Publications at top venues — e.g., ML/AI conferences (NeurIPS, AAAI,... 
    Suggested
    Contract work
    Work at office
    Immediate start
    Remote work

    Ivo Inc.

    San Francisco, CA
    4 days ago
  • $212k - $292k

     ...The Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is...  ...and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio...  ...technologies by focusing on three core areas:Cutting-Edge ResearchYou will:Lead... 
    Suggested
    Full time
    Local area
    Worldwide
    Flexible hours

    Via Licensing Corporation

    San Francisco, CA
    4 days ago
  •  ...About the Role We are seeking a Principal / Distinguished AI/ML Researcher and/or Engineer with deep experience in reasoning, planning, and...  ...across disciplines and influence company-wide AI architecture. A core dimension of this role is the design and deployment of multi-... 
    Suggested
    Local area

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    4 days ago
  •  ...in hospitals through advanced AI-powered reporting and analytics...  ...founding member of the clinical AI research team at Pharos. As a Physician...  ...contribute to Pharos's core mission: improving patient safety...  ...prototype, implement, and evaluate AI/ML models for healthcare quality... 
    Work at office

    Pharos

    San Francisco, CA
    6 days ago
  • $196k - $230k

     ...UX Researcher Notion is the collaborative AI workspace where teams and agents think together. We're building one...  ...automation) and working with Data Science/ML partners on measurement strategy and...  .... For some roles, AI fluency is a core requirement — when that's the case,... 
    Local area
    Shift work

    Notion, LLC

    San Francisco, CA
    4 days ago
  •  ...leader in Application Risk Management for the AI era. Powered by trillions of lines of code...  .... We are seeking a Principal AI Researcher to join Veracode's AI & Innovation Research...  ...evaluation techniques essential for AI / ML projects. Strong understanding of various... 
    Worldwide

    Veracode

    San Francisco, CA
    3 days ago
  • About Cartesia Our mission is to architect AI that learns from and interacts with the...  ...in AI. Your Impact Conduct groundbreaking research in neural network architecture design to advance...  ...proven research track record in top‑tier ML/AI venues (e.g., NeurIPS, ICML, ICLR, CVPR... 
    Work at office
    Visa sponsorship
    Flexible hours

    Cartesia AI, Inc.

    San Francisco, CA
    2 days ago
  • A leading cybersecurity firm is seeking a Cybersecurity Researcher to drive advancements in threat analysis and vulnerability investigation. You will collaborate with AI/ML teams to enhance systems, delivering actionable research with a focus on clear results. The ideal... 
    Remote job
    Full time
    Work at office

    AI Cybersecurity Company

    San Francisco, CA
    6 days ago
  • A leading technology company is looking for candidates in Machine Learning and Applied Research. This role focuses on Responsible AI and Safety, collaborating with various stakeholders to shape policies and develop groundbreaking solutions. Ideal candidates will have experience... 

    Apple Inc.

    San Francisco, CA
    4 days ago
  • Requirements 8+ years of experience in AI/ML research or applied science, with a proven history of taking products from applied research to global‑scale production Ph.D. or Master’s degree in Computer Science, Artificial Intelligence, Math, Physics, or equivalent technical... 

    Workday

    San Francisco, CA
    6 days ago
  • $295k

     ...the safety, robustness, and reliability of AI models towards their deployment in the...  ...We work at the intersection of AI safety research and healthcare applications, aiming to create...  ...team stakeholders to integrate methods in core model training and launch safety improvements... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    6 days ago
  •  ...Gravity Engineering Services Pvt Ltd. is seeking a Principal/Distinguished AI/ML Researcher in San Francisco, California. This role involves advancing AI capabilities, focusing on reasoning, planning, and adaptive decision-making systems. The ideal candidate will possess... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    4 days ago
  •  ...software that operationalizes responsible AI governance at scale. We're a 4-month-old AI...  ...re seeking a Principal AI Security & Risk Researcher to join our founding research team and...  ...cybersecurity, with 2+ years focused on AI/ML security, red teaming, or adversarial testing... 
    Part time
    Remote work
    Flexible hours

    Ciph Lab

    San Francisco, CA
    2 days ago
  •  ...Overview We are an applied AI lab building end-to-end software agents. We’re building collaborative AI teammates that enable engineers to focus on more interesting problems and empower engineering teams to strive for more ambitious goals. Team and Opportunity Our team... 

    Cognition Corp

    San Francisco, CA
    5 days ago
  • A pioneering AI governance company is looking for a Principal AI Security & Risk Researcher to join its remote-first team. This role offers the opportunity to build adaptive...  ...years in cybersecurity, especially focused on AI/ML, and a strong ability to translate technical... 
    Remote job

    Nashville Public Radio

    San Francisco, CA
    6 days ago
  •  ...services organization is seeking a Senior Distinguished Applied Researcher to lead AI innovation. In this role based in San Francisco, you will...  ...the opportunity to influence banking for good through AI and ML advancements. #J-18808-Ljbffr Capital One National Association

    Capital One National Association

    San Francisco, CA
    3 days ago
  • An innovative educational technology company based in San Francisco is seeking a Machine Learning Researcher to enhance K-12 education through AI. This role combines advanced technical skills with a passion for improving teaching experiences. Responsibilities include leveraging... 
    Flexible hours

    Kiddom

    San Francisco, CA
    3 days ago
  • Join Workday as a Principal AI Researcher, leading the AI Research Team to redefine how LLMs function in enterprise environments. We seek a...  ...systems and drive innovative research at scale. Your expertise in AI/ML and commitment to ethical AI will guide impactful contributions... 

    Workday

    San Francisco, CA
    5 days ago
  • About Perplexity Perplexity is seeking top‑tier AI Research Scientists and Engineers to advance our AI products and capabilities. We’re building...  ...and expertise, you’ll work on one of three specialized teams: Core Research Team (Horizontal) : Focus on generating and improving... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    4 days ago
  • $360k

     ...Looking to apply world-class research to some of the most challenging problems in modern AI? A well-funded, venture-backed AI...  ...focused on advancing the company's core AI capabilities. You'll identify...  ...Programming skills in Python and modern ML frameworks Compensation: Up... 

    Saragossa

    San Francisco, CA
    1 day ago
  • $160k - $280k

     ...winning artists use Suno, but our core user base consists of everyday...  ...We are a team of musicians and AI experts, including alumni from...  ...for early members of our research team. You’ll work closely with...  ...and deploy our state of the art ML models trained with an H100/scientist... 
    Work at office
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    4 days ago
  • $140k - $250k

     ...invasively. We apply deep learning research to large scale EEG datasets...  ...Learning Researcher to join our core R&D team. This role involves...  ...experts in neural decoding and AI, pushing the boundaries of what...  ...on the latest advancements in ML architectures (e.g., transformers... 
    Work from home
    Visa sponsorship

    Alljoined

    San Francisco, CA
    1 day ago
  • $169k - $250k

     ...translating designs into code, or iterating with AI. From idea to product, Figma empowers teams to streamline...  ...design and collaboration, join us! The Figma Research team is hiring a top-tier researcher to support our core product, Figma Design, and actively develop and... 
    Full time
    Temporary work
    Remote work
    Work from home

    Figma

    San Francisco, CA
    1 day ago
  • $160k - $250k

    Machine Learning Researcher, Audio Location: San Francisco, CA or Remote...  ...to empower enterprises to build AI phone agents at scale. Based in...  ...research and development across the core components of our voice stack:...  ...systems or telephony. PhD in ML, AI, or a related field, or... 
    Work at office
    Remote work

    Bland

    San Francisco, CA
    6 days ago
  • $350k

    A leading AI research company in San Francisco is hiring for a position on their Audio team. The role involves developing and training advanced audio models, optimizing performance, and working collaboratively across teams. Ideal candidates will have strong expertise in... 
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    6 days ago
  •  ..., applications, processes, and AI into a single, governed platform...  ...continuous innovation at its core, Workato provides the trusted foundation...  ...looking for an exceptional AI Research Scientist to join our growing...  ...MS/PhD in Computer Science, ML, or related field—or equivalent... 
    Remote work
    Flexible hours

    Workato

    San Francisco, CA
    5 days ago
  •  ...writer.com is seeking a Staff AI Research Scientist based in San Francisco or New York City. This role focuses on leading high-impact research...  ...and mentor other researchers. Ideal candidates have extensive ML experience and a Ph.D. or equivalent in a related field. #J-188... 

    Writer Corporation

    San Francisco, CA
    1 day ago
  •  ...the world's leading enterprises orchestrate AI-powered work. Our vision is to expand...  ...future of work with AI. About the role AI research at WRITER isn't just about publishing papers...  ...culture What you need 7+ years of hands‑on ML research experience, with deep expertise in... 
    Full time
    Work at office
    Local area
    Flexible hours

    Writer Corporation

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Researcher, Core ML (Turbo). Be the first to apply!