Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Technical Lead Manager - Training Runtime, Data(set) Movement

$295k

OpenAI

About the Team Training Runtime builds the distributed systems that power OpenAI's largest model training runs - most recently GPT-5.5! The Data Movement area owns the infrastructure that keeps training jobs supplied with the right data at the right time, and keeps model state moving safely and efficiently across large clusters. Our work spans machine-learning systems, distributed storage, high-throughput data loading, reliability engineering, and developer experience. Success means researchers can move quickly while training runs remain fast, reproducible, debuggable, and resilient at scale. About the Role We are looking for a deeply hands‑on Technical Lead Manager to own datasets throughout our training infrastructure. This person will set the direction for how training jobs read data: the APIs, storage contracts, versioning model, benchmarks, debugging tools, and reliability guarantees that make data access consistent across current and future training frameworks. You will begin as the primary technical owner for dataset reads, working directly in the code while aligning researchers, training framework owners, storage teams, and infrastructure partners around a durable platform. The problem is deceptively hard at frontier scale: make enormous, heterogeneous datasets easy to consume, correct across distributed workers, observable when something goes wrong, and flexible enough to support pretraining, reinforcement learning, and multimodal training. In this role, you will Design and build a unified dataset read platform for multiple current and future training frameworks. Define dataset APIs, storage-format expectations, registration/versioning, and migration paths that make data access reproducible and maintainable. Build reliability into the read path, including stateful iteration, caching, fast restart, recovery, and clear operational contracts. Build terminal and web-based visualizers that let teams inspect text, multimodal, and reinforcement learning data late in the pipeline, where bugs are most visible. Write and review production code in core data loading, service, caching, and reliability paths. Partner with teams working on training frameworks, reinforcement learning, multimodal models, storage, runtime, and cluster infrastructure. Over Time The long‑term goal is a team that owns fast, correct, scalable, and reliable in‑cluster data movement for training: data that comes in, data that goes out, and data that moves around inside the cluster. After ramping on datasets, this role will expand to TLM ownership for broader data movement systems, including checkpoint loads/saves and snapshot transfers, while partnering closely with existing technical leads and adjacent infrastructure teams. You might thrive in this role if you: Have built or owned dataset, data loading, storage, or distributed training infrastructure at large scale (e.g. torch.utils.data). Care equally about API design, debugging ergonomics, performance, and bit‑level correctness. Understand the failure modes of large distributed training jobs and know how data systems can create or prevent them. Have experience with stateful iterators, checkpoint/restart semantics, caching, remote services, or high‑throughput storage reads. Are comfortable working across Python and lower‑level systems code; Rust or C++ experience is useful but not required. Have worked with multimodal, video, reinforcement learning, or pretraining data pipelines where small data bugs are expensive and hard to diagnose. Can lead through code and technical judgment before a team exists, and can later manage engineers without losing the hands‑on edge. Obsess over developer experience by eliminating friction, such as manual preprocessing scripts and niche cluster‑specific bugs, ensuring a reliable and efficient experience for researchers. About OpenAI OpenAI is an AI research and deployment company dedicated to ensuring that general‑purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI's affirmative action and equal employment opportunity policy statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US‑based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non‑public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non‑compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology. Compensation Range: $295K - $445K #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Technical Lead Manager - Training Runtime, Data(set) Movement in San Francisco, CA vacancy
  • $295k

     ...OpenAI is seeking a Technical Lead Manager to own dataset management in training infrastructure. This role involves designing a unified platform for data access and building tools for debugging and visualization. Ideal candidates have experience with distributed training... 
    Data
    Training

    OpenAI

    San Francisco, CA
    4 days ago
  •  ...upskilling, from freelance AI training gigs to first...  ...This unique value is leading to unparalleled growth...  ...problems around human data, evals, and AI systems...  ...scale. As a Tech Lead Manager, you're first and foremost...  ...a builder. You'll set technical direction for a small... 
    Data
    Training
    Full time
    Freelance
    Internship
    Work at office
    Remote work
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  • $177k - $256.5k

     ...As the Sr. Staff Technical Lead and People Manager for the Identity Services team, you...  ...identity graph to ensure input data is paired with the most...  ...planning and priority setting (StackRank), helping to define...  ...in recruiting, hiring, training, promotion or other employment... 
    Data
    Training
    Work at office
    Work from home
    Flexible hours

    Dormont Manufacturing Company

    San Francisco, CA
    5 days ago
  •  ...Staff Technical Lead Manager, ML Sensor Validation Mar 02, 2026 Waymo is an autonomous driving...  ...to millions of miles of driving data from a diverse set of sensors, enabling researchers like...  ...to automatically generate suitable training data for rare instance sensor degradation... 
    Data
    Training
    Full time
    Work at office
    Immediate start
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  •  ...access to millions of miles of driving data from a diverse set of sensors, enabling engineers like...  ..., to(2) develop models and model training at scale, to(3) analyze real-world...  ...role you will report to a Sr Staff Technical Lead Manager. Responsibilities Lead the multi-... 
    Data
    Training
    Local area
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  • $199k - $241k

     ...About The Role Veho’s Data Science team is...  .... What You’ll Do Lead and grow a team of...  ...downstream systems. Set the roadmap for...  ...every project is technically sound and lessons...  ...platform experience: training and serving infrastructure...  ...experience managing impactful, high... 
    Data
    Training
    Full time
    Temporary work
    H1b

    Veho

    San Francisco, CA
    4 days ago
  •  ...Computer Vision , 3+ year in managing ML engineering or research...  ...of miles of driving data from a diverse set of sensors, enabling engineers...  ...develop models and model training at scale, to (3) analyze real...  ...will report to a Sr Staff Technical Lead Manager , Own object... 
    Data
    Training
    Temporary work

    Waymo

    San Francisco, CA
    2 days ago
  •  ...safely. The Trust & Safety Data Engineering team builds the...  ...About the Role We are hiring a Technical Lead Manager to lead and grow the Trust &...  ...role for someone who can set strategy, shape data architecture...  ...including features, labels, training data, backtesting,... 
    Data
    Training
    Relocation package

    United States Digital Space LLC

    San Francisco, CA
    4 days ago
  • $248.8k - $311k

     ...Scale AI is the data engine for the entire AI industry. Our mission is to accelerate...  ...of automation. Role Overview As the Technical Lead Manager (TLM) for the Physical AI team of...  ...best utilize massive datasets for pre‑training and fine‑tuning generalist policies. VLA... 
    Data
    Training
    Full time

    Scale AI

    San Francisco, CA
    2 days ago
  • $238k - $302k

     ...Technical Lead Manager (TLM), ML Simulation Waymo is an autonomous driving technology company with...  ...large-scale machine learning and data systems, simulation workflows, and insight...  ...machine learning models to deliver training and evaluation data for the Waymo driver... 
    Data
    Training
    Full time
    Remote work

    Waymo

    San Francisco, CA
    5 days ago
  • $251k - $310k

     ...Technical Lead Manager, Perception, Vehicle Understanding Waymo is an autonomous driving technology...  ...models, large-scale 3rd party data, and partner teams in Research, Oracles...  ...the team's key pipelines (e.g., data, training, evaluation, onboard) for high performance... 
    Data
    Training
    Full time
    Temporary work
    Immediate start
    Remote work

    Waymo

    San Francisco, CA
    3 days ago
  • $255k - $345k

     ...intellectually curious, deeply technical leaders eager to shape...  ...ML at Whatnot. You’ll lead the development and...  ...to distributed training and high‑throughput GPU...  ...GPUs and both model and data parallelism. Optimize system performance by managing resource utilization and... 
    Data
    Training
    Work experience placement
    Work at office
    Local area
    Remote work
    Work from home
    Home office

    Whatnot

    San Francisco, CA
    5 days ago
  • $251k - $310k

     ...Staff Technical Lead Manager, Planner Reasoning Waymo is an autonomous driving technology company...  ...evaluation techniques to drive data driven development Work closely with...  ...exact work location, experience, relevant training and education, and skill level. Your recruiter... 
    Data
    Training
    Full time
    Remote work

    Waymo

    San Francisco, CA
    5 days ago
  • $290k - $365k

     ...Technical Program Manager, Inference Performance About Anthropic...  ...across inference runtime and accelerator...  ...& Coordination : Lead cross‑functional...  ...and using data to drive technical...  ...combination of education, training, and/or...  ...with colleagues. As set forth in Anthropic... 
    Data
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    5 days ago
  • $225k - $325k

     ...the fastest-growing AI data business in history....  ...various data-intensive post-training techniques. We believe...  ...partners. FDEs are technical builders: they ship...  ...technical teams. As a Tech Lead Manager, Forward Deployed...  ...Deployed Engineers — set expectations, unblock,... 
    Data
    Training
    Full time
    Work at office
    Remote work
    Flexible hours

    Handshake

    San Francisco, CA
    20 days ago
  •  ...2+ years of experience managing medium-size teams Track...  ...research or applied research Technical experience working with...  ...as pretraining, mid-training, post-training,...  ...to a Research Director Lead and manage a team to build...  ...applications for the Waymo Driver Set the technical direction... 
    Training

    Waymo

    San Francisco, CA
    4 days ago
  •  ...years of engineering management experience of a full stack...  ...attitude; comfortable leading projects or learning...  ..., permissions, and technical complexity all matter...  ...industry’s broadest range of data: enterprise and world,...  ..., governance, search settings, AI features, and... 
    Data

    Glean.info

    San Francisco, CA
    4 days ago
  •  ...Engineering, or a related technical field 5+ years of...  ...as a Technical Program Manager in a software engineering...  ...machine learning models, training pipelines, or...  ...modeling using sensor data (Desirable) Master\'s degree...  ...strategic planning, objective setting, and technical roadmap... 
    Data
    Training

    Waymo

    San Francisco, CA
    5 days ago
  • $117k - $150k

     ...Responsibilities Build and scale technical enablement programs: Design and manage global technical...  ...platform capabilities into training, labs, and technical...  ...technical enablement sessions. Data‑driven mindset with the...  ...Postman offers a comprehensive set of benefits, including... 
    Data
    Training
    Flexible hours

    Postman

    San Francisco, CA
    4 days ago
  •  ...the Role As a tech lead, you will be...  ...practice include: Setting north star goals and milestones...  ...teams to ensure different technical approaches work...  ...such as RLHF, adversarial training, robustness, and more....  ...possession (including the data contained therein) upon... 
    Data
    Training
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  • $140k - $170k

     ...breakthrough AI models at leading research labs and...  ...been pioneering data‑centric approaches...  ...high‑quality training data at scale Frontier...  ...contributions. Technical Excellence : Work...  ...Technical Program Manager owns the day‑to‑day...  ...of a project: setting it up, bringing data... 
    Data
    Training
    For contractors
    Work at office
    Flexible hours
    Shift work
    3 days per week

    Labelbox

    San Francisco, CA
    4 days ago
  •  ...richest and most complex data type in the world....  ...top of it. They are the technical center of the platform:...  ...You will own both. You set the strategy and roadmap...  ...evaluation, release cadence and management, ranking quality, the...  ...quality: eval rubrics, training data investments,... 
    Data
    Training
    Work at office
    Worldwide
    Flexible hours
    Day shift
    2 days per week
    Weekday work

    Twelve Labs Inc.

    San Francisco, CA
    2 days ago
  • $160k - $225k

     ...passionate about enabling data teams to solve the...  ...their business. Training and customizing state...  ...Mosaic AI mission. AI Runtime (AIR) is our managed platform for large‑scale...  ...will help shape the technical direction for AIR, mentor...  ...training jobs. Lead end‑to‑end engineering... 
    Data
    Training

    Cacheflow

    San Francisco, CA
    18 hours ago
  • $140k - $170k

     ...breakthrough AI models at leading research labs and...  ...been pioneering data-centric approaches...  ...high‑quality training data at scale Frontier...  ...contributions. Technical Excellence : Work...  ...the Role The TPM Manager leads and grows...  ...of each project — set up, bringing data... 
    Data
    Training
    For contractors
    Work at office
    Flexible hours
    Shift work
    3 days per week

    Labelbox

    San Francisco, CA
    4 days ago
  • $332k - $421k

     ...deeply and solve complex technical challenges in areas...  ...and valuable piece of data that can be leveraged into...  ...(auto-)labeling, model training and evaluation, all the...  ...lifecycle. The Area Technical Lead for the Waymo machine...  ...technical teams and setting technical directions in... 
    Data
    Training
    Full time
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  • $250k - $300k

     ...accessing the industry’s broadest range of data: enterprise and world, structured and...  ...company. About the Role The Tech Lead Manager of the Agentic Runtime team builds the low‑latency, reliable...  ..., age, disability, or race. As set forth in Glean’s Equal Employment Opportunity... 
    Data
    Home office
    Flexible hours

    aijoblist

    San Francisco, CA
    5 days ago
  • $224k - $336k

     ...quality, purpose-built AI data. We're looking for a Senior or Principal Technical Program Manager to join our AI...  ...that data into the model training pipeline. You don't need...  ...organizations; set up the organizational...  ...field. Proven experience leading cross-functional programs... 
    Data
    Training
    Full time
    Work at office
    Local area
    Flexible hours

    Lila Sciences

    San Francisco, CA
    4 days ago
  • $214k - $230k

    ## Cloud Data Technical Platform LeadApplylocations: San Francisco...  ...: R0000612Develop and manage the modernization of...  ...a hybrid cloud model; leading team activities...  ...Data technologies; 3) setting up and maintaining cloud...  ...recruitment, selection, training, promotion, transfer,... 
    Data
    Training
    Work experience placement
    Work at office
    Remote work
    1 day per week

    Cox Worldwide Funds plc

    San Francisco, CA
    18 hours ago
  • $192k - $306k

     ...Senior / Principal Technical Program Manager, Life Sciences AI Cambridge...  ...experimental data — that power automated...  ...organizations (model training, experimental science...  ...product, and leadership); set up the organizational...  ...~ Proven experience leading cross-functional... 
    Data
    Training
    Full time
    Work at office
    Local area
    Flexible hours

    Lila Sciences

    San Francisco, CA
    1 day ago
  • $120k - $150k

     ...Technical Customer Success Manager, AI & Ops Tread is the AI-native operating system...  ..., and real P&L against our data every day. Tread is the most...  ...a bug, a workflow issue, a training gap, and a product gap. ~...  ...location, experience, skill set, and alignment with the role... 
    Data
    Training
    For contractors

    Higher People

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Technical Lead Manager - Training Runtime, Data(set) Movement. Be the first to apply!