Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, ML Systems & Training Architecture

$295k - $380k

OpenAI

Software Engineer, ML Systems & Training Architecture Robotics - San Francisco About the Team The OpenAI Robotics team is focused on unlocking general-purpose robotics and pushing towards AGI-level intelligence in dynamic, real-world settings. Working across the entire model stack, we integrate cutting-edge hardware and software to explore a broad range of robotic form factors. We strive to seamlessly blend high-level AI capabilities with the constraints of physical systems to improve peoples’ lives. About the Role As a Senior Software Engineer, ML Systems & Training Infrastructure, you will be a deeply hands‑on engineering force multiplier for the robotics team. You will help keep the training framework and surrounding infrastructure healthy, review and improve code quickly, debug failures across ML systems and infrastructure, and unblock researchers and engineers when the path from idea to working training job gets rough. We’re looking for people who love writing, reading, reviewing, and fixing code; who can get productive quickly in unfamiliar systems; and who bring strong practical judgment without a lot of ego or process overhead. This role will be based in San Francisco, CA and be expected in office 5 days per week and offer relocation assistance to new employees. In this role, you will: Review, improve, and clean up code across training frameworks and adjacent infrastructure. Identify risky or low-quality changes before they land, and raise the code quality bar without slowing the team down. Debug issues across ML training systems, GPUs, clusters, networking, and related infrastructure. Help researchers and engineers unblock broken training jobs, flaky workflows, and brittle internal tooling. Improve the reliability, maintainability, and usability of the robotics team’s training framework. Move quickly on practical engineering problems that directly affect team velocity. You might thrive in this role if you: Have strong software engineering fundamentals and excellent code review judgment. Have experience with ML systems, training frameworks, GPUs, distributed systems, infrastructure, or similarly complex technical environments. Read and debug unfamiliar codebases quickly, and enjoy getting to root cause. Ship high-quality code with strong velocity and pragmatic judgment. Are low-ego, responsive, and motivated by helping researchers and engineers move faster. Prefer being a highly effective hands‑on IC over driving broad process‑heavy initiatives. Have experience reviewing messy, fast‑moving, or AI-generated codebases. Compensation: $295K – $380K USD + equity. Equal Opportunity Employer OpenAI is an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws. #J-18808-Ljbffr

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, ML Systems & Training Architecture in San Francisco, CA vacancy
  • $300k - $405k

     ...and steerable AI systems. We want AI to be...  ...committed researchers, engineers, policy experts,...  ...and reliably for training and serving frontier...  ...Work with our ML engineers to understand...  ...related low-level software engineering...  ...familiar with modern CPU architectures and memory systems... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  • $146.5k

     ...About the team: The ML Data Engineering team powers metadata extraction...  ...of users worldwide. Our systems operate at massive scale, supporting...  ...We're seeking a Senior Software Engineer with deep...  ...sets; relevant education or training; and other business and organizational... 
    Training
    For contractors
    Local area
    Worldwide
    Home office
    Flexible hours

    Scribd

    San Francisco, CA
    1 day ago
  • $248.4k - $310.5k

     ...contributor building production systems for robotics data collection, model training pipelines, and...  ...vehicle datasets Build ML training and fine-tuning...  ...quality Collaborate with ML engineers and researchers to bring...  ...3+ years of software engineering experience in... 
    Training
    Full time

    Scale AI

    San Francisco, CA
    9 hours ago
  • $180k - $225k

    Software Engineer - Robotics & Autonomous Systems Scale's Robotics business unit is dedicated to solving the data bottleneck...  ...robotics data collection, model training pipelines, and evaluation...  ...autonomous vehicle datasets Build ML training and fine-tuning pipelines... 
    Training
    Full time

    Scale AI, Inc.

    San Francisco, CA
    4 days ago
  • $146.5k - $228k

     ...attitude. About the team: The ML Data Engineering team powers metadata...  ...millions of users worldwide. Our systems operate at massive scale,...  ...Overview: We’re seeking a Senior Software Engineer with deep...  ...sets; relevant education or training; and other business and organizational... 
    Training
    Temporary work
    Local area
    Worldwide
    Home office
    Flexible hours

    Scribd

    San Francisco, CA
    2 days ago
  • $380k

     ...data that enable our training and scaling efforts,...  ...optimization techniques, model architectures, and efficiency...  ...co-designing model-system interfaces with the...  ...We're looking for a Software Engineer focused on building and...  ...with embedding-based or ML-powered systems.... 
    Training

    OpenAI

    San Francisco, CA
    3 days ago
  • $147k - $211k

    Software Engineer, Agentic AI Systems, Cloud Security Google San Francisco, CA, USA Apply X Applicants in San...  ..., LLMs, Agentic development etc) or ML platform/infrastructure (e.g., model...  ..., and relevant education or training. Your recruiter can share more about... 
    Training
    Full time
    Worldwide

    Google Inc.

    San Francisco, CA
    1 day ago
  • Staff Software Engineer, ML Infra & Distributed Systems About the Role: As a Staff Software Engineer on the ML Infrastructure...  ...projects. This role grants architectural freedom to explore new...  ...Feast) Understanding of ML model training pipelines and model internals. Experience... 
    Training

    Tubi Tv

    San Francisco, CA
    2 days ago
  •  ...world-class scientists, ML researchers, and engineers to work together to...  ...frontier of model architectures for AI x Chemistry:...  ...of machine learning systems architecture and distributed...  ...data generation, training, and evaluations for...  ...systems design and software architecture.... 
    Training
    Work at office

    Achira

    San Francisco, CA
    9 hours ago
  •  ...Luma AI Infrastructure Engineer Luma's mission is...  ...aware, capable and useful systems, the next step...  ...So we are working on training and scaling up multimodal...  ...and integrate new model architectures from our research team...  ...performance, large-scale ML systems (managing ~1... 
    Training

    Luma AI

    San Francisco, CA
    2 days ago
  •  ...pioneering the model architectures that will make this possible...  ...a new primitive for training efficient, large-...  ...model innovation and systems engineering paired with a design‑...  ...we’re looking for a Software Engineer to help...  ...the training data and ML data infrastructure at... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Cartesia, Inc.

    San Francisco, CA
    1 day ago
  • $180k - $250k

     ...Staff Software Engineer, ML Performance & Systems San Francisco fal is the generative media ecosystem powering the next generation of AI products...  ...and implement novel approaches to model serving architecture on top of our in-house inference engine, focusing on... 
    Currently hiring
    Relocation package

    fal

    San Francisco, CA
    20 days ago
  • $218.4k - $365.2k

     .... Job Category Software Engineering Job Details About...  ...the most critical architectural initiatives for Spiff...  ...high-scale, agentic systems that move beyond static...  .... Experience with ML/AI model deployment and...  ...promotion, benefits, training, assessment of job... 
    Training
    Contract work
    Flexible hours

    Salesforce.Com Inc

    San Francisco, CA
    2 days ago
  • $218.4k - $365.2k

     ...Management (ICM) software that drives commissions...  .... As a Software Engineering Architect...  ...the most critical architectural initiatives for Spiff...  ...-scale, agentic systems that move beyond...  ....Experience with ML/AI model deployment...  ..., benefits, training, assessment of job... 
    Training
    Contract work
    Flexible hours

    Salesforce

    San Francisco, CA
    4 days ago
  • $230k - $385k

     ...integrate cutting-edge hardware and software to explore a broad range of...  ...the constraints of physical systems to improve peoples' lives....  ...the Role As a Software Engineer, Distributed Data Systems, you...  ...large-scale multimodal training and evaluation at OpenAI. You... 
    Training
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...AI/ML Engineer (RL & Physical Systems) FLUIX is building the AI Operating System for data centers. We deploy...  ...environments to accelerate training, testing, and Sim2Real deployment....  ...meet. Collaborate with controls, software, and field engineering teams to integrate... 
    Training
    Weekend work

    Fluix AI

    San Francisco, CA
    3 days ago
  • $192k - $260k

     ...BI, and all the way up to ML/AI with a unified platform...  ...believe the data warehouse architecture as we know it today will...  ...generation (decoupled) query engine and structured storage system that can outperform...  ...relevant certifications and training, and specific work location... 
    Training
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    5 days ago
  • $255k - $405k

     ...aligned with our mission of broad societal benefit. About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large‑scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines,... 
    Training
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    1 day ago
  •  ...seeking a Machine Learning Platform Engineer to help build scalable systems that support model training for their Machine Learning...  ...productivity of data scientists and ML engineers. The ideal candidate...  ...grasp of ML concepts and software development experience. Responsibilities... 
    Training

    CVFine by Instrovate Technologies

    San Francisco, CA
    2 days ago
  •  ...sites today. Backed by Accel. Our system runs a Multi Agent Action Expert architecture: classical precision algorithms orchestrated...  ...1: from data collection and model training through edge deployment on Jetson...  .... BS/MS/PhD in CS, Robotics, ML, or equivalent experience shipping... 
    Training

    Origin

    San Francisco, CA
    4 days ago
  •  ...Applied AI Engineer Valthos Inc. Valthos...  ...We build and deploy software and biological AI systems to safeguard...  ...The same AI architectures that enable self-driving...  ...applied biological ML engineers from MIT's...  ...including adapting and post-training biological frontier... 
    Training
    Work at office

    Valthos

    San Francisco, CA
    3 days ago
  • $125k - $195k

     ...exceptional, hands-on engineers to make this happen. Mechanical...  ...stack from atoms to architecture. Our team is...  ...Team The Fab Software team builds the product...  ...monitoring and controlling systems in real time,...  ...process driven by AI and ML orchestration. About... 
    Work at office
    Visa sponsorship
    Night shift

    Atomic Semi

    San Francisco, CA
    5 days ago
  • $218.4k - $365.2k

     ...Job Category Software Engineering Job Details About...  ...help shape how agentic systems are built, deployed,...  ...customers, and setting architectural direction for how enterprise...  ...) Background in ML infrastructure, data...  ...promotion, benefits, training, assessment of job... 
    Training

    Salesforce.Com Inc

    San Francisco, CA
    6 days ago
  •  ...Application Security Engineer at vCluster Labs,...  ...our multi-tenant architecture. Threat Modeling:...  .... Developer training : Make complex topics...  ...built for running AI, ML, and GPU-intensive...  ...for DGX systems. Benefits We offer...  ...We don't just ship software; we define the state... 
    Training
    Remote work
    Flexible hours
    Shift work

    vCluster Labs

    San Francisco, CA
    3 days ago
  • $212.5k - $250k

     ...partners through world‑class software, purpose‑built for everyone in...  .... Ink is Carta’s Design System – the underlying foundation for...  .... As a Senior Design Systems Engineer, you’ll work to: Architect and...  ...improve animation fluidity, CSS architecture, and system modularity,... 
    Full time
    Work at office

    Menlo Ventures

    San Francisco, CA
    1 day ago
  •  ...native financial operating system for health systems,...  ...partner closely with engineers and leadership to...  ...unaddressed. From multi-tenant architecture and security...  ...for a systems-minded software engineer who cares deeply...  ...computing, storage, and ML-enabled applications as... 
    Contract work

    MidStream PA

    San Francisco, CA
    5 days ago
  • $350k

     ...Software Engineer, Systems Generalist Thinking Machines Lab's mission is to empower humanity through...  ...Infrastructure: We support teams that train, research, and ultimately serve AI models...  ...performance profiling. Familiarity with GPU/ML workflows or large‑scale data/eval... 
    Local area
    Immediate start
    Visa sponsorship
    Work visa
    Relocation package
    Flexible hours

    Thinking Machines Lab

    San Francisco, CA
    2 days ago
  • $122k - $209k

     ...industries by unleashing the power of software and data. We enable organizations...  ...a highly scalable yet observable system for customers and engineers. The Atlas Search product is quickly...  ...team's roadmap and help determine the architecture of our system Success measures:... 
    Work at office
    Local area
    Worldwide
    Flexible hours

    MongoDB

    San Francisco, CA
    3 days ago
  • $320k

     ...interpretable, and steerable AI systems. We want AI to be safe and...  ...group of committed researchers, engineers, policy experts, and business...  ...for a systems-oriented Software Engineer to push the performance...  ...equivalent combination of education, training, and/or experience... 
    Training
    Work experience placement
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...evaluating and deploying AI systems. Our mission is to help enterprises...  ...role We're looking for a software engineer who loves to build high...  ...analyze in real-time. Our unique architecture allows them to store this...  ...expertise in database and ML systems, and you'll be empowered... 
    Flexible hours

    Brain Trust Inc

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, ML Systems & Training Architecture. Be the first to apply!