Software Engineer, ML Systems & Training Architecture
$295k - $380kSlope
About the Team The OpenAI Robotics team is focused on unlocking general‑purpose robotics and pushing towards AGI‑level intelligence in dynamic, real‑world settings. Working across the entire model stack, we integrate cutting‑edge hardware and software to explore a broad range of robotic form factors. We strive to seamlessly blend high‑level AI capabilities with the constraints of physical systems to improve peoples' lives. About the Role As a Senior Software Engineer, ML Systems & Training Infrastructure, you will be a deeply hands‑on engineering force multiplier for the robotics team. You will help keep the training framework and surrounding infrastructure healthy, review and improve code quickly, debug failures across ML systems and infrastructure, and unblock researchers and engineers when the path from idea to working training job gets rough. We’re looking for people who love writing, reading, reviewing, and fixing code; who can get productive quickly in unfamiliar systems; and who bring strong practical judgment without a lot of ego or process overhead. This role will be based in San Francisco, CA and be expected in office 5 days per week and offer relocation assistance to new employees. In this role, you will: Review, improve, and clean up code across training frameworks and adjacent infrastructure. Identify risky or low‑quality changes before they land, and raise the code quality bar without slowing the team down. Debug issues across ML training systems, GPUs, clusters, networking, and related infrastructure. Help researchers and engineers unblock broken training jobs, flaky workflows, and brittle internal tooling. Improve the reliability, maintainability, and usability of the robotics team's training framework. Move quickly on practical engineering problems that directly affect team velocity. You might thrive in this role if you: Have strong software engineering fundamentals and excellent code review judgment. Have experience with ML systems, training frameworks, GPUs, distributed systems, infrastructure, or similarly complex technical environments. Read and debug unfamiliar codebases quickly, and enjoy getting to root cause. Ship high‑quality code with strong velocity and pragmatic judgment. Are low‑ego, responsive, and motivated by helping researchers and engineers move faster. Prefer being a highly effective hands‑on IC over driving broad process‑heavy initiatives. Have experience reviewing messy, fast‑moving, or AI‑generated codebases. Compensation Range
$295K - $380K USD
About OpenAI OpenAI is an AI research and deployment company dedicated to ensuring that general‑purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI's affirmative action and equal employment opportunity policy statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US‑based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non‑public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non‑compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology. #J-18808-Ljbffr Slope$166k - $225k
...business. Founded by engineers — and customer obsessed... ...SQL query engines. As a software engineer on the... ...storage and processing systems that can outperform specialized... ...data engineering architecture. Delta Pipelines : It'... ...relevant certifications and training, and specific work...TrainingLocal areaWorldwide$140k - $225k
...embedding AI directly into architectural workflows via Autodesk... .... What You'll Own Agent engineering across context design,... ...and drafting generation systems for documentation and drawing... ...Is NOT For Traditional ML researchers focused on model training only Pure computer...TrainingFull timeH1bWork at officeVisa sponsorship$300k - $405k
...and steerable AI systems. We want AI to be... ...committed researchers, engineers, policy experts,... ...and reliably for training and serving frontier... ...Work with our ML engineers to understand... ...related low-level software engineering... ...familiar with modern CPU architectures and memory systems...TrainingWork at officeVisa sponsorshipFlexible hours$180k - $225k
...Software Engineer - Robotics & Autonomous Systems Scale's Robotics business unit is dedicated to solving the data... ...for robotics data collection, model training pipelines, and evaluation... ...autonomous vehicle datasets Build ML training and fine-tuning pipelines...TrainingFull time$146.5k
...About the team: The ML Data Engineering team powers metadata extraction... ...of users worldwide. Our systems operate at massive scale, supporting... ...We're seeking a Senior Software Engineer with deep... ...sets; relevant education or training; and other business and organizational...TrainingFor contractorsLocal areaWorldwideHome officeFlexible hours$146.5k - $228k
...attitude. About the team: The ML Data Engineering team powers metadata... ...millions of users worldwide. Our systems operate at massive scale,... ...Overview: We’re seeking a Senior Software Engineer with deep... ...sets; relevant education or training; and other business and organizational...TrainingTemporary workLocal areaWorldwideHome officeFlexible hours$248.4k - $310.5k
...contributor building production systems for robotics data collection, model training pipelines, and... ...vehicle datasets Build ML training and fine-tuning... ...quality Collaborate with ML engineers and researchers to bring... ...3+ years of software engineering experience in...TrainingFull time$380k
...data that enable our training and scaling efforts,... ...optimization techniques, model architectures, and efficiency... ...co-designing model-system interfaces with the... ...We're looking for a Software Engineer focused on building and... ...with embedding-based or ML-powered systems....Training$147k - $211k
Software Engineer, Agentic AI Systems, Cloud Security Google San Francisco, CA, USA Apply X Applicants in San... ..., LLMs, Agentic development etc) or ML platform/infrastructure (e.g., model... ..., and relevant education or training. Your recruiter can share more about...TrainingFull timeWorldwide- Staff Software Engineer, ML Infra & Distributed Systems About the Role: As a Staff Software Engineer on the ML Infrastructure... ...projects. This role grants architectural freedom to explore new... ...Feast) Understanding of ML model training pipelines and model internals. Experience...Training
$213k - $263k
...safely and efficiently. The system architecture team handles the onboard... ...real-world problems with ML and engineering solutions. Use state of... ..., ACL, or EMNLP Prior software development or ML research... ..., experience, relevant training and education, and skill level...TrainingFull timeContract workInternshipRemote work$166k - $225k
Senior Software Engineer - Database Engine Internals P-97 Our... ...and all the way up to ML/AI with a unified platform... ...the data warehouse architecture as we know it today will... ...structured storage system that can outperform specialized... ...certifications and training, and specific work...TrainingPermanent employmentContract workFor contractorsFor subcontractorWork at officeLocal areaWorldwideRelocationWork visa- ...world-class scientists, ML researchers, and engineers to work together to... ...frontier of model architectures for AI x Chemistry:... ...of machine learning systems architecture and distributed... ...data generation, training, and evaluations for... ...systems design and software architecture....TrainingWork at office
- ...Luma AI Infrastructure Engineer Luma's mission is... ...aware, capable and useful systems, the next step... ...So we are working on training and scaling up multimodal... ...and integrate new model architectures from our research team... ...performance, large-scale ML systems (managing ~1...Training
$218.4k - $365.2k
.... Job Category Software Engineering Job Details About... ...the most critical architectural initiatives for Spiff... ...high-scale, agentic systems that move beyond static... .... Experience with ML/AI model deployment and... ...promotion, benefits, training, assessment of job...TrainingContract workFlexible hours$180k - $250k
...Staff Software Engineer, ML Performance & Systems San Francisco fal is the generative media ecosystem powering the next generation of AI products... ...and implement novel approaches to model serving architecture on top of our in-house inference engine, focusing on...Currently hiringRelocation package- ...pioneering the model architectures that will make this possible... ...a new primitive for training efficient, large-... ...model innovation and systems engineering paired with a design‑... ...we’re looking for a Software Engineer to help... ...the training data and ML data infrastructure at...TrainingWork at officeVisa sponsorshipFlexible hours
$245k - $385k
...framework components to power our ML training systems. We work on building... ...As a Distributed Systems engineer, you will work to deliver powerful... ...of our training systems architectures. This role is based in... ...burden Have strong software engineering skills and are...TrainingWork at officeLocal areaRelocation package$170k - $216k
...15+ U.S. states. Software Engineering builds the brains of... ...collaborating with hardware and systems engineers. If you’re... ...different hardware architectures Be core to Waymo’s... ...systems Deploy ML systems in new areas... ...experience, relevant training and education, and...TrainingFull timeWork experience placementRemote work$218.4k - $365.2k
...Management (ICM) software that drives commissions... .... As a Software Engineering Architect... ...the most critical architectural initiatives for Spiff... ...-scale, agentic systems that move beyond... ....Experience with ML/AI model deployment... ..., benefits, training, assessment of job...TrainingContract workFlexible hours$230k - $385k
...integrate cutting-edge hardware and software to explore a broad range of... ...the constraints of physical systems to improve peoples' lives.... ...the Role As a Software Engineer, Distributed Data Systems, you... ...large-scale multimodal training and evaluation at OpenAI. You...TrainingWork at officeRelocation package- ...AI/ML Engineer (RL & Physical Systems) FLUIX is building the AI Operating System for data centers. We deploy... ...environments to accelerate training, testing, and Sim2Real deployment.... ...meet. Collaborate with controls, software, and field engineering teams to integrate...TrainingWeekend work
$102.5k - $187.9k
...AI Finance - Front-end Software Engineer - Senior EY.AI Finance... ...grounded in rigorous ML, and scenario planning... ...a custom multi-agent architecture. The opportunity This... ...and deliver effective system architecture solutions... ...analysis and delivering training. Ability to manage...TrainingSummer holidayFlexible hours- ...Experienced backend engineer. 5-7+ years of professional software engineering experience... ...backend and distributed systems; you’ve led complex... ...) Prior experience training and deploying your own ML models (including RL... ...to‑end, contribute to architecture for high‑impact AI features...TrainingWork at officeLocal area
$192k - $260k
...BI, and all the way up to ML/AI with a unified platform... ...believe the data warehouse architecture as we know it today will... ...generation (decoupled) query engine and structured storage system that can outperform... ...relevant certifications and training, and specific work location...TrainingLocal areaWorldwide$190.9k - $232.8k
...Role As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization... ...and orchestration systems. What You Will Do... ...Deep understanding of ML inference internals:... ...relevant certifications and training, and specific work...TrainingLocal areaWorldwide$255k - $405k
...aligned with our mission of broad societal benefit. About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure that powers large‑scale multimodal training and evaluation at OpenAI. You’ll manage distributed data pipelines,...TrainingFull timeWork at officeLocal areaRelocation packageFlexible hours- ...seeks a candidate to own the intelligence layer for a multi-agent system. Responsibilities include designing and improving the system... ...reliability. Ideal candidates will have experience with model training in production and a strong understanding of agent design principles...Training
- ...Applied AI Engineer Valthos Inc. Valthos... ...We build and deploy software and biological AI systems to safeguard... ...The same AI architectures that enable self-driving... ...applied biological ML engineers from MIT's... ...including adapting and post-training biological frontier...TrainingWork at office
- ...sites today. Backed by Accel. Our system runs a Multi Agent Action Expert architecture: classical precision algorithms orchestrated... ...1: from data collection and model training through edge deployment on Jetson... .... BS/MS/PhD in CS, Robotics, ML, or equivalent experience shipping...Training
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, ML Systems & Training Architecture. Be the first to apply!
- software sales engineer San Francisco, CA
- software engineer full time San Francisco, CA
- facebook software engineer San Francisco, CA
- startup software engineer San Francisco, CA
- intermediate software engineer San Francisco, CA
- research software engineer San Francisco, CA
- software developer no experience San Francisco, CA
- rust software engineer San Francisco, CA
- freelance software developer San Francisco, CA
- work from home software developer San Francisco, CA

