Software Engineer, Data Infrastructure - Research
Slope
About the Team The Workload team is responsible for designing and running OpenAI’s LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify how researchers train and serve models, abstracting away the complexity of performance, parallelism, and execution across vast GPU/accelerator fleets. By providing this foundation, the Workload team ensures that researchers can focus on advancing model capabilities while we handle the scale, efficiency, and reliability required to bring those models to life. About the Role We are looking for an engineer to design and implement the dataset infrastructure that powers OpenAI’s next-generation training stack. You will be responsible for building standardized dataset interfaces, scaling pipelines across thousands of GPUs, and proactively testing performance bottlenecks. In this role, you will collaborate closely with the multimodal researchers, and other infra groups to ensure datasets are unified, efficient, and easy to consume. In this role, you will: Design and maintain standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale validation pipelines for dataset loading at GPU scale. Collaborate with teammates to integrate datasets seamlessly into training and inference pipelines, ensuring smooth adoption and a great user experience. Document and maintain dataset interfaces so they are discoverable, consistent, and easy for other teams to adopt. Establish safeguards and validation systems to ensure datasets remain reproducible and unchanged once standardized. Debug and resolve performance bottlenecks in distributed dataset loading (e.g., straggler systems slowing global training). Provide visualization and inspection tools to surface errors, bugs, or bottlenecks in datasets. You might thrive in this role if you: Have strong engineering fundamentals with experience in distributed systems, data pipelines, or infrastructure. Have experience building APIs, modular code, and scalable abstractions, while recognizing that abstractions ultimately serve the users and UX is an important part of the abstractions design. Are comfortable debugging bottlenecks across large fleets of machines. Take pride in building infrastructure that “just works,” and find joy in being the guardian of reliability and scale. Are collaborative, humble, and excited to own a foundational (if not glamorous) part of the ML stack. Bonus points if you: Have background knowledge in data math, probability, or distributed data theory. Have worked with GPU-scale distributed systems or dataset scaling for real-time data About OpenAI OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement. Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology. #J-18808-Ljbffr
$250k - $380k
...LLM training and inference infrastructure that powers frontier models... ...scale. Our systems unify how researchers train and serve models, abstracting... ...We are looking for an engineer to design and implement the... ...including for multimodal (MM) data that cannot fit in memory....SuggestedWork at officeLocal areaRelocation packageFlexible hours$200k - $400k
...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences... ...across ClickHouse, BigQuery, or similar. Partner with research and product teams to architect data solutions, evaluate...SuggestedFull timeWork at officeLocal area- ...London and Amsterdam. Making data driven decisions is key to... ...and guidance to teams across engineering, product, and business and... ...effectively. Engineers on Data Infrastructure are domain experts in Data... ...Qualifications 5+ years of software engineering experience...SuggestedWork experience placementLocal area
- ...innovation and systems engineering paired with a design‑... ...in AI. About the Role Data is the lifeblood of... ...and we’re looking for a Software Engineer to help build... ...training data and ML data infrastructure at Cartesia. This role... ...partners closely with research and inference teams....SuggestedWork at officeVisa sponsorshipFlexible hours
$160k - $225k
...agentic platform synthesizes complex employee data, pinpoints risky behaviors, and deploys... ...Join Us Build and scale the foundational data infrastructure powering a category-defining product Work closely with engineering, data science, and product teams to operationalize...SuggestedWork experience placementRelocation packageFlexible hours- ...for exceptional people to join us! About the Role As an engineer on the Data Infrastructure team at Persona, you will play a key role in designing,... ...What you’ll bring to Persona 3+ years of experience in software engineering, with a focus on data infrastructure or large...Full timeFor contractorsInternship
$140k - $200k
...include frontend and backend engineers, AI research scientists, and others from... ...re looking to hire for our Data side of our AI team at... ...through a tight integration of infrastructure, engineering, and research... ...are looking for a skilled Software Engineer to join us. What...Full timeWork at officeShift work$175k
...building: models that are trained to use software and take actions just as a person... ...-scale models requires performant data infrastructure to create and store the datasets used... ...for company value Partner with engineers and research scientists to facilitate progress for...Work at officeRemote work- ...planning. The Schwab Asset Management (SAM) Engineering organization is a part of Schwab... ...'s investment management, operations, data, and research platforms. Everything that SAM... ...clients reach their financial goals.As a Software Engineer in the SAM Engineering Investment...Internship
$162k - $216k
...Software Engineer - Infrastructure, Data Platform San Francisco, California, United States Who We Are Baton is Ryder's in-house product development group focused on harnessing emerging technologies to redefine transportation and logistics. With $10B in freight...Full timeWork at officeImmediate startRemote workMonday to Friday- Charles Schwab is seeking a Software Engineer to join their SAM Engineering team in San Francisco. You will partner with researchers and product teams to build and enhance data pipelines used for quantitative research, while also implementing data-quality controls. The...
$160k - $220k
...turns siloed and disconnected data into operational... ...internationally. Team As an engineering team, we believe strongly that... ...We are looking for a Data Infrastructure Engineer to join our growing... ...Airflow or similar tools Strong software engineering fundamentals in...Full timeWork at officeLocal area$300k - $405k
...is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working... ...Engineer on the Economic Research Data Platform team, you will design, build, and maintain critical infrastructure that powers the company's research on AI'...Visa sponsorship$119k - $299.93k
...governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients... ...Degree - At least 8 years of professional AI/ML development, engineering, or testing experience What Sets You Apart - Master's...Full timeH1b$160k - $200k
...every Quest completed, there's data, petabytes of it, telling... ...excited about building data infrastructure at massive scale and cares deeply... ...Discord users and Discord engineers. We’re building the next... ...world. If you're the kind of Software Engineer who lights up when...Full timeWorldwideRelocationRelocation package$140k - $260k
...Software Engineer, Data Platform Profound is on a mission to help companies understand and control their AI presence. We are looking for... ...Engineer, Data Platform to design, build, and scale the infrastructure that powers data across our organization. You will architect...Work at officeVisa sponsorship- ...Role As a Data Platform Software Engineer working at OpenEvidence, you will build end-to-end systems powering critical product and research workflows. Your work will focus on performance... ...granting you full autonomy over the infrastructure that helps doctors navigate...Full time
$200k - $220k
...only vertically integrated AI infrastructure company built from the ground... ...energy, manufacturing, data center construction, and cloud... ...Crusoe Energy as a Senior Data Engineer, an early and pivotal hire on... ...Engineering Teams: Partner with software engineers, data scientists,...Full timeTemporary workWork at officeRemote work$200k - $236k
...Software Engineer, Data Platform Hybrid - SF Bay Area GlossGenius is the AI-powered system behind the world's most meaningful appointments... ...the Data Platform team, you'll own the architecture and infrastructure that moves data from raw ingestion to model-ready, at the...Work at officeFlexible hoursNight shift3 days per week- ...and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor... ...governed data access for MLOps and research. You will take full lifecycle... ...anticipates exabyte growth. • Platform Engineering: Contribute to the development of our...Work at officeVisa sponsorship
$230k - $265k
...financial tools they need. About the Position: We're looking for a seasoned software engineer to join Parafin's Infrastructure team and lead the development of our next-generation Data Platform. This role is critical to ensuring that our data infrastructure is...Work from homeFlexible hours- ...Whatnot updates on our news and engineering blogs and join us as we... ...commerce. Role The Data Platform team at Whatnot builds... ..., multi-tenant streaming infrastructure rather than simply building... ...understanding You As our next Software Engineer, Data Platform, you...Local areaRemote workWork from homeHome office
- ...the role We are hiring a Senior Software Engineer to own the data platform that powers Plenful's... ...experience building backend or data infrastructure in production ~ Deep expertise in... ...consume structured data, not pure ML research Comfort in customer-facing technical...Work at officeFlexible hours2 days per week
$170k - $230k
...Senior Software Engineer - Data Platform San Francisco About Highnote Founded in 2020 by a team of leaders from Braintree, PayPal, and... ...compensation packages are competitive based on robust market research and are a combination of a cash salary, equity, and...Work at officeLocal areaHome officeFlexible hours- ...Making data driven decisions is key to Plaid's culture. To support... ...guidance to teams across engineering, product, and business and... ...the data and machine learning infrastructure to enable Plaid engineers to... ...~5+ years of software engineering experience ~ Extensive...
$81k - $150k
...Is Where Your Career Begins If you're a Java developer, software programmer, data scientist, or data analyst struggling to break into the... ...stack developers, Python/Java developers, Data analysts/Data Engineers/ Data Scientists, Machine Learning engineers for full...Full timeH1b- Description Slack is looking for a Staff Software Engineer to join the Data Infrastructure team within the broader Data Engineering organization. The mission of our team is to build secure, reliable, performant, scalable, and cost-efficient infrastructure that powers Slack...Permanent employment
- ...expertise in model innovation and systems engineering paired with a design‑minded product... ...global AI, our models must be trained on data that reflects the world’s diversity of languages... ...building scalable systems that bridge research and production. What We Offer...Work at officeRelocation package
$147.4k - $272.1k
Sr Software Engineer, AI & Data Platforms (AiDP) San Francisco Bay Area, California, United States Software and Services Description Join our team... ...You would support our mission by applying groundbreaking research in this rapidly evolving and exciting space to our daily...Relocation$130.6k - $192k
About the Team DoorDash is a data‑driven organization that... ...Platform organization owns all the infrastructure necessary to run an... ...simplify data workflows for engineers, analysts, and ML practitioners... ...Codex, Cursor) throughout the software development lifecycle, including...Hourly payWork at officeLocal areaRelocationFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Data Infrastructure - Research. Be the first to apply!
- software engineer amazon San Francisco, CA
- experienced software developer San Francisco, CA
- federal - software developer San Francisco, CA
- software developer internship San Francisco, CA
- senior software engineer San Francisco, CA
- software developer fintech San Francisco, CA
- part time software developer remote San Francisco, CA
- software developer intern San Francisco, CA
- software data engineer San Francisco, CA
- fall software engineering internship San Francisco, CA


