Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Data Acquisition

$325k - $405k

OpenAI

Software Engineer, Data Acquisition | OpenAI Foundations – San Francisco Overview: The Data Acquisition team within the Foundations organization at OpenAI is responsible for all aspects of data collection to support our model training operations. Our team manages web crawling and GPTBot services and works closely with Data Processing, Architecture, and Scaling teams. We are looking for a skilled Software Engineer to join our Data Acquisition team. Responsibilities: Own and lead engineering projects in the area of data acquisition including web crawling, data ingestion, and search. Collaborate with other sub-teams, such as Data Processing, Architecture, and Scaling, to ensure smooth data flow and system operability. Work closely with the legal team to handle any compliance or data privacy-related matters. Develop and deploy highly scalable distributed systems capable of handling petabytes of data. Architect and implement algorithms for data indexing and search capabilities. Build and maintain backend services for data storage, including work with key-value databases and synchronization. Deploy solutions in a Kubernetes Infrastructure-as-Code environment and perform routine system checks. Conduct and analyze experiments on data to provide insights into system performance. Qualifications: BS/MS/PhD in Computer Science or a related field. 4+ years of industry experience in software development. Experience with large web crawlers a plus Strong expertise in large stateful distributed systems and data processing. Proficiency in Kubernetes, and Infrastructure-as-Code concepts. Willingness and enthusiasm for trying new approaches and technologies. Ability to handle multiple tasks and adapt to changing priorities. Strong communication skills, both written and verbal. About OpenAI OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s affirmative action and equal employment opportunity policy statement. Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. Compensation $325K – $405K + Offers Equity #J-18808-Ljbffr OpenAI

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, Data Acquisition in San Francisco, CA vacancy
  • $121.5k - $145.5k

     ...Dallas, TX; San Francisco Bay Area, CA; and Seattle/WA. The Data Acquisition Team is the entry point to WEX's Data-as-a-Service (DaaS)...  ...systems and third-party providers. As a Senior Software Engineer, you'll play a key role in designing and building robust,... 
    Suggested
    Remote work
    Flexible hours

    WEX

    San Francisco, CA
    4 days ago
  • $168.93k - $192.5k

     ...all people to have a secure digital identity. To learn more, visit Role Overview ID.me is seeking a Software Development Engineer III to join the Data Acquisition & Normalization team. This team is responsible for building and operating the integrations that power the... 
    Suggested
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours

    ID.me

    San Francisco, CA
    5 days ago
  • $140k - $200k

     ...office. These include frontend and backend engineers, AI research scientists, and others from...  ...Overview We're looking to hire for our Data side of our AI team at Speechify. This role...  ...work. We are looking for a skilled Software Engineer to join us. What You’ll Do Be... 
    Suggested
    Full time
    Work at office
    Shift work

    Clutch Canada

    San Francisco, CA
    5 days ago
  •  ...Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Join to apply for the Python and Kubernetes Software Engineer - Data, AI/ML & Analytics role at Canonical Python and Kubernetes Software Engineer - Data, AI/ML & Analytics 2 months ago Be among... 
    Suggested
    Full time
    Freelance
    Internship
    Local area
    Remote work
    Work from home
    Worldwide

    Canonical

    San Francisco, CA
    4 days ago
  •  ...About the Team Data is at the foundation of DoorDash success. The Data Engineering team builds database solutions for various use cases including reporting, product analytics, marketing optimization and financial reporting. By implementing pipelines, data structures, and... 
    Suggested
    Hourly pay
    Local area
    Flexible hours

    Fairygodboss

    San Francisco, CA
    3 days ago
  • $144k - $216k

     ...About the Role & Team The Data Warehouse team builds the systems that connect Amplitude to the broader data ecosystem, including...  ...the lifecycle of those connections and credentials. As a Software Engineer II on the Data Warehouse team, you will help build and scale... 
    Work at office
    Home office
    Flexible hours

    Amplitude

    San Francisco, CA
    4 days ago
  • $124k - $329.2k

     ...the world's leading platform for agentic software development - powered by Copilot to build...  ...United States Overview As a software engineer at GitHub, you will enhance the...  ...particularly), Azure Redis Cache, Azure Data Explorer Clusters. Experience operating... 
    Ongoing contract
    Remote work

    GitHub

    San Francisco, CA
    1 day ago
  • $179.5k - $221.5k

     ...gets done. At Airtable, we're passionate about democratizing software creation — empowering anyone to build powerful, flexible...  ...full apps and deploy AI agents directly into their workflows. Data engineering plays a critical role in this evolution by delivering the insights... 
    Live in
    Remote work
    Flexible hours
    Shift work

    Airtable

    San Francisco, CA
    3 days ago
  • $144k - $216k

     ...Report, Amplitude is the best-in-class solution for product, data, and marketing teams. Learn more at amplitude.com. As an organization...  ...the lifecycle of those connections and credentials. As a Software Engineer II on the Data Warehouse team, you'll help build and scale... 
    Work at office
    Home office
    Flexible hours

    Amplitude

    San Francisco, CA
    1 day ago
  • $168k - $210k

     ...see your impact and unlock incredible career growth opportunities, join us, and build real world value. As a Senior Software Engineer on the Data Engineering team, you are a core builder of the Caspian Data Platform - Ripple's centralized lakehouse that powers analytics... 
    Full time
    Work at office
    Local area

    Ripple

    San Francisco, CA
    5 days ago
  • $180k - $220k

     ...Software Engineer, Data Los Angeles, Palo Alto, San Francisco About HeyGen At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention... 
    Work experience placement

    HeyGen

    San Francisco, CA
    1 day ago
  •  ...bring those models to life. About the Role We are looking for an engineer to design and implement the dataset infrastructure that powers...  ...maintain standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale... 

    Slope

    San Francisco, CA
    4 days ago
  • $200k - $400k

     ...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences. Our technology enables industry-defining enterprises like Avis Budget Group, Block's Cash App and Square, Chime,... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    22 days ago
  •  ...cutting-edge tech startup in San Francisco is looking for a Software Engineer New Grad to contribute to their distributed query engine and...  ...candidates are recent graduates with programming skills in Python or Rust and a keen interest in data infrastructure. #J-18808-Ljbffr... 
    Full time

    Eventual

    San Francisco, CA
    4 days ago
  •  ...related technical field 2+ years of experience working in a Software Engineering role and 1+ years of experience with Java Strong foundation...  ...algorithms, and software application design Passionate about data processing, solving challenging problems, and iterating quickly... 

    Amplitude

    San Francisco, CA
    5 days ago
  • $168k - $210k

    Senior Software Engineer, Data Engineering Location: San Francisco, CA, United States. We’re building a world where value moves like information does today. Through our crypto solutions for financial institutions, businesses, governments and developers, we are improving... 
    Full time
    Local area

    Ripple

    San Francisco, CA
    5 days ago
  •  ...understand and reflect human preferences — the Human Data team is at the heart of that effort. The Human Data engineering team creates the systems that enable scalable,...  ...loops. About the Role We’re looking for software engineers to join the Human Data team and build... 
    Work at office
    Relocation package

    Slope

    San Francisco, CA
    3 days ago
  • $130k - $196.5k

    Senior Software Engineer - Big Data page is loaded## Senior Software Engineer - Big Datalocations: San Franciscotime type: Full timeposted on: Posted Todayjob requisition id: JR012086**LiveRamp is the data collaboration platform of choice for the world’s most innovative... 
    Work at office
    Work from home
    Worldwide
    Flexible hours
    Night shift

    LiveRamp

    San Francisco, CA
    2 days ago
  • $191k - $225k

    Senior Software Engineer, Unified Data Store Airbnb 23 July 2025 Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country... 
    Work experience placement
    Casual work
    Live in
    Work at office
    Remote work
    Flexible hours

    TechBrains

    San Francisco, CA
    3 days ago
  •  ...Development: Design, implement, and optimize data processing algorithms and AI models that...  ...meaningful insights from data. Feature Engineering: Identify relevant features and...  ...solutions. Collaboration: Work closely with software engineering, product, and research teams... 

    Suptask

    San Francisco, CA
    4 days ago
  • $123.7k - $254.67k

     ...platform purpose-built for performance marketers. We leverage massive data and cutting‑edge science to automate and optimize TV advertising...  ...advertisers can trust to grow their business. As a Senior Data Engineer at tvScientific, you will be a key player in implementing the... 
    Work at office
    Local area
    Relocation
    Relocation package

    Pinterest

    San Francisco, CA
    2 days ago
  • A leading AI research company in San Francisco seeks software engineers for their Human Data team. The role focuses on building robust systems for gathering and evaluating human feedback that improve AI models. Ideal candidates are strong in full-stack development and enjoy... 
    Work at office
    Flexible hours

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...be sent from @Rippling.com addresses. Why This Role Rare Opportunity Rippling is looking for a seasoned Senior Software Engineer to join the Payroll Data team, one of the most foundational teams in the Global Payroll organization. While other teams build features on... 
    Work at office
    3 days per week

    Rippling

    San Francisco, CA
    2 days ago
  • $160.65k - $217.35k

     ...Organizations use Mapbox applications, data, SDKs and APIs to create customized and...  ...service. This role will be focused on data engineering for feature expansion, improving...  ...quality data to our customers Mentor other software engineers to develop all aspects of their... 

    Mapbox

    San Francisco, CA
    4 days ago
  • $216k - $270k

    Scale AI, Inc. is seeking a software engineer to design, build, and maintain scalable systems within its Generative AI Data Engine. As part of a dynamic hybrid team based in San Francisco or New York City, you will play a crucial role in producing high-quality AI data while... 

    Scale AI, Inc.

    San Francisco, CA
    1 day ago
  •  ...Report, Amplitude is the best-in-class solution for product, data, and marketing teams. Learn more at amplitude.com . As an organization...  ...reliability of everything in between. As a Senior Software Engineer, you'll take on complex infrastructure challenges: designing... 

    Amplitude

    San Francisco, CA
    2 days ago
  • A technology company in San Francisco is seeking a skilled software engineer to build and scale data pipelines for delivering high-quality datasets. The ideal candidate has strong Python development skills and potential experience in Go or Typescript. Responsibilities include... 

    Sieve

    San Francisco, CA
    5 days ago
  •  ...About The Team/Role As WEX continues to scale its Data-as-a-Service (DaaS) platform, the Data Acquisition Team plays a critical role in enabling secure,...  ...external sources. We are looking for a Senior Staff Software Engineer to architect and lead the next evolution of our... 
    Remote work
    Flexible hours

    WEX

    San Francisco, CA
    2 days ago
  • A data processing company is seeking a passionate generalist software engineer to work on various projects including web applications and data transformation. The ideal candidate will have strong skills in Python, PostgreSQL, and Linux, with familiarity in Rust and TypeScript... 
    Remote job

    MixRank

    San Francisco, CA
    2 days ago
  • Voiceflow in San Francisco is looking for a Senior Software Engineer to join our ambitious team. As a founding engineer, you will work on distributed...  ...and a desire to innovate in building a next generation data platform. The ideal candidate has experience in async systems... 
    Work at office

    Voiceflow

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Data Acquisition. Be the first to apply!