Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Data Infrastructure & Acquisition - Cape Town, South Africa

Speechify

Software Engineer, Data Infrastructure & Acquisition - Cape Town, South Africa

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

Overview

We're looking to hire for our Data side of our AI team at Speechify. This role is responsible for all aspects of data collection to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join us.

What You'll Do
  • Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
  • Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
  • Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
  • Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team's dataset roadmap to power Speechify's next-generation consumer and enterprise products.
An Ideal Candidate Should Have
  • BS/MS/PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
  • Experience with web crawlers, large-scale data processing workflows is a plus
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.
What We Offer
  • A fast-growing environment where you can help shape the company and product.
  • An entrepreneurial-minded team that supports risk, intuition, and hustle.
  • A hands-off management approach so you can focus and do your best work.
  • An opportunity to make a big impact in a transformative industry.
  • Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture.
  • Opportunity to work on a life-changing product that millions of people use.
  • Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more.
  • Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Software Engineer, Data Infrastructure & Acquisition - Cape Town, South Africa in United States vacancy
  •  ...Software Engineer, Platform Cape Town, South Africa The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – PDFs, books, Google Docs... 
    Suggested
    Work at office
    Remote work

    Speechify

    United States
    4 days ago
  •  ...Software Engineer, iOS Core Product - Cape Town, South Africa The mission of Speechify is to make sure that reading is never a barrier to learning. Over...  ...Multithreading Programming Working with CI/CD infrastructure Experience with Fastlane SOLID principles,... 
    Suggested
    Work at office
    Remote work

    Speechify

    United States
    1 day ago
  •  ...processes millions of data points across search...  ...growing our tech team in South Africa as part of our...  ...Python, BigQuery, cloud infrastructure, and AI/LLM integrations...  ...in South Africa - Cape Town. What you’ll do We are...  ...for a Senior Data Engineer to make a significant... 
    Suggested
    Remote work

    DIGITAL LUXURY GROUP, DLG SA

    New York, NY
    4 days ago
  • $140k - $200k

    Overview We’re looking to hire for the data side of our AI team at Speechify. This...  ...cost through a tight integration of infrastructure, engineering, and research work. What You’ll Do Be...  .... 5+ years of industry experience in software development. Proficiency with bash or... 
    Suggested
    Full time
    Shift work

    TryApplyNow

    Providence, RI
    2 days ago
  •  ...Join OLX Software innovation is the driving force behind the success of Property24. We...  ...open as remote for all candidates based in South Africa. We are looking for someone who:...  ...with us! OLX will process your personal data to assess your fit for the applied... 
    Suggested
    Remote work
    Worldwide

    Olx India

    United States
    1 day ago
  •  ...Specialist Marketing Cloud Solution Engineer - South Africa Salesforce is looking for a motivated and technically strong Solution Engineer to...  ...requirements into solution designs that support personalized, data-driven engagement. Responsibilities: Business... 
    Remote work
    Flexible hours

    Salesforce

    United States
    2 days ago
  •  ...Software Engineer, iOS Core Product - Johannesburg, South Africa Johannesburg, South Africa The mission of Speechify is to make sure that reading is never...  ...Programming Working with CI/CD infrastructure Experience with Fastlane SOLID principles... 
    Work at office
    Remote work

    Speechify

    United States
    3 days ago
  •  ...Software Engineer - Data Infrastructure Home based - EMEA Canonical is building a comprehensive automation suite to provide multi-cloud and on-premise data solutions for the enterprise. The data platform team is a collaborative team that develops a full range of... 
    Remote work
    Work from home

    Canonical

    United States
    2 days ago
  • $144.1k - $180.1k

     ...Senior Software Engineer - Data Infrastructure Remote, USA Marqeta is looking for a talented Senior Software Engineer to independently identify and deliver software solutions on our Data Infrastructure team through a set of milestones spanning a specific platform... 
    Work at office
    Remote work
    Flexible hours

    Marqueta Referrals

    United States
    5 days ago
  •  ...To support a growing data infrastructure team, the full-time remote Senior Software Engineer, Data Infrastructure will design and maintain high-performance databases, optimize data pipelines, and collaborate cross-functionally to enhance data workflows. Key responsibilities... 
    Full time
    Remote work

    Virtual Vocations Inc

    United States
    5 days ago
  •  ...Software Engineer Arena Intelligence is seeking a Software Engineer to join our team and build the data pipelines and infrastructure that powers real-world AI evaluation. You'll play a crucial role in designing and building the data pipelines that process and analyze... 
    Permanent employment
    Remote work

    Arena AI

    United States
    2 days ago
  •  ...unique demands. About the Role: The AI Data Infrastructure team sits between Stack's data and its engineers. We provide tooling and infrastructure to process...  ...status. This position may also involve working with software and technologies subject to U.S. export control... 
    Remote work

    Stack AV

    United States
    2 days ago
  • $200k - $220k

     ...Senior Software Engineer, Data Infrastructure (RDBMS) United States Build to Protect Civilization TRM is a blockchain intelligence company that’s on a mission to build a safer financial system for billions of people. We’re a lean, high‑impact team tackling some of the... 
    Remote work
    Shift work

    TRM

    New York, NY
    4 days ago
  • $170k - $230k

     ...Senior Software Engineer - Data Infrastructure Bluesky's mission is to transition the social web from platforms to protocols. We're building a federated social network where users have more power. Our team has decades of combined experience building distributed applications... 
    Remote work

    Blue Sky MD

    United States
    2 days ago
  • $213k - $263k

     ...Waymo ML Platform team, builds tools and infrastructure to realize the ML flywheel at Waymo....  ...Develop and contribute to Waymo's data infrastructure platform to enable plant...  ...professional experience in the field of software engineering ~ Experience programming in C++ ~... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  •  ...Persona Data Infrastructure Engineer Persona is the configurable identity platform built for businesses in a digital-first world. Verifying individuals...  ...You'll Bring to Persona ~3+ years of experience in software engineering, with a focus on data infrastructure or large-... 
    Full time
    For contractors
    Internship
    Remote work

    Persona

    United States
    2 days ago
  •  ...Notion Data Engineering Infrastructure Engineer Notion's Data Engineering Infrastructure team keeps the platform under our data pipelines healthy...  ...team. Required skills include: ~7+ years as a software or infrastructure engineer with strong DevOps experience.... 
    Local area
    Remote work

    Notion, LLC

    United States
    2 days ago
  • $160.36k - $240.54k

     ...Software Engineer, ML Data Infrastructure Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with... 
    Work experience placement

    Nuro

    Mountain View, CA
    3 days ago
  • $250k - $380k

     ...running OpenAI's LLM training and inference infrastructure that powers frontier models at massive...  ...About the Role We are looking for an engineer to design and implement the dataset infrastructure...  ...APIs, including for multimodal (MM) data that cannot fit in memory. Build... 

    OpenAI

    San Francisco, CA
    2 days ago
  • $193.93k - $291.15k

     ...Sr. Software Engineer, Perception Data Infrastructure Mountain View, California (HQ) About the Role We are a team of high-output generalists where ML and systems engineering converge to push autonomy performance forward. As a Senior Perception ML Data Infrastructure... 

    Nuro

    Mountain View, CA
    5 days ago
  •  ...Data Infrastructure Engineer The Data Infrastructure teams are responsible for building and maintaining data storage technologies across the...  .... What We're Looking For We're looking for talented software engineers to help us build the vision of making our database... 

    Roberts Recruiting

    Cambridge, MA
    3 days ago
  • $109k - $160k

     ...CoreWeave combines superior infrastructure performance with deep technical...  ...at About The Role: The Data Platforms Team serves as the...  ...: We are seeking a senior engineer with specialization in database...  ...2~5 years of experience in a software or infrastructure engineering... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    3 days ago
  • $117.2k - $223.9k

     ...duplicating efforts. Job Category Software Engineering Job Details About Salesforce...  ...experienced engineers to join our Core Infrastructure organization, where you'll help design...  ...pipelines that process and transform data for Slack's search infrastructure.... 

    Salesforce.Com Inc

    Atlanta, GA
    2 days ago
  •  .... Cohere is a team of researchers, engineers, designers, and more, who are passionate...  ...Why this role? We're building the data infrastructure behind some of the most demanding AI...  ...training and evaluation jobs. As a Software Engineer, Data Infrastructure, you will... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    3 days ago
  • $350k

     ...Software Engineer, Data Infrastructure Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and... 
    Local area
    Immediate start
    Visa sponsorship
    Work visa
    Relocation package

    Thinking Machines Lab

    San Francisco, CA
    2 days ago
  • $153k - $376k

     ...design and collaboration, join us! The Data Platform team at Figma builds and...  ...including AI researchers, machine learning engineers, data scientists, product engineers, and...  ...ML Datalake, orchestration and pipeline infrastructure, and large‑scale data ingestion and processing... 
    Full time
    Remote work
    Work from home

    Figma

    San Francisco, CA
    4 days ago
  • $195.5k - $247.25k

     ...Sr. Software Engineer (Data Acquisition) Our healthcare system is the leading cause of personal bankruptcy in the U.S. Every year, over 50 million Americans suffer adverse financial consequences as a result of seeking care, from lower credit scores to garnished wages... 
    Work at office
    Remote work
    Work from home
    Flexible hours

    Cedar

    United States
    1 day ago
  • $210k - $267k

     ...we do. We ingest large-scale data-weather, prices, load, and grid...  ...: We're looking for an engineer to help lead the scaling and reliability of our data infrastructure, which is core to the ML work...  ..., or Temporal. Strong software engineering skills. Being able... 
    Work at office
    Remote work
    Work from home
    Home office
    Flexible hours
    3 days per week

    Gridmatic

    Cupertino, CA
    3 days ago
  •  ...About the Role As a Data Infrastructure Engineer in Research at Luma, you will play a critical role in building and scaling the data infrastructure that supports our cutting-edge multimodal AI systems. Your work will focus on developing high-throughput, large-scale... 

    Luma AI

    Redwood City, CA
    4 days ago
  •  ...Software Engineer, Data Infrastructure Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    United States
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Data Infrastructure & Acquisition - Cape Town, South Africa. Be the first to apply!