Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Data Infrastructure

Anysphere

Software Engineer, Data Infrastructure

Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.

About the Role

Cursor ships daily. Every release leaves signals behind: telemetry, prompts, completions, agent runs, sessions. Those signals power model improvement, evals, and experimentation. Data infrastructure is what turns them into something teams can trust.

A lot of systems here started simple so we could move fast. Over time, the constraints change and the "good enough" version becomes the bottleneck. This role owns the full ladder: patch what should be patched, redesign what should be redesigned, ship the replacement, and operate it.

Privacy guarantees are part of correctness. What we can retain and use depends on Privacy Mode and org configuration, and getting that wrong breaks a product promise. We choose work by business impact: what blocks product and model teams today, and what will block them next month.

Sample projects include...

  • A core pipeline started as a pragmatic reuse of infrastructure built for something else. It works, but it cannot guarantee properties downstream consumers now need (for example, point-in-time consistency). You design and ship the replacement while keeping the existing system running.

  • A new product surface ships without instrumentation. You talk to the team, define what needs to be captured, and wire it through before the absence becomes anyone else's problem.

  • Eval coverage drops. You trace it to an instrumentation gap introduced weeks ago by a product change nobody flagged. You fix the gap, add a contract so it cannot recur, and ship the dashboard that would have caught it earlier.

  • Multiple consumers depend on overlapping data. You design schema evolution and validation so changes in one place do not silently degrade the others.

  • Storage costs rise faster than usage. You decide what is worth keeping, implement retention and compression, and delete what is not.

What We're Looking For

We're looking for someone who has built real systems at scale and cares about correctness, cost, and ergonomics.

Strong signals include:

  • Deep experience with Spark (Databricks or open-source Spark both count)

  • Production experience with Ray Data

  • Hands-on ownership of large data pipelines and storage systems

  • Comfort debugging performance issues across client instrumentation, streaming, storage, and model-facing workflows, as well as, compute, storage, and networking layers

  • Clear thinking about data modeling and long-term maintainability

  • You have good judgment about when to patch and when to rebuild

Nice to have:

  • Experience running or scaling ClickHouse

  • Familiarity with dbt, Dagster, or similar orchestration and modeling tools

We're in-person with cozy offices in North Beach, San Francisco and Manhattan, New York, replete with well-stocked libraries.

Applying

If there appears to be a fit, we'll reach to schedule 2-3 short technicals. After, we'll schedule an onsite in our office, where you'll work on a small project, discuss ideas, and meet the team.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, Data Infrastructure in New York, NY vacancy
  •  ...suite to provide multi‑cloud and on‑premise data solutions for the enterprise. The data...  ...analytics; all the way to structured SQL engines. We are facing the interesting problem of...  ...for in you Proven hands‑on experience in software development using Python Have a Bachelor’... 
    Suggested
    Remote work
    Work from home

    Canonical Group Ltd

    New York, NY
    2 days ago
  • $140k - $215k

     ...Role: Build CrowdStrike's hyper-scale data lake that stops breaches and finds...  ...data processing systems Own Flink/Spark infrastructure on Kubernetes/AWS/GPS/OCI Ensure availability...  ...Terraform, Chef, Ansible, CI/CD Enable engineers and data scientists to run jobs and... 
    Suggested
    Work experience placement
    Work at office
    Local area
    Remote work

    CrowdStrike Holdings, Inc.

    New York, NY
    14 hours ago
  • $140k - $200k

     ...include frontend and backend engineers, AI research scientists, and...  ...'re looking to hire for our Data side of our AI team at...  ...through a tight integration of infrastructure, engineering, and research work...  ...are looking for a skilled Software Engineer to join us. What You... 
    Suggested
    Full time
    Work at office
    Shift work

    Speechify

    Jersey City, NJ
    1 day ago
  • $200k - $220k

     ...Senior Software Engineer, Data Infrastructure (RDBMS) United States Build to Protect Civilization TRM is a blockchain intelligence company that’s on a mission to build a safer financial system for billions of people. We’re a lean, high‑impact team tackling some of the... 
    Suggested
    Remote work
    Shift work

    TRM

    New York, NY
    2 days ago
  • $165k - $242k

     ...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted... 
    Suggested
    Temporary work
    Flexible hours

    CoreWeave

    New York, NY
    1 day ago
  •  ...Software Engineer, ML Data Infrastructure Ideogram's mission is to make world-class design accessible to everyone, multiplying human creativity. We build proprietary generative media models and AI native creative workflows, tackling unsolved challenges in graphic design... 
    Work at office

    Ideogram

    New York, NY
    1 day ago
  •  ...Framework Ventures is seeking a Software Engineer III to design and develop digital identity infrastructure with remote work options. The role requires 6+ years of experience in software development, proficiency in object-oriented programming, and the ability to architect... 
    Remote work

    Framework Ventures

    New York, NY
    2 days ago
  •  .... Cohere is a team of researchers, engineers, designers, and more, who are passionate...  ...Why this role? We're building the data infrastructure behind some of the most demanding AI...  ...training and evaluation jobs. As a Software Engineer, Data Infrastructure, you will... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    1 day ago
  • $160k - $240k

     ...Senior Software Engineer - Data Center Infrastructure Management API Location New York Business Area Engineering and CTO Ref # 10050006 Description & Requirements Our Team: We are on a mission to build applications that enable Bloomberg Engineering... 
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    4 days ago
  •  ...Community You Will Join: Data represents the voice of Airbnb...  ...scale. The Data Warehouse Infrastructure team is responsible for the...  ...which is used by hundreds of engineers to collect, manage, and analyze...  ...Contribute to open-source software and drive meaningful... 
    Casual work
    Live in
    Work at office
    Remote work

    GrabJobs

    New York, NY
    4 days ago
  • $160k - $240k

    Senior Software Engineer - MySQL Data Infrastructure Location New York Business Area Engineering and CTO Ref # 10046723 Description & Requirements Our financial products rely on petabytes of real‑time and historical data. DataHub is Bloomberg’s distributed data platform... 
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    14 hours ago
  • $160k - $240k

    Senior Software Engineer - Data Center Infrastructure Management API Location New York Business Area Engineering and CTO Ref # 10043898 Description & Requirements Our Team The team is at the forefront of Bloomberg's infrastructure management, crafting secure and reliable... 
    Temporary work
    For contractors
    Work experience placement

    Bloomberg

    New York, NY
    3 days ago
  • $200k - $250k

     ...Lead Software Engineer - Data Infrastructure Boston or NYC Layer Health was founded in 2023 by leading machine learning researchers from MIT and Harvard Medical School. We are building an AI layer that can accurately and scalably synthesize information from medical... 
    Work at office
    2 days per week
    3 days per week

    Layer Health

    New York, NY
    1 day ago
  • $15k

     ...technology talks by our experts, a beautiful modern office, daily catered lunches, and more. Your Team As a Senior/Staff Software Engineer on our Data Infrastructure group, you will contribute to scaling and advancing our entire data infrastructure. This includes building data‑... 
    Work at office
    Local area

    The Voleon Group

    New York, NY
    2 days ago
  • $200k - $275k

     ...Staff Data Infrastructure Engineer Peregrine helps public safety organizations, state and local and governments, federal agencies, and private...  ...pipeline orchestration using Airflow or similar tools Strong software engineering fundamentals in Python and/or Scala, with a... 
    Local area

    Peregrine Corporation

    New York, NY
    1 day ago
  •  ...NVentures (venture arm of NVIDIA). About the Role Suno is growing fast, and we're hiring Staff and Senior Software Engineers to work on Data Infrastructure at Suno, where you will be responsible for building and scaling the foundational systems that power Suno's... 
    Work at office
    Local area
    Immediate start

    SUNO

    New York, NY
    1 day ago
  •  ...Senior Python Full-Stack Engineer - AI Data & Infrastructure About the Role What if your Python expertise could directly shape the infrastructure behind the world's most advanced AI systems? We're looking for a Senior Python Full-Stack Engineer to build and optimize... 
    Hourly pay
    Ongoing contract
    Contract work
    Freelance
    Remote work
    Flexible hours

    Alignerr

    New York, NY
    2 days ago
  • $176k - $237.6k

     ...experience, building world‑class open‑source software and communities, and want to be a part of...  ...'d love to hear from you! Summary Cloud Data Store (CDS) owns the storage, retrieval,...  ...team, click here] As a Senior Software Engineer , you will get the chance to design, build... 
    Full time
    Temporary work
    Part time
    Remote work
    Work from home
    Home office

    Temporal Technologies

    New York, NY
    2 days ago
  •  ...A global cloud technology company is seeking a Senior Software Engineer to architect and develop cost management features. This fully remote...  ...Canada and requires expertise in Python, FastAPI, and various data tools. You will collaborate with engineers to create data... 
    Remote work
    Flexible hours

    Do-It

    New York, NY
    2 days ago
  •  ...Senior C++ Full-Stack Engineer - AI Data & Infrastructure About the Role What if your C++ skills could directly shape the infrastructure behind the world's most advanced AI systems? We're looking for a Senior C++ Full-Stack Engineer to build and optimize the... 
    Hourly pay
    Ongoing contract
    Contract work
    Freelance
    Remote work
    Flexible hours

    Alignerr

    New York, NY
    1 day ago
  • $125k - $175k

     ...Tokenization is the most significant infrastructure upgrade in modern capital markets, enabling...  ...assets. RWA.xyz is the leading data platform for tokenized assets. We are...  ...Valley and Wall Street. Overview Role: Software Engineer Team: Data Platform Type: Individual Contributor... 
    Part time
    Weekend work

    GrabJobs

    New York, NY
    4 days ago
  • $197k - $247k

     ...Senior Software Engineer, Data Platform We are looking for a Senior Software Engineer to architect, build, and maintain the data infrastructure at Gusto. As part of the Data Platform team, you will collaborate closely with Data Science, Business Intelligence, and analysts... 
    Work at office
    Local area
    2 days per week
    3 days per week

    Gusto

    New York, NY
    4 days ago
  • $145k - $220k

     ...generating industry-beating results (MIT News). Our predictive engines combine a wide variety of data through rigorous scientific process. We leverage...  ...platforms, and modern tech stacks ~ Strong software engineering fundamentals: system design, algorithms, data... 
    Work at office
    Remote work

    Covariance.ai

    New York, NY
    3 days ago
  •  ...NVentures (venture arm of NVIDIA). About the Role As a Software Engineer in the Data Platform team at Suno, you will help build the systems...  ...pipelines are structured. ~ Experience or exposure to infrastructure as code (e.g., Terraform) for managing data platform... 
    Full time
    Work at office
    Local area

    SUNO

    New York, NY
    1 day ago
  • $143k - $196.9k

     ...development of features and components for the data platform, focusing on high-throughput...  .... You will report to the Director, Engineering. Architect and implement robust, distributed...  ...you will bring with you: 5+ years in a software engineer role, with experience in a... 
    Full time
    Remote work

    Sysdig

    New York, NY
    2 days ago
  •  ...Senior Software Engineer, Data Platform The Viacom Data Platform is looking for an awesome Sr. Software Engineer with professional, hands-on experience in developing and maintaining applications and services primarily written in Python. The Data Platform is responsible... 

    ClifyX

    New York, NY
    1 day ago
  •  ...healthcare technology company in the United States seeks a Senior Software Engineer to build and maintain core components of its application...  ...operational efficiency, providing constructive feedback, and ensuring data pipeline architecture is robust and scalable. The position... 
    Flexible hours

    Stellar Health

    New York, NY
    2 days ago
  • $320k - $405k

     ...Software Engineer, Research Data Platform San Francisco, CA | New York City, NY About Anthropic Anthropic's mission is to create reliable...  ...closer to the research workflow than a typical data infrastructure position: you'll often embed with research teams, build... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    New York, NY
    4 days ago
  • $180k - $220k

     ...A leading data platform company in the United States seeks a Senior Software Engineer to lead initiatives in scalability and reliability. The role involves architectural design, mentoring engineers, and ensuring high system quality. Candidates should bring 6+ years of... 

    Datavant

    New York, NY
    1 day ago
  • $130k - $180k

     ...A healthcare technology company is seeking a Senior Software Engineer to enhance their data-centric services, impacting patient care. The ideal candidate has over 8 years of experience in full-stack development, strong knowledge of Python, and is comfortable with AWS.... 

    Pearl Health

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Data Infrastructure. Be the first to apply!