Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer, Data Infrastructure

Mistral AI

Job Description

Job Description

About Mistral 

 

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

 

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.

 

We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

 

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on

Role Summary 

 

This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability.
You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs.

 

 

What will you do

 

Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems

Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions.
Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth.
Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments.
Metadata & Lineage : Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity.
Operational Excellence : Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by orders of magnitude while remaining reliable and efficient.

 

About you

 

• Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering.
• Have experience or a strong interest in supporting foundational compute and storage platforms.
• Are proficient in Python and enjoy solving the "brittle data lake" problem with modern, columnar storage standards.
• Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments.
• Take pride in building and operating scalable, reliable, and secure systems from the ground up.
• Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment.

 

 

What we offer
  • \uD83D\uDCB0 Competitive salary and equity.
  • \uD83D\uDE91 Healthcare: Medical/Dental/Vision covered for you and your family.
  • \uD83D\uDC74\uD83C\uDFFB Pension : 401K (6% matching)
  • \uD83C\uDFDD️ PTO : 18 days 
  • \uD83D\uDE97 Transportation: Reimburse office parking charges, or $120/month for public transport
  • \uD83C\uDFC0 Sport: $120/month reimbursement for gym membership
  • \uD83E\uDD55 Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger)
  • \uD83C\uDF0E Visa sponsorship 
  • \uD83E\uDD1D Coaching: we offer BetterUp coaching on a voluntary basis

 

By applying, you agree to our Applicant Privacy Policy.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Research Engineer, Data Infrastructure in San Francisco, CA vacancy
  •  ...expertise in model innovation and systems engineering paired with a design‑minded product...  ...global AI, our models must be trained on data that reflects the world’s diversity of languages...  ...building scalable systems that bridge research and production. What We Offer... 
    Suggested
    Work at office
    Relocation package

    Cartesia

    San Francisco, CA
    2 hours ago
  • $300k - $405k

     ...is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working...  ...Engineer on the Economic Research Data Platform team, you will design, build, and maintain critical infrastructure that powers the company's research on AI'... 
    Suggested
    Visa sponsorship

    United States Digital Space LLC

    San Francisco, CA
    2 days ago
  • $350k

     ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...the reliability, observability, and infrastructure foundation that the team's research depends...  ...scale load testing. Experience with data quality pipelines, drift detection,... 
    Suggested
    Visa sponsorship
    Shift work

    United States Digital Space LLC

    San Francisco, CA
    17 hours ago
  •  ...video, lidar, radar, and sensor data. But today's data platforms (...  ...to close it. Our open‑source engine, Daft, is the distributed...  ...PhysicalAI labs and public AI infrastructure companies today. We have raised...  ...office. Your Role As a Research Engineer on the Visual Understanding... 
    Suggested
    Hourly pay
    Work at office
    Flexible hours
    Night shift
    1 day per week

    Eventual

    San Francisco, CA
    3 days ago
  • talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities... 
    Suggested

    talentpluto

    San Francisco, CA
    5 days ago
  • $250k - $280k

    A leading technological company is seeking a Sales Engineer to join their rapidly growing team in San Francisco. The ideal candidate will...  ...clients to understand their needs and educate them on WEKA's advanced data management solutions, focusing on high performance workloads and... 

    WekaIO

    San Francisco, CA
    4 days ago
  • $180.6k - $315k

     ...of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including...  ...agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working... 
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  • $200k - $250k

     ...team focused on serving frontier AI companies. This full-stack engineer role requires over 4 years of product engineering experience...  ...directly with clients and contribute to building a necessary infrastructure for AI development. The hybrid work model offers flexibility... 

    AI

    San Francisco, CA
    1 day ago
  •  ...backed AI startup solving one of enterprise data's most stubborn problems: getting...  ...a support role dressed up as Solutions Engineering. You'll be the technical anchor of every...  ...Diagnose and resolve accuracy, latency, and infrastructure issues across distributed systems — be... 
    Relocation package

    Lavendo

    San Francisco, CA
    3 days ago
  • $150k - $250k

     ...goods, and global social organizations. We research and deploy technologies that power AI-...  ...We Are Looking For At Distyl, Research Engineers build the bridge between frontier AI research...  ...Key Responsibilities Design and build data systems that power reliable AI workflows... 
    Full time
    Work at office
    3 days per week

    Distyl AI

    San Francisco, CA
    5 days ago
  •  ...great technology. The Liquid team is a community of world-class engineers, researchers, and builders creating the next generation of AI. Whether...  ...consolidating, gathering, and generating high-quality text data for pretraining, midtraining, SFT, and preference optimization... 

    Liquid AI

    San Francisco, CA
    2 days ago
  •  ...efficiently across deployment targets, from data center accelerators to on-device...  ...-built datasets. We need ML-minded engineers who can collect, filter, and...  ...data at scale. We treat data as a research problem, not an infrastructure problem. Our engineers run experiments... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    2 days ago
  • $175k

     .... Building these large-scale models requires performant data infrastructure to create and store the datasets used in all of our training...  ...costs to optimize for company value Partner with engineers and research scientists to facilitate progress for both research and... 
    Work at office
    Remote work

    I did my part and supported the Regular Toilet

    San Francisco, CA
    5 days ago
  • $200k - $400k

     ...a team. About the Team The Infrastructure team builds and operates the...  ...that power Decagon: networking, data, ML serving, developer...  ...a Senior Data Infrastructure Engineer to design, build, and operate...  ...BigQuery, or similar. Partner with research and product teams to... 
    Full time
    Work at office
    Local area

    Decagon AI, Inc.

    San Francisco, CA
    5 days ago
  • $160k - $225k

     ...agentic platform synthesizes complex employee data, pinpoints risky behaviours, and deploys...  ...Join Us Build and scale the foundational data infrastructure powering a category‑defining product Work closely with engineering, data science, and product teams to operationalize... 
    Work experience placement
    Relocation package
    Flexible hours

    Fable Security

    San Francisco, CA
    5 days ago
  • $140k - $200k

     .... These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft...  ...We're looking to hire for our Data side of our AI team at Speechify....  ...cost through a tight integration of infrastructure, engineering, and research work. We are... 
    Full time
    Work at office
    Shift work

    Clutch Canada

    San Francisco, CA
    3 days ago
  •  ...model innovation and systems engineering paired with a design‑minded...  ...experts in AI. About the Role Data is the lifeblood of our...  ...the training data and ML data infrastructure at Cartesia. This role sits...  ...code and partners closely with research and inference teams. This is... 
    Work at office
    Visa sponsorship
    Flexible hours

    Cartesia, Inc.

    San Francisco, CA
    1 day ago
  • About the Team Data Platform at OpenAI owns the foundational...  ...powering critical product, research, and analytics workflows. We...  ...Airflow; and support ML feature engineering tooling such as Chronon. Our...  .... We’re not just scaling infrastructure - we’re redefining how people... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    1 day ago
  •  ...WashingtonD.C., London and Amsterdam. Making data driven decisions is key to Plaid's...  ...tooling and guidance to teams across engineering, product, and business and help them explore...  ...more effectively. Engineers on Data Infrastructure are domain experts in Data Warehouse,... 
    Work experience placement
    Local area

    Plaid

    San Francisco, CA
    3 days ago
  • Palantir is seeking a Backend Software Engineer in San Francisco to develop scalable software for data-driven operations. The role requires expertise in programming...  ...familiarity in distributed systems and cloud infrastructure. The position offers significant autonomy in a... 
    Relocation package

    jobs.frontdoordefense.com - Jobboard

    San Francisco, CA
    3 days ago
  • $190k - $270k

    About Databricks Databricks is the Data + AI company. More than 10,000 organizations...  ...globe. About the Team The Databricks AI Research organization is pushing the frontier of...  ...research exploration with product and engineering rigor. Clear communication and strong cross... 
    Full time
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    2 days ago
  • $197.3k - $313.7k

    ## Staff Software Engineer, Data InfrastructureApplyremote type: Office Tech-Flexiblelocations: California - San Francisco: Washington...  ...is looking for a Staff Software Engineer to join the **Data Infrastructure** team within the broader Data Engineering organization. The... 
    Permanent employment
    Work at office

    Slack Enterprise

    San Francisco, CA
    4 days ago
  • Join Ditto as an Engineering Manager on the Data Sync Team, where you'll lead a dynamic team of engineers across crucial workstreams, including...  ...teams and a deep understanding of database and data infrastructure technologies. This position comes with competitive salaries... 
    Remote job

    Ditto

    San Francisco, CA
    3 days ago
  • Gerra Group in San Francisco is seeking a Senior Software Engineer to build core infrastructure for petabyte-scale data collection for leading robotics companies. You will design distributed systems for real-time sensor data and own critical data pipeline systems. The ideal... 

    Gerra Group

    San Francisco, CA
    1 day ago
  • Droyd in San Francisco is seeking a Staff Software Engineer focused on data infrastructure. You will own data pipelines that convert robot telemetry into valuable training signals. Collaborate directly with a small, senior team across robotics and machine learning to improve... 

    Droyd

    San Francisco, CA
    2 days ago
  • 11x in San Francisco is looking for a Data Engineer who operates like a founder to build systems for AI applications. This role involves owning critical infrastructure for AI workers, designing scalable data systems, and moving quickly through ambiguity. Candidates should... 

    11x

    San Francisco, CA
    4 days ago
  • $50 - $70 per hour

    Mercor is seeking a Network Engineer for Data for Autonomous Systems annotation. This remote position involves reviewing and classifying...  ...with enterprise networks and a curiosity about transforming infrastructure data into machine learning input. Commitment is 30-40 hours... 
    Remote job
    Hourly pay

    Mercor

    San Francisco, CA
    5 days ago
  • 53 Stations is seeking a Network Engineer to work onsite in the Bay Area. In this contract role, you'll blend your expertise in networking with data science to support autonomous infrastructure. Your responsibilities include reviewing network data and defining structures... 
    Contract work

    53 Stations

    San Francisco, CA
    5 days ago
  • A leading AI research firm in San Francisco is seeking a Data Center Controls Network Engineer to design and manage OT network architectures for high-density data centers. The ideal candidate has over 8 years of experience in controls engineering, industrial networking,... 

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...and maintaining frameworks that are used by many engineers , Experience in building high-performance sandboxes...  ...full-stack apps for automating workflows and data visualization , Experience in rapid iteration of research to production cycles , Experience in test automation... 

    Xai

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Data Infrastructure. Be the first to apply!