Research Engineer, Data Infrastructure
Mistral AI
Job Description
Job Description
About Mistral
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.
We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on
Role Summary
This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability.
You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs.
What will you do
• Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems
• Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions.
• Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth.
• Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments.
• Metadata & Lineage : Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity.
• Operational Excellence : Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by orders of magnitude while remaining reliable and efficient.
About you
• Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering.
• Have experience or a strong interest in supporting foundational compute and storage platforms.
• Are proficient in Python and enjoy solving the "brittle data lake" problem with modern, columnar storage standards.
• Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments.
• Take pride in building and operating scalable, reliable, and secure systems from the ground up.
• Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment.
What we offer
- \uD83D\uDCB0 Competitive salary and equity.
- \uD83D\uDE91 Healthcare: Medical/Dental/Vision covered for you and your family.
- \uD83D\uDC74\uD83C\uDFFB Pension : 401K (6% matching)
- \uD83C\uDFDD️ PTO : 18 days
- \uD83D\uDE97 Transportation: Reimburse office parking charges, or $120/month for public transport
- \uD83C\uDFC0 Sport: $120/month reimbursement for gym membership
- \uD83E\uDD55 Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger)
- \uD83C\uDF0E Visa sponsorship
- \uD83E\uDD1D Coaching: we offer BetterUp coaching on a voluntary basis
By applying, you agree to our Applicant Privacy Policy.
- ...expertise in model innovation and systems engineering paired with a design‑minded product... ...global AI, our models must be trained on data that reflects the world’s diversity of languages... ...building scalable systems that bridge research and production. What We Offer...SuggestedWork at officeRelocation package
$300k - $405k
...is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working... ...Engineer on the Economic Research Data Platform team, you will design, build, and maintain critical infrastructure that powers the company's research on AI'...SuggestedVisa sponsorship$350k
...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...the reliability, observability, and infrastructure foundation that the team's research depends... ...scale load testing. Experience with data quality pipelines, drift detection,...SuggestedVisa sponsorshipShift work- ...video, lidar, radar, and sensor data. But today's data platforms (... ...to close it. Our open‑source engine, Daft, is the distributed... ...PhysicalAI labs and public AI infrastructure companies today. We have raised... ...office. Your Role As a Research Engineer on the Visual Understanding...SuggestedHourly payWork at officeFlexible hoursNight shift1 day per week
- talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities...Suggested
$250k - $280k
A leading technological company is seeking a Sales Engineer to join their rapidly growing team in San Francisco. The ideal candidate will... ...clients to understand their needs and educate them on WEKA's advanced data management solutions, focusing on high performance workloads and...$180.6k - $315k
...of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including... ...agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working...Full time$200k - $250k
...team focused on serving frontier AI companies. This full-stack engineer role requires over 4 years of product engineering experience... ...directly with clients and contribute to building a necessary infrastructure for AI development. The hybrid work model offers flexibility...- ...backed AI startup solving one of enterprise data's most stubborn problems: getting... ...a support role dressed up as Solutions Engineering. You'll be the technical anchor of every... ...Diagnose and resolve accuracy, latency, and infrastructure issues across distributed systems — be...Relocation package
$150k - $250k
...goods, and global social organizations. We research and deploy technologies that power AI-... ...We Are Looking For At Distyl, Research Engineers build the bridge between frontier AI research... ...Key Responsibilities Design and build data systems that power reliable AI workflows...Full timeWork at office3 days per week- ...great technology. The Liquid team is a community of world-class engineers, researchers, and builders creating the next generation of AI. Whether... ...consolidating, gathering, and generating high-quality text data for pretraining, midtraining, SFT, and preference optimization...
- ...efficiently across deployment targets, from data center accelerators to on-device... ...-built datasets. We need ML-minded engineers who can collect, filter, and... ...data at scale. We treat data as a research problem, not an infrastructure problem. Our engineers run experiments...
$175k
.... Building these large-scale models requires performant data infrastructure to create and store the datasets used in all of our training... ...costs to optimize for company value Partner with engineers and research scientists to facilitate progress for both research and...Work at officeRemote work$200k - $400k
...a team. About the Team The Infrastructure team builds and operates the... ...that power Decagon: networking, data, ML serving, developer... ...a Senior Data Infrastructure Engineer to design, build, and operate... ...BigQuery, or similar. Partner with research and product teams to...Full timeWork at officeLocal area$160k - $225k
...agentic platform synthesizes complex employee data, pinpoints risky behaviours, and deploys... ...Join Us Build and scale the foundational data infrastructure powering a category‑defining product Work closely with engineering, data science, and product teams to operationalize...Work experience placementRelocation packageFlexible hours$140k - $200k
.... These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft... ...We're looking to hire for our Data side of our AI team at Speechify.... ...cost through a tight integration of infrastructure, engineering, and research work. We are...Full timeWork at officeShift work- ...model innovation and systems engineering paired with a design‑minded... ...experts in AI. About the Role Data is the lifeblood of our... ...the training data and ML data infrastructure at Cartesia. This role sits... ...code and partners closely with research and inference teams. This is...Work at officeVisa sponsorshipFlexible hours
- About the Team Data Platform at OpenAI owns the foundational... ...powering critical product, research, and analytics workflows. We... ...Airflow; and support ML feature engineering tooling such as Chronon. Our... .... We’re not just scaling infrastructure - we’re redefining how people...Work at officeRelocation package
- ...WashingtonD.C., London and Amsterdam. Making data driven decisions is key to Plaid's... ...tooling and guidance to teams across engineering, product, and business and help them explore... ...more effectively. Engineers on Data Infrastructure are domain experts in Data Warehouse,...Work experience placementLocal area
- Palantir is seeking a Backend Software Engineer in San Francisco to develop scalable software for data-driven operations. The role requires expertise in programming... ...familiarity in distributed systems and cloud infrastructure. The position offers significant autonomy in a...Relocation package
$190k - $270k
About Databricks Databricks is the Data + AI company. More than 10,000 organizations... ...globe. About the Team The Databricks AI Research organization is pushing the frontier of... ...research exploration with product and engineering rigor. Clear communication and strong cross...Full timeLocal areaWorldwide$197.3k - $313.7k
## Staff Software Engineer, Data InfrastructureApplyremote type: Office Tech-Flexiblelocations: California - San Francisco: Washington... ...is looking for a Staff Software Engineer to join the **Data Infrastructure** team within the broader Data Engineering organization. The...Permanent employmentWork at office- Join Ditto as an Engineering Manager on the Data Sync Team, where you'll lead a dynamic team of engineers across crucial workstreams, including... ...teams and a deep understanding of database and data infrastructure technologies. This position comes with competitive salaries...Remote job
- Gerra Group in San Francisco is seeking a Senior Software Engineer to build core infrastructure for petabyte-scale data collection for leading robotics companies. You will design distributed systems for real-time sensor data and own critical data pipeline systems. The ideal...
- Droyd in San Francisco is seeking a Staff Software Engineer focused on data infrastructure. You will own data pipelines that convert robot telemetry into valuable training signals. Collaborate directly with a small, senior team across robotics and machine learning to improve...
- 11x in San Francisco is looking for a Data Engineer who operates like a founder to build systems for AI applications. This role involves owning critical infrastructure for AI workers, designing scalable data systems, and moving quickly through ambiguity. Candidates should...
$50 - $70 per hour
Mercor is seeking a Network Engineer for Data for Autonomous Systems annotation. This remote position involves reviewing and classifying... ...with enterprise networks and a curiosity about transforming infrastructure data into machine learning input. Commitment is 30-40 hours...Remote jobHourly pay- 53 Stations is seeking a Network Engineer to work onsite in the Bay Area. In this contract role, you'll blend your expertise in networking with data science to support autonomous infrastructure. Your responsibilities include reviewing network data and defining structures...Contract work
- A leading AI research firm in San Francisco is seeking a Data Center Controls Network Engineer to design and manage OT network architectures for high-density data centers. The ideal candidate has over 8 years of experience in controls engineering, industrial networking,...
- ...and maintaining frameworks that are used by many engineers , Experience in building high-performance sandboxes... ...full-stack apps for automating workflows and data visualization , Experience in rapid iteration of research to production cycles , Experience in test automation...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Data Infrastructure. Be the first to apply!
- research assistant engineering San Francisco, CA
- ai research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research engineer San Francisco, CA
- research programmer San Francisco, CA
- deep learning research engineer San Francisco, CA
- research software engineer San Francisco, CA
- senior research engineer San Francisco, CA
- remote data engineer San Francisco, CA
- entry level big data engineer San Francisco, CA

