Research Engineer, Data Infrastructure
Mistral AI
About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users. We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited. Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on Role Summary This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability. You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs. What will you do Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions. Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth. Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments. Metadata & Lineage: Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity. Operational Excellence: Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by o About you Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering. Have experience or a strong interest in supporting foundational compute and storage platforms. Are proficient in Python and enjoy solving the 'brittle data lake' problem with modern, columnar storage standards. Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments. Take pride in building and operating scalable, reliable, and secure systems from the ground up. Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment. What we offer Competitive salary and equity. Healthcare: Medical/Dental/Vision covered for you and your family. Pension: 401K (6% matching) PTO: 18 days. Transportation: Reimburse office parking charges, or $120/month for public transport. Sport: $120/month reimbursement for gym membership. Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger). Visa sponsorship. Coaching: we offer BetterUp coaching on a voluntary basis. By applying, you agree to our Applicant Privacy Policy. #J-18808-Ljbffr Mistral AI
- Senior Research Engineer, Training Data Infrastructure in Foundation Models Cupertino, California, United States - Software and Services Our team is dedicated to solving the high-quality training data problem at the scale required to train advanced Foundation Models. We...Suggested
- Orbifold AI in Palo Alto is seeking a Research Engineer - Multimodal AI to develop advanced AI models and optimize data platforms. The ideal candidate will have a strong background in computer vision and a passion for multimodal advancements. This position also offers the...Suggested
$160.36k - $240.54k
...diversity of its training and evaluation data. The team plays a crucial role in the... ...by creating a scalable and reliable data infrastructure. This infrastructure is designed to produce... ...team collaborates closely with system engineers to thoroughly validate the autonomous...SuggestedWork experience placement$193.93k - $291.15k
About the Role We are a team of high-output generalists where ML and systems engineering converge to push autonomy performance forward. As a Senior Perception ML Data Infrastructure Engineer, you will own the critical bridge between our autonomous vehicle hardware, our...Suggested$185k - $230k
The Opportunity We are looking for a Senior Data Engineer to join our Data Platform team and build the core data foundations that power analytics, experimentation, and decision‑making across the company. In this role, you will design and own foundational data models, pipelines...Suggested$228.6k - $314.25k
Databricks is looking for an experienced engineer to join the ManagedTables team. You'll drive the development of storage solutions, optimize large production clusters, and mentor fellow engineers. With 15+ years in distributed systems, you’ll work on enhancing database...$228.6k - $314.25k
Databricks is seeking an experienced software engineer to work on enterprise-grade analytical data systems, focusing on distributed systems and performance optimization. In this role, you will be responsible for delivering scalable architectures and mentoring team members...$126k - $423k
...Valley company is creating the digital infrastructure needed to bring intelligence to every... ...team We are looking for a passionate Research Engineer (AI/RL Infrastructure) to join the Research... ...can access millions of miles of data from large fleets, and deploy methods...Full timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift$224k - $356.5k
...searching for a senior or principal engineer who specializes in building cutting‑edge infrastructure for large‑scale foundation... ...the Generalist Embodied Agent Research (GEAR) group. Our team is leading... ...datasets. Implement scalable data loaders and preprocessors tailored...Full time$180k - $300k
DatologyAI in Redwood City, CA is seeking a Research Engineer to drive innovative research and contribute to product development. The ideal... ...along with comprehensive benefits. Join us to help optimize data curation for developing advanced AI models while enjoying unlimited...$19 - $65 per hour
...Ready to get hands‑on with real‑world, large‑scale data challenges? We’re seeking a Software Engineer Intern to help build and improve an event mining framework... ...for backend development and automation. Backend & infrastructure fundamentals: Solid understanding of backend...Hourly payInternship- ...Founded by a team of Stanford researchers and entrepreneurs with deep... ...innovation and systems engineering with a design-minded product... ...models are only as good as the data that trains them. As a Staff... ...Data Engineer, you'll own the infrastructure that takes raw audio —...
- Rhoda AI is looking for Data Infrastructure MLEs in Palo Alto to develop systems that manage immense data volumes essential for robotics. This role requires expertise in designing large-scale data infrastructure to optimize the processing of billions of video clips, ensuring...
- GXL is seeking a Lead Data Engineer in Palo Alto to own the data infrastructure for their AI products. The role involves designing and maintaining scalable ETL/ELT pipelines, enhancing database performance, and collaborating with product teams. The ideal candidate has 2...Visa sponsorship
- A leading tech firm is seeking a senior leader in Data Security to enhance security for their data analytics platform. The role requires over 7 years of experience in Data Security, expertise in areas like Cryptography and Web Security, and a strong leadership background...
$160.36k - $240.54k
...diversity of its training and evaluation data. The team plays a crucial role in the... ...by creating a scalable and reliable data infrastructure. This infrastructure is designed to produce... ...team collaborates closely with system engineers to thoroughly validate the autonomous driving...Work experience placement- PlusAI in Santa Clara is seeking a Software Engineer Intern to contribute to the development of advanced metrics dashboards. The intern... ...while collaborating across domains to enhance backend infrastructure. This role requires strong programming ability and is ideal for...Internship
$180k - $250k
...own large models on their own data. The current industry... ...at worst. There is compelling research showing that smarter data selection... ...an experienced Data Platform Engineer to join as a member of our core... ...Engineering / Platform / Infrastructure Team. Experience building ML...Work at officeVisa sponsorshipRelocation package$153k - $222k
...the Silicon Valley company is creating the digital infrastructure needed to bring intelligence to every moving... ...About the role We are looking for infrastructure engineers with expertise in scaling open-source data infrastructure to join the Data & ML infra group....Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$150k - $300k
As Staff Software Engineer for data infrastructure, you will play a crucial role in designing and implementing the systems that process, analyze, and serve our satellite constellation’s data to end‑users. You will have the opportunity to shape highly reliable backend infrastructure...Permanent employmentFull timeRemote work- ...Position Summary: At HeyGen, we are at the forefront of developing applications powered by our cutting-edge AI research. As a Data Infrastructure Engineer, you will lead the development of fundamental data systems and infrastructure. These systems are essential for...
- ...A pioneering AI company in California is seeking a Data Infrastructure Engineer to build and operate large-scale data systems. The role involves architecting multi-cluster systems for optimized performance and maintaining modern storage solutions. Ideal candidates have...
- ...models that leverage our large-scale, high-quality, real-world data collection system. At the same time, we’re building a new... ...more time on the things they value most. As a Machine Learning Research Engineer, you will work on the software and algorithms that enable our...
- A pioneering AI company in California is seeking a Research Engineer for ML to enhance large-scale learning systems and collaborate with Research Scientists. The ideal candidate will have a Master's or PhD in Computer Science, over four years of experience in ML codebases...
$180k - $250k
A tech-driven AI company in Redwood City is seeking an Infrastructure Engineer to develop core infrastructure and support multi-cloud environments. The ideal candidate has experience in large-scale infrastructure, proficiency with tools such as Kubernetes, and a passion...$162.8k - $203.5k
Rivian is searching for a Staff Software Engineer on the Data team, responsible for expertise in cloud and data engineering. The role requires... ...of the AWS Cloud Data Platform, leading critical infrastructure services for the ADAS team. Key qualifications include 5+...$139.8k - $205.04k
...driven to create a better, more sustainable future, then this is the right place for you. Role Description The Engineering Manager, Data Engineering & Infrastructure leads a team of data engineers and infrastructure engineers supporting Powertrain, Battery, and...Immediate start- Keywords to look for: Linux, Networking, Automation, Python/Java, Data Analytics Job Description: Experienced Analytics and Automation Engineer, preferably with experience in the telecom industry. The ideal candidate will have a strong analytics and automation background...
$174k - $252k
Senior Software Engineer, Infrastructure, Google Cloud Data Management Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 5 years of experience with software development in C++, C, or Python. 3 years of experience testing, maintaining...Full time$207k - $300k
AI Innovation and Research Software Engineer, Platforms and Devices Google | Mountain View, CA, USA... ...technical field. 8 years of experience with data structures and algorithms. 3 years... .... Experience with Machine Learning Infrastructure. Experience with Machine Learning...Full time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Data Infrastructure. Be the first to apply!
- ai research engineer Palo Alto, CA
- research engineer Palo Alto, CA
- research programmer Palo Alto, CA
- deep learning research engineer Palo Alto, CA
- research software engineer Palo Alto, CA
- remote data engineer Palo Alto, CA
- entry level big data engineer Palo Alto, CA
- big data devops engineer Palo Alto, CA
- data engineer Palo Alto, CA
- software data engineer Palo Alto, CA

