Research Engineer, Data Infrastructure
Mistral AI
About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users. We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited. Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on Role Summary This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability. You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs. What will you do Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions. Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth. Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments. Metadata & Lineage: Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity. Operational Excellence: Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by o About you Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering. Have experience or a strong interest in supporting foundational compute and storage platforms. Are proficient in Python and enjoy solving the 'brittle data lake' problem with modern, columnar storage standards. Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments. Take pride in building and operating scalable, reliable, and secure systems from the ground up. Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment. What we offer Competitive salary and equity. Healthcare: Medical/Dental/Vision covered for you and your family. Pension: 401K (6% matching) PTO: 18 days. Transportation: Reimburse office parking charges, or $120/month for public transport. Sport: $120/month reimbursement for gym membership. Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger). Visa sponsorship. Coaching: we offer BetterUp coaching on a voluntary basis. By applying, you agree to our Applicant Privacy Policy. #J-18808-Ljbffr Mistral AI
- Senior Research Engineer, Training Data Infrastructure in Foundation Models Cupertino, California, United States - Software and Services Our team is dedicated to solving the high-quality training data problem at the scale required to train advanced Foundation Models. We...Suggested
$213k - $263k
...Waymo Ml Ops Engineer Waymo is an autonomous driving technology company with the mission... ...ML Platform team, builds tools and infrastructure to realize the ML flywheel at Waymo. This... ...Develop and contribute to Waymo's data infrastructure platform to enable plant...Suggested- Orbifold AI in Palo Alto is seeking a Research Engineer - Multimodal AI to develop advanced AI models and optimize data platforms. The ideal candidate will have a strong background in computer vision and a passion for multimodal advancements. This position also offers the...Suggested
$185k - $230k
The Opportunity We are looking for a Senior Data Engineer to join our Data Platform team and build the core data foundations that power analytics, experimentation, and decision‑making across the company. In this role, you will design and own foundational data models, pipelines...Suggested$193.93k - $291.15k
About the Role We are a team of high-output generalists where ML and systems engineering converge to push autonomy performance forward. As a Senior Perception ML Data Infrastructure Engineer, you will own the critical bridge between our autonomous vehicle hardware, our...Suggested$160.36k - $240.54k
...diversity of its training and evaluation data. The team plays a crucial role in the... ...by creating a scalable and reliable data infrastructure. This infrastructure is designed to produce... ...team collaborates closely with system engineers to thoroughly validate the autonomous...Work experience placement$228.6k - $314.25k
Databricks is seeking an experienced software engineer to work on enterprise-grade analytical data systems, focusing on distributed systems and performance optimization. In this role, you will be responsible for delivering scalable architectures and mentoring team members...$228.6k - $314.25k
Databricks is looking for an experienced engineer to join the ManagedTables team. You'll drive the development of storage solutions, optimize large production clusters, and mentor fellow engineers. With 15+ years in distributed systems, you’ll work on enhancing database...$126k - $423k
...Valley company is creating the digital infrastructure needed to bring intelligence to every... ...team We are looking for a passionate Research Engineer (AI/RL Infrastructure) to join the Research... ...can access millions of miles of data from large fleets, and deploy methods...Full timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift$224k - $356.5k
...searching for a senior or principal engineer who specializes in building cutting‑edge infrastructure for large‑scale foundation... ...the Generalist Embodied Agent Research (GEAR) group. Our team is leading... ...datasets. Implement scalable data loaders and preprocessors tailored...Full time$180k - $300k
DatologyAI in Redwood City, CA is seeking a Research Engineer to drive innovative research and contribute to product development. The ideal... ...along with comprehensive benefits. Join us to help optimize data curation for developing advanced AI models while enjoying unlimited...$19 - $65 per hour
...Ready to get hands‑on with real‑world, large‑scale data challenges? We’re seeking a Software Engineer Intern to help build and improve an event mining framework... ...for backend development and automation. Backend & infrastructure fundamentals: Solid understanding of backend...Hourly payInternship$165k - $242k
...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted...- GXL is seeking a Lead Data Engineer in Palo Alto to own the data infrastructure for their AI products. The role involves designing and maintaining scalable ETL/ELT pipelines, enhancing database performance, and collaborating with product teams. The ideal candidate has 2...Visa sponsorship
- Rhoda AI is looking for Data Infrastructure MLEs in Palo Alto to develop systems that manage immense data volumes essential for robotics. This role requires expertise in designing large-scale data infrastructure to optimize the processing of billions of video clips, ensuring...
- ...Founded by a team of Stanford researchers and entrepreneurs with deep... ...innovation and systems engineering with a design-minded product... ...models are only as good as the data that trains them. As a Staff... ...Data Engineer, you'll own the infrastructure that takes raw audio —...
- ...Infrastructure Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon... ...engineers with expertise in scaling open-source data infrastructure to join the Data & ML infra group. This role...
$165k - $242k
...Senior Software Engineer - Data Infrastructure Services Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI...Permanent employmentTemporary workCasual workWork at officeFlexible hours- A leading tech firm is seeking a senior leader in Data Security to enhance security for their data analytics platform. The role requires over 7 years of experience in Data Security, expertise in areas like Cryptography and Web Security, and a strong leadership background...
$160.36k - $240.54k
...diversity of its training and evaluation data. The team plays a crucial role in the... ...by creating a scalable and reliable data infrastructure. This infrastructure is designed to produce... ...team collaborates closely with system engineers to thoroughly validate the autonomous driving...Work experience placement- PlusAI in Santa Clara is seeking a Software Engineer Intern to contribute to the development of advanced metrics dashboards. The intern... ...while collaborating across domains to enhance backend infrastructure. This role requires strong programming ability and is ideal for...Internship
$153k - $222k
...the Silicon Valley company is creating the digital infrastructure needed to bring intelligence to every moving... ...About the role We are looking for infrastructure engineers with expertise in scaling open-source data infrastructure to join the Data & ML infra group....Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$180k - $250k
...own large models on their own data. The current industry... ...at worst. There is compelling research showing that smarter data selection... ...an experienced Data Platform Engineer to join as a member of our core... ...Engineering / Platform / Infrastructure Team. Experience building ML...Work at officeVisa sponsorshipRelocation package$140k - $220k
...private by design, with all data processing performed by the robot... ...for a motivated perception engineer to join us on the ground... ...Build scalable machine learning infrastructure to train on large datasets and... ...Track record of product-focused research applied to real-world...Immediate startWork from home- ...See more about our culture on Role Summary About the Research Engineering team The team spans Platform (shared infra & clean... ...- Platform RE Team: Enhance the shared training framework, data pipelines and cluster tooling used by every team; or - Embedded...Work at officeVisa sponsorship
$150k - $300k
As Staff Software Engineer for data infrastructure, you will play a crucial role in designing and implementing the systems that process, analyze, and serve our satellite constellation’s data to end‑users. You will have the opportunity to shape highly reliable backend infrastructure...Permanent employmentFull timeRemote work$160k - $240k
...robots are private by design, with all data processing performed by the robot... ...the role We're seeking a Senior Research Engineer to lead cutting-edge perception research... ...Build and scale robust machine learning infrastructure supporting large-scale training while...Work from home- A pioneering AI company in California is seeking a Research Engineer for ML to enhance large-scale learning systems and collaborate with Research Scientists. The ideal candidate will have a Master's or PhD in Computer Science, over four years of experience in ML codebases...
$176k - $253k
...At Toyota Research Institute (TRI), we're on a mission to improve... ...with senior researchers and engineers to develop methods that make... ...learned policies and simulation infrastructure to assess interpretability,... ...information about how your data is processed, please contact...Local areaShift work$217.57k - $260k
...identity. To learn more, visit [ ROLE OVERVIEW ID.me is seeking a Staff Software Engineer - Data Platform to lead the design, build, and operation of the core data infrastructure that underpins our identity platform. This engineer will be responsible for ensuring...Full timeTemporary workWork at officeRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Data Infrastructure. Be the first to apply!
- ai research engineer Palo Alto, CA
- research engineer Palo Alto, CA
- research programmer Palo Alto, CA
- deep learning research engineer Palo Alto, CA
- research software engineer Palo Alto, CA
- remote data engineer Palo Alto, CA
- entry level big data engineer Palo Alto, CA
- big data devops engineer Palo Alto, CA
- data engineer Palo Alto, CA
- software data engineer Palo Alto, CA

