Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer, Data Infrastructure

Mistral AI

About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users. We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited. Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on Role Summary This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability. You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs. What will you do Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions. Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth. Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments. Metadata & Lineage: Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity. Operational Excellence: Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by o About you Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering. Have experience or a strong interest in supporting foundational compute and storage platforms. Are proficient in Python and enjoy solving the 'brittle data lake' problem with modern, columnar storage standards. Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments. Take pride in building and operating scalable, reliable, and secure systems from the ground up. Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment. What we offer Competitive salary and equity. Healthcare: Medical/Dental/Vision covered for you and your family. Pension: 401K (6% matching) PTO: 18 days. Transportation: Reimburse office parking charges, or $120/month for public transport. Sport: $120/month reimbursement for gym membership. Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger). Visa sponsorship. Coaching: we offer BetterUp coaching on a voluntary basis. By applying, you agree to our Applicant Privacy Policy. #J-18808-Ljbffr Mistral AI

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Research Engineer, Data Infrastructure in Palo Alto, CA vacancy
  • Senior Research Engineer, Training Data Infrastructure in Foundation Models Cupertino, California, United States - Software and Services Our team is dedicated to solving the high-quality training data problem at the scale required to train advanced Foundation Models. We... 
    Suggested

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $213k - $263k

     ...Waymo Ml Ops Engineer Waymo is an autonomous driving technology company with the mission...  ...ML Platform team, builds tools and infrastructure to realize the ML flywheel at Waymo. This...  ...Develop and contribute to Waymo's data infrastructure platform to enable plant... 
    Suggested

    Latent Logic

    Mountain View, CA
    2 days ago
  • Orbifold AI in Palo Alto is seeking a Research Engineer - Multimodal AI to develop advanced AI models and optimize data platforms. The ideal candidate will have a strong background in computer vision and a passion for multimodal advancements. This position also offers the... 
    Suggested

    Bonfirevc

    Palo Alto, CA
    5 days ago
  • $185k - $230k

    The Opportunity We are looking for a Senior Data Engineer to join our Data Platform team and build the core data foundations that power analytics, experimentation, and decision‑making across the company. In this role, you will design and own foundational data models, pipelines... 
    Suggested

    Cacheflow

    Mountain View, CA
    3 days ago
  • $193.93k - $291.15k

    About the Role We are a team of high-output generalists where ML and systems engineering converge to push autonomy performance forward. As a Senior Perception ML Data Infrastructure Engineer, you will own the critical bridge between our autonomous vehicle hardware, our... 
    Suggested

    Kindredventures

    Mountain View, CA
    5 days ago
  • $160.36k - $240.54k

     ...diversity of its training and evaluation data. The team plays a crucial role in the...  ...by creating a scalable and reliable data infrastructure. This infrastructure is designed to produce...  ...team collaborates closely with system engineers to thoroughly validate the autonomous... 
    Work experience placement

    Icehouseventures

    Mountain View, CA
    1 day ago
  • $228.6k - $314.25k

    Databricks is seeking an experienced software engineer to work on enterprise-grade analytical data systems, focusing on distributed systems and performance optimization. In this role, you will be responsible for delivering scalable architectures and mentoring team members... 

    Menlo Ventures

    Mountain View, CA
    4 days ago
  • $228.6k - $314.25k

    Databricks is looking for an experienced engineer to join the ManagedTables team. You'll drive the development of storage solutions, optimize large production clusters, and mentor fellow engineers. With 15+ years in distributed systems, you’ll work on enhancing database... 

    I did my part and supported the Regular Toilet

    Mountain View, CA
    5 days ago
  • $126k - $423k

     ...Valley company is creating the digital infrastructure needed to bring intelligence to every...  ...team We are looking for a passionate Research Engineer (AI/RL Infrastructure) to join the Research...  ...can access millions of miles of data from large fleets, and deploy methods... 
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Decisive Point

    Sunnyvale, CA
    2 days ago
  • $224k - $356.5k

     ...searching for a senior or principal engineer who specializes in building cutting‑edge infrastructure for large‑scale foundation...  ...the Generalist Embodied Agent Research (GEAR) group. Our team is leading...  ...datasets. Implement scalable data loaders and preprocessors tailored... 
    Full time

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $180k - $300k

    DatologyAI in Redwood City, CA is seeking a Research Engineer to drive innovative research and contribute to product development. The ideal...  ...along with comprehensive benefits. Join us to help optimize data curation for developing advanced AI models while enjoying unlimited... 

    datologyai

    Redwood City, CA
    2 days ago
  • $19 - $65 per hour

     ...Ready to get hands‑on with real‑world, large‑scale data challenges? We’re seeking a Software Engineer Intern to help build and improve an event mining framework...  ...for backend development and automation. Backend & infrastructure fundamentals: Solid understanding of backend... 
    Hourly pay
    Internship

    Medium

    Santa Clara, CA
    1 day ago
  • $165k - $242k

     ...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted... 

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • GXL is seeking a Lead Data Engineer in Palo Alto to own the data infrastructure for their AI products. The role involves designing and maintaining scalable ETL/ELT pipelines, enhancing database performance, and collaborating with product teams. The ideal candidate has 2... 
    Visa sponsorship

    GXL

    Palo Alto, CA
    1 day ago
  • Rhoda AI is looking for Data Infrastructure MLEs in Palo Alto to develop systems that manage immense data volumes essential for robotics. This role requires expertise in designing large-scale data infrastructure to optimize the processing of billions of video clips, ensuring... 

    Rhoda AI

    Palo Alto, CA
    1 day ago
  •  ...Founded by a team of Stanford researchers and entrepreneurs with deep...  ...innovation and systems engineering with a design-minded product...  ...models are only as good as the data that trains them. As a Staff...  ...Data Engineer, you'll own the infrastructure that takes raw audio —... 

    Sanas

    Palo Alto, CA
    1 day ago
  •  ...Infrastructure Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon...  ...engineers with expertise in scaling open-source data infrastructure to join the Data & ML infra group. This role... 

    Applied Intuition

    Sunnyvale, CA
    2 days ago
  • $165k - $242k

     ...Senior Software Engineer - Data Infrastructure Services Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • A leading tech firm is seeking a senior leader in Data Security to enhance security for their data analytics platform. The role requires over 7 years of experience in Data Security, expertise in areas like Cryptography and Web Security, and a strong leadership background... 

    Menlo Ventures

    Mountain View, CA
    4 days ago
  • $160.36k - $240.54k

     ...diversity of its training and evaluation data. The team plays a crucial role in the...  ...by creating a scalable and reliable data infrastructure. This infrastructure is designed to produce...  ...team collaborates closely with system engineers to thoroughly validate the autonomous driving... 
    Work experience placement

    Icehouseventures

    Mountain View, CA
    6 days ago
  • PlusAI in Santa Clara is seeking a Software Engineer Intern to contribute to the development of advanced metrics dashboards. The intern...  ...while collaborating across domains to enhance backend infrastructure. This role requires strong programming ability and is ideal for... 
    Internship

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  • $153k - $222k

     ...the Silicon Valley company is creating the digital infrastructure needed to bring intelligence to every moving...  ...About the role We are looking for infrastructure engineers with expertise in scaling open-source data infrastructure to join the Data & ML infra group.... 
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Decisive Point

    Sunnyvale, CA
    3 days ago
  • $180k - $250k

     ...own large models on their own data. The current industry...  ...at worst. There is compelling research showing that smarter data selection...  ...an experienced Data Platform Engineer to join as a member of our core...  ...Engineering / Platform / Infrastructure Team. Experience building ML... 
    Work at office
    Visa sponsorship
    Relocation package

    datologyai

    Redwood City, CA
    10 days ago
  • $140k - $220k

     ...private by design, with all data processing performed by the robot...  ...for a motivated perception engineer to join us on the ground...  ...Build scalable machine learning infrastructure to train on large datasets and...  ...Track record of product-focused research applied to real-world... 
    Immediate start
    Work from home

    Matic

    Menlo Park, CA
    1 day ago
  •  ...See more about our culture on Role Summary About the Research Engineering team The team spans Platform (shared infra & clean...  ...- Platform RE Team: Enhance the shared training framework, data pipelines and cluster tooling used by every team; or - Embedded... 
    Work at office
    Visa sponsorship

    Mistral AI

    Palo Alto, CA
    4 days ago
  • $150k - $300k

    As Staff Software Engineer for data infrastructure, you will play a crucial role in designing and implementing the systems that process, analyze, and serve our satellite constellation’s data to end‑users. You will have the opportunity to shape highly reliable backend infrastructure... 
    Permanent employment
    Full time
    Remote work

    ArrayLabs, LLC

    Redwood City, CA
    4 days ago
  • $160k - $240k

     ...robots are private by design, with all data processing performed by the robot...  ...the role We're seeking a Senior Research Engineer to lead cutting-edge perception research...  ...Build and scale robust machine learning infrastructure supporting large-scale training while... 
    Work from home

    Matic

    Menlo Park, CA
    3 days ago
  • A pioneering AI company in California is seeking a Research Engineer for ML to enhance large-scale learning systems and collaborate with Research Scientists. The ideal candidate will have a Master's or PhD in Computer Science, over four years of experience in ML codebases... 

    Mistral AI

    Palo Alto, CA
    2 days ago
  • $176k - $253k

     ...At Toyota Research Institute (TRI), we're on a mission to improve...  ...with senior researchers and engineers to develop methods that make...  ...learned policies and simulation infrastructure to assess interpretability,...  ...information about how your data is processed, please contact... 
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    3 days ago
  • $217.57k - $260k

     ...identity. To learn more, visit [ ROLE OVERVIEW ID.me is seeking a Staff Software Engineer - Data Platform to lead the design, build, and operation of the core data infrastructure that underpins our identity platform. This engineer will be responsible for ensuring... 
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours

    ID.me

    Mountain View, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Data Infrastructure. Be the first to apply!