Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer, Data Infrastructure

Mistral AI

About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users. We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited. Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on Role Summary This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability. You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs. What will you do Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions. Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth. Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments. Metadata & Lineage: Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity. Operational Excellence: Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by o About you Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering. Have experience or a strong interest in supporting foundational compute and storage platforms. Are proficient in Python and enjoy solving the 'brittle data lake' problem with modern, columnar storage standards. Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments. Take pride in building and operating scalable, reliable, and secure systems from the ground up. Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment. What we offer Competitive salary and equity. Healthcare: Medical/Dental/Vision covered for you and your family. Pension: 401K (6% matching) PTO: 18 days. Transportation: Reimburse office parking charges, or $120/month for public transport. Sport: $120/month reimbursement for gym membership. Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger). Visa sponsorship. Coaching: we offer BetterUp coaching on a voluntary basis. By applying, you agree to our Applicant Privacy Policy. #J-18808-Ljbffr Mistral AI

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Research Engineer, Data Infrastructure in Palo Alto, CA vacancy
  • Senior Research Engineer, Training Data Infrastructure in Foundation Models Cupertino, California, United States - Software and Services Our team is dedicated to solving the high-quality training data problem at the scale required to train advanced Foundation Models. We... 
    Suggested

    Apple Inc.

    Cupertino, CA
    15 hours ago
  • Orbifold AI in Palo Alto is seeking a Research Engineer - Multimodal AI to develop advanced AI models and optimize data platforms. The ideal candidate will have a strong background in computer vision and a passion for multimodal advancements. This position also offers the... 
    Suggested

    Bonfirevc

    Palo Alto, CA
    4 days ago
  • $160.36k - $240.54k

     ...diversity of its training and evaluation data. The team plays a crucial role in the...  ...by creating a scalable and reliable data infrastructure. This infrastructure is designed to produce...  ...team collaborates closely with system engineers to thoroughly validate the autonomous... 
    Suggested
    Work experience placement

    Icehouseventures

    Mountain View, CA
    15 hours ago
  • $193.93k - $291.15k

    About the Role We are a team of high-output generalists where ML and systems engineering converge to push autonomy performance forward. As a Senior Perception ML Data Infrastructure Engineer, you will own the critical bridge between our autonomous vehicle hardware, our... 
    Suggested

    Kindredventures

    Mountain View, CA
    4 days ago
  • $185k - $230k

    The Opportunity We are looking for a Senior Data Engineer to join our Data Platform team and build the core data foundations that power analytics, experimentation, and decision‑making across the company. In this role, you will design and own foundational data models, pipelines... 
    Suggested

    Cacheflow

    Mountain View, CA
    2 days ago
  • $228.6k - $314.25k

    Databricks is looking for an experienced engineer to join the ManagedTables team. You'll drive the development of storage solutions, optimize large production clusters, and mentor fellow engineers. With 15+ years in distributed systems, you’ll work on enhancing database... 

    I did my part and supported the Regular Toilet

    Mountain View, CA
    4 days ago
  • $228.6k - $314.25k

    Databricks is seeking an experienced software engineer to work on enterprise-grade analytical data systems, focusing on distributed systems and performance optimization. In this role, you will be responsible for delivering scalable architectures and mentoring team members... 

    Menlo Ventures

    Mountain View, CA
    3 days ago
  • $126k - $423k

     ...Valley company is creating the digital infrastructure needed to bring intelligence to every...  ...team We are looking for a passionate Research Engineer (AI/RL Infrastructure) to join the Research...  ...can access millions of miles of data from large fleets, and deploy methods... 
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Decisive Point

    Sunnyvale, CA
    1 day ago
  • $224k - $356.5k

     ...searching for a senior or principal engineer who specializes in building cutting‑edge infrastructure for large‑scale foundation...  ...the Generalist Embodied Agent Research (GEAR) group. Our team is leading...  ...datasets. Implement scalable data loaders and preprocessors tailored... 
    Full time

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $180k - $300k

    DatologyAI in Redwood City, CA is seeking a Research Engineer to drive innovative research and contribute to product development. The ideal...  ...along with comprehensive benefits. Join us to help optimize data curation for developing advanced AI models while enjoying unlimited... 

    datologyai

    Redwood City, CA
    1 day ago
  • $19 - $65 per hour

     ...Ready to get hands‑on with real‑world, large‑scale data challenges? We’re seeking a Software Engineer Intern to help build and improve an event mining framework...  ...for backend development and automation. Backend & infrastructure fundamentals: Solid understanding of backend... 
    Hourly pay
    Internship

    Medium

    Santa Clara, CA
    15 hours ago
  •  ...Founded by a team of Stanford researchers and entrepreneurs with deep...  ...innovation and systems engineering with a design-minded product...  ...models are only as good as the data that trains them. As a Staff...  ...Data Engineer, you'll own the infrastructure that takes raw audio —... 

    Sanas

    Palo Alto, CA
    15 hours ago
  • Rhoda AI is looking for Data Infrastructure MLEs in Palo Alto to develop systems that manage immense data volumes essential for robotics. This role requires expertise in designing large-scale data infrastructure to optimize the processing of billions of video clips, ensuring... 

    Rhoda AI

    Palo Alto, CA
    15 hours ago
  • GXL is seeking a Lead Data Engineer in Palo Alto to own the data infrastructure for their AI products. The role involves designing and maintaining scalable ETL/ELT pipelines, enhancing database performance, and collaborating with product teams. The ideal candidate has 2... 
    Visa sponsorship

    GXL

    Palo Alto, CA
    15 hours ago
  • A leading tech firm is seeking a senior leader in Data Security to enhance security for their data analytics platform. The role requires over 7 years of experience in Data Security, expertise in areas like Cryptography and Web Security, and a strong leadership background... 

    Menlo Ventures

    Mountain View, CA
    3 days ago
  • $160.36k - $240.54k

     ...diversity of its training and evaluation data. The team plays a crucial role in the...  ...by creating a scalable and reliable data infrastructure. This infrastructure is designed to produce...  ...team collaborates closely with system engineers to thoroughly validate the autonomous driving... 
    Work experience placement

    Icehouseventures

    Mountain View, CA
    5 days ago
  • PlusAI in Santa Clara is seeking a Software Engineer Intern to contribute to the development of advanced metrics dashboards. The intern...  ...while collaborating across domains to enhance backend infrastructure. This role requires strong programming ability and is ideal for... 
    Internship

    PlusAI, Inc.

    Santa Clara, CA
    2 days ago
  • $180k - $250k

     ...own large models on their own data. The current industry...  ...at worst. There is compelling research showing that smarter data selection...  ...an experienced Data Platform Engineer to join as a member of our core...  ...Engineering / Platform / Infrastructure Team. Experience building ML... 
    Work at office
    Visa sponsorship
    Relocation package

    datologyai

    Redwood City, CA
    9 days ago
  • $153k - $222k

     ...the Silicon Valley company is creating the digital infrastructure needed to bring intelligence to every moving...  ...About the role We are looking for infrastructure engineers with expertise in scaling open-source data infrastructure to join the Data & ML infra group.... 
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Decisive Point

    Sunnyvale, CA
    2 days ago
  • $150k - $300k

    As Staff Software Engineer for data infrastructure, you will play a crucial role in designing and implementing the systems that process, analyze, and serve our satellite constellation’s data to end‑users. You will have the opportunity to shape highly reliable backend infrastructure... 
    Permanent employment
    Full time
    Remote work

    ArrayLabs, LLC

    Redwood City, CA
    3 days ago
  •  ...Position Summary: At HeyGen, we are at the forefront of developing applications powered by our cutting-edge AI research. As a Data Infrastructure Engineer, you will lead the development of fundamental data systems and infrastructure. These systems are essential for... 

    HeyGen

    Palo Alto, CA
    15 hours ago
  •  ...A pioneering AI company in California is seeking a Data Infrastructure Engineer to build and operate large-scale data systems. The role involves architecting multi-cluster systems for optimized performance and maintaining modern storage solutions. Ideal candidates have... 

    Mistral AI

    Palo Alto, CA
    15 hours ago
  •  ...models that leverage our large-scale, high-quality, real-world data collection system. At the same time, we’re building a new...  ...more time on the things they value most. As a Machine Learning Research Engineer, you will work on the software and algorithms that enable our... 

    Sunday Robotics

    Mountain View, CA
    1 day ago
  • A pioneering AI company in California is seeking a Research Engineer for ML to enhance large-scale learning systems and collaborate with Research Scientists. The ideal candidate will have a Master's or PhD in Computer Science, over four years of experience in ML codebases... 

    Mistral AI

    Palo Alto, CA
    1 day ago
  • $180k - $250k

    A tech-driven AI company in Redwood City is seeking an Infrastructure Engineer to develop core infrastructure and support multi-cloud environments. The ideal candidate has experience in large-scale infrastructure, proficiency with tools such as Kubernetes, and a passion... 

    Datology

    Redwood City, CA
    2 days ago
  • $162.8k - $203.5k

    Rivian is searching for a Staff Software Engineer on the Data team, responsible for expertise in cloud and data engineering. The role requires...  ...of the AWS Cloud Data Platform, leading critical infrastructure services for the ADAS team. Key qualifications include 5+... 

    Rivian

    Palo Alto, CA
    15 hours ago
  • $139.8k - $205.04k

     ...driven to create a better, more sustainable future, then this is the right place for you. Role Description The Engineering Manager, Data Engineering & Infrastructure leads a team of data engineers and infrastructure engineers supporting Powertrain, Battery, and... 
    Immediate start

    Lucid Motors

    Newark, CA
    1 day ago
  • Keywords to look for: Linux, Networking, Automation, Python/Java, Data Analytics Job Description: Experienced Analytics and Automation Engineer, preferably with experience in the telecom industry. The ideal candidate will have a strong analytics and automation background... 

    TechDigital Group

    Mountain View, CA
    2 days ago
  • $174k - $252k

    Senior Software Engineer, Infrastructure, Google Cloud Data Management Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 5 years of experience with software development in C++, C, or Python. 3 years of experience testing, maintaining... 
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $207k - $300k

    AI Innovation and Research Software Engineer, Platforms and Devices Google | Mountain View, CA, USA...  ...technical field. 8 years of experience with data structures and algorithms. 3 years...  .... Experience with Machine Learning Infrastructure. Experience with Machine Learning... 
    Full time

    Google Inc.

    Mountain View, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Data Infrastructure. Be the first to apply!