Research Engineer, Machine Learning

> 4 years of experience
Permanent contract
ML Engineer
Deep learning
CUDA
Machine Learning

✨ About Mistral ✨

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.

We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.

🚀 Role Summary 🚀

💡 About the Research Engineering team 💡

The team spans Platform (shared infra & clean code) and Embedded (inside research squads). Engineers can move along the research↔production spectrum as needs or interests evolve.

As a Research Engineer – ML track, you’ll build and optimise the large-scale learning systems that power our open-weight models. Working hand-in-hand with Research Scientists, you’ll either join:

  • Platform RE Team: Enhance the shared training framework, data pipelines and cluster tooling used by every team; or
  • Embedded RE Team: Sit inside a research squad (Alignment, Pre-training, Multimodal, …) and turn fresh ideas into repeatable, scalable code.

🌟 What will you do 🌟

  • Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
  • Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
  • Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
  • Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
  • Deliver prototypes that become production-grade components for Le Chat and our enterprise API.

👤 About you 👤

  • Master’s or PhD in Computer Science (or equivalent proven track record).
  • 4 + years working on large-scale ML codebases.
  • Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed / FSDP / SLURM / K8s).
  • Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops.
  • Strong software-design instincts: testing, code review, CI/CD.
  • Self-starter, low-ego, collaborative.

🎁 What we offer 🎁

  • 💰 Competitive salary and equity.
  • 🚑 Healthcare: Medical/Dental/Vision covered for you and your family.
  • 👴🏻 Pension : 401K (6% matching)
  • 🏝️ PTO : 18 days
  • 🚗 Transportation: Reimburse office parking charges, or $120/month for public transport
  • 🏀 Sport: $120/month reimbursement for gym membership
  • 🥕 Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger)
  • 🌎 Visa sponsorship
  • 🤝 Coaching: we offer BetterUp coaching on a voluntary basis

By applying, you agree to our Applicant Privacy Policy.

Reference :mistral-lever+Mistral-AI-Research-Engineer-Machine-Learning

Skills

Data
Deep learning
CUDA
Machine Learning
Pytorch
TensorFlow
No code
Make
Backend
Python

Similar Jobs

brand cover
new grads 2026 - data engineer
WeRide.aiPermanent contract
WeRide.aiPermanent contract
San Jose, US
No remote work
Juniors accepted
C++
Deep learning
Machine Learning
1 hour ago
brand cover
2026 technical product manager intern
WeRide.aiInternship
WeRide.aiInternship
San Jose, US
No remote work
Juniors accepted
30k€ ➞ 45k€/year
Management
Go
Make
3 days ago
brand cover
applied ai, forward deployed machine learning engineer - palo alto
Mistral AIPermanent contract
Mistral AIPermanent contract
Palo Alto, US
No remote work
≥ 2 years experience
Machine Learning
Deep learning
LangChain
3 days ago
brand cover
engineering manager, machine learning behavior planning & prediction
Woven by ToyotaPermanent contract
Woven by ToyotaPermanent contract
Palo Alto, US
No remote work
≥ 5 years experience
Machine Learning
Motion
D3
4 days ago
brand cover
senior / staff machine learning engineer, behavior planning & prediction
Woven by ToyotaPermanent contract
Woven by ToyotaPermanent contract
Ann Arbor, US& 1 other
No remote work
≥ 3 years experience
Machine Learning
Deep learning
Motion
9 days ago
brand cover
ml platform senior engineering manager, autonomy
Woven by ToyotaPermanent contract
Woven by ToyotaPermanent contract
Palo Alto, US
& Remote
Hybrid remote
≥ 10 years experience
Machine Learning
Motion
Apache
9 days ago
brand cover
talent acquisition, usa
Mistral AIPermanent contract
Mistral AIPermanent contract
New York, US& 1 other
No remote work
≥ 6 years experience
Make
Management
10 days ago
brand cover
lead perception engineer
Woven by ToyotaPermanent contract
Woven by ToyotaPermanent contract
Ann Arbor, US& 1 other
No remote work
≥ 10 years experience
Machine Learning
Deep learning
Motion
12 days ago
brand cover
infrastructure solution architect - us
Mistral AIPermanent contract
Mistral AIPermanent contract
New York, US& 1 other
& Remote
Hybrid remote
Juniors accepted
Make
Management
Azure
15 days ago