Mid-level systems engineer role focused on operating and scaling HPC and Linux infrastructure for AI at Mistral AI, remote-friendly in APAC.
Mid-level engineers with strong Linux systems administration experience in large-scale HPC or cloud environments. Candidates should excel at automation, troubleshooting, and scaling clusters to thousands of nodes while handling petabyte-scale storage. Ideal applicants bridge infrastructure, HPC, and research teams effectively.
As published by Mistral AI on their official careers page.
About Mistral
At Mistral AI, we build high-performance, open, and efficient AI systems designed to power the next generation of applications. Our infrastructure combines large-scale distributed systems, cloud platforms, and HPC environments to support cutting-edge research and production workloads.
We are a collaborative, low-ego, and highly technical team, operating across Europe, the US, and beyond. As we scale rapidly, we are building the foundational infrastructure to support thousands of nodes and petabyte-scale systems.
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
We are looking for Systems Engineers / System Administrators to help design, operate, and scale the infrastructure behind Mistral’s AI platforms.
This is a hands-on, hybrid role combining:
You’ll work closely with infrastructure, HPC, and research teams to ensure our clusters and platforms run reliably at scale.
Help scale clusters toward hundreds to thousands of nodes
Work on systems handling petabyte-scale storage
Improve performance, reliability, and resource utilisation
Automate operational tasks using tools like Python, Bash, Ansible, or Terraform
Improve deployment, provisioning, and system lifecycle management
Contribute to system design and architecture decisions
Work closely with:
HPC / infrastructure teams
Platform / DevOps engineers
Research teams
Act as a bridge between users and infrastructure
Strong Linux systems administration experience (core requirement)
Experience working in large-scale environments:
HPC clusters or cloud infrastructure
Experience with Job schedulers (e.g. Slurm)
Solid troubleshooting skills across systems, hardware, and networks
We are not expecting everything — strong depth in one area is valuable.
Containers / orchestration (e.g. Kubernetes)
Storage systems (e.g. Ceph, Lustre, NFS)
Networking fundamentals (Ethernet; InfiniBand is a plus)
Infrastructure as Code / automation tooling
GPU or AI/ML experience
Pragmatic problem solver who can operate in fast-scaling environments
Comfortable working across multiple domains (“Swiss army knife” mindset)
Able to go deep in one area while learning others
Low-ego, collaborative, and hands-on
—------------------------------------------------------------------
Impact: Play a pivotal role in scaling Mistral’s cutting-edge AI infrastructure.
Growth: Opportunity to shape data centre operations from the ground up in a high-growth startup environment.
Collaboration: Work with a talented, cross-functional team passionate about AI and technology.
Flexibility: Competitive compensation, benefits, and the chance to contribute to revolutionary projects.
OpenAI
Analytics Engineer, Safety Systems at OpenAI — Remote · San Francisco. Mid-level engineering role on the Applied AI Engineering team.
OpenAI
Software Engineer, Scaled Abuse at OpenAI — San Francisco. Mid-level engineering role on the Applied AI Engineering team.
OpenAI
AI Deployment Engineer at OpenAI — San Francisco. Mid-level engineering role on the Technical Success team.
OpenAI
AI Deployment Engineer at OpenAI — Remote · Sydney, Australia. Mid-level engineering role on the Technical Success team.
OpenAI
Solutions Engineer at OpenAI — Remote · Seoul, South Korea. Mid-level engineering role on the Technical Success team.
OpenAI
Value Engineer, AI Success - San Francisco at OpenAI — San Francisco. Mid-level engineering role on the AI Success team.