Systems Engineer, HPC
<div class="show-more-less-html__markup show-more-less-html__markup--clamp-after-5 relative overflow-hidden"> <strong>About Mistral<br/><br/></strong>At Mistral AI, we build high-performance, open, and efficient AI systems designed to power the next generation of applications. Our infrastructure combines large-scale distributed systems, cloud platforms, and HPC environments to support cutting-edge research and production workloads.<br/><br/>We are a collaborative, low-ego, and highly technical team, operating across Europe, the US, and beyond. As we scale rapidly, we are building the foundational infrastructure to support thousands of nodes and petabyte-scale systems.<br/><br/>Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.<br/><br/><strong>About The Role<br/><br/></strong>We are looking for Systems Engineers / System Administrators to help design, operate, and scale the infrastructure behind Mistral’s AI platforms.<br/><br/>This is a hands-on, hybrid role combining:<br/><br/>Systems administration (operating and troubleshooting large-scale Linux environments)<br/><br/>Systems engineering (automation, scalability, and performance improvements)<br/><br/>You’ll work closely with infrastructure, HPC, and research teams to ensure our clusters and platforms run reliably at scale.<br/><br/><strong><strong>What You’ll Work On<br/><br/></strong></strong><strong>Core Systems Operations<br/><br/></strong><ul><li>Operate and maintain large-scale Linux environments (bare metal, clusters, cloud) </li><li>Monitor system health, troubleshoot incidents, and ensure high availability </li><li>Support production and research workloads across multiple environments <br/><br/><br/></li></ul><strong> Scaling Infrastructure<br/><br/></strong><ul><li>Help scale clusters toward hundreds to thousands of nodes </li><li>Work on systems handling petabyte-scale storage </li><li>Improve performance, reliability, and resource utilisation <br/><br/><br/></li></ul><strong> Automation & Engineering<br/><br/></strong><ul><li>Automate operational tasks using tools like Python, Bash, Ansible, or Terraform </li><li>Improve deployment, provisioning, and system lifecycle management </li><li>Contribute to system design and architecture decisions <br/><br/><br/></li></ul><strong> Cross-Functional Collaboration</strong> <br/><br/><ul><li>Work closely with:</li><ul><li>HPC / infrastructure teams </li><li>Platform / DevOps engineers </li><li>Research teams <br/></li></ul><li>Act as a bridge between users and infrastructure <br/><br/><br/></li></ul><strong><strong>What We’re Looking For<br/><br/></strong></strong><strong><strong>Must-have</strong></strong> <br/><br/><ul><li>Strong Linux systems administration experience (core requirement) </li><li>Experience working in large-scale environments:</li><ul><li>HPC clusters or cloud infrastructure <br/></li></ul><li>Experience with Job schedulers (e.g. Slurm) </li><li>Solid troubleshooting skills across systems, hardware, and networks <br/><br/><br/></li></ul><strong><strong>Nice-to-have (any of these)<br/><br/></strong></strong>We are not expecting everything — strong depth in one area is valuable.<br/><br/><ul><li>Containers / orchestration (e.g. Kubernetes) </li><li>Storage systems (e.g. Ceph, Lustre, NFS) </li><li>Networking fundamentals (Ethernet; InfiniBand is a plus) </li><li>Infrastructure as Code / automation tooling </li><li>GPU or AI/ML experience <br/><br/><br/></li></ul><strong><strong>Profile We Value<br/><br/></strong></strong><ul><li>Pragmatic problem solver who can operate in fast-scaling environments </li><li>Comfortable working across multiple domains (“Swiss army knife” mindset) </li><li>Able to go deep in one area while learning others </li><li>Low-ego, collaborative, and hands-on <br/><br/><br/></li></ul>—------------------------------------------------------------------<br/><br/><strong><strong>Why Join Mistral?<br/><br/></strong></strong><ul><li>Impact: Play a pivotal role in scaling Mistral’s cutting-edge AI infrastructure. </li><li>Growth: Opportunity to shape data centre operations from the ground up in a high-growth startup environment. </li><li>Collaboration: Work with a talented, cross-functional team passionate about AI and technology. </li><li>Flexibility: Competitive compensation, benefits, and the chance to contribute to revolutionary projects.</li></ul> </div>