Site Reliability Engineer

Vollzeit  •  IT & Software  •  Zürich, Switzerland

<div class="show-more-less-html__markup show-more-less-html__markup--clamp-after-5 relative overflow-hidden"> <p>Our client, a leading proprietary trading firm specialising in both systematic and discretionary strategies, is seeking a Site Reliability Engineer to join their Zurich office. This is a unique opportunity to evolve and enhance a highly sophisticated production trading environment, ensuring exceptional uptime and performance. The role focuses on delivering code-driven solutions while partnering closely with developers and traders to strengthen reliability, observability, and overall operational maturity within a low-latency, high-performance ecosystem.</p><p> </p><p> </p><p>The ideal candidate will bring deep experience supporting highly available, performance-critical, latency-sensitive systems, alongside a strong understanding of Linux internals and networking. A solid background in reliability engineering is essential, with a clear automation-first mindset and hands-on experience with containerisation technologies.</p><p> </p><p> </p><p> </p><p>Key responsibilities:</p><p>* Reliability &amp; Production Ownership: Own availability, stability, and performance of Linux-based trading systems (RedHat, Rocky, Ubuntu).</p><p>* Incident Response: Lead incident management, on-call, and blameless post-mortems, driving automation to prevent recurrence.</p><p>* Operational Processes: Maintain runbooks, documentation, and standards for consistent production support.</p><p>* Production Readiness: Partner with developers and traders to ensure reliable, high-performance system design and deployment.</p><p>* Linux Systems &amp; Performance: Perform low-level tuning (CPU, IRQ, memory, networking) for latency-sensitive workloads.</p><p>* Performance Diagnostics: Troubleshoot using perf, ftrace, tcpdump, and eBPF.</p><p>* Automation &amp; Infrastructure: Deliver infrastructure as code with Ansible, Terraform, Python, and shell scripting.</p><p> </p><p> </p><p>Required Qualifications:</p><p>* Experience in Site Reliability Engineering, Linux engineering, DevOps, or infrastructure-focused roles.</p><p>* Production Systems: Proven experience supporting highly available, performance-sensitive production environments.</p><p>* Linux Expertise: Deep knowledge of Linux internals, including scheduling, memory management, interrupts, filesystems, and storage.</p><p>* Networking: Strong understanding of TCP/IP, UDP, multicast, and distributed systems networking.</p><p>* Automation &amp; Tooling: Proficiency with Ansible, Terraform, Python, shell scripting, YAML/JSON, and Git-based workflows.</p><p>* Containers &amp; Observability: Experience with Docker (or similar) and familiarity with observability tools such as Prometheus, Grafana, ELK, or equivalent.</p><p></p> </div>

Job Overview
  • Datum der Veröffentlichung

    Mai 07, 2026

  • Kategorie

    IT & Software

  • Job Type

    Vollzeit

  • Standort

    Zürich, Switzerland

  • Arbeitgeber

    Selby Jennings

  • Source

    LinkedIn