Help to improve the resilience, automation, and observability of production systems that power a mission-critical quant trading platform for a systematic hedge fund.
This isn’t your typical ops role - they're looking for Engineers who can write code to eliminate toil, improve reliability and automate release, monitoring and recovery processes.
You'll build and maintain automated tools in Python for deployment, health checks, alerts and runbooks whilst focusing on reliability engineering and incident management.
You'll monitor the health of their trading systems whilst owning and improving incident response whilst also supporting their trading operations during market hours.
It's a lean, global team here. There's plenty of scope to take ownership, modernise tooling, and influence infrastructure direction.
Up to £90k + bonus
Central London 5x days on site (4x after passing probation)