IT Service Performance and Reliability Manager

Spectrum IT Recruitment
Southampton, United Kingdom
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
£ 60K

Job location

Southampton, United Kingdom

Tech stack

Microsoft Windows
API
Amazon Web Services (AWS)
Azure
Cloud Computing
Communications Protocols
Data Integration
Linux
DevOps
Python
Linux Servers
Network Architecture
Node.js
Powershell
Zabbix
Scripting (Bash/Python/Go/Ruby)
Grafana
Kibana
Webhooks

Job description

  • Take ownership of performance, capacity, and resilience across critical IT services
  • Lead observability across services by ensuring effective monitoring and actionable insights
  • Manage capacity and performance through forecasting and trend analysis
  • Identify risks early and drive improvements in service performance
  • Ensure resilience and availability are built into services from the outset
  • Support continuity planning and risk management
  • Work closely with technical teams and stakeholders to resolve issues
  • Deliver ongoing service improvements

Technologies:

  • AWS
  • OpenSearch
  • Azure
  • DevOps
  • Grafana
  • Support
  • ITIL
  • Kibana
  • Linux
  • Network
  • PowerShell
  • Python
  • Windows
  • Zabbix
  • NodeJS
  • Cloud

More:

We are seeking an IT Service Performance & Reliability Manager to join our team. In this role, you will focus on keeping customer-facing services fast, reliable, and fully observable while driving continuous improvement. If you are looking for a position where you can make a tangible impact on service performance and resilience, we encourage you to apply.

Requirements

  • Experience managing capacity and performance in IT environments
  • Hands-on experience with AWS and Azure
  • Strong knowledge of ITIL v3/v4 (certification required)
  • Experience with monitoring/observability tools (e.g. Zabbix, Grafana, Kibana, OpenSearch)
  • Knowledge of Windows and Linux server environments
  • Scripting skills (e.g. Python, PowerShell, Node.js)
  • Experience integrating data via APIs, webhooks, or messaging
  • Strong analytical, problem-solving, and stakeholder management skills
  • Desirable: DevOps exposure
  • Desirable: Network infrastructure and communications protocols knowledge
  • Desirable: Experience with social alarm platforms

Apply for this position