IT Service Performance and Reliability Manager
Spectrum IT Recruitment
Southampton, United Kingdom
3 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Compensation
£ 60KJob location
Southampton, United Kingdom
Tech stack
Microsoft Windows
API
Amazon Web Services (AWS)
Azure
Cloud Computing
Communications Protocols
Data Integration
Linux
DevOps
Python
Linux Servers
Network Architecture
Node.js
Powershell
Zabbix
Scripting (Bash/Python/Go/Ruby)
Grafana
Kibana
Webhooks
Job description
- Take ownership of performance, capacity, and resilience across critical IT services
- Lead observability across services by ensuring effective monitoring and actionable insights
- Manage capacity and performance through forecasting and trend analysis
- Identify risks early and drive improvements in service performance
- Ensure resilience and availability are built into services from the outset
- Support continuity planning and risk management
- Work closely with technical teams and stakeholders to resolve issues
- Deliver ongoing service improvements
Technologies:
- AWS
- OpenSearch
- Azure
- DevOps
- Grafana
- Support
- ITIL
- Kibana
- Linux
- Network
- PowerShell
- Python
- Windows
- Zabbix
- NodeJS
- Cloud
More:
We are seeking an IT Service Performance & Reliability Manager to join our team. In this role, you will focus on keeping customer-facing services fast, reliable, and fully observable while driving continuous improvement. If you are looking for a position where you can make a tangible impact on service performance and resilience, we encourage you to apply.
Requirements
- Experience managing capacity and performance in IT environments
- Hands-on experience with AWS and Azure
- Strong knowledge of ITIL v3/v4 (certification required)
- Experience with monitoring/observability tools (e.g. Zabbix, Grafana, Kibana, OpenSearch)
- Knowledge of Windows and Linux server environments
- Scripting skills (e.g. Python, PowerShell, Node.js)
- Experience integrating data via APIs, webhooks, or messaging
- Strong analytical, problem-solving, and stakeholder management skills
- Desirable: DevOps exposure
- Desirable: Network infrastructure and communications protocols knowledge
- Desirable: Experience with social alarm platforms