Cloud SRE
Role details
Job location
Tech stack
Job description
- Work at project inception to capture requirements for software and infrastructure development
- Ensure platform reliability, standards, safety and security objectives are all met
- Design, implement and maintain monitoring, alerting and dashboarding solutions using Grafana and Open Telemetry
- Own SLOs, SLIs and error budgets for the DSE WebNet IoT platform
- Provision and manage cloud infrastructure across GCP, Azure or AWS using Terraform and/or Ansible
- Design and maintain cloud networking components including VPCs, load balancers, DNS and firewalls
- Manage containerised workloads using Docker and Kubernetes, including cluster lifecycle and resource optimisation
- Build and maintain CI/CD pipelines using GitHub Actions from code commit through to production deployment
- Transform the engineering teams SDLC from a manually driven process to a fully automated one
- Develop platform-level services and tooling in one or more of PHP, Golang, NodeJS (TypeScript) or C#
- Maintain and extend WebNets MQTT broker infrastructure and WebSocket/HTTP API layers
- With a focus on IoT protocols and technologies, remain up to date with the latest developments and make suggestions on how these can be applied
- Be open and innovative in the use of new programming platforms and tools
- Review software and infrastructure designs with regard to coding standards and best practice
- Demonstrate diligent use of company version control systems and excellent Git discipline
- Support software deliveries through test, production and commercial deployment
- Produce professional documentation as a key part of the full development lifecycle
- Participate in on-call rotation and lead post-incident reviews with actionable follow-ups
Requirements
Working as an individual and as part of a team; the successful Cloud SRE will be proficient in one or more object-oriented languages such as PHP, Golang, NodeJS (TypeScript) or C#, as well as web-based languages including TypeScript, HTML and CSS. The role requires hands-on experience building monitoring solutions, CI/CD pipelines and cloud infrastructure. A strong communicator, this Cloud SRE will excel at requirements capture, design, development and verification.
The primary development you will be involved in is around DSEs IoT platform (DSE WebNet). In addition, there will be a number of ancillary tools and business systems that require development.
The companys products operate in the energy control and automation markets and are designed to provide energy throughout a wide range of forms such as generator sets, renewables, microgrid as well as off-highway machinery.
The successful engineer will be keen to be involved in all development areas. You should have a proven track record in both problem solving and standards-compliant, quality-focused software and infrastructure development. You will be required to follow all existing processes and working practices, contributing suggestions to their improvement.
Working as part of a multidisciplinary team, the successful engineer will have good communication skills and a can-do attitude. A genuine passion for innovative technology will be advantageous. This opportunity will enable the successful candidate to gain valuable knowledge of industrial controls, IoT infrastructure and cloud-native best practices., * Proficiency in one or more of PHP, Golang, NodeJS (TypeScript) or C#
- Proficiency in MySQL or NoSQL fundementals
- API design and programming (JSON/REST)
- Proficient user of Git for source control management
- Understanding of IoT technologies and protocols such as MQTT and edge devices
- WebSocket and HTTP fundamentals
- Excellent team player with collaborative problem solving and troubleshooting capabilities
- Accustomed to breaking down tasks for more accurate estimates of effort
- Used to working in a high-pace environment
- Enthusiastic and optimistic with a genuine passion for cloud-native and IoT technology
- Strong communicator across technical and non-technical stakeholders
- Docker and Kubernetes in a production environment
- Infrastructure-as-code using Terraform and/or Ansible
- CI/CD pipeline development with GitHub Actions
- Experience with one or more of GCP, Microsoft Azure or AWS
- Understanding of networking in a high availability environment
- Hypervisor setup, configuration and management (vSphere/ESXi)
- Linux server setup and health monitoring (CentOS/Rocky Linux or other RedHat-based distro)
- Qualification or Equivalent Experience in Software Engineering or Cloud Engineering
Benefits & conditions
- 25 days holiday + Bank Holidays
- New flexible holiday scheme
- Bupa Healthcare package
- Life Insurance
- Enhanced Maternity/Paternity pay
- 5% Pension contributions
- Companywide performance based annual bonus scheme
Pay: £33,769.15-£45,940.33 per year
Benefits:
- Company events
- Company pension
- Cycle to work scheme
- Enhanced maternity leave
- Enhanced paternity leave
- Flexitime
- Free parking
- On-site parking
- Private medical insurance
- Work from home
Ability to commute/relocate:
- Sutton-In-Ashfield NG17 5LD: reliably commute or plan to relocate before starting work (required)
Work authorisation:
- United Kingdom (preferred)