Cloud Infrastructure Engineer

Black Pen Recruitment
5 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English, Afrikaans
Experience level
Senior
Compensation
£ 78K

Job location

Remote

Tech stack

Amazon Web Services (AWS)
Azure
Ubuntu (Operating System)
Software as a Service
Cloud Computing
Configuration Management
Computer Programming
Continuous Integration
Github
Identity and Access Management
Python
Linux System Administration
Ansible
Prometheus
Server Administration
Zabbix
Datadog
Pulumi
Grafana
Containerization
Kubernetes
TeamCity
Lxc
Terraform
Docker
Jenkins

Job description

Our client is seeking experienced mid-to-senior software engineers or infrastructure specialists who are excited by the challenge of building and operating a complex, business-critical managed service used daily by hundreds of users.

You will join a highly skilled team and apply your expertise in infrastructure engineering, automation tooling, systems design, and operational resilience. Their environment offers opportunities to work on everything from CI/CD pipelines and container orchestration to configuration management, infrastructure as code, and observability tooling. While you may bring experience in specific tools or platforms, you will be expected to contribute broadly across our infrastructure landscape.

Our client's core product is a comprehensive administration platform delivered as software-as-a-service to leading South African investment and wealth management firms. Their mission is to provide a platform that leads the market in accuracy, resilience, and functionality.

As a cloud-hosted solution, their stack includes modern tooling across CI/CD, containerization, monitoring, and cloud infrastructure. You will help build, maintain, and evolve this environment to meet high standards of performance, reliability, and security.

Requirements

They make extensive use of automation tools such as Terraform and Ansible, alongside programming in Python. Their environments are entirely based on Ubuntu Linux. Experience with server monitoring software (e.g. Prometheus, Grafana, Zabbix, Datadog) and a solid understanding of security principles and best practices (including hardening, access control, auditing, and incident response) is highly valued.

This is a remote-first role, and they are looking for individuals who can thrive in a distributed work environment. You must be able to work independently, manage your time effectively, and take initiative in progressing your tasks. Clear communication and self-direction are essential., Proven experience delivering complex infrastructure and automation projects, including:

  • Infrastructure as code (Terraform, Pulumi)
  • Configuration management with Ansible
  • Cloud platforms (AWS, Azure)
  • Containerization (LXC, LXD, Docker, Kubernetes)
  • CI/CD tooling (TeamCity, Jenkins, GitHub Actions)
  • Server monitoring and alerting systems (Prometheus, Grafana, Zabbix, Datadog)
  • Strong Python programming skills
  • Solid Linux administration and general networking knowledge
  • Understanding of infrastructure security best practices, including secure configuration, identity and access management, and compliance controls
  • Experience with monitoring, alerting, and system observability
  • Background in financial services infrastructure is advantageous but not required

About the company

Our client is headquartered in Johannesburg, South Africa, but operates as a fully distributed team. You can work from anywhere with a timezone within 3 hours of South African Standard Time. You must be available during our core business hours (Monday to Friday, 09:00-16:00 SAST). The full team meets annually for planning and social events, with additional in-person collaboration among Johannesburg-based staff.

Apply for this position