Cloud Infrastructure Engineer
Role details
Job location
Tech stack
Job description
Our client is seeking experienced mid-to-senior software engineers or infrastructure specialists who are excited by the challenge of building and operating a complex, business-critical managed service used daily by hundreds of users.
You will join a highly skilled team and apply your expertise in infrastructure engineering, automation tooling, systems design, and operational resilience. Their environment offers opportunities to work on everything from CI/CD pipelines and container orchestration to configuration management, infrastructure as code, and observability tooling. While you may bring experience in specific tools or platforms, you will be expected to contribute broadly across our infrastructure landscape.
Our client's core product is a comprehensive administration platform delivered as software-as-a-service to leading South African investment and wealth management firms. Their mission is to provide a platform that leads the market in accuracy, resilience, and functionality.
As a cloud-hosted solution, their stack includes modern tooling across CI/CD, containerization, monitoring, and cloud infrastructure. You will help build, maintain, and evolve this environment to meet high standards of performance, reliability, and security.
Requirements
They make extensive use of automation tools such as Terraform and Ansible, alongside programming in Python. Their environments are entirely based on Ubuntu Linux. Experience with server monitoring software (e.g. Prometheus, Grafana, Zabbix, Datadog) and a solid understanding of security principles and best practices (including hardening, access control, auditing, and incident response) is highly valued.
This is a remote-first role, and they are looking for individuals who can thrive in a distributed work environment. You must be able to work independently, manage your time effectively, and take initiative in progressing your tasks. Clear communication and self-direction are essential., Proven experience delivering complex infrastructure and automation projects, including:
- Infrastructure as code (Terraform, Pulumi)
- Configuration management with Ansible
- Cloud platforms (AWS, Azure)
- Containerization (LXC, LXD, Docker, Kubernetes)
- CI/CD tooling (TeamCity, Jenkins, GitHub Actions)
- Server monitoring and alerting systems (Prometheus, Grafana, Zabbix, Datadog)
- Strong Python programming skills
- Solid Linux administration and general networking knowledge
- Understanding of infrastructure security best practices, including secure configuration, identity and access management, and compliance controls
- Experience with monitoring, alerting, and system observability
- Background in financial services infrastructure is advantageous but not required