Platform Engineer
Role details
Job location
Tech stack
Job description
CAIS is seeking a Platform Engineer to join our high-performing team responsible for architecting, engineering, and maintaining the foundation of our cloud platform. The team designs and operates AWS infrastructure, manages CI/CD solutions, and builds observability systems that ensure reliability, scalability, and performance across CAIS's technology ecosystem., * Design, build, and operate Kubernetes clusters on Amazon EKS, with an emphasis on reliability, scalability, and security
- Troubleshoot and resolve issues within the Kubernetes environment, including networking, workloads, and cluster operations
- Support the automation and maintenance of AWS infrastructure using infrastructure as code principles
- Implement and manage Role-Based Access Control (RBAC) and least-privilege access across clusters and supporting AWS services
- Collaborate with developers and product teams to improve the efficiency and reliability of deployment workflows
- Contribute to the design and implementation of observability practices, ensuring systems are monitored effectively and meaningful metrics are captured to measure reliability and performance
- Participate in incident response and post-incident reviews to continuously enhance platform resilience and operational maturity
- Help define and implement best practices for platform engineering, including deployment standards, environment consistency, and operational excellence
- Be open to learning and leveraging AI tools and techniques to improve platform reliability, automation, and efficiency
Requirements
Ideal candidates are experienced in platform management, including source control, SDLC tooling, infrastructure as code, and CI/CD pipelines, with an understanding of how to measure and maintain platform health through effective monitoring and alerting. This is a great opportunity to join a growing team of innovators, problem solvers, and collaborators dedicated to elevating and scaling our platform., * Strong understanding of Kubernetes and hands-on experience operating and troubleshooting clusters in Amazon EKS
- Working knowledge of AWS core services such as IAM, S3, RDS, and networking fundamentals
- Experience implementing infrastructure as code using Terraform or similar tools
- Familiarity with CI/CD concepts and tools, with GitHub Actions preferred
- Experience administering artifact management systems such as Nexus, Artifactory, or equivalent
- Understanding of observability principles including monitoring, logging, metrics, and tracing
- Solid grasp of Git-based workflows and version control best practices
- Foundational understanding of programming concepts; familiarity with languages used by CAIS developers-including Java/Kotlin, JavaScript/TypeScript (React), and Python-is a plus
- Comfortable collaborating with developers and cross-functional teams to support deployment and platform needs
Qualifications:
- Relevant industry experience in cloud infrastructure or platform engineering roles
- Hands-on experience supporting production workloads in AWS and Kubernetes environments
- Strong problem-solving skills with the ability to troubleshoot distributed systems and collaborate across teams
- Excellent communication and documentation skills, with an emphasis on clarity and operational excellence
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
Whilst we spend a lot of our time working remotely, we believe there's no substitute (yet) for in-person collaboration, so we strongly encourage you to be in our London office on a weekly basis to spend time with your colleagues.