Cloud Operations Engineer
Role details
Job location
Tech stack
Job description
The Cloud Operations Engineer is a vital role in overseeing one of the key customer facing elements within our business. The engineer will embrace the "customer first" ethos of the Customer Care division and share this with their colleagues.
The role will focus on monitoring and managing our customer cloud deployments and customer facing internal deployments. The engineer will have responsibilities ranging from release management of applications to cloud resources, monitoring the running deployments to ensure that Service Level Agreements are met, review and analysis of the existing setup to optimize costs.
The role will also entail promoting the work of the Cloud Operations team and supporting other business teams in the region to understand the value of the Cloud Operations team.
Key Responsibilities
-
Cloud Deployment Management Manage cloud deployments by developing and refining processes for deploying applications across a variety of cloud resource types, with a focus on optimization and achieving zero-downtime updates.
-
Infrastructure as Code (IaC) Support the development and ongoing maintenance of Infrastructure as Code for AWS environments using Terraform.
-
Platform Monitoring Define, monitor, and respond to infrastructure and application alerts that may impact customer experience and service level agreements across all deployments.
-
Operational Analysis Collect, record, and analyze operational system data at both infrastructure and application levels. Ensure cloud resources are right-sized to optimize costs and that application summary data and analytics are readily available.
-
Incident Management Manage operational incidents through initial mitigation and resolution, and support the identification of corrective and preventive actions. This role will initially cover standard working hours and will transition to supporting out-of-hours coverage for other time zones.
-
Case Management & Reporting Utilize Salesforce (SFDC) to accurately track, manage, and report cloud operations cases.
-
Perform other duties as assigned.
Requirements
- 3+ years of experience in Cloud Operations or a related role.
-
Cloud & Infrastructure Expertise
- Strong understanding of AWS services, including EC2, S3, RDS, Lambda, and related technologies.
- Hands-on experience with Infrastructure as Code tools such as Terraform and/or CloudFormation.
- Proficiency with monitoring and observability tools (e.g., Amazon CloudWatch, Datadog).
- Experience integrating multiple cloud services and managing complex cloud environments.
- Knowledge of IT infrastructure concepts, including passive and active networks, edge computing, and cloud services.
-
Operational & Analytical Skills
- Experience analyzing cloud and system data to identify performance, reliability, and cost-optimization opportunities.
- Ability to clearly document and communicate findings and recommendations to stakeholders in concise, professional language.
- Familiarity with incident management processes and working within established operational frameworks while suggesting and implementing improvements.
-
Communication & Collaboration
- Strong communication skills, with the ability to engage professionally with global clients in both technical and non-technical contexts.
- Ability to motivate, mentor, and support team members, with the expectation of contributing to a future global team.
- Self-driven, highly motivated, and accountable, with the ability to take full ownership of assigned tasks and deliverables.
-
Education & Certifications
- Bachelor's degree in Computer Science, Information Technology, or a related discipline.
- Industry certifications (such as AWS Certified SysOps Administrator) are a plus.
Preferred / Additional Qualifications (Optional)
-
Understanding of interoperable integrations using open protocols such as Modbus, BACnet, and OPC, commonly used in Smart Buildings and Manufacturing environments.
-
Knowledge of Smart Building technologies, including common use cases and user personas.
-
Awareness of IoT solutions, particularly in industrial contexts such as machine-vision occupancy sensing and indoor air quality monitoring.
-
Some experience in a Software Development or Quality Assurance role, with exposure to:
-
Networking concepts including security groups, load balancers (ALB/ELB), and Route 53.
-
CI/CD pipelines using AWS CodePipeline, CodeBuild, or similar tools.
-
Containerization and orchestration with Docker, Amazon ECS, or Amazon EKS.
-
Database services such as Amazon RDS and Amazon DynamoDB.
-
Monitoring, logging, and observability using AWS CloudWatch and AWS CloudTrail.
-
Scripting or automation experience using Python and/or PowerShell (a bonus).
Benefits & conditions
401(k) matching, Paid time off, Vision insurance, Dental insurance, Paid sick time, Flexible spending account, Employee assistance program, Paid holidays