Senior Cloud Operations Engineer (10PM- 6AM)
Role details
Job location
Tech stack
Job description
We are seeking a highly skilled Cloud Operations Engineer to join our team. The ideal candidate will possess deep technical expertise in Linux systems, AWS cloud infrastructure, container orchestration platforms, and database administration.
In this role, you will be responsible for designing, implementing, operating, and optimizing complex cloud environments on AWS. You will manage containerized workloads running on Amazon EKS and ECS, maintain production-grade Linux systems and databases, and contribute to the reliability, scalability, and security of our cloud platforms.
This is an excellent opportunity to join a collaborative, engineering-driven organization where technical excellence, innovation, and continuous improvement are highly valued.
How will you make an impact?
- Design, implement, and operate scalable, secure, and highly available AWS cloud infrastructure leveraging services such as EC2, EKS, ECS, RDS, S3, VPC, Lambda, and IAM.
- Drive the reliability and performance of containerized applications by managing Amazon EKS and ECS environments, including cluster operations, networking, scaling, and troubleshooting.
- Ensure the stability, security, and efficiency of production Linux environments through system administration, performance tuning, storage management, networking, and incident resolution.
- Maintain and optimize relational databases (PostgreSQL, MySQL, Aurora) and NoSQL platforms (DynamoDB, Redis), ensuring high availability, performance, and disaster recovery readiness.
- Strengthen the organization's cloud security posture through effective management of IAM, network security controls, secrets management, and compliance best practices.
- Enhance platform observability and operational excellence by implementing and improving monitoring, logging, alerting, and performance analytics using CloudWatch, Prometheus, and Grafana.
- Take ownership of production incidents by participating in on-call rotations, leading troubleshooting efforts, performing root cause analysis, and driving continuous improvement initiatives.
- Partner closely with software engineering, DevOps, and platform teams to improve deployment processes, application reliability, and operational efficiency.
- Identify and implement cloud cost optimization opportunities through resource right-sizing, capacity planning, automation, and governance best practices., At NICE, we work according to the NICE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere.
Requirements
Do you have experience in Ubuntu?, Do you have a Bachelor's degree?, * 4-5 years in a cloud operation, infrastructure engineering, or SRE role with a strong hands-on technical focus, * Deep hands-on experience with core AWS services: EC2, EKS, ECS, RDS/Aurora, S3, VPC, IAM, Lambda, CloudWatch, Route 53, and ALB/NLB
- Proven ability to design and troubleshoot complex AWS networking topologies (VPCs, subnets, transit gateways, security groups)
- Solid understanding of AWS IAM - roles, policies, permission boundaries, and cross-account access
Container Orchestration
- Hands-on production experience managing workloads on Amazon EKS and ECS - cluster lifecycle, node group management, networking (CNI, service mesh basics), and autoscaling
- Strong Docker fundamentals: image builds, registries (ECR), multi-stage builds, and container security
Linux
- Strong Linux administration skills: Bash/Python scripting, process and memory management, filesystem and storage operations, kernel parameters, and network diagnostics
- Experience managing and hardening Linux servers in production environments (RHEL, Ubuntu, or Amazon Linux)
Infrastructure as Code & Configuration Management
- Proficient in Terraform - module design, state management, remote backends, and workspace strategies
- Hands-on experience with Puppet for configuration management, node classification, and enforcing system state at scale
Databases
- Hands-on experience with relational databases: PostgreSQL, MySQL, or AWS RDS/Aurora - schema management, query optimisation, replication, backups, and failover
- Familiarity with NoSQL databases: DynamoDB, Redis, or MongoDB - data modelling, performance tuning, and operational monitoring
General
-
Familiarity with CI/CD pipelines (GitHub Actions, Jenkins, or AWS CodePipeline)
-
Experience with observability tooling: CloudWatch, Datadog, Prometheus, or Grafana
-
Education Level: Bachelor's in computer science/IT preferred (or any engineering field considered) or equivalent.
-
Certifications: AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or similar certifications.
-
Ability to work effectively with distributed team
-
Must be detail oriented, task driven, and have excellent communication skills. Customer service focus is key.
-
Ability to work effectively with staff, peers, and others in and outside the organization to accomplish goals, objectives and to identify and resolve problems, Join an ever-growing, market disrupting, global company where the teams - comprised of the best of the best - work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NICE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NICEr!