Cloud Engineer - Observability Platform
Role details
Job location
Tech stack
Requirements
Extensive knowledge of infrastructure as code (Terraform, CFT, CDK, etc.).
Hands-on experience with continuous integration and continuous delivery/deployment using ALM tools such as Jenkins., 5+ years of hands-on experience deploying and/or supporting highly distributed multi-tiered systems at scale.
Deep understanding of distributed systems and telemetry architecture.
Experience choosing/modeling the right technique for the job (e.g., anomaly detection, ranking/recommendation, NLP), and knowing when a heuristic beats a model
Experience operating observability pipelines in Kubernetes or similar orchestration environments.
Hands-on experience in 2 or more languages (Python, Java, Go etc.).
Hands-on experience with Open Telemetry (OTEL) or any opensource Observability implementations.
5+ years of experience leading projects and designing, analyzing, and troubleshooting distributed systems
Create and maintain Grafana dashboards, visualizations, and alerts for real-time operational insights
Hands-on experience designing and building scalable and resilient applications in the cloud.
Good knowledge on networking concepts such as DNS, Load Balancers, routers , Linux etc.
Experience on containerization technologies such as Docker, Kubernetes etc.
qualifications:
Experience in deploying applications in AWS platforms such as EC2 & EKS or equivalent platforms from any other cloud provider such as Google Cloud Platform, Azure etc
Experience delivering software using engineering best practices and principles.